当前位置：文江博客话题详情

为什么背包问题是伪多项式？

发布于 2024-10-09 10:59:28 字数 146 浏览 7 评论 0原文

我知道 Knapsack 是 NP 完全的，但可以通过 DP 解决。他们说 DP 解决方案是伪多项式，因为它在“输入长度”（即对输入进行编码所需的位数）上呈指数关系。不幸的是我没有得到它。有人可以慢慢地向我解释一下伪多项式吗？

原文

分享到QQ

分享到微博

如果你对这篇内容有疑问，欢迎到本站社区发帖提问参与讨论，获取更多帮助，或者扫码二维码加入 Web 技术交流群。

发布评论

需要登录才能够评论，你可以免费注册一个本站的账号。

指尖上得阳光 2024-10-16 10:59:28

对于包含 N 个物品和大小为 W 的背包的无界背包问题，运行时间为 O(NW)。不过，W 在输入长度上不是多项式，这使得它成为伪多项式。

考虑 W = 1,000,000,000,000。只需 40 位即可表示该数字，因此输入大小 = 40，但计算运行时使用因子 1,000,000,000,000，即 O(2⁴⁰)。

因此，更准确地说，运行时间为 O(N.2^{bits in W})，这是指数级的。

另请参阅：

回复收藏 0 原文

樱花落人离去 2024-10-16 10:59:28

在我们的大多数问题中，我们正在处理大量数字列表，这些数字非常适合标准 int/float 数据类型。由于大多数处理器的构建方式是一次处理 4-8 字节数字，无需额外成本（相对于适合的数字，例如 1 字节），因此我们很少会遇到因扩大数字或增加数字而导致运行时间发生变化的情况。低于我们在实际问题中遇到的范围 - 因此主导因素仍然只是数据点的绝对数量，即我们习惯的 n 或 m 因素。

（你可以想象 Big-O 表示法隐藏了一个常数因子，该因子将每个数据划分为 32 或 64 位，只要我们的每个数字适合那么多位或更少，就只留下数据点的数量）

但是尝试使用其他算法重新设计，以处理涉及大整数的数据集（需要超过 8 个字节来表示的数字），并看看这对运行时有何影响。所涉及的数字的大小总是会产生影响，即使在二进制排序等其他算法中，一旦扩展到安全缓冲区之外，传统处理器就会通过处理 4-8 字节批次为我们“免费”提供支持。

我们讨论的背包算法的技巧在于，它对特定参数 W 的大小异常敏感（相对于其他算法）。向 W 添加一位，算法的运行时间就会增加一倍。在此之前，我们还没有在其他算法中看到过对值变化的这种戏剧性响应，这就是为什么我们似乎以不同的方式对待 Knapsack - 但这是对其如何以非多项式方式响应的真实分析输入大小的变化。

回复收藏 0 原文

冬天的雪花 2024-10-16 10:59:28

我理解这一点的方式是，如果容量输入是 [1,2,...,W] 数组，其大小为 W，则容量将为 O(W)但容量输入不是数字数组，而是单个整数。时间复杂度与输入的大小关系有关。整数的大小不是整数的值，而是表示它的位数。我们稍后在算法中将这个整数W转换为数组[1,2,...,W]，导致人们错误地认为W是大小，但这个数组不是输入，整数本身才是。

将输入视为“一组内容”，将大小视为“数组中有多少内容”。项目输入实际上是一个由 n 个项目组成的数组，因此 size=n。 容量输入不是一个包含 W 数字的数组，而是一个整数，由 log(W) 位数组表示。将其大小增加 1（添加 1 个有意义的位），W 加倍，因此运行时间加倍，因此时间复杂度呈指数级。

回复收藏 0 原文

橘香 2024-10-16 10:59:28

背包算法的运行时间不仅取决于输入的大小（n - 物品数量），还取决于输入的大小（W - 背包容量） O(nW)，它是指数级的在计算机中以二进制 (2^n) 表示。计算复杂性（即如何通过位在计算机内部完成处理）仅与输入的大小有关，而不与输入的大小有关价值观。

暂时忽略价值/重量列表。假设我们有一个背包容量为 2 的实例。W 将在输入数据中占用两位。现在我们将背包容量增加到4，保留其余的输入。我们的输入只增长了一位，但计算复杂度却增加了一倍。如果我们将容量增加到 1024，W 的输入将只有 10 位，而不是 2 位，但复杂性增加了 512 倍。时间复杂度随着二进制（或十进制）表示形式的 W 大小呈指数增长。

另一个帮助我理解伪多项式概念的简单例子是朴素素性测试算法。对于给定的数字 n，我们检查它是否被 2..√n 范围内的每个整数整除，因此该算法需要 √(n−1) 个步骤。但这里，n 是输入的大小，而不是它的大小。

                     Now The regular O(n) case

相比之下，在数组中搜索给定元素需要多项式时间：O(n)。最多需要 n 步，这里 n 是输入的大小（数组的长度）。

[参见此处]

计算存储十进制数所需的位数

The Knapsack algorithm's run-time is bound not only on the size of the input (n - the number of items) but also on the magnitude of the input (W - the knapsack capacity) O(nW) which is exponential in how it is represented in computer in binary (2^n) .The computational complexity (i.e how processing is done inside a computer through bits) is only concerned with the size of the inputs, not their magnitudes/values.

Disregard the value/weight list for a moment. Let's say we have an instance with knapsack capacity 2. W would take two bits in the input data. Now we shall increase the knapsack capacity to 4, keeping the rest of the input. Our input has only grown by one bit, but the computational complexity has increased twofold. If we increase the capacity to 1024, we would have just 10 bits of the input for W instead of 2, but the complexity has increased by a factor of 512. Time complexity grows exponentially in the size of W in binary (or decimal) representation.

Another simple example that helped me understand the pseudo-polynomial concept is the naive primality testing algorithm. For a given number n we are checking if it's divided evenly by each integer number in range 2..√n, so the algorithm takes √(n−1) steps. But here, n is the magnitude of the input, not it's size.

                     Now The regular O(n) case

By contrast, searching an array for a given element runs in polynomial time: O(n). It takes at most n steps and here n is the size of the input (the length of the array).

[ see here ]

Calculating bits required to store decimal number

回复收藏 0 原文

黑色毁心梦 2024-10-16 10:59:28

复杂性基于输入。在背包问题中，输入是大小、最大容量、利润、重量数组。我们将 dp 表构造为 size * W 因此我们感觉它具有多项式时间复杂度。但是，输入 W 是一个整数，不是一个数组。因此，它将是 O(大小*(存储给定 W 所需的位数))。如果没有位增加 1，则运行时间加倍。因此它是指数的，因而是伪多项式的。

回复收藏 0 原文

も星光 2024-10-16 10:59:28

时间复杂度定义为输入大小（而不是输入）的函数

Knapsack 的复杂度为 O(nW)，这里 n = 硬币数组的大小。
为了表示 W，我们需要 log₂W 位。这给我们 O(n * 2^^log₂W) = O(n * 2^^input_size)，因此是指数的。

同样的逻辑，硬币变化（nW）也是指数级的。然而，规范硬币的贪婪算法是多项式的。

// assumed the coins are sorted in ascending order.
int CoinChange(const vector<int>& coins, int amount) {
    int coinsUsed = 0;
    for(int i = n-1; i >= 0; i --) {
        coinsUsed += amount / coins[i];
        amount %= coins[i];
    }
    return coinsUsed;
 }

Time complexity is defined as the function of input size (not the input)

Knapsack has complexity of O(nW), here n = size of the coins array.
To represent W, we need log₂W bits. This gives us O(n * 2^^log₂W) = O(n * 2^^input_size), Hence exponential.

With the same logic, coin change (nW) is also exponential. However greedy algorithm with canonical coins is polynomial.

// assumed the coins are sorted in ascending order.
int CoinChange(const vector<int>& coins, int amount) {
    int coinsUsed = 0;
    for(int i = n-1; i >= 0; i --) {
        coinsUsed += amount / coins[i];
        amount %= coins[i];
    }
    return coinsUsed;
 }

回复收藏 0 原文

~没有更多了~