Knapsack Problem

Sneiderman, Robby

Algorithms Foundations

Knapsack Problem

The canonical constrained optimization problem: 0/1 knapsack (NP-hard, pseudo-polynomial DP), fractional knapsack (greedy), FPTAS, and connections to Lagrangian relaxation in ML.

CoreTier 2StableSupporting~40 min

Prerequisites

Dynamic Programming Greedy Algorithms

Prereq Map

Learning position

Read this page in the graph.

algorithms-foundations | layer 0A | tier 2. This page has 2 direct prerequisites and 0 published dependents.

Open Atlas Prerequisites Leads to

What next

Take the diagnostic

No published continuation is declared yet, so the diagnostic is the clean next route.

Evidence badge

Claim status

This page has no public Lean mapping yet. Use the evidence page to inspect how claim status labels work.

Show the backing system

AtlasOpen the full prerequisite graph and run grounding traces.EvidenceInspect source support, claim labels, and public trust status.LeanReview the checked declaration list, scopes, and axiom profile.

Why This Matters

The knapsack problem is the prototype of constrained optimization: maximize value subject to a resource constraint. This pattern is everywhere in machine learning. Selecting features subject to a computation budget, allocating GPU memory across model components, choosing which data points to label given an annotation budget. these are all knapsack-type problems.

Understanding knapsack gives you three fundamental algorithm design techniques in one problem: dynamic programming (exact solution), greedy (fast approximation for the fractional case), and FPTAS (near-optimal in polynomial time).

Mental Model

You have a backpack with a weight limit $W$ . There are $n$ items, each with a weight and a value. You want to pack the most valuable subset that fits. The catch: you cannot split items (0/1 knapsack) or you can split them (fractional knapsack). This simple change. can you take fractions?. flips the problem from NP-hard to polynomial.

Problem Definitions

Definition

0/1 Knapsack Problem

Given $n$ items with weights $w_1, \ldots, w_n$ and values $v_1, \ldots, v_n$ , and a capacity $W$ , find a subset $S \subseteq \{1, \ldots, n\}$ that maximizes:

$\sum_{i \in S} v_i \quad \text{subject to} \quad \sum_{i \in S} w_i \leq W$

Each item is either included entirely or not at all.

Definition

Fractional Knapsack Problem

Same setup, but you may take any fraction $x_i \in [0, 1]$ of each item. Maximize:

$\sum_{i=1}^n x_i v_i \quad \text{subject to} \quad \sum_{i=1}^n x_i w_i \leq W$

The 0/1 Knapsack: Dynamic Programming

Theorem

0/1 Knapsack DP Solution

Statement

Define $\text{dp}[i][c]$ as the maximum value achievable using items $1, \ldots, i$ with capacity $c$ . The recurrence is:

$\text{dp}[i][c] = \begin{cases} \text{dp}[i-1][c] & \text{if } w_i > c \\ \max(\text{dp}[i-1][c], \; \text{dp}[i-1][c - w_i] + v_i) & \text{if } w_i \leq c \end{cases}$

with base case $\text{dp}[0][c] = 0$ for all $c$ . The optimal value is $\text{dp}[n][W]$ . This algorithm runs in $O(nW)$ time and $O(nW)$ space (reducible to $O(W)$ space).

Intuition

For each item, you have two choices: skip it or take it. If you skip item $i$ , the best value with capacity $c$ is the same as with items $1, \ldots, i-1$ . If you take item $i$ , you gain $v_i$ but lose $w_i$ capacity. The recurrence considers both choices and picks the better one.

Proof Sketch

By induction on $i$ . Base case: with zero items, the value is 0 for any capacity. Inductive step: assume $\text{dp}[i-1][\cdot]$ is correct. For item $i$ , any optimal solution either includes $i$ or not. If it includes $i$ , the remaining items form an optimal solution for items $1, \ldots, i-1$ with capacity $c - w_i$ (by cut-and-paste argument). If not, the remaining items form an optimal solution for items $1, \ldots, i-1$ with capacity $c$ . The max of these two cases gives the optimal.

Why It Matters

This is the textbook example of pseudo-polynomial time: $O(nW)$ looks polynomial, but $W$ can be exponential in the input size (which is $O(n \log W)$ bits). This distinction between polynomial and pseudo-polynomial is essential for understanding NP-hardness and approximation algorithms.

Failure Mode

The algorithm requires integer weights. For real-valued weights, you must discretize (which introduces approximation error) or use a different approach. The $O(nW)$ runtime is impractical when $W$ is very large (e.g., $W = 10^{18}$ ).

report a correction →

The Fractional Knapsack: Greedy

Theorem

Greedy Optimality for Fractional Knapsack

Statement

Sort items by value-to-weight ratio $v_i / w_i$ in decreasing order. Greedily take as much as possible of each item in this order until the knapsack is full. This greedy algorithm produces an optimal solution to the fractional knapsack problem in $O(n \log n)$ time.

Intuition

Each "unit of weight" of item $i$ is worth $v_i / w_i$ . To maximize total value, you should fill the knapsack with the most valuable units first. Since you can take fractions, you can perfectly fill the knapsack by taking the highest-density items and splitting the last one.

Proof Sketch

Suppose the greedy solution is not optimal. Then there exists an optimal solution that takes less of some high-ratio item $i$ and more of some low-ratio item $j$ . Swap: replace some of item $j$ with the same weight of item $i$ . Since $v_i / w_i > v_j / w_j$ , the total value increases, contradicting optimality of the supposed better solution.

Why It Matters

This is a clean example of the greedy method working. The key insight: the fractional relaxation removes the combinatorial structure that makes 0/1 knapsack hard. This connection between fractional relaxation and greedy optimality appears throughout combinatorial optimization.

Failure Mode

The greedy algorithm is not optimal for 0/1 knapsack. Classic counterexample: items with $(w, v) = (1, 6), (2, 10), (3, 12)$ and $W = 5$ . Greedy by ratio takes items 1 and 2 (value 16), but optimal takes items 2 and 3 (value 22).

report a correction →

The FPTAS: Near-Optimal in Polynomial Time

Theorem

FPTAS for 0/1 Knapsack

Statement

There exists a fully polynomial-time approximation scheme (FPTAS) for 0/1 knapsack that, for any $\epsilon > 0$ , produces a solution with value at least $(1 - \epsilon) \cdot \text{OPT}$ in time $O(n^3 / \epsilon)$ via the standard value-rounding DP. (Tighter analyses and refined data structures bring this down to $\tilde O(n^2/\epsilon)$ and below; see Ibarra-Kim 1975 and the subsequent literature.)

Intuition

The DP runs in $O(nW)$ time. Instead of DP over weights, do DP over values: $\text{dp}[i][v]$ is the minimum weight to achieve value $v$ using items $1, \ldots, i$ . The issue is that values can be large. The FPTAS fixes this by rounding all values down to multiples of a small number $K = \epsilon \cdot v_{\max} / n$ . Each rounded value $\hat v_i = \lfloor v_i/K\rfloor$ is at most $n/\epsilon$ , so the achievable rounded total is at most $n^2/\epsilon$ .

Proof Sketch

Rounding each value down by at most $K$ changes the optimal value by at most $nK = \epsilon \cdot v_{\max} \leq \epsilon \cdot \text{OPT}$ , so the rounded DP returns a solution of value $\geq (1 - \epsilon) \cdot \text{OPT}$ . The value-DP table has $n$ rows (items) and up to $n^2/\epsilon$ columns (rounded total value), with $O(1)$ work per cell, giving $O(n^3/\epsilon)$ time. This is polynomial in both $n$ and $1/\epsilon$ , qualifying as an FPTAS.

Why It Matters

The FPTAS shows that while 0/1 knapsack is NP-hard exactly, it is easy to approximate. You can get within any desired fraction of optimal in polynomial time. This is the gold standard for NP-hard problems. Not all NP-hard problems admit an FPTAS (unless P = NP).

Failure Mode

Even $\tilde O(n^2/\epsilon)$ can be large in absolute terms: for $\epsilon = 0.01$ and $n = 10^6$ , this is on the order of $10^{14}$ operations. In practice, branch-and-bound and core-based exact solvers are often faster than the FPTAS on real-world instances.

report a correction →

Connection to Lagrangian Relaxation

The constrained optimization structure of knapsack connects directly to Lagrangian methods used throughout ML. The Lagrangian relaxation of the 0/1 knapsack is:

$L(\lambda) = \max_{x \in \{0,1\}^n} \sum_{i=1}^n v_i x_i - \lambda\left(\sum_{i=1}^n w_i x_i - W\right)$

For a given multiplier $\lambda \geq 0$ , this decomposes into $n$ independent decisions: include item $i$ if $v_i - \lambda w_i > 0$ , i.e., if the "value minus cost" is positive. The optimal $\lambda$ is found by solving the dual problem $\min_{\lambda \geq 0} L(\lambda)$ .

This Lagrangian perspective appears whenever you have a constrained optimization in ML: the multiplier $\lambda$ plays the same role as the regularization parameter in regularized ERM, the KL penalty coefficient in RLHF, or the constraint threshold in constrained optimization.

Common Confusions

Watch Out

Pseudo-polynomial is not polynomial

The $O(nW)$ DP looks polynomial, but $W$ is an integer whose representation requires $O(\log W)$ bits. The runtime is exponential in the input size $\log W$ . If someone gives you a knapsack instance with $W = 2^{100}$ , the DP is utterly infeasible despite being "O(nW)."

Watch Out

Greedy works for fractional but not 0/1

The greedy algorithm (sort by value-to-weight ratio) is optimal for fractional knapsack but can be arbitrarily bad for 0/1 knapsack without modification. The integrality constraint changes the problem from continuous optimization (easy) to combinatorial optimization (hard).

Watch Out

NP-hard does not mean practically impossible

The FPTAS gives near-optimal solutions in polynomial time. Branch-and-bound solves most practical instances quickly. NP-hardness is a worst-case complexity statement, not a practical impossibility statement. Real-world knapsack instances with millions of items are routinely solved.

Summary

0/1 knapsack: NP-hard, solved by DP in $O(nW)$ pseudo-polynomial time
Fractional knapsack: greedy (sort by $v_i/w_i$ ) is optimal, $O(n \log n)$
FPTAS: $(1-\epsilon)$ -approximate solution in $O(n^3/\epsilon)$ time via standard value rounding (refinements push this to $\tilde O(n^2/\epsilon)$ and below)
Pseudo-polynomial is not polynomial: $O(nW)$ depends on the magnitude of $W$
Lagrangian relaxation connects knapsack to regularized optimization in ML
The fractional/0/1 gap illustrates how integrality makes problems hard

Exercises

ExerciseCore

Problem

Solve the 0/1 knapsack instance: items $(w, v) = (2, 3), (3, 4), (4, 5), (5, 6)$ with capacity $W = 7$ . Fill in the DP table and trace back the optimal solution.

ExerciseAdvanced

Problem

Show that the plain ratio-greedy algorithm for 0/1 knapsack (take items in decreasing order of $v_i/w_i$ until one does not fit, then stop) has no constant-factor approximation guarantee: its ratio to OPT can be made arbitrarily small. Then state and justify the modified greedy fix that recovers an $\text{OPT}/2$ guarantee.

ExerciseAdvanced

Problem

Explain why the knapsack FPTAS does not violate the fact that 0/1 knapsack is NP-hard. What is the distinction between exact and approximate solutions?

References

Canonical:

Korte & Vygen, Combinatorial Optimization, Chapter 17
Garey & Johnson, Computers and Intractability (1979)

Current:

Kellerer, Pferschy, Pisinger, Knapsack Problems (2004)
Williamson & Shmoys, The Design of Approximation Algorithms (2011), Chapter 3
Ibarra & Kim, "Fast Approximation Algorithms for the Knapsack and Sum of Subset Problems," JACM 22(4):463-468 (1975) (the original FPTAS)

Next Topics

The natural next steps from knapsack:

Lagrangian relaxation: the general framework for constrained optimization
Integer programming: the broader class of problems knapsack belongs to
Approximation algorithms: systematic techniques for NP-hard problems

Last reviewed: April 26, 2026

Canonical graph

Required before and derived from this topic

These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.

Full prerequisite chain All derived topics

Required prerequisites

2

Dynamic Programminglayer 0A · tier 1
Greedy Algorithmslayer 0A · tier 2

Derived topics

0

No published topic currently declares this as a prerequisite.