What Each Bounds
Both Azuma-Hoeffding and Freedman bound the tail probability of a martingale. Let be a martingale with differences .
Azuma-Hoeffding uses only the bounded range of each increment.
Freedman uses both the bounded range and the cumulative conditional variance (the predictable quadratic variation).
Side-by-Side Statement
Azuma-Hoeffding Inequality
If almost surely for each , then:
For identically bounded increments :
Freedman's Inequality
If almost surely and is the predictable quadratic variation, then for any :
Where Each Is Stronger
Azuma-Hoeffding wins on simplicity
Azuma-Hoeffding requires one piece of information per increment: the bound . There is no conditional variance to track or bound. The statement is clean and the proof is a direct application of Hoeffding's lemma to martingale differences. When you have bounded increments and no variance information, Azuma-Hoeffding is the right choice.
Freedman wins when variance is small
Consider a martingale where each increment is bounded by but has conditional variance . Azuma-Hoeffding treats the increment as if it could take any value in with equal ease. Freedman knows the increment is usually small.
For concreteness, suppose but for some . After steps:
| Azuma exponent | Freedman exponent | |
|---|---|---|
| Deviation |
When and is moderate, Freedman is tighter by a factor of roughly .
The Two Regimes of Freedman
Freedman's Two Regimes
Statement
Freedman's bound interpolates between two behaviors:
Small deviations (): The denominator , giving:
This is a sub-Gaussian tail with the actual variance, not the worst-case variance from the bounded range.
Large deviations (): The denominator , giving:
This is a sub-exponential (Poisson-like) tail, linear in .
Intuition
Near the mean, the martingale behaves like a Gaussian with its true variance. Far from the mean, the bounded increments cause the tail to transition from Gaussian () to exponential (). Azuma-Hoeffding gives a single Gaussian-like bound everywhere, but with the worst-case variance instead of the true conditional variance. Freedman captures the correct behavior in both regimes.
Failure Mode
Freedman requires bounding the predictable quadratic variation , which is a random quantity. In many applications, you bound by a deterministic quantity using problem-specific arguments. If you cannot bound tightly, Freedman's advantage over Azuma is lost.
The Relationship to Hoeffding vs. Bernstein
Azuma-Hoeffding is to Freedman as Hoeffding is to Bernstein. The parallel is exact:
| Independent sums | Martingales |
|---|---|
| Hoeffding (range only) | Azuma-Hoeffding (range only) |
| Bernstein (range + variance) | Freedman (range + conditional variance) |
Azuma-Hoeffding generalizes Hoeffding from independent sums to martingales. Freedman generalizes Bernstein from independent sums to martingales. The proofs follow the same pattern: Hoeffding's lemma for Azuma, Bernstein's moment condition for Freedman.
When to Use Each
Use Azuma-Hoeffding when:
- You only know the bounded range of each increment.
- The conditional variance is comparable to the square of the bound.
- You want a simple, quick bound for a rough estimate.
Use Freedman when:
- The conditional variance is much smaller than .
- You need tight bounds for rare events (importance sampling, bandit algorithms).
- You are working in the "small variance" regime where Azuma wastes a factor of .
Common Confusions
Freedman bounds a joint event
Freedman's inequality as stated bounds . This is a joint event. To get a bound on , you need to be small, and then apply a union bound. Forgetting the condition is a common error.
Azuma-Hoeffding is not just Hoeffding applied to the sum
The martingale differences are not independent. Azuma-Hoeffding is a genuine martingale result. The proof uses Hoeffding's lemma on each conditional increment, then telescopes. You cannot simply apply the independent-sum Hoeffding inequality to correlated increments.
Freedman does not require bounded increments in the same way Azuma does
Azuma requires with possibly different bounds per step. Freedman requires a uniform bound for all . If the bounds vary wildly, Azuma can be more natural to apply (each step gets its own bound), while Freedman requires a single worst-case .
References
Canonical:
- Freedman, "On Tail Probabilities for Martingales" (Annals of Probability, 1975)
- Azuma, "Weighted Sums of Certain Dependent Random Variables" (Tohoku Math Journal, 1967)
Current:
- Boucheron, Lugosi, Massart, Concentration Inequalities (2013), Chapter 3
- Wainwright, High-Dimensional Statistics (2019), Chapter 2