Sequential Inference
e-processes
The sequential version of an e-value: a nonnegative process whose value at every stopping time is an e-value for the null. Constructed as the running product of conditional e-values adapted to a filtration; equivalently, a nonnegative supermartingale under the null. The object that powers anytime-valid inference, sequential testing, and confidence sequences.
Why This Matters
An e-value is a single nonnegative statistic with . An e-process is the sequential version: a collection where each is an e-value with respect to the information available at time , and the construction is consistent across times so that the value at any stopping time is still an e-value. The construction lets a tester accumulate evidence, peek as often as they like, stop whenever the evidence is convincing, and retain the same Type I error guarantee they would have had at a single fixed sample size.
The technical bridge is one of the cleanest results in modern statistics. If is a conditional e-value at time given the past, the running product is a nonnegative supermartingale under the null. Ville's inequality bounds the maximum of a nonnegative supermartingale: . The supremum is over all times, including any stopping time the analyst might choose adaptively. Optional stopping, optional continuation, and data-dependent decision rules all preserve the bound.
This is the technical content of "anytime-valid inference." Every confidence sequence, every always-valid p-value, every safe sequential test in the modern literature is a special case of the e-process construction, dating back to Wald's 1945 SPRT (likelihood-ratio e-process between two simple hypotheses) and reaching forward to LLM evaluation pipelines and continuous-monitoring clinical trials.
Formal Setup
Filtration
A nondecreasing family of -algebras on a probability space, with representing the information available at time . In practice, for an observed sequence.
Adapted process
A stochastic process is adapted to if is -measurable for every . Equivalently, is a function of the observations through time only.
e-process
A nonnegative process adapted to the filtration , with , such that for every stopping time (possibly infinite), A test that rejects when has Type I error at most , by Markov's inequality applied to .
Stopping time
A random variable such that the event is -measurable. Intuitively, the decision to stop at time depends only on the information available by time .
Construction From Conditional e-values
The standard recipe builds an e-process from a sequence of conditional e-values. The constraint is that each new term has unit conditional expectation given the past.
Running Product of Conditional e-values Is a Supermartingale
Statement
Let be a sequence of nonnegative random variables adapted to with for every . Define and for . Then is a nonnegative supermartingale under adapted to , and for every .
Consequently, is an e-process, and for every stopping time ,
Intuition
Each is a conditional bet: a payoff function with unit expected value given everything seen so far. Wealth multiplies across rounds. The conditional unit-expectation property is the martingale-like constraint that survives compounding. Equality at every step gives a martingale; an inequality gives a supermartingale, which is fine because we only need an upper bound on expected wealth.
Proof Sketch
Compute the conditional expectation of given : The factoring of out of the conditional expectation uses -measurability. The supermartingale property is the displayed inequality. The unconditional bound follows from the tower property: .
For the stopping-time statement, the optional stopping theorem (Doob) for nonnegative supermartingales gives for any stopping time .
Why It Matters
This construction is the engine of every modern sequential test. Wald's SPRT is the special case where (the likelihood ratio of one observation), and the running product is the joint likelihood ratio. Confidence sequences are built from running products of inverted-bet e-processes. The Sequential Hypothesis Testing pipeline at large-scale A/B platforms (Optimizely, Statsig) reduces to this theorem applied to bounded-mean estimators.
Failure Mode
The bound requires the conditional unit-expectation property under the null. If the analyst chooses based on data not in (e.g. by looking ahead, or by retrospectively reordering observations), the construction fails. The "adapted to " requirement is non-negotiable. Reusing the same observations across multiple definitions also breaks the conditional expectation, even though the resulting process is still adapted.
Optional ProofVille's inequality for nonnegative supermartingalesShow
The single-shot Markov inequality for an e-value generalizes to a maximal inequality for the entire e-process trajectory.
Ville's Inequality
Statement
Let be a nonnegative supermartingale under with . Then for every , Setting gives the anytime-valid Type I error guarantee: under the null, the probability that the e-process ever exceeds is at most .
Proof Sketch
Define , a stopping time. On the event , and . By the optional stopping theorem, On the complement, the bound is trivial. Apply Markov: so .
Why It Matters
The supremum is over all times. Ville's inequality says optional stopping at any time, including the time that maximizes the e-process value, cannot inflate Type I error beyond . This is the technical content of "anytime-valid."
Failure Mode
Ville's inequality requires the uniform supermartingale property: for every , not just unconditionally. Mixing e-values from different filtrations or recomputing the conditional expectation only at certain times breaks the bound.
Canonical Example: Wald's SPRT
The sequential probability ratio test (Wald 1945) is the prototype e-process. Test against , both simple, observe iid. Define Each likelihood ratio is a conditional (in fact unconditional, by independence) e-value: . The running product is a nonnegative supermartingale under (actually a martingale, since the inequality is an equality).
Wald's original test stops the first time (reject ) or (accept ), with and for desired error rates. The Type I error guarantee follows from Ville: under , . The expected sample size to reject under scales like , the Kullback-Leibler rate.
The same construction works for composite nulls via universal inference (Wasserman-Ramdas-Balakrishnan 2020): plug in any density estimator for on a held-out half of the data, evaluate the likelihood ratio on the other half. The result is an e-process for any composite null.
Worked Exercise
Problem
Let be iid and consider testing against for fixed . Construct the likelihood-ratio e-process explicitly. Compute its value at when and . At what time does the e-process exceed if the true mean is exactly and for all ?
Practical Example: Hourly A/B Test Monitoring
A standard A/B test compares conversion rates of two website variants over a planned days of traffic. The product team wants to peek hourly and stop early if the result is convincing. With classical -test -values, peeking inflates Type I error. With an e-process:
- Define .
- For each hour , the streaming users contribute conditional bets based on the observed click-rate differential and the running variance estimate. Empirical-Bernstein constructions (Howard-Ramdas-McAuliffe-Sekhon 2021) give explicit formulas for with bounded outcomes.
- The running product is the e-process. Stop the test at the first where (for ) or at the budget , whichever comes first.
Ville's inequality guarantees that under , the probability of falsely rejecting at any time is at most , regardless of how many hourly looks the analyst performed. Confidence sequences (the next topic) provide the matching interval estimates.
Connection to Classical Tests
Every classical sequential test maps onto an e-process construction:
- Wald's SPRT: likelihood-ratio e-process with two-sided thresholds.
- Group-sequential tests (Pocock, O'Brien-Fleming): e-process is implicit; the boundaries are tuned to the fixed schedule of looks. Adaptive peeking outside the schedule still breaks them; the e-process framing handles arbitrary stopping.
- Always-valid p-values: defined as , which inherits Ville's inequality.
- Bayes factors: under a prior, the Bayes factor is a likelihood ratio averaged over the prior, which is again an e-value. The product form across observations gives an e-process.
The modern view (Ramdas, Grünwald, Vovk, Shafer 2023) is that all sequential tests are e-processes under different choices of betting strategy.
Implementation Note
The standard implementation pattern is to maintain the log-e-process for numerical stability:
log_E = 0.0
for x_t in stream:
log_B_t = compute_log_conditional_bet(x_t, history)
log_E += log_B_t
if log_E >= np.log(1 / alpha):
return "reject H_0", t
history.append(x_t)
The confseq Python package (Howard et al. 2021) provides ready-made empirical-Bernstein e-processes for bounded outcomes. For Gaussian observations with unknown variance, the Robbins (1970) mixture e-process is standard; for two-sample binomial tests, the betting construction from Waudby-Smith and Ramdas (2024) gives the tightest confidence sequences in the bounded-outcome regime.
Care must be taken with the filtration. Each must depend only on observations strictly before time (or, by convention, on and any external randomness independent of ). Using the current observation inside violates adaptation and breaks the supermartingale property.
References
Canonical:
- Ramdas, A., Grünwald, P., Vovk, V., and Shafer, G. (2023). "Game-theoretic statistics and safe anytime-valid inference." Statistical Science 38(4), pp. 576-601. The modern survey, with Section 3 on e-processes and Section 4 on construction methods.
- Shafer, G. and Vovk, V. (2019). Game-Theoretic Foundations for Probability and Finance (Wiley). Chapters 3-7 develop the betting/martingale framework that e-processes formalize.
- Howard, S. R., Ramdas, A., McAuliffe, J., and Sekhon, J. (2021). "Time-uniform Chernoff bounds via nonnegative supermartingales." Probability Surveys 18, pp. 257-317. Quantitative time-uniform concentration inequalities derived from e-processes; the reference for empirical-Bernstein constructions.
Historical:
- Wald, A. (1945). "Sequential tests of statistical hypotheses." Annals of Mathematical Statistics 16(2), pp. 117-186. The SPRT.
- Wald, A. (1947). Sequential Analysis (Wiley). Book-length treatment of the SPRT and its operating characteristics.
- Ville, J. (1939). Étude critique de la notion de collectif (Gauthier-Villars). The original supermartingale maximal inequality, in a foundations-of-probability context.
Current:
- Waudby-Smith, I. and Ramdas, A. (2024). "Estimating means of bounded random variables by betting." Journal of the Royal Statistical Society, Series B 86(1), pp. 1-27. The state-of-the-art betting construction for confidence sequences.
- Grünwald, P., de Heide, R., and Koolen, W. (2024). "Safe testing." Journal of the Royal Statistical Society, Series B. e-process construction via reverse information projection for composite nulls.
Next Topics
- Anytime-valid inference: the broader framing of inference under continuous monitoring.
- Confidence sequences: time-uniform interval estimates derived from e-processes.
- Safe testing: the formal framework that builds tests directly on e-processes.
- E-values and anytime-valid inference: the umbrella reference with proofs and multiple-testing applications.
- Martingale theory: the mathematical foundation underneath the supermartingale construction.
Last reviewed: May 13, 2026
Canonical graph
Required before and derived from this topic
These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.
Required prerequisites
6- Measure-Theoretic Probabilitylayer 0B · tier 1
- Modes of Convergence of Random Variableslayer 0B · tier 1
- e-valueslayer 2 · tier 1
- Likelihood-Ratio, Wald, and Score Testslayer 2 · tier 1
- Martingale Theorylayer 0B · tier 2
Derived topics
3- Confidence Sequenceslayer 2 · tier 1
- Anytime-Valid Inferencelayer 3 · tier 1
- Safe Testinglayer 3 · tier 1
Graph-backed continuations