Statistics
Tweedie Distribution
The Tweedie distribution is the one-parameter subfamily of the exponential dispersion model (EDM) family characterized by a power variance function V(mu) = mu^p. Special cases recover the Normal (p=0), Poisson (p=1), Gamma (p=2), and Inverse Gaussian (p=3). The intermediate range 1<p<2 produces a compound Poisson-Gamma distribution with a point mass at zero and a continuous positive part, which is the canonical model for insurance loss severity. The page covers the EDM construction, the four special-case identifications, and the compound-Poisson-Gamma representation; the applied actuarial treatment lives on ActuaryPath.
Prerequisites
Why This Matters
The Tweedie family is the answer to one question: which exponential dispersion model distributions have variance proportional to a power of the mean? The answer, due to Tweedie (1984), is a one-parameter family indexed by the power . Setting to specific integers recovers four standard distributions: Normal (), Poisson (), Gamma (), Inverse Gaussian (). The intermediate regime is the surprising one. It produces a distribution with a point mass at zero plus a continuous Gamma-shaped positive part, and it is the standard model for insurance claim severity, rainfall totals, and other quantities that are often exactly zero and otherwise positive and skewed.
The Tweedie family also gives the variance function for generalized linear models with power-law variance, which is why it appears in modern actuarial software, environmental statistics, and high-frequency-zero count modeling.
Background: Exponential Dispersion Models
An exponential dispersion model (EDM) has density (or PMF) of the form where is the natural parameter, is the dispersion parameter, is the cumulant function, and is a normalization. The mean and variance are where the variance function encodes the mean-variance relationship of the family. The EDM family includes Normal, Poisson, Gamma, Binomial, and Inverse Gaussian; each is determined by its variance function.
The Tweedie Variance Function
Tweedie Family as Power-Variance EDMs
Statement
For each , there is an EDM with variance function . The corresponding distribution is called the Tweedie distribution with power , written . The mean is and the variance is . Special cases identify standard families:
| Tweedie distribution | Variance function | |
|---|---|---|
| Normal | ||
| Poisson (with ); over-dispersed Poisson otherwise | ||
| Compound Poisson-Gamma (point mass at zero plus positive continuous part) | ||
| Gamma | ||
| Inverse Gaussian | ||
| , | Positive stable distributions (heavy-tailed continuous on ) |
The gap is excluded: no Tweedie distribution exists for those powers because the corresponding cumulant function would not be convex on a non-degenerate parameter space.
Intuition
The variance function encodes how the spread of the data grows with the mean. Constant variance () is Normal. Variance equal to the mean () is Poisson. Variance proportional to the squared mean () is Gamma. The whole continuum is achievable, with the surprising compound case appearing in .
Proof Sketch
The cumulant function for Tweedie with power is, up to constants, defined on the appropriate domain of so is positive. Computing and inverting gives , then by direct calculation. The boundary cases are continuous limits requiring separate treatment and recover the Normal, Poisson, and Gamma cumulant functions respectively. See Jorgensen (1987, 1997) for the existence proof of the EDM for each admissible .
Why It Matters
The Tweedie family is closed under several operations relevant in GLM fitting: scaling, summing iid copies, and translating the mean via a link function. The MLE for a Tweedie GLM can be computed by IRLS with the variance function plugged into the standard machinery. Modern actuarial-pricing software (R tweedie, statmod, Python glm packages) implements exactly this.
Failure Mode
The dispersion and power are both estimated from data in applied GLMs; identifying them simultaneously requires either a profile likelihood over or a saddlepoint approximation. With small samples the joint MLE can be unstable, particularly near the boundaries (Poisson) and (Gamma). Practical workflow: fix on a grid, fit the GLM for each, compare via profile log-likelihood.
The Compound Poisson-Gamma Regime:
Compound Poisson-Gamma Representation in the Range 1 to 2
Statement
For , is the distribution of where and are iid independent of , with parameters In particular, is strictly positive, and conditional on , has a continuous density on .
Intuition
Think of an insurance portfolio in a one-year period. Each policyholder either has zero claims (with positive probability) or has a positive number of claims, each of positive Gamma-distributed size. The total annual claim amount is exactly zero on policies with no claims and Gamma-sum-shaped on policies with at least one. The Tweedie distribution captures both regimes in a single parametric family.
Proof Sketch
Compute the MGF of by conditioning on : where is the Gamma MGF. Matching the cumulant function to the Tweedie cumulant identified in the proof above gives the stated parameter map. The boundary cases (pure Poisson with and Gamma collapsing to a point mass at ) and (pure Gamma with , no zero mass) are continuity arguments.
Why It Matters
This is why the Tweedie distribution is the workshop tool for any quantity that is exactly zero with positive probability and otherwise positive and right-skewed. Insurance claim severity by policy. Rainfall by day in a dry region. Health-care expenditure by individual in a year. The compound-Poisson-Gamma decomposition makes the model interpretable (claim frequency, claim size, both as separate parameters) while keeping the unified Tweedie GLM tractable.
Failure Mode
The density on does not have a closed-form expression for general and must be computed by series expansion or saddlepoint approximation. Statistical software handles this; do not try to write the density yourself for production use.
Common Confusions
Tweedie distribution is not the Tweedie formula
The Tweedie distribution (this page) is the EDM family with power variance function. The Tweedie formula, sometimes also called the Robbins-Tweedie formula, is an empirical-Bayes identity that expresses the posterior mean as for a Gaussian likelihood with prior . The two share only the surname; the formula appears in score matching and denoising diffusion (see score matching line 310 for the formula in that context), the distribution appears in GLMs and actuarial pricing.
The gap p in 0 to 1 is mathematical, not a software bug
The Tweedie family has no member for . This is a genuine restriction of the EDM structure, not an implementation choice in tweedie or similar packages. Asking for a Tweedie GLM with will produce a software error or silently fall back to a nearby admissible power; check the package documentation.
The dispersion phi and the power p are both parameters
A Tweedie GLM has two parameters beyond the mean structure: the dispersion (analogous to variance in a Gaussian GLM) and the power . Both must be estimated or fixed. In actuarial pricing, is often profiled over and chosen by AIC.
Compound Poisson-Gamma is not zero-inflated Gamma
A zero-inflated Gamma is a mixture: a Bernoulli draw for "zero or positive", then a Gamma if positive. A compound Poisson-Gamma has a structural reason for the zero mass: the underlying claim count is Poisson, so zero claims happens with probability . The two models are different and produce different inferences when claims-per-policy can exceed one; the Tweedie distribution corresponds to the second.
Exercises
Problem
Verify that the Tweedie variance function specializes correctly to the Normal, Poisson, and Gamma cases.
Problem
For with , compute in terms of using the compound Poisson-Gamma representation.
References
Canonical:
- Tweedie, "An index which distinguishes between some important exponential families" (in Statistics: Applications and New Directions, Proceedings of the Indian Statistical Institute Golden Jubilee International Conference, 1984), pages 579-604
- Jorgensen, "Exponential dispersion models" (Journal of the Royal Statistical Society, Series B, 1987), volume 49, pages 127-162
- Jorgensen, The Theory of Dispersion Models (1997). The monograph treatment.
- Smyth, "Regression analysis of quantity data with exact zeros" (Proceedings of the Second Australia-Japan Workshop on Stochastic Models, 1996), pages 572-580. The GLM treatment of the case.
Actuarial applications:
- Frees, Derrig, and Meyers (eds.), Predictive Modeling Applications in Actuarial Science, Volume 1 (2014), Chapters 5 and 8
- See also the ActuaryPath topic page on compound Poisson and Tweedie for the applied pricing treatment.
Next Topics
- Maximum likelihood estimation: the standard estimator for Tweedie GLMs; profile likelihood over is the standard approach.
- Common probability distributions: the Normal, Poisson, and Gamma members of the Tweedie family.
Last reviewed: May 12, 2026
Canonical graph
Required before and derived from this topic
These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.
Required prerequisites
3- Common Probability Distributionslayer 0A · tier 1
- Maximum Likelihood Estimation: Theory, Information Identity, and Asymptotic Efficiencylayer 0B · tier 1
- Sufficient Statistics and Exponential Familieslayer 0B · tier 2
Derived topics
0No published topic currently declares this as a prerequisite.