Skip to main content

Foundations

Weibull Distribution

The Weibull distribution is the parametric workmate of reliability and survival analysis. Two parameters: a shape k and a scale lambda. The hazard rate is monotone in time, increasing when k > 1 (wear-out), constant when k = 1 (Exponential, memoryless), decreasing when k < 1 (early-failure / infant-mortality). Mean is lambda Gamma(1 + 1/k); variance is lambda squared times (Gamma(1 + 2/k) minus Gamma(1 + 1/k) squared). MLE for k has no closed form and is solved by Newton-Raphson on the profile score equation. Applications: component lifetimes, time to event in clinical trials, wind speeds, extreme-value Type III.

AdvancedAdvancedTier 2StableSupporting~45 min
For:StatsActuarial

Plain-Language Definition

The Weibull distribution is the working parametric model for the time until an event when the hazard rate is not constant. The hazard rate of a positive random variable is the conditional instantaneous failure rate given survival to time xx. For an Exponential, the hazard is constant; for many real-world lifetimes, the hazard increases with age (wear-out) or decreases with age (early-life failures dropping off as defective units fail and are replaced).

The Weibull is built around a hazard rate that is a power of xx. The shape parameter kk controls the power, and therefore the trend of the hazard. The scale parameter λ\lambda rescales the time axis. Together they give a two-parameter family flexible enough for reliability engineering, survival analysis in medicine, and wind-speed modeling, while remaining easy to fit and interpret.

Definition

Definition

Weibull Distribution

A positive random variable XX has a Weibull distribution with shape k>0k > 0 and scale λ>0\lambda > 0 when its density is

fX(x)=kλ(xλ)k1exp ⁣((xλ)k),x0,f_X(x) = \frac{k}{\lambda}\left(\frac{x}{\lambda}\right)^{k - 1} \exp\!\left(-\left(\frac{x}{\lambda}\right)^k\right), \quad x \geq 0,

equivalently, its survival function is SX(x)=P(X>x)=exp((x/λ)k)S_X(x) = \mathbb{P}(X > x) = \exp(-(x/\lambda)^k).

The hazard function h(x)=fX(x)/SX(x)h(x) = f_X(x) / S_X(x) equals (k/λ)(x/λ)k1(k/\lambda)(x/\lambda)^{k - 1}, a power of xx with exponent k1k - 1.

The parameter names vary across references. Some texts use β\beta for the shape and η\eta for the scale; some use aa and bb. The shape is the parameter that controls the hazard trend, and is what most readers will care about; the scale just sets the time units.

Why This Matters

Three reasons the Weibull is the most-used parametric lifetime model.

  1. Hazard flexibility. A constant hazard is a strong assumption that few real-world systems satisfy. Mechanical components have an early-life region where defective units fail (decreasing hazard) and a wear-out region where surviving units accumulate damage (increasing hazard). The Weibull spans the full range of monotone hazards with one parameter.

  2. Extreme-value connection. The Weibull is the Type III extreme-value distribution: the limiting law for the minimum of iid samples from a distribution bounded below, after suitable centering and scaling. This is the formal reason wind speeds and failure times often follow Weibulls; both are minima of many local failures and can be characterized via the Fisher-Tippett-Gnedenko theorem in extreme-value theory.

  3. Reliability engineering practice. The Weibull is the default fit in reliability software (Minitab, Reliasoft, R's survreg). Engineers can interpret the shape parameter directly as "infant mortality" (k<1k < 1), "random failures" (k=1k = 1), or "wear-out" (k>1k > 1). The Weibull also generalizes the Exponential and the Rayleigh as special cases, so the same workflow covers all three.

Survival, Mean, Variance

Theorem

Weibull Survival Function and Moments

Statement

P(X>x)=e(x/λ)k,P(Xx)=1e(x/λ)k,x0.\mathbb{P}(X > x) = e^{-(x/\lambda)^k}, \quad \mathbb{P}(X \leq x) = 1 - e^{-(x/\lambda)^k}, \quad x \geq 0.

All moments exist. The first two are E[X]=λΓ ⁣(1+1k),Var(X)=λ2[Γ ⁣(1+2k)Γ ⁣(1+1k)2].\mathbb{E}[X] = \lambda\, \Gamma\!\left(1 + \frac{1}{k}\right), \qquad \operatorname{Var}(X) = \lambda^2 \left[\Gamma\!\left(1 + \frac{2}{k}\right) - \Gamma\!\left(1 + \frac{1}{k}\right)^2\right].

Intuition

The integral that defines the mm-th moment is 0xm(k/λ)(x/λ)k1e(x/λ)kdx\int_0^\infty x^m \cdot (k/\lambda)(x/\lambda)^{k - 1} e^{-(x/\lambda)^k}\, dx. The substitution u=(x/λ)ku = (x/\lambda)^k converts it to λm0um/keudu=λmΓ(1+m/k)\lambda^m \int_0^\infty u^{m/k} e^{-u}\, du = \lambda^m \Gamma(1 + m/k). All Weibull moments are finite because the Gamma function is finite at every positive argument and the substitution removes the power-law factor cleanly.

Proof Sketch

For the survival function, integrate the density: P(X>x)=xfX(t)dt\mathbb{P}(X > x) = \int_x^\infty f_X(t)\, dt with substitution u=(t/λ)ku = (t/\lambda)^k giving du=(k/λ)(t/λ)k1dtdu = (k/\lambda)(t/\lambda)^{k-1}dt, so the integral evaluates to (x/λ)keudu=e(x/λ)k\int_{(x/\lambda)^k}^\infty e^{-u}\, du = e^{-(x/\lambda)^k}.

For the mm-th moment, E[Xm]=0mxm1P(X>x)dx=0mxm1e(x/λ)kdx\mathbb{E}[X^m] = \int_0^\infty m x^{m-1} \mathbb{P}(X > x)\, dx = \int_0^\infty m x^{m-1} e^{-(x/\lambda)^k}\, dx. Substitute u=(x/λ)ku = (x/\lambda)^k to obtain λmΓ(1+m/k)\lambda^m \Gamma(1 + m/k).

Why It Matters

Both moments require the Gamma function, which is a closed-form quantity only in the integer-shape case. For k=1k = 1 (Exponential), Γ(2)=1\Gamma(2) = 1 and E[X]=λ\mathbb{E}[X] = \lambda, Var(X)=λ2\operatorname{Var}(X) = \lambda^2. For k=2k = 2 (Rayleigh), E[X]=λπ/2\mathbb{E}[X] = \lambda \sqrt{\pi}/2 and Var(X)=λ2(1π/4)\operatorname{Var}(X) = \lambda^2(1 - \pi/4). For other kk, the moments are computed numerically from a Gamma routine.

Failure Mode

Statistical software is split on the scale parameterization. The Wikipedia / R pweibull convention uses λ\lambda as defined here. The reliability-engineering convention sometimes uses η=λ\eta = \lambda (same thing) but with the survival function written as e(x/η)ke^{-(x/\eta)^k} and the density formula modified accordingly. SciPy's weibull_min takes shape and scale and matches the convention used here. Check the docstring before fitting.

Hazard Function and Its Three Regimes

Theorem

Weibull Hazard Is Monotone, with Shape Setting the Trend

Statement

The hazard function h(x)=fX(x)/SX(x)=(k/λ)(x/λ)k1h(x) = f_X(x) / S_X(x) = (k / \lambda) (x / \lambda)^{k - 1} is

  • strictly decreasing in xx when 0<k<10 < k < 1 ("infant mortality" or early-failure regime);
  • constant at 1/λ1/\lambda when k=1k = 1 (the Exponential / memoryless case);
  • strictly increasing in xx when k>1k > 1 ("wear-out" or aging regime).

Intuition

The hazard is the failure rate conditional on survival. The Weibull hazard is a monotone power of time. When k=1k = 1, the power is 00 and the hazard is flat. When kk exceeds 11, the hazard grows; when kk is below 11, it falls. The Weibull is the simplest parametric family that lets you fit a monotone hazard with one parameter.

Proof Sketch

Direct from the formula h(x)=(k/λ)(x/λ)k1h(x) = (k/\lambda)(x/\lambda)^{k - 1}. The derivative with respect to xx is h(x)=(k(k1)/λ2)(x/λ)k2h'(x) = (k(k - 1)/\lambda^2)(x/\lambda)^{k - 2}, which is positive when k>1k > 1 and negative when k<1k < 1. Setting k=1k = 1 gives h(x)=1/λh(x) = 1/\lambda, a constant.

Why It Matters

The shape parameter is the most interpretable quantity in a Weibull fit. A reliability engineer who sees k=2.3k = 2.3 on a fitted Weibull immediately knows the failure mode is wear-out, not random. An k=0.8k = 0.8 suggests early-failure issues that burn-in testing can mitigate. The numerical scale λ\lambda only sets the time axis; the qualitative regime comes from kk.

Failure Mode

The Weibull cannot fit a hazard that goes up then down (a "bathtub curve" without monotonicity inside each segment) with one set of parameters. Such data needs a mixture of Weibulls or a parametric extension (Generalized Gamma, Burr, or a competing-risks model). Forcing a single Weibull on bathtub data gives a poor fit and an uninterpretable kk.

Worked Example: Component Lifetime

A manufacturer fits a Weibull to time-to-failure data on a sealed bearing. The MLE gives k^=2.5\hat k = 2.5 and λ^=8000\hat\lambda = 8000 hours.

The fit interpretation: k^>1\hat k > 1 indicates a wear-out regime, consistent with the physical picture that the bearing accumulates damage over time and fails more readily after extended use.

Key quantities:

  • E[X]=8000Γ(1+1/2.5)=8000Γ(1.4)80000.88737099\mathbb{E}[X] = 8000 \cdot \Gamma(1 + 1/2.5) = 8000 \cdot \Gamma(1.4) \approx 8000 \cdot 0.8873 \approx 7099 hours.
  • Var(X)=80002[Γ(1.8)Γ(1.4)2]6.4×107[0.93140.7873]9.2×106\operatorname{Var}(X) = 8000^2 \cdot [\Gamma(1.8) - \Gamma(1.4)^2] \approx 6.4 \times 10^7 \cdot [0.9314 - 0.7873] \approx 9.2 \times 10^6; SD(X)3036\operatorname{SD}(X) \approx 3036 hours.
  • Median: q0.5=λ(ln2)1/k=8000(0.693)0.480000.8666928q_{0.5} = \lambda \cdot (\ln 2)^{1/k} = 8000 \cdot (0.693)^{0.4} \approx 8000 \cdot 0.866 \approx 6928 hours.
  • 10th percentile (reliability R(t)=0.9R(t) = 0.9): q0.1=λ(ln(1/0.9))1/k=8000(0.1054)0.480000.4083267q_{0.1} = \lambda \cdot (\ln(1 / 0.9))^{1/k} = 8000 \cdot (0.1054)^{0.4} \approx 8000 \cdot 0.408 \approx 3267 hours.

The 10 percent of bearings fail before about 3267 hours. The median is 6928 hours. The mean is 7099 hours, only slightly above the median because the right skew of a k=2.5k = 2.5 Weibull is modest.

Compare to the Exponential fit (assuming constant hazard, which gives k=1k = 1). For the same mean of 70997099 hours, λexp=7099\lambda_{\text{exp}} = 7099 and 10 percent fail before 7099ln0.9748-7099 \cdot \ln 0.9 \approx 748 hours. The Exponential dramatically overstates early failures and understates the median lifetime. The Weibull fit captures the wear-out behavior that the Exponential cannot.

Common Misconceptions

Watch Out

The Weibull is not memoryless

The memoryless property P(X>s+tX>s)=P(X>t)\mathbb{P}(X > s + t \mid X > s) = \mathbb{P}(X > t) is unique to the Exponential among continuous distributions on the half-line. The Weibull with k1k \neq 1 is explicitly age-dependent: the conditional probability of further survival depends on how long the system has already survived. This is the whole point of using a Weibull instead of an Exponential.

Watch Out

The shape k is not the mean or the median

kk is dimensionless and controls the hazard trend. The scale λ\lambda has units of time (or whatever the random variable measures). E[X]=λΓ(1+1/k)\mathbb{E}[X] = \lambda \Gamma(1 + 1/k) depends on both. A larger kk at fixed λ\lambda gives a slightly smaller mean (because Γ(1+1/k)\Gamma(1 + 1/k) decreases towards 11 as kk grows past 1). The two parameters are not interchangeable.

Watch Out

Maximum likelihood for k has no closed form

The Weibull log-likelihood is concave in (k,λ)(k, \lambda) but the score equation for kk is transcendental. Solving requires a one-dimensional numerical solver (Newton-Raphson is standard). The closed form λ^=(1/nXik)1/k\hat \lambda = (1/n \sum X_i^k)^{1/k} exists for λ\lambda once kk is fixed, which is why the standard algorithm profiles out λ\lambda and solves for kk alone.

Watch Out

A bathtub curve is not a single Weibull

A "bathtub" hazard rate (high early, low middle, high late) is a common reliability-textbook picture, but no single Weibull produces it. The Weibull hazard is monotone. A bathtub curve is typically modeled as a mixture or a competing-risks decomposition. Fitting a single Weibull to bathtub data and reading off "the kk" is meaningless: the algorithm will fit the dominant tail and ignore the others.

Comparison: Weibull vs Exponential vs Gamma vs Lognormal

The four are the standard nonnegative lifetime distributions in applied work, and they differ on the hazard shape and the tail.

  • Exponential. Constant hazard. Memoryless. Tail decays as eλxe^{-\lambda x}. Only one parameter.
  • Weibull. Monotone hazard. Tail decays as e(x/λ)ke^{-(x/\lambda)^k}, faster than Exponential if k>1k > 1, slower if k<1k < 1. Two parameters; k=1k = 1 recovers Exponential.
  • Gamma. Initially-increasing-then-leveling hazard for shape >1> 1; saturates at 1/β1/\beta rather than rising indefinitely. Tail decays as eβxe^{-\beta x} times a polynomial. Two parameters.
  • Lognormal. Non-monotone hazard (rises and then falls). Tail decays sub-exponentially, heavier than Gamma or Weibull with k>1k > 1.

Diagnostics: a log-log plot of lnS(x)-\ln S(x) against lnx\ln x is linear for a Weibull (the slope is kk and the intercept involves λ\lambda). A curved version of the same plot suggests Gamma or Lognormal. A plot of empirical hazard against time shows whether the hazard is monotone (Weibull or Gamma) or non-monotone (Lognormal); whether monotone hazards trend up (wear-out, k>1k > 1) or down (early-life, k<1k < 1); and whether the hazard ever flattens (Exponential).

For loss-modeling and severity-fitting applications, see ActuaryPath's Weibull page at https://www.actuarypath.com/concepts/weibull-distribution/.

Maximum-Likelihood Estimation

The log-likelihood from an iid sample X1,,XnX_1, \dots, X_n is

(k,λ)=nlnknklnλ+(k1)ilnXii(Xi/λ)k.\ell(k, \lambda) = n \ln k - n k \ln \lambda + (k - 1) \sum_i \ln X_i - \sum_i (X_i / \lambda)^k.

Setting the score /λ=0\partial \ell / \partial \lambda = 0 gives λ^(k)=[(1/n)iXik]1/k\hat\lambda(k) = \left[(1/n) \sum_i X_i^k\right]^{1/k} as a function of kk. Substituting into the score for kk produces a one-dimensional equation

1k+1nilnXiiXiklnXiiXik=0,\frac{1}{k} + \frac{1}{n}\sum_i \ln X_i - \frac{\sum_i X_i^k \ln X_i}{\sum_i X_i^k} = 0,

which is solved by Newton-Raphson. The MLE is consistent, asymptotically Normal, and asymptotically efficient under standard regularity. Censored-data extensions (right censoring is the common case in reliability and survival data) require including censored observations through the survival function in the likelihood rather than the density.

Exercises

ExerciseCore

Problem

A battery lifetime is modeled as XWeibull(k=1.5,λ=1000)X \sim \operatorname{Weibull}(k = 1.5, \lambda = 1000) hours. Find the survival function at x=500x = 500 and x=1500x = 1500, the median lifetime, and the mean lifetime.

ExerciseCore

Problem

A patient cohort in a clinical trial has time-to-event modeled as XWeibull(k=0.8,λ=24)X \sim \operatorname{Weibull}(k = 0.8, \lambda = 24) months. Compute the hazard rate at 6 months and 24 months, and explain what the shape parameter implies clinically.

ExerciseCore

Problem

Wind speeds at a hub-height anemometer are modeled as XWeibull(k=2,λ=8)X \sim \operatorname{Weibull}(k = 2, \lambda = 8) m/s (the k=2k = 2 case is the Rayleigh distribution). Find the median, the mean, the 95th percentile, and the probability the wind exceeds 15 m/s.

ExerciseAdvanced

Problem

Show that Y=(X/λ)kY = (X / \lambda)^k is Exponential with rate 1 when XWeibull(k,λ)X \sim \operatorname{Weibull}(k, \lambda).

ExerciseAdvanced

Problem

For an iid sample X1,,XnX_1, \dots, X_n from Weibull(k,λ)\operatorname{Weibull}(k, \lambda) with known shape kk, show that the MLE of λ\lambda is λ^=((1/n)iXik)1/k\hat\lambda = \left((1/n) \sum_i X_i^k\right)^{1/k} and derive its asymptotic distribution.

ExerciseAdvanced

Problem

A reliability dataset of n=100n = 100 failure times has ilnXi=750\sum_i \ln X_i = 750. Suppose the Weibull MLE for the shape parameter has been computed numerically as k^=2.1\hat k = 2.1. Find λ^\hat\lambda.

Extensions

The Weibull is the starting point for several richer parametric families used in reliability and survival work.

  • Three-parameter Weibull. Adds a location (shift) parameter θ\theta: XθWeibull(k,λ)X - \theta \sim \operatorname{Weibull}(k, \lambda). Used when failure times have a known minimum (a guaranteed warranty period, or a manufacturer-specified minimum life).
  • Generalized Gamma. Adds a third parameter to interpolate between Weibull, Gamma, and Lognormal. Used when the shape diagnostic does not clearly favor any one of the three.
  • Weibull mixtures and competing risks. Two or more Weibulls combined by a mixing distribution; produces bathtub hazard curves and other non-monotone hazards.

Each extension keeps the Weibull as a special case and adds flexibility when the two-parameter Weibull is rejected by a goodness-of-fit test.

References

  • Casella, G., and Berger, R. L. (2002). Statistical Inference, 2nd ed., Duxbury. Section 3.3 includes the Weibull as a transformation of the Exponential and gives the moment formulas via the Gamma function.
  • Blitzstein, J. K., and Hwang, J. (2019). Introduction to Probability, 2nd ed., Chapman and Hall / CRC. Chapter 5 covers the Weibull alongside the other lifetime distributions, with the k=2k = 2 Rayleigh case worked out.
  • For the actuarial / severity-modeling perspective on the Weibull, see ActuaryPath's Weibull page at https://www.actuarypath.com/concepts/weibull-distribution/ and Klugman, Panjer, Willmot (2019), Loss Models, 5th ed., Wiley, Chapter 5.
  • For the survival-analysis perspective with censored data, see Klein, J. P., and Moeschberger, M. L. (2003), Survival Analysis: Techniques for Censored and Truncated Data, 2nd ed., Springer. Chapter 12 covers Weibull regression with right censoring.

Last reviewed: May 12, 2026

Canonical graph

Required before and derived from this topic

These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.

Required prerequisites

4

Derived topics

0

No published topic currently declares this as a prerequisite.