Foundations
Weibull Distribution
The Weibull distribution is the parametric workmate of reliability and survival analysis. Two parameters: a shape k and a scale lambda. The hazard rate is monotone in time, increasing when k > 1 (wear-out), constant when k = 1 (Exponential, memoryless), decreasing when k < 1 (early-failure / infant-mortality). Mean is lambda Gamma(1 + 1/k); variance is lambda squared times (Gamma(1 + 2/k) minus Gamma(1 + 1/k) squared). MLE for k has no closed form and is solved by Newton-Raphson on the profile score equation. Applications: component lifetimes, time to event in clinical trials, wind speeds, extreme-value Type III.
Prerequisites
Plain-Language Definition
The Weibull distribution is the working parametric model for the time until an event when the hazard rate is not constant. The hazard rate of a positive random variable is the conditional instantaneous failure rate given survival to time . For an Exponential, the hazard is constant; for many real-world lifetimes, the hazard increases with age (wear-out) or decreases with age (early-life failures dropping off as defective units fail and are replaced).
The Weibull is built around a hazard rate that is a power of . The shape parameter controls the power, and therefore the trend of the hazard. The scale parameter rescales the time axis. Together they give a two-parameter family flexible enough for reliability engineering, survival analysis in medicine, and wind-speed modeling, while remaining easy to fit and interpret.
Definition
Weibull Distribution
A positive random variable has a Weibull distribution with shape and scale when its density is
equivalently, its survival function is .
The hazard function equals , a power of with exponent .
The parameter names vary across references. Some texts use for the shape and for the scale; some use and . The shape is the parameter that controls the hazard trend, and is what most readers will care about; the scale just sets the time units.
Why This Matters
Three reasons the Weibull is the most-used parametric lifetime model.
-
Hazard flexibility. A constant hazard is a strong assumption that few real-world systems satisfy. Mechanical components have an early-life region where defective units fail (decreasing hazard) and a wear-out region where surviving units accumulate damage (increasing hazard). The Weibull spans the full range of monotone hazards with one parameter.
-
Extreme-value connection. The Weibull is the Type III extreme-value distribution: the limiting law for the minimum of iid samples from a distribution bounded below, after suitable centering and scaling. This is the formal reason wind speeds and failure times often follow Weibulls; both are minima of many local failures and can be characterized via the Fisher-Tippett-Gnedenko theorem in extreme-value theory.
-
Reliability engineering practice. The Weibull is the default fit in reliability software (Minitab, Reliasoft, R's
survreg). Engineers can interpret the shape parameter directly as "infant mortality" (), "random failures" (), or "wear-out" (). The Weibull also generalizes the Exponential and the Rayleigh as special cases, so the same workflow covers all three.
Survival, Mean, Variance
Weibull Survival Function and Moments
Statement
All moments exist. The first two are
Intuition
The integral that defines the -th moment is . The substitution converts it to . All Weibull moments are finite because the Gamma function is finite at every positive argument and the substitution removes the power-law factor cleanly.
Proof Sketch
For the survival function, integrate the density: with substitution giving , so the integral evaluates to .
For the -th moment, . Substitute to obtain .
Why It Matters
Both moments require the Gamma function, which is a closed-form quantity only in the integer-shape case. For (Exponential), and , . For (Rayleigh), and . For other , the moments are computed numerically from a Gamma routine.
Failure Mode
Statistical software is split on the scale parameterization. The Wikipedia / R pweibull convention uses as defined here. The reliability-engineering convention sometimes uses (same thing) but with the survival function written as and the density formula modified accordingly. SciPy's weibull_min takes shape and scale and matches the convention used here. Check the docstring before fitting.
Hazard Function and Its Three Regimes
Weibull Hazard Is Monotone, with Shape Setting the Trend
Statement
The hazard function is
- strictly decreasing in when ("infant mortality" or early-failure regime);
- constant at when (the Exponential / memoryless case);
- strictly increasing in when ("wear-out" or aging regime).
Intuition
The hazard is the failure rate conditional on survival. The Weibull hazard is a monotone power of time. When , the power is and the hazard is flat. When exceeds , the hazard grows; when is below , it falls. The Weibull is the simplest parametric family that lets you fit a monotone hazard with one parameter.
Proof Sketch
Direct from the formula . The derivative with respect to is , which is positive when and negative when . Setting gives , a constant.
Why It Matters
The shape parameter is the most interpretable quantity in a Weibull fit. A reliability engineer who sees on a fitted Weibull immediately knows the failure mode is wear-out, not random. An suggests early-failure issues that burn-in testing can mitigate. The numerical scale only sets the time axis; the qualitative regime comes from .
Failure Mode
The Weibull cannot fit a hazard that goes up then down (a "bathtub curve" without monotonicity inside each segment) with one set of parameters. Such data needs a mixture of Weibulls or a parametric extension (Generalized Gamma, Burr, or a competing-risks model). Forcing a single Weibull on bathtub data gives a poor fit and an uninterpretable .
Worked Example: Component Lifetime
A manufacturer fits a Weibull to time-to-failure data on a sealed bearing. The MLE gives and hours.
The fit interpretation: indicates a wear-out regime, consistent with the physical picture that the bearing accumulates damage over time and fails more readily after extended use.
Key quantities:
- hours.
- ; hours.
- Median: hours.
- 10th percentile (reliability ): hours.
The 10 percent of bearings fail before about 3267 hours. The median is 6928 hours. The mean is 7099 hours, only slightly above the median because the right skew of a Weibull is modest.
Compare to the Exponential fit (assuming constant hazard, which gives ). For the same mean of hours, and 10 percent fail before hours. The Exponential dramatically overstates early failures and understates the median lifetime. The Weibull fit captures the wear-out behavior that the Exponential cannot.
Common Misconceptions
The Weibull is not memoryless
The memoryless property is unique to the Exponential among continuous distributions on the half-line. The Weibull with is explicitly age-dependent: the conditional probability of further survival depends on how long the system has already survived. This is the whole point of using a Weibull instead of an Exponential.
The shape k is not the mean or the median
is dimensionless and controls the hazard trend. The scale has units of time (or whatever the random variable measures). depends on both. A larger at fixed gives a slightly smaller mean (because decreases towards as grows past 1). The two parameters are not interchangeable.
Maximum likelihood for k has no closed form
The Weibull log-likelihood is concave in but the score equation for is transcendental. Solving requires a one-dimensional numerical solver (Newton-Raphson is standard). The closed form exists for once is fixed, which is why the standard algorithm profiles out and solves for alone.
A bathtub curve is not a single Weibull
A "bathtub" hazard rate (high early, low middle, high late) is a common reliability-textbook picture, but no single Weibull produces it. The Weibull hazard is monotone. A bathtub curve is typically modeled as a mixture or a competing-risks decomposition. Fitting a single Weibull to bathtub data and reading off "the " is meaningless: the algorithm will fit the dominant tail and ignore the others.
Comparison: Weibull vs Exponential vs Gamma vs Lognormal
The four are the standard nonnegative lifetime distributions in applied work, and they differ on the hazard shape and the tail.
- Exponential. Constant hazard. Memoryless. Tail decays as . Only one parameter.
- Weibull. Monotone hazard. Tail decays as , faster than Exponential if , slower if . Two parameters; recovers Exponential.
- Gamma. Initially-increasing-then-leveling hazard for shape ; saturates at rather than rising indefinitely. Tail decays as times a polynomial. Two parameters.
- Lognormal. Non-monotone hazard (rises and then falls). Tail decays sub-exponentially, heavier than Gamma or Weibull with .
Diagnostics: a log-log plot of against is linear for a Weibull (the slope is and the intercept involves ). A curved version of the same plot suggests Gamma or Lognormal. A plot of empirical hazard against time shows whether the hazard is monotone (Weibull or Gamma) or non-monotone (Lognormal); whether monotone hazards trend up (wear-out, ) or down (early-life, ); and whether the hazard ever flattens (Exponential).
For loss-modeling and severity-fitting applications, see ActuaryPath's Weibull page at https://www.actuarypath.com/concepts/weibull-distribution/.
Maximum-Likelihood Estimation
The log-likelihood from an iid sample is
Setting the score gives as a function of . Substituting into the score for produces a one-dimensional equation
which is solved by Newton-Raphson. The MLE is consistent, asymptotically Normal, and asymptotically efficient under standard regularity. Censored-data extensions (right censoring is the common case in reliability and survival data) require including censored observations through the survival function in the likelihood rather than the density.
Exercises
Problem
A battery lifetime is modeled as hours. Find the survival function at and , the median lifetime, and the mean lifetime.
Problem
A patient cohort in a clinical trial has time-to-event modeled as months. Compute the hazard rate at 6 months and 24 months, and explain what the shape parameter implies clinically.
Problem
Wind speeds at a hub-height anemometer are modeled as m/s (the case is the Rayleigh distribution). Find the median, the mean, the 95th percentile, and the probability the wind exceeds 15 m/s.
Problem
Show that is Exponential with rate 1 when .
Problem
For an iid sample from with known shape , show that the MLE of is and derive its asymptotic distribution.
Problem
A reliability dataset of failure times has . Suppose the Weibull MLE for the shape parameter has been computed numerically as . Find .
Extensions
The Weibull is the starting point for several richer parametric families used in reliability and survival work.
- Three-parameter Weibull. Adds a location (shift) parameter : . Used when failure times have a known minimum (a guaranteed warranty period, or a manufacturer-specified minimum life).
- Generalized Gamma. Adds a third parameter to interpolate between Weibull, Gamma, and Lognormal. Used when the shape diagnostic does not clearly favor any one of the three.
- Weibull mixtures and competing risks. Two or more Weibulls combined by a mixing distribution; produces bathtub hazard curves and other non-monotone hazards.
Each extension keeps the Weibull as a special case and adds flexibility when the two-parameter Weibull is rejected by a goodness-of-fit test.
References
- Casella, G., and Berger, R. L. (2002). Statistical Inference, 2nd ed., Duxbury. Section 3.3 includes the Weibull as a transformation of the Exponential and gives the moment formulas via the Gamma function.
- Blitzstein, J. K., and Hwang, J. (2019). Introduction to Probability, 2nd ed., Chapman and Hall / CRC. Chapter 5 covers the Weibull alongside the other lifetime distributions, with the Rayleigh case worked out.
- For the actuarial / severity-modeling perspective on the Weibull, see ActuaryPath's Weibull page at https://www.actuarypath.com/concepts/weibull-distribution/ and Klugman, Panjer, Willmot (2019), Loss Models, 5th ed., Wiley, Chapter 5.
- For the survival-analysis perspective with censored data, see Klein, J. P., and Moeschberger, M. L. (2003), Survival Analysis: Techniques for Censored and Truncated Data, 2nd ed., Springer. Chapter 12 covers Weibull regression with right censoring.
Last reviewed: May 12, 2026
Canonical graph
Required before and derived from this topic
These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.
Required prerequisites
4- Common Probability Distributionslayer 0A · tier 1
- Distributions Atlaslayer 0A · tier 1
- Exponential Distributionlayer 0A · tier 1
- Gamma Distributionlayer 0A · tier 1
Derived topics
0No published topic currently declares this as a prerequisite.