Foundations
Scale, Location, and Shape Parameters
Three roles a parameter can play in a distribution family: location shifts the support, scale stretches it, and shape changes the form. Conventions vary by source (rate vs scale, especially in Exponential and Gamma), and the group structure of location-scale families is what makes standardization and pivot quantities work.
Why This Matters
A reader who has met the Normal distribution as has already met all three parameter roles, even if no textbook labeled them. The Normal has a location parameter that slides the bell along the real line, a scale parameter that stretches it, and (uniquely among distributions) no shape parameter. The Gamma family adds the third role: is a shape parameter, and changing it does something neither shifting nor stretching can reproduce.
Naming the roles matters for two reasons. First, the same one-letter Greek symbol means different things in different books. Casella and Berger write the Exponential as where is the scale (mean , density ). Klugman, Panjer, and Willmot, and most actuarial sources, write the same distribution but with for the scale. A statistics-track student reading a textbook that uses for the rate ( with density ) sees the same family with . Errors creep in when rate and scale are conflated. Second, the group structure of location-scale families is what makes standardization behave like a universal recipe, and what makes pivot statistics and t-tests work.
Quick Version
| Role | Effect on the density | Example |
|---|---|---|
| Location | Shifts the density horizontally: | Normal mean, Cauchy median, Uniform midpoint |
| Scale | Stretches the density: | Normal SD, Exponential scale, Cauchy spread |
| Shape | Changes the form, not just location or scale | Gamma , Weibull , Pareto , degrees of freedom |
A distribution can have any subset. The Exponential has scale only. The Cauchy has location and scale. The Gamma and Weibull have shape and scale. The Generalized Pareto has all three.
Core Definitions
Location Parameter
A parameter is a location parameter of a family iff for some base density . Equivalently, if has density , then has density .
The CDF translates: . Quantiles shift by . The mean (when it exists) shifts by ; the variance, skewness, and kurtosis are unchanged.
Scale Parameter
A parameter is a scale parameter of a family iff for some base density . Equivalently, if has density , then has density .
The CDF stretches: . The mean and median (when finite) scale by ; the variance scales by . The standardized moments (skewness, kurtosis) are unchanged because they are scale-invariant by construction.
Shape Parameter
A parameter is a shape parameter iff changing it alters the family in a way that cannot be reproduced by any combination of location shift and scale rescaling. Two members of a family with different shape values are not affine transformations of each other.
Standardized moments (skewness, kurtosis, higher cumulant ratios) are functions of shape only. The Gamma family with shape has skewness regardless of scale; the Student-t with degrees of freedom has kurtosis for , also a shape-only function.
Location-Scale Family
A two-parameter family is a location-scale family generated by base density iff Equivalently, if , then has density .
Examples: Normal, Cauchy, Uniform on (parameterized as midpoint half-width), Logistic, Laplace, Student-t with fixed . The Exponential is scale only (no location shift keeps the support on ). The Gamma is shape-scale, not location-scale.
The Group Structure
Location-scale families are closed under affine transformations of the random variable. This is what makes standardization universal.
Affine Closure of Location-Scale Families
Statement
Let where . For any and , the affine transformation satisfies . The family is therefore closed under positive affine maps.
In particular, the standardized variable has density and does not depend on or .
Intuition
Shifting and stretching a member of a location-scale family produces another member of the same family. The standardized base density is a fixed point of this group action: every member of the family is a translate-and-stretch of . This is what licenses the universal recipe "standardize first, then look up the tail probability in a table."
Proof Sketch
Change of variables. If , then and . So the density of is This is .
Why It Matters
Standardization produces a pivot quantity: a function of data and parameters whose distribution does not depend on the parameters. Pivots are what confidence intervals are built from. The classical t-statistic is a pivot for the location-scale Normal family. The same construction does not work for shape families (Gamma, Weibull) because there is no fixed base member to standardize against.
Failure Mode
The closure property requires the affine map to be positive (). For symmetric base densities (), a sign flip still keeps you in the family. For asymmetric base densities (like a skewed Logistic), a sign flip lands you outside.
Parameter Conventions: Rate vs Scale
The single most common source of confusion. Different texts parameterize the same distribution differently, and the parameter symbol can mean either the rate or its reciprocal.
| Distribution | Rate convention | Scale convention | Relation |
|---|---|---|---|
| Exponential | , density , mean | or , density , mean | |
| Gamma | rate, mean | shape-scale, mean | , |
| Inverse Gamma | rate | scale | |
| Weibull | (rare) | shape-scale, density | scale convention dominates |
Rate and scale are reciprocals. Read the density before you trust the symbol.
A textbook that writes "let " is using either rate or scale depending on the book. Casella-Berger uses scale with mean . Klugman uses scale with mean . Many introductory statistics texts use rate with mean . The density tells you which: if the parameter appears as a coefficient out front, it is the rate; if it appears as the divisor inside the exponent, it is the scale.
Inverse Gamma scale is not the inverse of Gamma scale
The inverse-Gamma distribution is the distribution of where . The scale parameter of is the scale of the reciprocated variable, not the reciprocal of the Gamma scale. Easy to misread.
Shape-Only Quantities
Standardized moments are functions of shape parameters only. Two members of the same family with different location or scale share the same skewness, kurtosis, and higher standardized cumulants.
| Distribution | Skewness | Excess kurtosis | Notes |
|---|---|---|---|
| Normal | Shape-free; all higher cumulants vanish | ||
| Exponential | Scale-only family; skewness and kurtosis are constants | ||
| Gamma | Both depend on shape only | ||
| Student- | for | for | Shape is ; location-scale extensions add |
| Weibull | function of | function of | Both depend on shape only |
This is the practical version of the rule "shape parameters carry the genuinely new information." Once you have computed a standardized moment, you have computed a function of shape.
The Cauchy Case
The Cauchy distribution is a location-scale family. It has location (the median, since the mean does not exist) and scale (the half-width at half-maximum, since the variance does not exist). Standardization produces a standard Cauchy with density .
The lesson: location and scale parameters can exist even when the corresponding moment (mean for location, variance for scale) does not. Treat them as parameters of the family, not as moments of the variable.
Examples by Distribution
- Normal : location , scale , no shape.
- Exponential : scale , no location, no shape. Constant skewness and excess kurtosis .
- Gamma : shape , scale , no location. Setting recovers Exponential. Sum of i.i.d. when is a positive integer.
- Weibull : shape , scale . Setting recovers Exponential; recovers Rayleigh.
- Lognormal: location and scale parameters live on the log scale ( are the mean and SD of , not of ). Sometimes called "log-location" and "log-scale" to avoid confusion.
- Pareto : scale (the minimum), shape (tail index). Heavy tail controlled by shape; -th moment exists iff .
- Generalized Pareto : location , scale , shape . The shape controls tail behavior: is exponential tail, is heavy tail, has a bounded right endpoint.
- Student-: shape (degrees of freedom). Location-scale extension adds the two remaining roles; controls tail weight.
- Cauchy : location , scale . The case of Student-.
Common Confusions
Scale is not standard deviation in general
The Normal is the special case where the scale parameter equals the standard deviation. For most other distributions, the scale parameter does not equal the SD. The Exponential with scale has SD (here they happen to coincide), but the Gamma has SD — the SD is the scale times a shape-dependent constant. For the Cauchy, the SD does not exist at all; the scale parameter is a half-width, not a standard deviation.
Standardization works for location-scale families, not shape families
Subtracting the mean and dividing by the SD always produces a variable with mean 0 and variance 1, provided both exist. But the distribution of the standardized variable depends on the original shape parameter unless the family is location-scale. Standardizing two different Gamma samples with different gives two different distributions with mean 0 and variance 1; that is why pivot quantities and tabulated tail tables work for the Normal but not for the Gamma.
A parameter symbol does not commit to a role
Greek letters carry no semantic load. The same can be the location for one distribution, the scale for another, the shape for a third. Resolve the role by inspecting how the parameter enters the density, not by assuming the name.
Exercises
Problem
Let in the rate parameterization, so the density is for . Show that has the density of a standard exponential ( in the rate parameterization). Then express the same in terms of as the standardized version of in the scale parameterization.
Problem
The Gamma distribution (shape-scale parameterization) has mean and variance . Compute the coefficient of variation . Which parameter does it depend on?
Problem
Suppose with location and scale . Show that has the standard Cauchy density . Then show that the sample mean of i.i.d. Cauchy variables is distributed as — the scale does not shrink with .
References
Canonical:
- Casella & Berger, Statistical Inference (2nd ed., 2002), Chapter 3.5 (location-scale families)
- Lehmann & Casella, Theory of Point Estimation (2nd ed., 1998), Chapter 3 (invariance and equivariance under location-scale groups)
Current:
- Klugman, Panjer & Willmot, Loss Models: From Data to Decisions (5th ed., 2019), Chapters 5-6 (actuarial scale convention and parameter inventory)
- Johnson, Kotz & Balakrishnan, Continuous Univariate Distributions, Volume 1 (2nd ed., 1994), Chapters 13, 17, 19 (Exponential, Gamma, Weibull parameterization conventions)
Next Topics
- Moment generating functions: how location and scale enter the MGF cleanly via
- Method of moments: estimating location, scale, and shape from sample moments
- Multivariate distributions atlas: how the location-scale framework generalizes to vector and matrix valued laws
Last reviewed: May 13, 2026
Canonical graph
Required before and derived from this topic
These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.
Required prerequisites
2- Common Probability Distributionslayer 0A · tier 1
- Expectation, Variance, Covariance, and Momentslayer 0A · tier 1
Derived topics
0No published topic currently declares this as a prerequisite.