Decision Theory
Prospect Theory
How people actually make decisions under risk. Kahneman and Tversky's model: reference dependence, loss aversion, probability weighting, and why expected utility fails as a descriptive theory.
Prerequisites
Why This Matters
Expected utility theory (EUT) prescribes how a rational agent should decide. Prospect theory describes how people actually decide. The gap between these is large and systematic. People overweight small probabilities, are more sensitive to losses than to gains, and evaluate outcomes relative to a reference point rather than in absolute terms.
For ML, this matters in at least three ways. First, RLHF relies on human preference judgments, and those judgments follow prospect theory, not EUT. Second, any system that presents risk information to users (medical AI, financial models) must account for how users perceive probabilities. Third, reward function design in RL implicitly assumes a value function shape; prospect theory shows why symmetric reward functions may not match human preferences.
The Failures of Expected Utility
Before building prospect theory, we need to see why EUT fails as a descriptive model.
Allais Paradox
Consider two choice problems:
Problem 1: Choose between A (5M with probability 0.10, 0 with probability 0.01).
Problem 2: Choose between C (0 with probability 0.89) and D (0 with probability 0.90).
Most people choose A over B and D over C. Under EUT with any utility function, choosing A over B implies choosing C over D (by the independence axiom). The observed pattern violates EUT.
The Allais paradox reveals the certainty effect: people overweight outcomes that are certain relative to outcomes that are merely probable. Moving from certainty (100%) to near-certainty (99%) feels much worse than moving from 11% to 10%, even though the probability difference is the same.
Reflection Effect
The risk preferences observed for gains reverse for losses. People are risk-averse for gains (preferring a sure 1000) but risk-seeking for losses (preferring a 50/50 chance of losing 500). Under EUT with a concave utility function, the agent should be risk-averse everywhere.
The Prospect Theory Model
Kahneman and Tversky (1979) proposed a two-phase model:
Phase 1: Editing. The decision-maker frames outcomes as gains or losses relative to a reference point, simplifies dominated options, and isolates shared components. This phase is informal and context-dependent.
Phase 2: Evaluation. The edited prospects are evaluated using a value function and a probability weighting function , and the prospect with the highest weighted value is chosen.
For a simple prospect yielding with probability and with probability (with and outcome otherwise):
Prospect Theory Value Function Properties
Statement
The value function in prospect theory has three key properties:
- Reference dependence: . Outcomes are evaluated as deviations from a reference point, not as final wealth levels.
- Diminishing sensitivity: is concave for gains ( for ) and convex for losses ( for ).
- Loss aversion: is steeper for losses than for gains. for all . The loss aversion coefficient is typically around 2.25 empirically.
A commonly used parametric form is:
with , , and .
Intuition
The concavity for gains and convexity for losses follows from diminishing sensitivity. The difference between 20 feels larger than the difference between 1020, whether these are gains or losses. Loss aversion is a separate phenomenon: losing 100 feels good. The asymmetry is roughly 2:1.
Why It Matters
This value function explains the reflection effect (risk aversion for gains, risk seeking for losses), the endowment effect (people demand more to give up an object than they would pay to acquire it), and the status quo bias (losses from switching loom larger than gains). These are not irrational quirks; they are systematic patterns that any model of human preferences must accommodate.
Failure Mode
The reference point is not always well-defined. In complex multi-attribute decisions, what counts as "the reference point" can change with framing. The parametric form with is a simplification; individual heterogeneity is large. Loss aversion may also be domain-dependent, with different coefficients for money, health, and time.
Probability Weighting
Probability Weighting Function
The probability weighting function transforms objective probabilities into decision weights. It satisfies and but is not linear. Key properties:
- Overweighting of small probabilities: for small . This explains why people buy lottery tickets (overweight the small chance of winning) and insurance (overweight the small chance of a catastrophe).
- Underweighting of moderate to large probabilities: for moderate to large .
- Subcertainty: for . Decision weights do not sum to 1.
A common parametric form (Prelec, 1998):
with estimated empirically.
The weighting function is concave near 0 (overweighting) and convex near 1 (underweighting), producing an inverse-S shape. This single function explains both gambling (overweight rare large gains) and insurance (overweight rare large losses).
Cumulative Prospect Theory
Original prospect theory (1979) had a problem: it could violate stochastic dominance. If prospect yields outcomes at least as good as prospect in every state, a rational agent should prefer . But the probability weighting in the original model could reverse this.
Cumulative Prospect Theory
Cumulative prospect theory (Tversky and Kahneman, 1992) fixes this by applying the weighting function to cumulative probabilities rather than individual probabilities. For a prospect with outcomes occurring with probabilities , separate gains and losses:
For gains (), the decision weight is:
For losses (), the decision weight is:
The overall value is .
Cumulative prospect theory preserves stochastic dominance while retaining the key empirical predictions: loss aversion, diminishing sensitivity, and probability weighting. It allows different weighting functions and for gains and losses.
The Fourfold Pattern of Risk Attitudes
Prospect theory predicts a distinctive pattern:
| High probability | Low probability | |
|---|---|---|
| Gains | Risk averse | Risk seeking |
| Losses | Risk seeking | Risk averse |
- High-probability gains: Diminishing sensitivity makes the value function concave; risk aversion.
- Low-probability gains: Overweighting of small probabilities dominates; risk seeking (lottery tickets).
- High-probability losses: Diminishing sensitivity makes the value function convex; risk seeking (gambling to avoid a near-certain loss).
- Low-probability losses: Overweighting of small probabilities dominates; risk aversion (insurance).
EUT with a fixed utility function cannot produce all four patterns simultaneously.
Connections to ML
RLHF and reward modeling. Human evaluators in RLHF express preferences between model outputs. These preferences are typically modeled as following a Bradley-Terry model, which assumes a consistent utility scale. But prospect theory predicts that human preferences are reference-dependent, loss-averse, and sensitive to framing. A model output that is a clear improvement over the current response may be judged differently depending on whether the evaluator frames the comparison as "how much better is A?" versus "what is wrong with B?"
Calibration and risk communication. ML models that output calibrated probabilities assume users interpret probabilities linearly. Prospect theory says they do not. A model reporting 2% cancer risk will be perceived as more alarming than a linear model would predict, because users overweight small probabilities.
Reward shaping in RL. If the reward function is symmetric around zero, prospect theory predicts that an agent trained to match human preferences should weight negative rewards more heavily than positive ones. The optimal reward shaping for human-aligned behavior may need a loss-aversion coefficient.
Common Confusions
Prospect theory is descriptive, not normative
EUT says what you should do given rationality axioms. Prospect theory says what you will do given cognitive biases. A well-calibrated Bayesian should not follow prospect theory. But if you are modeling human behavior, predicting human preferences, or designing interfaces for human users, prospect theory is the better model.
Loss aversion is not risk aversion
Risk aversion (concave utility for gains) and loss aversion (steeper value for losses than gains) are different phenomena. A risk-neutral agent can still be loss-averse. Risk aversion is about the curvature of the value function; loss aversion is about the slope difference at the reference point. Prospect theory includes both.
The reference point is not always the status quo
The reference point can be an expectation, a goal, or a social comparison. If you expect a 4000, the reference point is 1000 loss, even though you received $4000. The reference point is context-dependent and can be manipulated by framing.
Exercises
Problem
Under the prospect theory value function for and for , compute , , and the ratio . Verify that the ratio equals the loss aversion coefficient.
Problem
The Allais paradox: A decision-maker with utility function (EUT with square-root utility) is presented with the two choice problems from the Allais paradox. Show that if the decision-maker prefers A over B in Problem 1, then EUT requires preferring C over D in Problem 2. What does this imply about the observed pattern (A over B and D over C)?
Problem
Show that the probability weighting function (Tversky-Kahneman form) satisfies , , and has an inverse-S shape (overweights small , underweights large ) for . Find the fixed point where .
Problem
Consider an RLHF setting where human evaluators compare pairs of model outputs. The standard Bradley-Terry model assumes where is a reward function and is the logistic function. Propose a modification that incorporates prospect-theoretic features (reference dependence and loss aversion). What would the training objective look like, and what empirical signatures would distinguish it from the standard model?
References
Original papers:
- Kahneman and Tversky, "Prospect Theory: An Analysis of Decision under Risk," Econometrica 47(2), 1979, pp. 263-291
- Tversky and Kahneman, "Advances in Prospect Theory: Cumulative Representation of Uncertainty," Journal of Risk and Uncertainty 5(4), 1992, pp. 297-323
Probability weighting:
- Prelec, "The Probability Weighting Function," Econometrica 66(3), 1998, pp. 497-527
Textbook treatments:
- Wakker, Prospect Theory: For Risk and Ambiguity (2010), Chapters 3-9
- Kahneman, Thinking, Fast and Slow (2011), Chapters 25-29
- Mas-Colell, Whinston, and Green, Microeconomic Theory (1995), Chapter 6
Next Topics
Prospect theory connects to several directions:
- Decision theory foundations: the normative framework that prospect theory departs from
- Game theory: strategic interaction where agents may follow prospect-theoretic rather than EUT preferences
Last reviewed: April 2026
Prerequisites
Foundations this topic depends on.
- Expected Utility TheoryLayer 2
- Common Probability DistributionsLayer 0A
- Sets, Functions, and RelationsLayer 0A
- Basic Logic and Proof TechniquesLayer 0A
- Convex Optimization BasicsLayer 1
- Differentiation in RnLayer 0A
- Matrix Operations and PropertiesLayer 0A