Game Theory Foundations

Sneiderman, Robby

Decision Theory

Game Theory Foundations

Strategic interaction between rational agents. Normal-form games, dominant strategies, Nash equilibrium existence, mixed strategies, and connections to minimax, mechanism design, and multi-agent RL.

CoreTier 1StableCore spine~50 min

Prerequisites

Common Probability Distributions Convex Optimization Basics Arrows Impossibility Bounded Rationality

Start 8-question practice · 6 available 3-question pulse check Prereq Map

Learning position

Read this page in the graph.

decision-theory | layer 2 | tier 1. This page has 6 direct prerequisites and 5 published dependents.

Open Atlas Prerequisites Leads to

What next

Nash Equilibrium

This is the first curated or graph-derived continuation from the current page.

Evidence badge

Claim status

This page has no public Lean mapping yet. Use the evidence page to inspect how claim status labels work.

Show the backing system

AtlasOpen the full prerequisite graph and run grounding traces.EvidenceInspect source support, claim labels, and public trust status.LeanReview the checked declaration list, scopes, and axiom profile.

Why This Matters

Any setting where multiple agents optimize simultaneously is a game. GAN training is a two-player game between the generator and discriminator. Multi-agent reinforcement learning agents play games against each other. Auction design, federated learning incentive structures, and adversarial robustness all require game-theoretic reasoning. Without this vocabulary, you cannot state what "equilibrium" means for these systems, let alone analyze convergence or efficiency.

Game theory provides the formal language for strategic interaction: what rational agents should do when their payoffs depend on the actions of others.

Mental Model

A game specifies (1) who the players are, (2) what actions each player can take, and (3) what payoff each player receives for each combination of actions. The central question: given that every player is rational and knows the game structure, what will they do? The answer, in most cases, is a Nash equilibrium.

Normal-Form Games

Definition

Normal-Form Game $(N, S, u)$

A normal-form (strategic-form) game consists of:

A finite set of players $N = \{1, 2, \ldots, n\}$
For each player $i$ , a finite set of pure strategies $S_i$
For each player $i$ , a payoff function $u_i: S_1 \times S_2 \times \cdots \times S_n \to \mathbb{R}$

A strategy profile is a tuple $s = (s_1, \ldots, s_n) \in S = S_1 \times \cdots \times S_n$ . Player $i$ receives payoff $u_i(s)$ when profile $s$ is played. In two-player games, payoffs are often written as a bimatrix $(A, B)$ where $A_{jk}$ is player 1's payoff and $B_{jk}$ is player 2's payoff when player 1 plays row $j$ and player 2 plays column $k$ .

Definition

Dominant Strategy

A strategy $s_i^*$ is a strictly dominant strategy for player $i$ if and only if for every alternative strategy $s_i \neq s_i^*$ and every opponent profile $s_{-i}$ :

$u_i(s_i^*, s_{-i}) > u_i(s_i, s_{-i})$

If the inequality is weak ( $\geq$ ), the strategy is weakly dominant. A strictly dominant strategy is uniquely optimal regardless of what others do.

Definition

Best Response

Strategy $s_i^*$ is a best response to the opponent profile $s_{-i}$ if and only if:

$s_i^* \in \arg\max_{s_i \in S_i} u_i(s_i, s_{-i})$

The best-response correspondence $BR_i: S_{-i} \rightrightarrows S_i$ maps each opponent profile to the set of best responses. This correspondence is central to the existence proof for Nash equilibria.

Definition

Iterated Elimination of Dominated Strategies (IEDS)

Repeatedly remove strictly dominated strategies from every player's strategy set. The order of elimination does not affect the final result for strict dominance. If IEDS leaves a unique strategy profile, the game is dominance solvable.

Canonical Examples

Example

Prisoner's Dilemma

Two suspects choose independently to cooperate (stay silent) or defect (betray). Payoff matrix:

	Cooperate	Defect
Cooperate	$(-1, -1)$	$(-3, 0)$
Defect	$(0, -3)$	$(-2, -2)$

Defect is strictly dominant for both players: regardless of the other's choice, defecting yields a higher payoff. The unique Nash equilibrium is (Defect, Defect) with payoffs $(-2, -2)$ . This is Pareto-dominated by (Cooperate, Cooperate) at $(-1, -1)$ . Rational self-interest leads to a collectively worse outcome.

Example

Battle of the Sexes

Two players prefer to coordinate but disagree on which activity. Payoff matrix:

	Opera	Football
Opera	$(3, 2)$	$(0, 0)$
Football	$(0, 0)$	$(2, 3)$

Two pure Nash equilibria: (Opera, Opera) and (Football, Football). There is also a mixed Nash equilibrium where player 1 plays Opera with probability $3/5$ and player 2 plays Opera with probability $2/5$ . The mixed equilibrium gives expected payoff $6/5$ to each player, worse than either pure equilibrium. This illustrates that mixed equilibria can be inefficient.

Example

Matching Pennies

A strictly competitive (zero-sum) game with no pure Nash equilibrium:

	Heads	Tails
Heads	$(1, -1)$	$(-1, 1)$
Tails	$(-1, 1)$	$(1, -1)$

For any pure strategy, the opponent has a profitable deviation. The unique Nash equilibrium is mixed: both players play Heads with probability $1/2$ . The expected payoff is $0$ for both players. This is the simplest example showing why mixed strategies are necessary.

Mixed Strategies

Definition

Mixed Strategy $σ_{i} \in Δ (S_{i})$

A mixed strategy for player $i$ is a probability distribution $\sigma_i$ over the pure strategy set $S_i$ . The set of all mixed strategies is the simplex $\Delta(S_i) = \{\sigma_i \in \mathbb{R}^{|S_i|}_{\geq 0} : \sum_{s_i} \sigma_i(s_i) = 1\}$ .

The expected payoff under mixed strategy profile $\sigma = (\sigma_1, \ldots, \sigma_n)$ is:

$u_i(\sigma) = \sum_{s \in S} \left(\prod_{j=1}^n \sigma_j(s_j)\right) u_i(s)$

A key property: $u_i(\sigma)$ is multilinear in the mixed strategies. It is linear in $\sigma_i$ for fixed $\sigma_{-i}$ .

Definition

Nash Equilibrium (Mixed)

A mixed strategy profile $\sigma^* = (\sigma_1^*, \ldots, \sigma_n^*)$ is a Nash equilibrium if and only if no player can improve their expected payoff by unilateral deviation:

$u_i(\sigma_i^*, \sigma_{-i}^*) \geq u_i(\sigma_i, \sigma_{-i}^*) \quad \forall \sigma_i \in \Delta(S_i), \; \forall i \in N$

Equivalently, every player's mixed strategy is a best response to the others. A pure strategy Nash equilibrium is the special case where all $\sigma_i^*$ are degenerate (point mass on a single strategy).

Nash's Existence Theorem

Theorem

Nash Existence Theorem

Statement

Every finite normal-form game (finite players, finite strategy sets) has at least one Nash equilibrium in mixed strategies.

Intuition

Each player's best-response correspondence maps the opponents' mixed strategies (a compact convex set) to a convex subset of the player's own mixed strategy simplex. The combined best-response correspondence maps the product of simplices to itself. Kakutani's fixed-point theorem guarantees a fixed point, which is precisely a Nash equilibrium: a profile where every player is simultaneously best-responding.

Proof Sketch

Define the combined strategy space $\Sigma = \Delta(S_1) \times \cdots \times \Delta(S_n)$ . This is a nonempty, compact, convex subset of $\mathbb{R}^{\sum_i |S_i|}$ .

Define the best-response correspondence $BR: \Sigma \rightrightarrows \Sigma$ by $BR(\sigma) = BR_1(\sigma_{-1}) \times \cdots \times BR_n(\sigma_{-n})$ .

Verify the conditions of Kakutani's fixed-point theorem:

$\Sigma$ is nonempty, compact, and convex.
$BR(\sigma)$ is nonempty (finite games always have best responses).
$BR(\sigma)$ is convex (if $\sigma_i$ and $\sigma_i'$ are both best responses, any mixture $\alpha \sigma_i + (1-\alpha)\sigma_i'$ is also a best response because $u_i$ is linear in $\sigma_i$ ).
$BR$ has a closed graph (by continuity of $u_i$ in all strategies).

By Kakutani's theorem, there exists $\sigma^* \in BR(\sigma^*)$ , which is a Nash equilibrium.

Why It Matters

This theorem guarantees that the equilibrium concept is never vacuous for finite games: every finite game has a well-defined prediction of rational play. It also justifies the study of multi-agent RL equilibria when action spaces are finite. The finite-game theorem does not transfer automatically to GAN-style parameter games — those have continuous, typically non-compact parameter spaces and non-concave/non-convex payoffs, so equilibrium existence requires separate hypotheses (e.g. compact strategy sets, continuity, suitable convexity/quasiconcavity, or restriction to mixed strategies over a compact subset, à la Glicksberg 1952). Practical GAN training does not satisfy these conditions out of the box, which is one structural reason GAN dynamics can fail to converge to any equilibrium in parameters even when one exists in distribution space.

Failure Mode

The theorem guarantees existence but says nothing about uniqueness, computational tractability, or stability. A game may have exponentially many Nash equilibria. Computing a single Nash equilibrium of a general game is PPAD-complete, so no known polynomial-time algorithm exists. The equilibrium may also be unstable: small perturbations can cause the dynamics to diverge. See Nash equilibrium for these complications.

report a correction →

Zero-Sum Games and Minimax

Definition

Zero-Sum Game

A two-player game is zero-sum if and only if $u_1(s) + u_2(s) = 0$ for every strategy profile $s$ . One player's gain is the other's loss. The payoff is fully determined by a single matrix $A$ : player 1 receives $A_{jk}$ and player 2 receives $-A_{jk}$ .

In zero-sum games, the Nash equilibrium has a special structure. Player 1 maximizes the minimum guaranteed payoff (maximin), and player 2 minimizes the maximum payoff player 1 can achieve (minimax). Von Neumann's minimax theorem shows that these values are equal:

$\max_{\sigma_1 \in \Delta(S_1)} \min_{\sigma_2 \in \Delta(S_2)} \sigma_1^\top A \sigma_2 = \min_{\sigma_2 \in \Delta(S_2)} \max_{\sigma_1 \in \Delta(S_1)} \sigma_1^\top A \sigma_2$

This common value is the value of the game. The minimax theorem is the foundational result of game theory and connects to LP duality: computing the minimax strategies is a linear program.

Connections to Machine Learning

GANs. The GAN objective is a two-player zero-sum game. The generator minimizes and the discriminator maximizes $\mathbb{E}[\log D(x)] + \mathbb{E}[\log(1 - D(G(z)))]$ . The optimal discriminator theorem characterizes the Nash equilibrium: $p_G = p_{\text{data}}$ .

Multi-agent RL. When multiple RL agents interact in a shared environment, the joint optimization is a stochastic game (Markov game). Nash equilibria of the stage game determine equilibrium policies. Convergence of independent learners to Nash equilibrium is not guaranteed in general.

Adversarial robustness. Adversarial attacks frame robustness as a game between a classifier and an adversary who perturbs inputs. The robust optimization formulation $\min_\theta \max_{\|\delta\| \leq \epsilon} \mathcal{L}(f_\theta(x + \delta), y)$ is a minimax problem.

Shapley value. The Shapley value from cooperative game theory assigns a fair attribution of value to each player (feature). SHAP values for model interpretability are exactly Shapley values of a cooperative game defined by the model's predictions.

Common Confusions

Watch Out

Nash equilibrium does not mean optimal outcome

The Prisoner's Dilemma demonstrates that Nash equilibria can be Pareto-dominated. Rational play by all agents does not guarantee a collectively good outcome. The Nash equilibrium is a stability concept (no one wants to deviate), not an efficiency concept. The gap between the Nash equilibrium and the social optimum is measured by the price of anarchy.

Watch Out

Mixed strategies are not randomization for its own sake

A mixed strategy equilibrium means that each player is exactly indifferent between all strategies in the support of their mixture. The mixing probabilities are determined by the requirement that the opponent be indifferent. In Matching Pennies, player 1 mixes 50-50 not because randomness helps them directly, but because this is the only way to prevent the opponent from exploiting a predictable pattern.

Watch Out

Dominant strategy equilibrium is much stronger than Nash

A dominant strategy equilibrium means each player has a strategy that is optimal regardless of what others do. Nash equilibrium only requires each strategy to be optimal given what others are doing. Dominant strategy equilibria are rare and do not require players to know or predict opponent strategies. The Prisoner's Dilemma has a dominant strategy equilibrium. Most games do not.

Exercises

ExerciseCore

Problem

In the Battle of the Sexes game above, compute the mixed strategy Nash equilibrium. Let player 1 play Opera with probability $p$ and player 2 play Opera with probability $q$ . Find $p$ and $q$ and the expected payoff to each player.

ExerciseCore

Problem

Consider a two-player zero-sum game with payoff matrix $A = \begin{pmatrix} 3 & -1 \\ -2 & 4 \end{pmatrix}$ . Does either player have a dominant strategy? Find the value of the game and the mixed strategy Nash equilibrium.

ExerciseAdvanced

Problem

Prove that in any two-player zero-sum game with payoff matrix $A$ , the set of Nash equilibrium strategies for player 1 is a convex polytope. Show that if $\sigma_1^*$ and $\sigma_1^{**}$ are both equilibrium strategies for player 1, then any convex combination $\alpha \sigma_1^* + (1-\alpha)\sigma_1^{**}$ for $\alpha \in [0, 1]$ is also an equilibrium strategy for player 1.

ExerciseAdvanced

Problem

The Shapley value $\phi_i(v)$ for player $i$ in a cooperative game with characteristic function $v: 2^N \to \mathbb{R}$ is:

$\phi_i(v) = \sum_{S \subseteq N \setminus \{i\}} \frac{|S|!(n - |S| - 1)!}{n!} [v(S \cup \{i\}) - v(S)]$

Prove that the Shapley value satisfies the efficiency axiom: $\sum_{i=1}^n \phi_i(v) = v(N) - v(\emptyset)$ .

References

Canonical:

Osborne & Rubinstein, A Course in Game Theory (1994), Chapters 1-3
Fudenberg & Tirole, Game Theory (1991), Chapters 1-2, 11
Myerson, Game Theory: Analysis of Conflict (1991), Chapters 1-3

Current:

Shoham & Leyton-Brown, Multiagent Systems (2009), Chapters 3-4, 13
Nisan, Roughgarden, Tardos & Vazirani, Algorithmic Game Theory (2007), Chapters 1-2
Goodfellow et al., "Generative Adversarial Nets" (2014), Section 4 (game-theoretic analysis)

Next Topics

Natural extensions from game theory foundations:

Nash equilibrium: existence proofs, computational complexity, refinements, price of anarchy
Mechanism design: inverse game theory, designing rules for self-interested agents
Minimax theorem: the fundamental theorem of zero-sum games and its connections to LP duality

Last reviewed: April 26, 2026

Canonical graph

Required before and derived from this topic

These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.

Full prerequisite chain All derived topics

Required prerequisites

6

Common Probability Distributionslayer 0A · tier 1
Convex Optimization Basicslayer 1 · tier 1
Bounded Rationalitylayer 2 · tier 1
Arrow's Impossibility Theoremlayer 2 · tier 2
Decision Theory Foundationslayer 2 · tier 2

Derived topics

5

Nash Equilibriumlayer 2 · tier 2
Von Neumann Minimax Theoremlayer 2 · tier 2
Auction Theorylayer 3 · tier 2
Commons Governance and Institutional Analysislayer 3 · tier 2
Mechanism Designlayer 3 · tier 2

Graph-backed continuations

Nash Equilibrium Mechanism Design Von Neumann Minimax Theorem Auction Theory Commons Governance and Institutional Analysis