Agent-Based Modeling with ML

Sneiderman, Robby

Applied ML

Agent-Based Modeling with ML

Where ML meets agent-based modeling: neural surrogates for slow simulations, differentiable ABMs that allow gradient-based calibration, multi-agent RL inside simulators, and the equation-free vs. equation-based identification debate.

AdvancedTier 3CurrentReference~15 min

Prerequisites

Multi Armed Bandits Theory Markov Games and Self Play

Prereq Map

Learning position

Read this page in the graph.

applied-ml | layer 4 | tier 3. This page has 2 direct prerequisites and 2 published dependents.

Open Atlas Prerequisites Leads to

What next

Self-Play and Multi-Agent RL

This is the first curated or graph-derived continuation from the current page.

Evidence badge

Claim status

This page has no public Lean mapping yet. Use the evidence page to inspect how claim status labels work.

Show the backing system

AtlasOpen the full prerequisite graph and run grounding traces.EvidenceInspect source support, claim labels, and public trust status.LeanReview the checked declaration list, scopes, and axiom profile.

Why This Matters

Agent-based models (ABMs) simulate populations of heterogeneous decision makers under explicit interaction rules and read off macro outcomes from the aggregate behavior. The appeal is that the resulting macro behavior is generated, not assumed: bubbles, cascades, segregation, and disease spread emerge from local rules. The catch is that classical ABMs are computationally heavy, hard to calibrate, and often weakly identified, which has kept them out of policy work that demands fast counterfactuals and reproducible fits.

ML has changed three of those constraints. Neural surrogates compress expensive simulators into fast approximations. Differentiable ABMs allow gradient-based calibration against observed moments. Multi-agent RL replaces hand-coded heuristic agents with agents trained inside the simulator. Identification, though, remains as hard as ever, and that is the part that determines whether the resulting model can answer policy questions.

Core Ideas

Neural surrogates for ABM behavior. Lamperti, Roventini, and Sani (2018, Journal of Economic Dynamics and Control 90) train a neural network to approximate the input-output mapping from ABM parameters to summary statistics, then calibrate the original ABM by inverting the surrogate. The surrogate evaluates orders of magnitude faster than the simulator, which makes likelihood-free inference and Bayesian calibration tractable on models that previously required weeks of compute. The standard pipeline: sample parameters, run the simulator, fit the surrogate, then use approximate Bayesian computation or sequential neural posterior estimation on the surrogate to recover a posterior over parameters.

Differentiable ABMs. Rule-based simulators are typically not differentiable, because they include if-statements, sampling steps, and discrete agent decisions. Differentiable ABMs replace these with smooth relaxations (Gumbel-softmax for discrete choice, reparameterized samplers, soft attention for matching) so that the entire simulator becomes a computation graph through which gradients flow. Andelfinger (2021, ACM Transactions on Modeling and Computer Simulation 31) gave a systematic treatment; Chopra, Quera-Bofarull, and collaborators (2024) scaled the approach to epidemiological ABMs with millions of agents. Calibration becomes gradient descent on a moment-matching loss instead of black-box optimization.

Multi-agent RL inside ABMs. Replace heuristic agents with agents whose policies are trained by RL, so behavior emerges from objectives rather than assumed rules. This is attractive when the modeler has confidence about agent objectives and budget constraints but not about the decision rule. The risks are well documented: training non-stationarity, cycling equilibria, and reward-hacking artifacts that no human modeler would have written by hand. Scope conditions matter; not every ABM benefits from turning rule-based agents into RL agents.

Identification stays hard. The equation-free vs. equation-based debate hinges on whether macro outcomes pin down micro mechanisms. They typically do not: many distinct micro rules generate observationally equivalent macro moments. ABMs with thousands of free parameters can fit almost any aggregate trajectory. ML calibration tightens computational fitting but does not relax this fundamental underdetermination. Reporting which moments the model matches and which it cannot is the productive discipline.

Common Confusions

Watch Out

A neural surrogate is not a faster simulator

A surrogate trained on $10^4$ parameter draws can interpolate well within the training region and fail badly outside it. Calibrating against out-of-sample data can push the inverse problem into a regime where the surrogate has no support. Active-learning loops that re-sample the simulator near the current best fit are the standard fix, and skipping that step is the most common error.

Watch Out

Multi-agent RL agents can satisfy a simulator and lie to it

RL agents optimize the reward they are given inside the environment they are trained in. If the environment has a loophole, they will take it. ABM applications routinely report agents that learn to exploit numerical discretization, agent-creation rules, or boundary conditions in ways that inflate measured welfare while telling the modeler nothing about the underlying economic question. Sanity-check learned policies against hand-coded baselines before reading anything off them.

References

Lamperti, Roventini, Sani, Agent-based model calibration using machine learning surrogates (Journal of Economic Dynamics and Control 90, 2018; arXiv 1703.10639).
Andelfinger, Differentiable Agent-Based Simulation for Gradient-Guided Simulation-Based Optimization (ACM Transactions on Modeling and Computer Simulation 31, 2021).
Chopra, Quera-Bofarull, et al., Differentiable Agent-Based Epidemiology (AAMAS 2023; arXiv 2207.09714). Scaling differentiable ABMs.
Axtell and Farmer, Agent-Based Modeling in Economics and Finance: Past, Present, and Future (Journal of Economic Literature, forthcoming as of 2024).
Cranmer, Brehmer, Louppe, The frontier of simulation-based inference (PNAS 117, 2020; arXiv 1911.01429).
Zheng, Trott, Srinivasa, Naik, Gruesbeck, Parkes, Socher, The AI Economist: Improving Equality and Productivity with AI-Driven Tax Policies (arXiv 2004.13332, 2020). MARL inside an economic simulator.

Required before and derived from this topic

These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.

Full prerequisite chain All derived topics