Gaussian Processes in Astronomy

Sneiderman, Robby

Applied ML

Gaussian Processes in Astronomy

GPs as the workhorse for stellar light curves, exoplanet radial velocities, and cosmological field reconstruction: scalable kernels (celerite), correlated-noise modeling, and joint inference with physical signal models.

AdvancedTier 3CurrentReference~15 min

Prerequisites

Gaussian Processes for ML Gaussian Processes Regression

Prereq Map

Learning position

Read this page in the graph.

applied-ml | layer 4 | tier 3. This page has 2 direct prerequisites and 1 published dependent.

Open Atlas Prerequisites Leads to

What next

Bayesian State Estimation

This is the first curated or graph-derived continuation from the current page.

Evidence badge

Claim status

This page has no public Lean mapping yet. Use the evidence page to inspect how claim status labels work.

Show the backing system

AtlasOpen the full prerequisite graph and run grounding traces.EvidenceInspect source support, claim labels, and public trust status.LeanReview the checked declaration list, scopes, and axiom profile.

Why This Matters

Astronomical data are short, irregularly sampled, and contaminated by physical nuisance signals: stellar rotation, granulation, instrumental drift, atmospheric turbulence. Most physical signals of interest (planetary transits at $\sim 100$ ppm, radial-velocity wobbles of $\sim 1$ m/s) sit below the amplitude of these correlated nuisances. Treating noise as i.i.d. Gaussian biases parameter estimates and inflates false-positive rates.

Gaussian processes give a principled framework for jointly modeling a physical signal and a stationary stochastic noise process. The kernel encodes prior beliefs about the timescale and smoothness of the nuisance; the posterior returns parameter estimates with calibrated uncertainty. This matters for the yes/no decisions that drive follow-up time on Keck or JWST.

The cubic cost $O(N^3)$ of standard GPs forced the field to develop scalable inference for one-dimensional time series. The celerite algorithm reduces inference to $O(N)$ for a class of mixture-of-exponential kernels, which made GPs the default tool for Kepler, K2, TESS, and ground-based RV pipelines.

Core Ideas

Stellar variability and transit detection. Kepler photometry shows correlated brightness variations from rotation-modulated starspots, p-mode oscillations, and granulation. A quasi-periodic kernel $k(\tau) = A \exp(-\tau^2 / 2\ell^2) \cos(2\pi \tau / P)$ models rotation while leaving room for a transit dip parameterized by a Mandel-Agol model. Marginal likelihood discriminates planet vs. stellar artifact. The celerite kernel (Foreman-Mackey et al. 2017, AJ 154) writes the covariance as a sum of damped harmonic oscillators, yielding a semiseparable matrix with linear-time Cholesky.

Exoplanet radial velocities. Stellar surface activity injects RV signals at the rotation period and its harmonics, often comparable to or larger than the planetary signal. The standard practice is a joint model: Keplerian orbits for the planets, a GP with quasi-periodic kernel for activity, with hyperparameters sampled by HMC or nested sampling. This was central to confirmations of Proxima b and several TESS systems.

Cosmological field reconstruction. GPs serve as nonparametric priors for fields where the underlying function is smooth but otherwise unknown: reconstructing the Hubble parameter $H(z)$ from supernova distance moduli, mapping the dark-energy equation of state $w(z)$ , or inferring weak-lensing convergence maps. The kernel choice (squared-exponential, Matern $\nu = 3/2$ , $\nu = 5/2$ ) controls smoothness assumptions and propagates into the posterior on cosmological parameters. The model-selection question (which kernel) is itself addressed by the marginal likelihood.

Scalable kernels beyond celerite. For two-dimensional fields and higher-dimensional inputs, structured kernel interpolation (KISS-GP) and inducing-point methods (SVGP) extend GPs to $N \sim 10^5$ or larger. These power applications in galaxy survey systematics modeling and 21-cm signal extraction.

Common Confusions

Watch Out

GP residuals are not white

Adding a GP noise model does not whiten the residuals after subtracting the posterior mean. The posterior mean already absorbs the correlated component, so residuals against the full posterior predictive should look white. Plotting residuals against the planet-only model and complaining about correlations is a category error.

References

Rasmussen and Williams, Gaussian Processes for Machine Learning, MIT Press 2006, Chapter 5 (model selection and hyperparameter learning).
Foreman-Mackey, Agol, Ambikasaran, Angus, Fast and Scalable Gaussian Process Modeling with Applications to Astronomical Time Series (Astronomical Journal 154, 2017; arXiv 1703.09710). The celerite paper.
Haywood et al., Planets and Stellar Activity: Hide and Seek in the CoRoT-7 system (MNRAS 443, 2014; arXiv 1407.1044). GP modeling of stellar RV jitter.
Aigrain, Pont, Zucker, A simple method to estimate radial velocity variations due to stellar activity using photometry (MNRAS 419, 2012; arXiv 1109.6443).
Seikel, Clarkson, Smith, Reconstruction of dark energy and expansion dynamics using Gaussian processes (JCAP 06, 2012; arXiv 1204.2832). GP priors for cosmological functions.
Angus et al., Inferring probabilistic stellar rotation periods using Gaussian processes (MNRAS 474, 2018; arXiv 1706.05459).

Required before and derived from this topic

These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.

Full prerequisite chain All derived topics