Non-Euclidean and Hyperbolic Geometry

Sneiderman, Robby

Mathematical Infrastructure

Non-Euclidean and Hyperbolic Geometry

The geometry that drops the parallel postulate. Hyperbolic and spherical models, sectional curvature, the Poincare disk, and why hyperbolic spaces embed tree-structured data with low distortion. The grounding for graph embeddings on curved spaces.

AdvancedTier 2StableSupporting~30 min

Prerequisites

Metric Spaces Convergence Completeness Vectors Matrices and Linear Maps

Prereq Map

Learning position

Read this page in the graph.

mathematical-infrastructure | layer 1 | tier 2. This page has 2 direct prerequisites and 4 published dependents.

Open Atlas Prerequisites Leads to

What next

Riemannian Optimization and Manifold Constraints

This is the first curated or graph-derived continuation from the current page.

Evidence badge

Claim status

This page has no public Lean mapping yet. Use the evidence page to inspect how claim status labels work.

Show the backing system

AtlasOpen the full prerequisite graph and run grounding traces.EvidenceInspect source support, claim labels, and public trust status.LeanReview the checked declaration list, scopes, and axiom profile.

Why This Matters

Hierarchies, taxonomies, and trees are everywhere in ML data: WordNet, biological phylogenies, file systems, citation graphs, knowledge graphs. Embedding them faithfully in $\mathbb{R}^d$ with the Euclidean metric is provably hard. The reason is geometric. Trees grow exponentially in the number of nodes per shell of radius $r$ ; Euclidean balls in $\mathbb{R}^d$ have polynomial volume $r^d$ ; so the embedding must distort distances. In contrast, hyperbolic space has exponential volume growth, matching the combinatorial structure of trees, which is why hyperbolic embeddings (Nickel-Kiela 2017, Sala et al. 2018) achieve much lower distortion than Euclidean ones at the same dimension.

Five-panel infographic: Euclidean vs hyperbolic vs spherical geometry (parallel postulate, curvature), Poincare disk and upper half-plane models of hyperbolic space, exponential volume growth, why hyperbolic embeddings represent hierarchies efficiently, and applications in ML (Poincare embeddings, hyperbolic neural networks). — Hyperbolic geometry has constant negative curvature: distance grows exponentially with depth, which makes it natural for representing tree-like and hierarchical structure.

This page is the geometric grounder. It states the parallel postulate, names the three constant-curvature model spaces, defines the Poincare disk metric, proves the volume-growth gap, and lists the operational consequences for graph and representation learning. It is not a substitute for a Riemannian geometry course; it is the prerequisite chain for hyperbolic-embeddings-for-graphs.

The Parallel Postulate and the Three Geometries

Euclid's fifth postulate, Playfair's version: through a point not on a given line there is exactly one line parallel to the given one. Dropping this postulate while keeping the others yields two consistent alternatives: zero parallels (spherical / elliptic geometry) or infinitely many parallels (hyperbolic geometry). The three constant-curvature model spaces in dimension $n$ are:

$\mathbb{S}^n$ , the round $n$ -sphere of constant sectional curvature $K = +1$ .
$\mathbb{R}^n$ , Euclidean space, $K = 0$ .
$\mathbb{H}^n$ , hyperbolic $n$ -space, $K = -1$ .

A general Riemannian manifold has curvature that varies from point to point and from plane to plane in the tangent space. The constant-curvature spaces are the simplest models and serve as the local "tangent geometry" of the general case via the Gauss-Bonnet machinery.

Definition

Sectional Curvature $K (σ)$

For a Riemannian manifold $M$ and a $2$ -plane $\sigma \subset T_p M$ in the tangent space at $p$ , the sectional curvature $K(\sigma)$ is the Gaussian curvature at $p$ of the small surface obtained by exponentiating $\sigma$ at $p$ . The three model spaces above are exactly the simply connected complete Riemannian manifolds whose sectional curvature is constant in $\sigma$ and $p$ .

The Poincare Disk Model

The cleanest model of $\mathbb{H}^2$ for ML purposes is the Poincare disk. Take the open unit disk $D = \{(x, y) \in \mathbb{R}^2 : x^2 + y^2 < 1\}$ and equip it with the conformal metric $ds^2 = \frac{4 (dx^2 + dy^2)}{(1 - x^2 - y^2)^2}.$ The Riemannian distance between two points $u, v \in D$ is $d_{\mathbb{H}}(u, v) = \mathrm{arcosh}\left(1 + \frac{2 \|u - v\|^2}{(1 - \|u\|^2)(1 - \|v\|^2)}\right).$ The boundary of the disk is at infinite hyperbolic distance from any interior point: as $\|u\| \to 1$ , $d_{\mathbb{H}}(0, u) \to \infty$ . Distances near the boundary stretch. Euclidean straight lines through the origin and Euclidean circular arcs that meet the boundary orthogonally are the geodesics (length-minimizing paths) of the model.

The conformal factor $4 / (1 - \|x\|^2)^2$ is what makes hyperbolic geometry so different from Euclidean geometry near the boundary, and is also the source of its embedding power. The same point that looks "small" in the Euclidean picture has enormous hyperbolic neighbourhood, so a polynomial number of dimensions can host an exponential number of well-separated points.

Volume Growth: The Embedding Argument

Theorem

Volume Growth of a Ball

Statement

For the geodesic ball of radius $r$ in $\mathbb{H}^n$ , $\mathrm{vol}\big(B_r^{\mathbb{H}^n}\big) = \omega_{n-1} \int_0^r \sinh^{n-1}(s)\, ds,$ where $\omega_{n-1}$ is the surface area of the unit $(n-1)$ -sphere. For large $r$ , $\mathrm{vol}(B_r^{\mathbb{H}^n}) \sim C_n e^{(n-1) r}$ .

For comparison, $\mathrm{vol}(B_r^{\mathbb{R}^n}) = c_n r^n$ (polynomial) and $\mathrm{vol}(B_r^{\mathbb{S}^n}) \leq \mathrm{vol}(\mathbb{S}^n) < \infty$ (bounded).

Intuition

In hyperbolic space the metric coefficient $\sinh(s)$ grows exponentially with the radial distance $s$ , whereas in Euclidean space the coefficient is the polynomial $s$ . Volumes of shells therefore grow exponentially, which gives hyperbolic space "room" for a tree of branching factor $b$ to embed at depth $\approx \log_b(\mathrm{vol})$ .

Why It Matters

A complete binary tree of depth $d$ has $2^{d+1} - 1$ nodes and pairwise graph distances up to $2 d$ . Embedding it isometrically in $\mathbb{R}^k$ requires $k \geq d$ in the worst case (Bourgain), or significant distortion. In $\mathbb{H}^2$ a tree of depth $d$ embeds with constant distortion: Sala et al. (2018) show that $b$ -ary trees embed in $\mathbb{H}^2$ with distortion $1 + O(1/\log d)$ . The volume-growth match is the geometric reason. This is the launching pad for hyperbolic-embeddings-for-graphs.

Failure Mode

Hyperbolic embeddings help only when the data is approximately tree-like. For grid-structured or feature-grid data the hyperbolic gain disappears and the Euclidean baseline is at least as good. Gromov's $\delta$ -hyperbolicity of the data graph is the right diagnostic: small $\delta$ favours hyperbolic embedding, large $\delta$ does not.

report a correction →

Operations in the Poincare Ball

Practical hyperbolic ML works in the Poincare ball $\mathbb{B}^n = \{x \in \mathbb{R}^n : \|x\| < 1\}$ (the $n$ -dimensional analog of the disk). The basic operations live in the gyrovector formalism (Ungar 2008, Ganea et al. 2018):

Mobius addition $u \oplus v$ : the analog of vector addition.
Exponential map $\exp_x^c(v)$ : maps a tangent vector at $x$ to a point in $\mathbb{B}^n$ . Replaces the Euclidean update $x + v$ .
Logarithmic map $\log_x^c(y)$ : the inverse, mapping a target point to the tangent vector that reaches it.
Hyperbolic distance $d^c(u, v)$ : a curvature- $c$ rescaling of the Poincare formula above.

A hyperbolic neural-network layer applies a linear map in the tangent space at the origin, then exponentiates back to the ball. Optimization uses Riemannian gradient methods with the hyperbolic metric.

Spherical Geometry, Briefly

The sphere $\mathbb{S}^n$ is the constant-positive-curvature counterpart. Key facts: geodesics are great circles; the diameter is $\pi$ ; volumes are bounded. In ML, spherical geometry shows up via L2-normalized embeddings (face recognition with cosine similarity, normalized text embeddings) and in the von Mises-Fisher distribution for directional data. The closed, finite nature of $\mathbb{S}^n$ is a feature, not a bug, for problems with naturally bounded similarity.

Common Confusions

Watch Out

Hyperbolic distance is not Euclidean distance scaled

Two points near the centre of the Poincare disk have hyperbolic distance close to twice their Euclidean distance, but two points equally close in the Euclidean sense near the boundary can be infinitely far apart in the hyperbolic metric. The conformal factor depends on position. Reading off similarity from Euclidean distance in a hyperbolic embedding is a category error.

Watch Out

Negative curvature is not 'inverse' positive curvature

Spherical geometry is closed, bounded, and self-intersecting (great circles meet); hyperbolic geometry is open, unbounded, and tree-like. They are not each other's mirror image. Sign of curvature controls volume growth and geodesic spreading; the qualitative behaviour is asymmetric. Concretely, balls in $\mathbb{S}^n$ saturate, balls in $\mathbb{H}^n$ explode.

Exercises

ExerciseCore

Problem

Compute the Poincare-disk distance between the origin and the point $u = (r, 0)$ for $r \in (0, 1)$ . Show that as $r \to 1$ the distance diverges to infinity, and compute the leading-order rate.

ExerciseAdvanced

Problem

Show that a complete binary tree of depth $d$ requires at least $\Omega(d)$ Euclidean dimensions to embed with constant multiplicative distortion, but embeds in $\mathbb{H}^2$ with constant distortion. Sketch the argument from volume growth.

References

Manfredo P. do Carmo. Riemannian Geometry. Birkhauser, 1992. Chapters 4 and 8: sectional curvature, model spaces, volume comparison. Standard graduate Riemannian geometry text.
James W. Anderson. Hyperbolic Geometry (2nd ed.). Springer, 2005. Chapters 1-4: Poincare disk, upper-half-plane model, isometries, distance formulas. Most accessible standalone hyperbolic-geometry reference.
James W. Cannon, William J. Floyd, Richard Kenyon, Walter R. Parry. Hyperbolic Geometry. In Flavors of Geometry, MSRI Publications 31, 1997. Survey-style introduction with all five standard models and careful proofs.
Maximilian Nickel, Douwe Kiela. Poincare Embeddings for Learning Hierarchical Representations. NeurIPS 2017. The paper that brought hyperbolic embeddings to mainstream representation learning. arXiv:1705.08039
Frederic Sala, Christopher De Sa, Albert Gu, Christopher Re. Representation Tradeoffs for Hyperbolic Embeddings. ICML 2018. Distortion bounds for tree embeddings; precision-vs-dimension analysis. arXiv:1804.03329
Octavian Ganea, Gary Becigneul, Thomas Hofmann. Hyperbolic Neural Networks. NeurIPS 2018. Mobius addition, exp/log maps, and hyperbolic layers in the gyrovector formalism. arXiv:1805.09112

Required before and derived from this topic

These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.

Full prerequisite chain All derived topics

Required prerequisites

2

Metric Spaces, Convergence, and Completenesslayer 0A · tier 1
Vectors, Matrices, and Linear Mapslayer 0A · tier 1

Derived topics

4

Distance Metrics Comparedlayer 1 · tier 2
Hyperbolic Embeddings for Graphslayer 2 · tier 2
Riemannian Optimization and Manifold Constraintslayer 3 · tier 2
Information Geometrylayer 3 · tier 3

Graph-backed continuations

Riemannian Optimization and Manifold Constraints Hyperbolic Embeddings for Graphs Information Geometry Distance Metrics Compared