Model Timeline

Ineffable Intelligence

British AI lab founded by David Silver (UCL professor, ex-DeepMind reinforcement-learning lead, AlphaGo / AlphaZero / AlphaProof). Announced a $1.1 billion seed round at $5.1 billion valuation on April 27, 2026 — co-led by Sequoia and Lightspeed, with Nvidia, Google, DST Global, Index Ventures, and the UK Sovereign AI Fund participating. Stated mission: build a 'superlearner' that acquires knowledge from its own experience rather than from human-generated data, instantiating the 'Era of Experience' research agenda.

CoreTier 2FrontierFrontier watch~18 min

Prerequisites

Reinforcement Learning From Human Feedback Deep Dive AI Labs Landscape

Prereq Map

Why This Matters

On April 27, 2026, Ineffable Intelligence announced a $1.1 billion seed round at a$ 5.1 billion valuation. Multiple outlets reported it as the largest seed round in European history. Two facts make this interesting beyond the headline number:

The founder is David Silver, who led DeepMind's reinforcement- learning team for over a decade and is the central figure behind AlphaGo, AlphaZero, MuZero, and AlphaProof. The technical pedigree is unusually concentrated.
The stated thesis is not "scale up the next transformer". It is that the current paradigm of training on curated human-generated data is approaching limits, and the next jump in capability comes from agents that learn from their own experience via reinforcement learning, with rewards grounded in interaction rather than imitation. Silver and Richard Sutton sketched this thesis in the 2025 essay "Welcome to the Era of Experience" (forthcoming in a MIT Press volume edited by George Konidaris); Ineffable Intelligence is the operational instantiation.

This page treats Ineffable Intelligence as a frontier-watch entry, not a recommendation. The company is a few months old at time of writing, has no published research under its own name, and has not shipped a product. Treat every claim about its capability as forward-looking until evaluated artifacts exist.

What is Verified

The verifiable facts as of April 28, 2026, drawn from primary business-press coverage (CNBC, TechCrunch, Bloomberg, SiliconANGLE, PYMNTS, EU-Startups, FoundersToday) and the company's own announcements:

Field	Value
Founder	David Silver — UCL professor; ex-DeepMind RL team lead (~10+ years); principal investigator on AlphaGo, AlphaZero, AlphaProof
Headquarters	UK (specific city not disclosed in announcements)
Founded	Late 2025 (described as "a few months ago" relative to the April 27, 2026 announcement)
Round	$1.1 billion seed at$ 5.1 billion post-money valuation
Co-leads	Sequoia Capital, Lightspeed Venture Partners
Other participants	Nvidia, Google, DST Global, Index Ventures, UK Sovereign AI Fund (also reported as British Business Bank participation)
Reported team	"Several former DeepMind staffers" joining the executive team; specific names not disclosed in coverage
Stated mission	"Make first contact with superintelligence" via a reinforcement-learning superlearner
Founder commitment	Silver has publicly stated that any personal compensation from Ineffable will be donated to high-impact charities

Direct quotes from David Silver as reported in the coverage:

Our mission is to make first contact with superintelligence. We are creating a superlearner that discovers all knowledge from its own experience, from elementary motor skills through to profound intellectual breakthroughs.

If successful, this will represent a scientific breakthrough of comparable magnitude to Darwin: where his law explained all Life, our law will explain and build all Intelligence.

The Darwin analogy is editorial framing from the company, not a technical claim, and is best read as positioning rather than as a falsifiable statement.

The Technical Thesis: Era of Experience

The intellectual scaffold for Ineffable Intelligence is the Silver- Sutton essay Welcome to the Era of Experience (April 26, 2025; forthcoming in Designing an Intelligence, MIT Press, ed. George Konidaris). The essay proposes a periodization of recent AI history:

Era	Approximate dates	Defining mechanism
Era of simulation	~2015–2020	Self-play and simulation environments (Atari, Go, Dota, StarCraft); RL as the dominant story
Era of human data	~2020–present	Pretraining on internet-scale human-generated text and code; foundation models; instruction tuning; RLHF
Era of experience (proposed)	beginning now	Agents learning from their own interaction with environments; rewards grounded in real-world consequences rather than imitation of human behaviour

The argument for the third era rests on two claims:

Human-data scaling is approaching its ceiling. In domains where high-quality human text already exists (much of the web, code, math competitions), the supply is large but bounded; pretraining on it has captured most of the compressible signal. Adding more low- quality data degrades rather than improves.
Continual self-generated experience is unbounded in principle. An agent that interacts with an environment, receives rewards, and updates can in principle generate arbitrarily much new training signal, with the data distribution adapting to the agent's current weakness rather than to whatever humans happened to write.

Both claims have prominent critics. The first is empirically contested (performance has continued to improve under scale; "ceiling" is hard to operationalize). The second is contested on grounds of reward hacking, distribution shift, and the difficulty of specifying useful reward functions in open-ended domains. Treat these as genuine disagreements in the field rather than settled questions.

For TheoremPath readers: the Era-of-Experience thesis is the intellectual cousin of Sutton's Bitter Lesson ("methods that leverage computation tend to dominate methods that leverage human knowledge") and inherits the same caveats. See bitter-lesson for the broader framing and the arguments against treating it as universal.

Continuity with Silver's Prior Work

Ineffable Intelligence's stated approach is recognizably continuous with Silver's published research at DeepMind. The salient line:

Year	System	Mechanism	Relevance to "superlearner"
2016	AlphaGo	Supervised pretraining on human Go games + self-play RL with policy and value networks; Monte Carlo tree search at inference	Showed RL can reach superhuman play in a complex strategic domain, but still bootstrapped from human data
2017	AlphaGo Zero	Self-play RL only, no human games	First-class evidence that human bootstrapping can be removed without losing capability
2017	AlphaZero	Same architecture as AlphaGo Zero, generalized to chess and shogi	Demonstrated transfer of the self-play paradigm across rule-based domains
2019	MuZero	Model-based RL: the system learns the environment dynamics rather than receiving them as oracles	Reduces the dependence on hand-coded simulators; closer to "agent-learns-its-own-world-model"
2024	AlphaProof	Lean-grounded RL theorem proving; IMO 2024 silver medal (4 of 6 problems)	Extension of self-play / search to formal mathematics; rewards from a verifier instead of a game outcome

The Ineffable thesis is, charitably, a bet that this trajectory generalizes from rule-based domains (Go, chess, theorem proving) to open-ended ones, and that the right combination of model-based RL, self-generated curriculum, and grounded rewards makes a superlearner viable in domains without a clean evaluator. That is a substantial extrapolation. AlphaZero worked because chess has a known win/loss signal. The hard part of Ineffable's stated agenda is specifying analogous reward signals for science, engineering, and "profound intellectual breakthroughs". See alphaproof-and-ai-theorem-proving for one concrete example of how a verifier-grounded reward looks in practice.

What is Not Yet Known

To avoid overreading the announcement, the explicit list of unknowns:

No published research under the Ineffable name as of April 2026. All technical context comes from Silver's prior work or the Silver- Sutton essay, neither authored as Ineffable Intelligence.
No disclosed product, benchmark, or capability claim. The announcement describes intent, not artifacts.
No disclosed compute partnership specifics. Nvidia and Google appear as investors; specific compute commitments, contracts, or co-development arrangements were not disclosed in the coverage reviewed.
No disclosed cofounders or executive team beyond Silver. Coverage reports "several former DeepMind staffers" joining; specific names and roles were not in any of the press articles surveyed.
No disclosed location beyond "UK". No city, no campus, no office details.
No disclosed timeline for the first publication, paper, or model release.
No published safety, alignment, or governance commitments. The word "superintelligence" appears in the mission statement; the company has not published a corresponding safety position.

Each item is a thing to look for over the next 6-18 months. They are the natural milestones that would convert an announcement into evaluable research output.

Open Questions

For readers tracking the AI-labs landscape, three questions to keep in mind:

Reward grounding in open domains. AlphaZero and AlphaProof work because the reward signal is unambiguous (game won, theorem verified). What is the analogous grounding for "scientific breakthrough"? Possibilities discussed in the literature include verifier-augmented domains (formal proofs, symbolic mathematics, compiled code), simulation-grounded domains (physics, biology, chemistry with computational solvers), and human-feedback hybrids (still not pure self-experience). Which path Ineffable takes is a substantive technical choice, not yet disclosed.
Compute strategy. $1.1 billion seed plus Nvidia/Google as investors signals heavy compute. Whether the lab builds its own cluster, leases from Nvidia/Google partners, or pursues a hybrid strategy will shape its research velocity and partnership incentives.
Relationship with DeepMind. Several DeepMind alumni joining plus Google as an investor creates a cooperative-yet-competitive structure. The historical precedent (DeepMind itself originated with university and industry collaboration before Google acquired it in 2014) suggests the boundary will need explicit definition, especially around IP, publication norms, and recruitment.

How This Fits in the Path-Network

Ineffable Intelligence sits at an intersection that this site already covers separately:

The bitter-lesson thesis is in bitter-lesson. The Era-of-Experience thesis is the natural successor argument: even computation that leverages human data eventually has to give way to computation that generates its own data.
The RLHF and alignment machinery for "experience-driven" agents is covered in reinforcement-learning-from-human-feedback-deep-dive. Note the tension: Ineffable's thesis is that RL without human data is the next regime, which contrasts with the human-feedback paradigm that has dominated the last several years.
The AlphaProof / formal-verification track of grounded-reward RL is in alphaproof-and-ai-theorem-proving. This is the most concrete published example of the kind of work Ineffable seems to want to scale.
The AI-labs landscape entry in ai-labs-landscape places Ineffable alongside SSI and other early-stage frontier labs that have not yet shipped products.

Common Confusions

Watch Out

Largest seed in European history is not 'largest seed ever'

Coverage emphasizes the European-record framing. US comparators (Mistral's early rounds, Inflection AI before its dissolution, SSI's seed) reached comparable or larger numbers; Ineffable's $1.1B is notable for European venture markets specifically, not for global AI fundraising. Useful to read alongside the broader frontier-lab funding landscape, not in isolation.

Watch Out

'Superlearner' is a positioning term, not a defined technical concept

The word superlearner in the Ineffable announcements does not refer to a defined architecture or training paradigm. It is rhetorical positioning consistent with the Era-of-Experience essay. There is no peer-reviewed definition this page can point to. Treat it the way you would treat "AGI" in a press release: a directional claim about what the company wants to build, not a technical specification.

Watch Out

Founder pedigree is necessary, not sufficient

Silver's track record on AlphaGo, AlphaZero, and AlphaProof is real and well-documented. It does not by itself establish that Ineffable Intelligence will deliver on its stated mission. Cofounder-led labs with strong pedigrees have a wide outcome distribution; the relevant evidence will be artifacts, not announcements.

Watch Out

Era-of-Experience is a research thesis, not a settled paradigm

The Silver-Sutton essay is a position paper, not a peer-reviewed empirical demonstration. There is real disagreement in the field about whether human-data scaling is genuinely "running out" and whether self-generated experience produces useful gradient signal in open-ended domains. Read the essay, read the rebuttals (e.g., commentaries on the "grounded rewards" question), and form your own view.

References

Primary announcements (April 27, 2026):

"Ex-DeepMind David Silver raises $1.1 billion for AI startup Ineffable." CNBC (April 27, 2026).
"DeepMind's David Silver just raised $1.1B to build an AI that learns without human data." TechCrunch (April 27, 2026).
"Ineffable Intelligence raises $1.1B at$ 5.1B valuation to build an AI 'superlearner'." SiliconANGLE (April 27, 2026).
"Sequoia and Nvidia back David Silver's Ineffable Intelligence at $5.1B." The Next Web (April 27, 2026).
"Sequoia and Nvidia Back Ex-DeepMind Researcher's New AI Startup at $5.1 Billion Value." Bloomberg (April 27, 2026).
"DeepMind Vet's Ineffable Intelligence AI Startup Raises $1.1 Billion." PYMNTS (April 27, 2026).

Technical foundation:

Silver, D., and Sutton, R. S. "Welcome to the Era of Experience." (April 26, 2025; forthcoming in Designing an Intelligence, ed. G. Konidaris, MIT Press). The intellectual scaffold for the Ineffable Intelligence agenda.
Silver, D., et al. "Mastering the game of Go with deep neural networks and tree search." Nature 529 (2016): 484-489. AlphaGo.
Silver, D., et al. "Mastering the game of Go without human knowledge." Nature 550 (2017): 354-359. AlphaGo Zero.
Silver, D., et al. "A general reinforcement learning algorithm that masters chess, shogi, and Go through self-play." Science 362 (2018): 1140-1144. AlphaZero.
Schrittwieser, J., et al. "Mastering Atari, Go, chess and shogi by planning with a learned model." Nature 588 (2020): 604-609. MuZero.
DeepMind. "AlphaProof and AlphaGeometry 2 solve advanced reasoning problems in mathematics" (July 2024). AlphaProof / IMO 2024.

Adjacent context on the "Era of Experience" debate:

"Grounded rewards in the era of experience: A commentary on Silver and Sutton, 'Welcome to the era of experience.'" Noumenal AI (2025). One of the more substantive critiques of the reward-grounding question.
Sutton, R. S. "The Bitter Lesson." (March 2019). The intellectual ancestor; see bitter-lesson for the TheoremPath treatment.

Next Topics

AI labs landscape for the broader frontier-lab map.
AlphaProof and AI-assisted theorem proving for the most concrete published example of verifier-grounded RL, the family Ineffable seems to want to scale.
Bitter lesson for the intellectual ancestor of the Era-of-Experience thesis.
Reinforcement learning from human feedback (deep dive) for the human-feedback regime that the Era-of-Experience thesis positions itself against.

Last reviewed: April 28, 2026

Canonical graph

Required before and derived from this topic

These links come from prerequisite edges in the curriculum graph. Editorial suggestions are shown here only when the target page also cites this page as a prerequisite.

Full prerequisite chain All derived topics

Required prerequisites

Reinforcement Learning from Human Feedbacklayer 5 · tier 1
AI Labs Landscapelayer 5 · tier 2

Derived topics

No published topic currently declares this as a prerequisite.