Skip to content
AI-Daily-Builder

2026-05-18 views

Recursive Superintelligence emerges with $650M to build self-improving AI

Read this because NVIDIA and AMD on the same cap table is the buried signal: the bet is on a workload (recursive search over architectures) that burns cycles on whichever silicon is available — not a model family loyal to one vendor.

$650M raise at $4.65B, pre-product, <30 staff. GV/Greycroft led; NVIDIA AND AMD both joined. Thesis: AI that automates its own architecture search.

A new frontier-model lab, Recursive Superintelligence, emerged from stealth on May 13 with $650M raised at a $4.65B valuation — pre-product, pre-revenue, fewer than 30 employees. The thesis is the most concrete bet so far on closing the loop from “AI assists research” to “AI does research.”

The cap table is the story

Lead investorsStrategic participants
GV (Alphabet)NVIDIA
GreycroftAMD

Two leading chip vendors on the same Series A is unusual. The usual pattern is exclusive — Microsoft + OpenAI, Google + Anthropic, Amazon + Anthropic — because the cloud underwrites both compute commit and equity in one line. Here, NVIDIA and AMD are both buying optionality on a workload that will run on whichever silicon delivers FLOPs first.

The founding team

FounderPrior role
Richard Socherex-Chief Scientist, Salesforce
Yuandong Tianex-Director, Meta FAIR
Tim Rocktäschelex-DeepMind
Jeff CluneDeepMind, OpenAI
Josh Tobinex-OpenAI
Tim Shiearly Cresta, OpenAI

A team weighted toward open-ended search and meta-learning (Clune, Rocktäschel) rather than the now-standard “post-train a base model” lineage.

What “recursive self-improvement” actually means in practice

The technical bet, distilled from public statements:

  1. Architecture search. Today, picking a new transformer variant takes a small army of researchers running ablations for months. The lab’s claim: a Level-1 system that proposes architectural changes, runs the experiments, and reads its own loss curves to decide what to keep.
  2. Training-recipe optimization. Learning-rate schedule, data mix, curriculum — the metalanguage of training itself becomes the optimization target, not just the weights.
  3. Evaluation generation. The model generates its own benchmark variants to detect overfitting and capability gaps that human-curated benchmarks miss.

If even part of this works at scale, the time-to-next-model collapses from quarters to weeks.

Roadmap

Why this matters now

Three reasons the timing is not random:

Practitioner note

For builders shipping today, the practical takeaways:

The under-considered angle: the team composition is a leading indicator on the field’s belief about RSI feasibility. When researchers with Clune-Tian-Rocktäschel pedigrees walk away from FAIR / DeepMind / OpenAI to bet a career on it, the field’s median estimate of “is this real?” has quietly moved.


Sources

Tags

Tip