Quanta — Open AI Research

We build open models and publish the parts most labs keep quiet.

What we do

Quanta is an independent research lab studying the foundations of machine intelligence. We train open models, publish negative results, and release the weights, the data provenance, and the eval suite alongside every paper.

Why we exist

Frontier capability is concentrating in a handful of closed systems. We believe the most important ideas in this field — how to align, interpret, and trust a model — should be public knowledge, owned by no one.

14 open models released 63 papers since 2022 1.2T tokens in Q-Corpus v3 0 proprietary training data

Mechanistic Interpretability How do models actually reason?

We reverse-engineer the circuits inside transformer models — finding the features, heads, and computational motifs that produce behavior. The goal is not description but prediction: a theory of what a model will do before it does it, and a toolchain that lets any researcher inspect a forward pass at the level of individual tokens.

SAEs Circuits Probing

Scalable Alignment Can supervision scale with capability?

As models grow more capable, human feedback becomes a bottleneck and a liability. We study scalable oversight — debate, recursive reward modeling, process-based supervision — and the theoretical question underneath them: under what conditions can a weaker system reliably supervise a stronger one? We release the training stacks, not just the results.

RLHF Debate Oversight

Efficient Pretraining What can a small model know?

Frontier capability should not require frontier compute. We study the scaling laws, data curation, and architectural choices that let a 7B model match a 70B model on the tasks that matter — and the regimes where scale is genuinely irreducible. Every Quanta base model is reproducible from a single H100 node in under 72 hours.

Scaling Distillation Curriculum

We build open models and publish the parts most labs keep quiet.

Capability without transparency is just leverage.

Three threads, woven into one fabric.

Mechanistic Interpretability How do models actually reason?

Scalable Alignment Can supervision scale with capability?

Efficient Pretraining What can a small model know?

Selected work, last twelve months.

We are hiring researchers who want to be wrong in public.