Independent AI research lab · Est. 2022 · San Francisco
We build open models
and publish the partsmost labs keep quiet.
What we do
Quanta is an independent research lab studying the foundations of machine intelligence. We train open models, publish negative results, and release the weights, the data provenance, and the eval suite alongside every paper.
Why we exist
Frontier capability is concentrating in a handful of closed systems. We believe the most important ideas in this field — how to align, interpret, and trust a model — should be public knowledge, owned by no one.
14 open models released63 papers since 20221.2T tokens in Q-Corpus v30 proprietary training data
§ 01 / Mission
Capability without transparency is just leverage.
The dominant approach to building AI treats the model as a product: opaque, optimized for benchmark scores, and shipped without the engineering context that produced it. We think this is the wrong default for a technology that is reshaping how millions of people reason, write, and decide.
So we do the opposite. Every Quanta release ships with training traces, failure analyses, and the exact code that produced every number in the paper. When we are wrong, we say so in print. When a method does not scale, we publish the negative result. When a model behaves unexpectedly, we publish the activations.
This is slower. It is also, we think, the only honest way to do this work — and the fastest path to actually understanding these systems rather than merely operating them.
§ 02 / Research directions
Three threads, woven into one fabric.
01
Mechanistic Interpretability
How do models actually reason?
We reverse-engineer the circuits inside transformer models — finding the features, heads, and computational motifs that produce behavior. The goal is not description but prediction: a theory of what a model will do before it does it, and a toolchain that lets any researcher inspect a forward pass at the level of individual tokens.
SAEsCircuitsProbing
02
Scalable Alignment
Can supervision scale with capability?
As models grow more capable, human feedback becomes a bottleneck and a liability. We study scalable oversight — debate, recursive reward modeling, process-based supervision — and the theoretical question underneath them: under what conditions can a weaker system reliably supervise a stronger one? We release the training stacks, not just the results.
RLHFDebateOversight
03
Efficient Pretraining
What can a small model know?
Frontier capability should not require frontier compute. We study the scaling laws, data curation, and architectural choices that let a 7B model match a 70B model on the tasks that matter — and the regimes where scale is genuinely irreducible. Every Quanta base model is reproducible from a single H100 node in under 72 hours.
Recursive oversight under distribution shift: a negative result and a path forward
S. Park, A. Volkov, J. Mwangi, T. Halvorsen
arXiv:2502.10411 Preprint · 41pp
→
2025 · 01
Q-Base 14B: training a frontier-class model on 1.2T curated tokens
K. Iyer, M. Vasquez, R. Chen, L. Okafor, the Quanta team
Technical report Weights + code · 38pp
→
2024 · 11
In-context learning as gradient descent: a circuit-level analysis
R. Chen, M. Vasquez, A. Volkov
NeurIPS 2024 Oral · 18pp
→
2024 · 09
Process supervision does not transfer: evidence from six reasoning domains
J. Mwangi, T. Halvorsen, S. Park
arXiv:2409.08812 Preprint · 27pp
→
2024 · 06
Curriculum effects in large-scale pretraining: a controlled study
M. Vasquez, K. Iyer, R. Chen
ICML 2024 Poster · 22pp
→
§ 04 / Careers
We are hiring researchers who want to be wrong in public.
We are a small team — eighteen people, one floor, no managers between you and the work. We pay top of market, publish everything, and measure success in ideas that change how the field thinks, not press releases.
If you are a researcher or engineer who reads a paper and immediately wants to find its breaking point, we would like to meet you. We sponsor visas, hire without credentials, and run paid two-week research residencies for people considering a switch from industry.