We develop methods, benchmarks, and training systems that turn expert data into frontier AI

building benchmarks and collaborating with

Image
Image
Image
Image
Image
Image
Image
Image
Image
key research areas

Vision and impact

We help labs advance frontier models by working with domain experts to design and build complex, realistic datasets that drive model performance.

initiatives

Community and open science

Open benchmarks, conversations, and research for real-world AI performance.

Image

Open Benchmarks Grants

Backed by a $3M commitment, the program funds
open-source datasets, benchmarks, and evaluation artifacts that shape how frontier AI systems are built
and evaluated.

Image

Benchtalks

Our podcast series at the intersection of AI evaluation, data quality, and real-world impact.
Image

Reading Group

A recurring forum for researchers and practitioners to explore the latest frontier developments in AI while building meaningful connections within the community.

DEEP RESEARCH Expertise

Technical advisors and distinguished affiliates

Stephen Bach headshot

Stephen Bach

Brown University
Eliot Horowitz Assistant Professor, Computer Science Department
Jason Fries headshot

Jason Fries

Stanford University
Assistant Professor of Biomedical Data Science and of Medicine
Jared Dunnmon headshot

Jared Dunnmon

Co-Founder & Chief Scientist, Stealth Startup
Prev. Dir. of AI at DIU
Fred Sala headshot

Fred Sala

Chief Scientist
,
Snorkel AI
Assistant Professor @ University of Wisconsin-Madison
Chris Ré headshot

Chris Ré

Co-Founder
,
Snorkel AI
Professor @ Stanford University
Ludwig Schmidt headshot

Ludwig Schmidt

Stanford University · LAION
Stanford researcher and LAION collaborator
Karthik Narasimhan headshot

Karthik Narasimhan

Princeton University
Professor of Computer Science
Yu Su headshot

Yu Su

Ohio State University
Associate Professor of Computer Science and Engineering
Lewis Tunstall headshot

Lewis Tunstall

Hugging Face
Machine Learning Engineer
PUBLICATIONS

Browse research blogs
and academic papers

Type: All Types
Sort: Newest
Snorkel MeTaL: Weak Supervision for Multi-Task Learning
Presenting Snorkel MeTal, an end-to-end system for multi-task learning.
Research Paper
Snorkel MeTaL: Weak Supervision for Multi-Task Learning

Presenting Snorkel MeTal, an end-to-end system for multi-task learning.

Dec 18, 2018
A. Ratner, et al, 2018
Learn more about Snorkel MeTaL: Weak Supervision for Multi-Task Learning
Fonduer: Knowledge Base Construction From Richly Formatted Data
Introducing Fonduer, a machine-learning-based KBC system for richly formatted data.
Research Paper
Fonduer: Knowledge Base Construction From Richly Formatted Data

Introducing Fonduer, a machine-learning-based KBC system for richly formatted data.

Dec 17, 2018
S. Wu, et al, 2018
Learn more about Fonduer: Knowledge Base Construction From Richly Formatted Data
Deep Text Mining of Instagram Data Without Strong Supervision
This paper showcases methods for unsupervised mining of fashion attributes from Instagram text, which can enable a new kind of user recommendation in the fashion domain.
Research Paper
Deep Text Mining of Instagram Data Without Strong Supervision

This paper showcases methods for unsupervised mining of fashion attributes from Instagram text, which can enable a new kind of user recommendation in the fashion domain.

Dec 16, 2018
K. Hammar, et al, 2018
Learn more about Deep Text Mining of Instagram Data Without Strong Supervision
Snorkel: Fast Training Set Generation for Information Extraction
Introducing Snorkel, a new system for quickly creating, managing, and modeling training datasets.
Research Paper
Snorkel: Fast Training Set Generation for Information Extraction

Introducing Snorkel, a new system for quickly creating, managing, and modeling training datasets.

Dec 20, 2017
A. Ratner, et al, 2017
Learn more about Snorkel: Fast Training Set Generation for Information Extraction
Learning to Compose Domain-Specific Transformations for Data Augmentation
Automating data augmentation by learning a generative sequence model over user-specified transformation functions.
Research Paper
Learning to Compose Domain-Specific Transformations for Data Augmentation

Automating data augmentation by learning a generative sequence model over user-specified transformation functions.

Dec 19, 2017
A. Ratner, et al, 2017
Learn more about Learning to Compose Domain-Specific Transformations for Data Augmentation
Learning the Structure of Generative Models Without Labeled Data
Proposing a structure estimation method that is 100x faster than a maximum likelihood approach for training data.
Research Paper
Learning the Structure of Generative Models Without Labeled Data

Proposing a structure estimation method that is 100x faster than a maximum likelihood approach for training data.

Dec 18, 2017
S. Bach, et al, 2017
Learn more about Learning the Structure of Generative Models Without Labeled Data
Inferring Generative Model Structure With Static Analysis
Obtaining enough labeled data to robustly train complex discriminative models is a major bottleneck in the machine learning pipeline. A popular solution is combining multiple sources of weak supervision using generative models. The structure of these models affects the quality of the training labels, but is difficult to learn without any ground truth labels. We instead rely on weak supervision sources having some structure by virtue of being encoded programmatically. We present Coral, a paradigm that infers generative model structure by statically analyzing the code for these heuristics, thus significantly reducing the amount of data required to learn structure. We...
Research Paper
Inferring Generative Model Structure With Static Analysis

Obtaining enough labeled data to robustly train complex discriminative models is a major bottleneck in the machine learning pipeline. A popular solution is combining multiple sources of weak supervision using generative models. The structure of these models affects the quality of the training labels, but is difficult to learn without any ground truth labels. We instead rely on weak supervision…

Dec 17, 2017
P. Varma, et al, 2017
Learn more about Inferring Generative Model Structure With Static Analysis
Swellshark: A Generative Model for Biomedical Named Entity Recognition Without Labeled Data
Introducing SwellShark, a framework for building biomedical named entity recognition (NER) systems quickly.
Research Paper
Swellshark: A Generative Model for Biomedical Named Entity Recognition Without Labeled Data

Introducing SwellShark, a framework for building biomedical named entity recognition (NER) systems quickly.

Nov 13, 2017
J. Fries, et al, 2017
Learn more about Swellshark: A Generative Model for Biomedical Named Entity Recognition Without Labeled Data
Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data
A challenge in training discriminative models like neural networks is obtaining enough labeled training data. Recent approaches use generative models to combine weak supervision sources, like user-defined heuristics or knowledge bases, to label training data. Prior work has explored learning accuracies for these sources even without ground truth labels, but they assume that a single accuracy parameter is sufficient to model the behavior of these sources over the entire training set. In particular, they fail to model latent subsets in the training data in which the supervision sources perform differently than on average. We present Socratic learning, a paradigm that...
Research Paper
Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data

A challenge in training discriminative models like neural networks is obtaining enough labeled training data. Recent approaches use generative models to combine weak supervision sources, like user-defined heuristics or knowledge bases, to label training data. Prior work has explored learning accuracies for these sources even without ground truth labels, but they assume that a single accuracy parameter is sufficient to…

Nov 13, 2017
P. Varma, et al, 2017
Learn more about Socratic Learning: Augmenting Generative Models to Incorporate Latent Subsets in Training Data
1 2 35 36

Let’s research together

Join our team of leading researchers and help shape the future of AI.