We develop methods, benchmarks, and training systems that turn expert data into frontier AI

building benchmarks and collaborating with

Image
Image
Image
Image
Image
Image
Image
Image
Image
key research areas

Vision and impact

We help labs advance frontier models by working with domain experts to design and build complex, realistic datasets that drive model performance.

initiatives

Community and open science

Open benchmarks, conversations, and research for real-world AI performance.

Image

Open Benchmarks Grants

Backed by a $3M commitment, the program funds
open-source datasets, benchmarks, and evaluation artifacts that shape how frontier AI systems are built
and evaluated.

Image

Benchtalks

Our podcast series at the intersection of AI evaluation, data quality, and real-world impact.
Image

Reading Group

A recurring forum for researchers and practitioners to explore the latest frontier developments in AI while building meaningful connections within the community.

DEEP RESEARCH Expertise

Technical advisors and distinguished affiliates

Stephen Bach headshot

Stephen Bach

Brown University
Eliot Horowitz Assistant Professor, Computer Science Department
Jason Fries headshot

Jason Fries

Stanford University
Assistant Professor of Biomedical Data Science and of Medicine
Jared Dunnmon headshot

Jared Dunnmon

Co-Founder & Chief Scientist, Stealth Startup
Prev. Dir. of AI at DIU
Fred Sala headshot

Fred Sala

Chief Scientist, Snorkel AI
Assistant Professor @ University of Wisconsin-Madison
Chris Ré headshot

Chris Ré

Co-Founder, Snorkel AI
Professor @ Stanford University
Ludwig Schmidt headshot

Ludwig Schmidt

Stanford University · LAION
Stanford researcher and LAION collaborator
Karthik Narasimhan headshot

Karthik Narasimhan

Princeton University
Professor of Computer Science
Yu Su headshot

Yu Su

Ohio State University
Associate Professor of Computer Science and Engineering
Lewis Tunstall headshot

Lewis Tunstall

Hugging Face
Machine Learning Engineer
PUBLICATIONS

Browse research blogs
and academic papers

Type: All Types
Sort: Newest
Interactive Programmatic Labeling for Weak Supervision
Research Paper
Interactive Programmatic Labeling for Weak Supervision

Demonstrating in synthetic and real-world experiments how two simple labeling function acquisition strategies outperform a random baseline.

Dec 08, 2019
B. Cohen-Wang, et al, 2019
Learn more about Interactive Programmatic Labeling for Weak Supervision
Bootstrapping Conversational Agents with Weak Supervision
Research Paper
Bootstrapping Conversational Agents with Weak Supervision

This paper presents a framework called search, label, and propagate (SLP) for bootstrapping intents from existing chat logs using weak supervision.

Dec 07, 2019
N. Mallinar, et al, 2019
Learn more about Bootstrapping Conversational Agents with Weak Supervision
A Machine-Compiled Database of Genome-Wide Association Studies
Research Paper
A Machine-Compiled Database of Genome-Wide Association Studies

Describing GWASkb, a machine-compiled knowledge base of genetic associations collected from the scientific literature using automated information extraction algorithms.

Dec 06, 2019
V. Kuleshov, et al, 2019
Learn more about A Machine-Compiled Database of Genome-Wide Association Studies
A Clinical Text Classification Paradigm Using Weak Supervision…
Research Paper
A Clinical Text Classification Paradigm Using Weak Supervision…

This work develops a rule-based NLP algorithm to automatically generate labels for the training data, and then use the pre-trained word embeddings as deep representation features for training machine learning models.

Dec 05, 2019
Y. Wang, et al, 2019
Learn more about A Clinical Text Classification Paradigm Using Weak Supervision…
Training Classifiers with Natural Language Explanations
Research Paper
Training Classifiers with Natural Language Explanations

Introducing BabbleLabble, a framework for training classifiers in which an annotator provides a natural language explanation for each labeling decision.

Dec 20, 2018
B. Hancock, et al, 2018
Learn more about Training Classifiers with Natural Language Explanations
Software 2.0 and Snorkel: Beyond Hand-Labeled Data
Research Paper
Software 2.0 and Snorkel: Beyond Hand-Labeled Data

This paper describes Snorkel, a system that enables users to help shape, create, and manage training data for Software 2.0 stacks.

Dec 19, 2018
C. Ré, 2018 (invited)
Learn more about Software 2.0 and Snorkel: Beyond Hand-Labeled Data
Snorkel MeTaL: Weak Supervision for Multi-Task Learning
Research Paper
Snorkel MeTaL: Weak Supervision for Multi-Task Learning

Presenting Snorkel MeTal, an end-to-end system for multi-task learning.

Dec 18, 2018
A. Ratner, et al, 2018
Learn more about Snorkel MeTaL: Weak Supervision for Multi-Task Learning
Fonduer: Knowledge Base Construction From Richly Formatted Data
Research Paper
Fonduer: Knowledge Base Construction From Richly Formatted Data

Introducing Fonduer, a machine-learning-based KBC system for richly formatted data.

Dec 17, 2018
S. Wu, et al, 2018
Learn more about Fonduer: Knowledge Base Construction From Richly Formatted Data
Deep Text Mining of Instagram Data Without Strong Supervision
Research Paper
Deep Text Mining of Instagram Data Without Strong Supervision

This paper showcases methods for unsupervised mining of fashion attributes from Instagram text, which can enable a new kind of user recommendation in the fashion domain.

Dec 16, 2018
K. Hammar, et al, 2018
Learn more about Deep Text Mining of Instagram Data Without Strong Supervision
1 2 34 35

Let’s research together

Join our team of leading researchers and help shape the future of AI.