resources

Resource library

Explore our complete library of resources including blogs, benchmarks, research papers, and more.

Image for Evaluating coding agent capabilities with Terminal-Bench: Snorkel’s role in building the next generation benchmark
Blog

Evaluating coding agent capabilities with Terminal-Bench: Snorkel’s role in building the next generation benchmark

Announcing a $3M commitment to launch Open Benchmarks Grants
September 30, 2025
Image for Closing the Evaluation Gap in Agentic AI
Blog

Closing the Evaluation Gap in Agentic AI

Announcing a $3M commitment to launch Open Benchmarks Grants

February 11, 2026
Image for Benchtalks #1: Alex Shaw (Terminal-Bench, Harbor) – Building the Benchmark Factory
Blog

Benchtalks #1: Alex Shaw (Terminal-Bench, Harbor) – Building the Benchmark Factory

Announcing a $3M commitment to launch Open Benchmarks Grants
March 31, 2026
Image for Building FinQA: An Open RL Environment for Financial Reasoning Agents
Blog

Building FinQA: An Open RL Environment for Financial Reasoning Agents

Announcing a $3M commitment to launch Open Benchmarks Grants
March 30, 2026
Image for The science of rubric design
Blog

The science of rubric design

Announcing a $3M commitment to launch Open Benchmarks Grants
September 11, 2025
of
Type: All Types
Sort: Newest
Interactive Programmatic Labeling for Weak Supervision
Demonstrating in synthetic and real-world experiments how two simple labeling function acquisition strategies outperform a random baseline.
Research Paper
Interactive Programmatic Labeling for Weak Supervision

Demonstrating in synthetic and real-world experiments how two simple labeling function acquisition strategies outperform a random baseline.

Dec 08, 2019
B. Cohen-Wang, et al, 2019
Learn more about Interactive Programmatic Labeling for Weak Supervision
Bootstrapping Conversational Agents with Weak Supervision
This paper presents a framework called search, label, and propagate (SLP) for bootstrapping intents from existing chat logs using weak supervision.
Research Paper
Bootstrapping Conversational Agents with Weak Supervision

This paper presents a framework called search, label, and propagate (SLP) for bootstrapping intents from existing chat logs using weak supervision.

Dec 07, 2019
N. Mallinar, et al, 2019
Learn more about Bootstrapping Conversational Agents with Weak Supervision
A Machine-Compiled Database of Genome-Wide Association Studies
Describing GWASkb, a machine-compiled knowledge base of genetic associations collected from the scientific literature using automated information extraction algorithms.
Research Paper
A Machine-Compiled Database of Genome-Wide Association Studies

Describing GWASkb, a machine-compiled knowledge base of genetic associations collected from the scientific literature using automated information extraction algorithms.

Dec 06, 2019
V. Kuleshov, et al, 2019
Learn more about A Machine-Compiled Database of Genome-Wide Association Studies
A Clinical Text Classification Paradigm Using Weak Supervision…
This work develops a rule-based NLP algorithm to automatically generate labels for the training data, and then use the pre-trained word embeddings as deep representation features for training machine learning models.
Research Paper
A Clinical Text Classification Paradigm Using Weak Supervision…

This work develops a rule-based NLP algorithm to automatically generate labels for the training data, and then use the pre-trained word embeddings as deep representation features for training machine learning models.

Dec 05, 2019
Y. Wang, et al, 2019
Learn more about A Clinical Text Classification Paradigm Using Weak Supervision…
Training Classifiers with Natural Language Explanations
Training accurate classifiers requires many labels, but each label provides only limited information (one bit for binary classification). In this work, we propose BabbleLabble, a framework for training classifiers in which an annotator provides a natural language explanation for each labeling decision. A semantic parser converts these explanations into programmatic labeling functions that generate noisy labels for an arbitrary amount of unlabeled data, which is used to train a classifier. On three relation extraction tasks, we find that users are able to train classifiers with comparable F1 scores from 5–100× faster by providing explanations instead of just labels. Furthermore, given...
Research Paper
Training Classifiers with Natural Language Explanations

Training accurate classifiers requires many labels, but each label provides only limited information (one bit for binary classification). In this work, we propose BabbleLabble, a framework for training classifiers in which an annotator provides a natural language explanation for each labeling decision. A semantic parser converts these explanations into programmatic labeling functions that generate noisy labels for an arbitrary amount…

Dec 20, 2018
B. Hancock, et al, 2018
Learn more about Training Classifiers with Natural Language Explanations
Software 2.0 and Snorkel: Beyond Hand-Labeled Data
This paper describes Snorkel, a system that enables users to help shape, create, and manage training data for Software 2.0 stacks.
Research Paper
Software 2.0 and Snorkel: Beyond Hand-Labeled Data

This paper describes Snorkel, a system that enables users to help shape, create, and manage training data for Software 2.0 stacks.

Dec 19, 2018
C. Ré, 2018 (invited)
Learn more about Software 2.0 and Snorkel: Beyond Hand-Labeled Data
Snorkel MeTaL: Weak Supervision for Multi-Task Learning
Presenting Snorkel MeTal, an end-to-end system for multi-task learning.
Research Paper
Snorkel MeTaL: Weak Supervision for Multi-Task Learning

Presenting Snorkel MeTal, an end-to-end system for multi-task learning.

Dec 18, 2018
A. Ratner, et al, 2018
Learn more about Snorkel MeTaL: Weak Supervision for Multi-Task Learning
Fonduer: Knowledge Base Construction From Richly Formatted Data
Introducing Fonduer, a machine-learning-based KBC system for richly formatted data.
Research Paper
Fonduer: Knowledge Base Construction From Richly Formatted Data

Introducing Fonduer, a machine-learning-based KBC system for richly formatted data.

Dec 17, 2018
S. Wu, et al, 2018
Learn more about Fonduer: Knowledge Base Construction From Richly Formatted Data
Deep Text Mining of Instagram Data Without Strong Supervision
This paper showcases methods for unsupervised mining of fashion attributes from Instagram text, which can enable a new kind of user recommendation in the fashion domain.
Research Paper
Deep Text Mining of Instagram Data Without Strong Supervision

This paper showcases methods for unsupervised mining of fashion attributes from Instagram text, which can enable a new kind of user recommendation in the fashion domain.

Dec 16, 2018
K. Hammar, et al, 2018
Learn more about Deep Text Mining of Instagram Data Without Strong Supervision
1 2 63 64
Image

Join our newsletter

For expert advice, the latest research, and exclusive events.
By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.