stephen bach (steve bach)
author

Stephen Bach

Applied Research Scientist
,
Brown University
Eliot Horowitz Assistant Professor, Computer Science Department

Stephen Bach is the Eliot Horowitz Assistant Professor in the Computer Science Department at Brown University. Previously, he was a visiting scholar at Google, and a postdoctoral scholar in the computer science department at Stanford University advised by Christopher Ré.

He received his Ph.D. in computer science from the University of Maryland, where he was advised by Lise Getoor. His research focuses on weakly supervised, zero-shot, and few-shot machine learning. The goal of his work is to create methods and systems that drive down the labor cost of AI. He was a core contributor to the Snorkel framework, which was recognized with a Best of VLDB 2018 award. He also co-led the team that developed the T0 family of large language models. The team was also one of the proposers of instruction tuning, which is the process of fine-tuning language models with supervised training to follow instructions. Instruction tuning is now a standard part of training large language models. Stephen is also an advisor to Snorkel AI.

The latest from Stephen

Adversarial Multiclass Learning under Weak Supervision with Performance Guarantees
This paper presents a rigorous approach for using a set of arbitrarily correlated weak supervision sources in order to solve a multiclass classification task when only a very small set of labeled data is available
Research Paper
Adversarial Multiclass Learning under Weak Supervision with Performance Guarantees

This paper presents a rigorous approach for using a set of arbitrarily correlated weak supervision sources in order to solve a multiclass classification task when only a very small set of labeled data is available

May 11, 2021
Snorkel Team
Learn more about Adversarial Multiclass Learning under Weak Supervision with Performance Guarantees
Selecting Auxiliary Data Using Knowledge Graphs for Image Classification with Limited Labels
In this paper, we propose a learning algorithm for training deep neural networks when there is not sufficient labeled data. To improve the generalization capabilities of the deep model, we adopt a learning scheme to train two related tasks simultaneously. One is the original task (target), and the other is an auxiliary task (source). In order to create a related auxiliary task, we leverage an available knowledge graph to query for semantically related concepts that are grounded in labeled images; hence we call our method KGAuxLearn. We jointly train the target and source tasks in a multi-task architecture. We evaluate...
Research Paper
Selecting Auxiliary Data Using Knowledge Graphs for Image Classification with Limited Labels

In this paper, we propose a learning algorithm for training deep neural networks when there is not sufficient labeled data. To improve the generalization capabilities of the deep model, we adopt a learning scheme to train two related tasks simultaneously. One is the original task (target), and the other is an auxiliary task (source). In order to create a related…

Jul 28, 2020
Snorkel Team
Learn more about Selecting Auxiliary Data Using Knowledge Graphs for Image Classification with Limited Labels
Weakly Supervised Sequence Tagging from Noisy Rules
We propose a framework for training sequence tagging models with weak supervision consisting of multiple heuristic rules of unknown accuracy. In addition to supporting rules that vote on tags in the output sequence, we introduce a new type of weak supervision, called linking rules, that vote on how sequence elements should be grouped into spans with the same tag. These rules are an alternative to candidate span generators that require significantly more human effort. To estimate the accuracies of the rules and combine their conflicting outputs into training data, we introduce a new type of generative model, linked hidden Markov...
Research Paper
Weakly Supervised Sequence Tagging from Noisy Rules

We propose a framework for training sequence tagging models with weak supervision consisting of multiple heuristic rules of unknown accuracy. In addition to supporting rules that vote on tags in the output sequence, we introduce a new type of weak supervision, called linking rules, that vote on how sequence elements should be grouped into spans with the same tag. These…

Apr 03, 2020
E. Safranchik, et al.
Learn more about Weakly Supervised Sequence Tagging from Noisy Rules
Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale
This is first-of-its-kind study showing how existing knowledge resources from across an organization can be used as weak supervision in order to bring development time and cost down by an order of magnitude, and introduce Snorkel DryBell, a new weak supervision management system for this setting
Research Paper
Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale

This is first-of-its-kind study showing how existing knowledge resources from across an organization can be used as weak supervision in order to bring development time and cost down by an order of magnitude, and introduce Snorkel DryBell, a new weak supervision management system for this setting

Dec 15, 2019
S. Bach, et al, 2019
Learn more about Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale
Snorkel: Fast Training Set Generation for Information Extraction
Introducing Snorkel, a new system for quickly creating, managing, and modeling training datasets.
Research Paper
Snorkel: Fast Training Set Generation for Information Extraction

Introducing Snorkel, a new system for quickly creating, managing, and modeling training datasets.

Dec 20, 2017
A. Ratner, et al, 2017
Learn more about Snorkel: Fast Training Set Generation for Information Extraction
Learning the Structure of Generative Models Without Labeled Data
Proposing a structure estimation method that is 100x faster than a maximum likelihood approach for training data.
Research Paper
Learning the Structure of Generative Models Without Labeled Data

Proposing a structure estimation method that is 100x faster than a maximum likelihood approach for training data.

Dec 18, 2017
S. Bach, et al, 2017
Learn more about Learning the Structure of Generative Models Without Labeled Data
Snorkel: Rapid Training Data Creation With Weak Supervision
This paper presents a flexible interface layer to write labeling functions based on experience.
Research Paper
Snorkel: Rapid Training Data Creation With Weak Supervision

This paper presents a flexible interface layer to write labeling functions based on experience.

Oct 04, 2017
Alexander Ratner, Stephen H Bach, Henry Ehrenberg, Jason Fries, Sen Wu, Christopher Ré
Learn more about Snorkel: Rapid Training Data Creation With Weak Supervision
1 2 3 4

For models that need to be right. Not just good enough.