Stephen Bach

Adversarial Multiclass Learning under Weak Supervision with Performance Guarantees

This paper presents a rigorous approach for using a set of arbitrarily correlated weak supervision sources in order to solve a multiclass classification task when only a very small set of labeled data is available

Research Paper

Adversarial Multiclass Learning under Weak Supervision with Performance Guarantees

This paper presents a rigorous approach for using a set of arbitrarily correlated weak supervision sources in order to solve a multiclass classification task when only a very small set of labeled data is available

May 11, 2021 •

Snorkel Team

Learn more about Adversarial Multiclass Learning under Weak Supervision with Performance Guarantees

Selecting Auxiliary Data Using Knowledge Graphs for Image Classification with Limited Labels

In this paper, we propose a learning algorithm for training deep neural networks when there is not sufficient labeled data. To improve the generalization capabilities of the deep model, we adopt a learning scheme to train two related tasks simultaneously. One is the original task (target), and the other is an auxiliary task (source). In order to create a related auxiliary task, we leverage an available knowledge graph to query for semantically related concepts that are grounded in labeled images; hence we call our method KGAuxLearn. We jointly train the target and source tasks in a multi-task architecture. We evaluate...

Research Paper

Selecting Auxiliary Data Using Knowledge Graphs for Image Classification with Limited Labels

In this paper, we propose a learning algorithm for training deep neural networks when there is not sufficient labeled data. To improve the generalization capabilities of the deep model, we adopt a learning scheme to train two related tasks simultaneously. One is the original task (target), and the other is an auxiliary task (source). In order to create a related…

Jul 28, 2020 •

Snorkel Team

Learn more about Selecting Auxiliary Data Using Knowledge Graphs for Image Classification with Limited Labels

Weakly Supervised Sequence Tagging from Noisy Rules

We propose a framework for training sequence tagging models with weak supervision consisting of multiple heuristic rules of unknown accuracy. In addition to supporting rules that vote on tags in the output sequence, we introduce a new type of weak supervision, called linking rules, that vote on how sequence elements should be grouped into spans with the same tag. These rules are an alternative to candidate span generators that require significantly more human effort. To estimate the accuracies of the rules and combine their conflicting outputs into training data, we introduce a new type of generative model, linked hidden Markov...

Research Paper

Weakly Supervised Sequence Tagging from Noisy Rules

We propose a framework for training sequence tagging models with weak supervision consisting of multiple heuristic rules of unknown accuracy. In addition to supporting rules that vote on tags in the output sequence, we introduce a new type of weak supervision, called linking rules, that vote on how sequence elements should be grouped into spans with the same tag. These…

Apr 03, 2020 •

E. Safranchik, et al.

Learn more about Weakly Supervised Sequence Tagging from Noisy Rules

Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale

This is first-of-its-kind study showing how existing knowledge resources from across an organization can be used as weak supervision in order to bring development time and cost down by an order of magnitude, and introduce Snorkel DryBell, a new weak supervision management system for this setting

Research Paper

Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale

This is first-of-its-kind study showing how existing knowledge resources from across an organization can be used as weak supervision in order to bring development time and cost down by an order of magnitude, and introduce Snorkel DryBell, a new weak supervision management system for this setting

Dec 15, 2019 •

S. Bach, et al, 2019

Learn more about Snorkel DryBell: A Case Study in Deploying Weak Supervision at Industrial Scale

Snorkel: Fast Training Set Generation for Information Extraction

Introducing Snorkel, a new system for quickly creating, managing, and modeling training datasets.

Research Paper

Snorkel: Fast Training Set Generation for Information Extraction

Introducing Snorkel, a new system for quickly creating, managing, and modeling training datasets.

Dec 20, 2017 •

A. Ratner, et al, 2017

Learn more about Snorkel: Fast Training Set Generation for Information Extraction

Learning the Structure of Generative Models Without Labeled Data

Proposing a structure estimation method that is 100x faster than a maximum likelihood approach for training data.

Research Paper

Learning the Structure of Generative Models Without Labeled Data

Proposing a structure estimation method that is 100x faster than a maximum likelihood approach for training data.

Dec 18, 2017 •

S. Bach, et al, 2017

Learn more about Learning the Structure of Generative Models Without Labeled Data

Snorkel: Rapid Training Data Creation With Weak Supervision

This paper presents a flexible interface layer to write labeling functions based on experience.

Research Paper

Snorkel: Rapid Training Data Creation With Weak Supervision

This paper presents a flexible interface layer to write labeling functions based on experience.

Oct 04, 2017 •

Alexander Ratner, Stephen H Bach, Henry Ehrenberg, Jason Fries, Sen Wu, Christopher Ré

Learn more about Snorkel: Rapid Training Data Creation With Weak Supervision

Stephen Bach

The latest from Stephen

For models that need to be right. Not just good enough.

How do you want to work with Snorkel?