Fred Sala

Anomaly Detection with Multiple Reference Datasets

This paper proposes generalizations of CWOLA and SALAD, which exploit multiple reference datasets to improve performance in resonant anomaly detection, and provides finite-sample guarantees to go beyond existing asymptotic analyses.

Research Paper

Anomaly Detection with Multiple Reference Datasets

This paper proposes generalizations of CWOLA and SALAD, which exploit multiple reference datasets to improve performance in resonant anomaly detection, and provides finite-sample guarantees to go beyond existing asymptotic analyses.

Mar 15, 2023 •

Snorkel Team

Learn more about Anomaly Detection with Multiple Reference Datasets

Ask Me Anything: A simple strategy for prompting language models.

This paper proposes "Ask Me Anything" (AMA), a prompting method that uses weak supervision to combine noisy predictions from multiple prompts generated from an LLM, resulting in an average 10.2% performance lift over the few-shot baseline across a variety of different open-source models.

Research Paper

Ask Me Anything: A simple strategy for prompting language models.

This paper proposes “Ask Me Anything” (AMA), a prompting method that uses weak supervision to combine noisy predictions from multiple prompts generated from an LLM, resulting in an average 10.2% performance lift over the few-shot baseline across a variety of different open-source models.

Mar 15, 2023 •

S. Arora, et al.

Learn more about Ask Me Anything: A simple strategy for prompting language models.

AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels

AutoWS-Bench-101 is a framework for evaluating automated weak supervision techniques compared to other baseline methods such as zero-shot foundation models and supervised learning, in order to help practitioners choose the best method to generate additional labels.

Research Paper

AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels

AutoWS-Bench-101 is a framework for evaluating automated weak supervision techniques compared to other baseline methods such as zero-shot foundation models and supervised learning, in order to help practitioners choose the best method to generate additional labels.

Mar 15, 2023 •

Snorkel Team

Learn more about AutoWS-Bench-101: Benchmarking Automated Weak Supervision with 100 Labels

Lifting Weak Supervision To Structured Prediction

This paper finds that weak supervision can be used beyond classification applications, including rankings, graphs, and manifolds, and can provide generalization guarantees nearly identical to models trained on clean data.

Research Paper

Lifting Weak Supervision To Structured Prediction

This paper finds that weak supervision can be used beyond classification applications, including rankings, graphs, and manifolds, and can provide generalization guarantees nearly identical to models trained on clean data.

Mar 15, 2023 •

Vishwakarma, et al

Learn more about Lifting Weak Supervision To Structured Prediction

Generative Modeling Helps Weak Supervision (and Vice Versa)

This work proposes and theoretically justifies a model that fuses weak supervision and generative adversarial networks to improve the estimate of unobserved labels and data augmentation, outperforming baseline weak supervision models on multiclass image classification datasets.

Research Paper

Generative Modeling Helps Weak Supervision (and Vice Versa)

This work proposes and theoretically justifies a model that fuses weak supervision and generative adversarial networks to improve the estimate of unobserved labels and data augmentation, outperforming baseline weak supervision models on multiclass image classification datasets.

Mar 15, 2023 •

B. Boecking, et al

Learn more about Generative Modeling Helps Weak Supervision (and Vice Versa)

Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision

Liger, a combination of foundation models and weak supervision frameworks, improves existing weak supervision techniques by partitioning the embedding space and extending source votes in embedding space, resulting in improved performance on six benchmark NLP and video tasks.

Research Paper

Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision

Liger, a combination of foundation models and weak supervision frameworks, improves existing weak supervision techniques by partitioning the embedding space and extending source votes in embedding space, resulting in improved performance on six benchmark NLP and video tasks.

Mar 15, 2023 •

M. Chen, et al

Learn more about Shoring Up the Foundations: Fusing Model Embeddings and Weak Supervision

Blog

Auto LF generation: Lots of little models, big benefits

Constructing labeling functions (LFs) is at the heart of using weak supervision. We often think of these labeling functions as programmatic expressions of domain expertise or heuristics. Indeed, much of the advantage of weak supervision is that we can save time—writing labeling functions and applying them to data at scale is much more efficient compared to hand-labeling huge numbers of…

May 31, 2022 •

Fred Sala

Learn more about Auto LF generation: Lots of little models, big benefits

Universalizing Weak Supervision

This paper proposes a universal technique that enables weak supervision over any label type while still offering desirable properties, including practical flexibility, computational efficiency, and theoretical guarantees.

Research Paper

Universalizing Weak Supervision

This paper proposes a universal technique that enables weak supervision over any label type while still offering desirable properties, including practical flexibility, computational efficiency, and theoretical guarantees.

Apr 04, 2022 •

C. Shin, et al

Learn more about Universalizing Weak Supervision

Hidden network generating rules from partially observed complex networks

Complex biological, neuroscience, geoscience and social networks exhibit heterogeneous self-similar higher order topological structures that are usually characterized as being multifractal in nature. However, describing their topological complexity through a compact mathematical description and deciphering their topological governing rules has remained elusive and prevented a comprehensive understanding of networks. To overcome this challenge, we propose a weighted multifractal graph model capable of capturing the underlying generating rules of complex systems and characterizing their node heterogeneity and pairwise interactions. To infer the generating measure with hidden information, we introduce a variational expectation maximization framework. We demonstrate the robustness of the network...

Research Paper

Hidden network generating rules from partially observed complex networks

Complex biological, neuroscience, geoscience and social networks exhibit heterogeneous self-similar higher order topological structures that are usually characterized as being multifractal in nature. However, describing their topological complexity through a compact mathematical description and deciphering their topological governing rules has remained elusive and prevented a comprehensive understanding of networks. To overcome this challenge, we propose a weighted multifractal graph model…

Sep 01, 2021 •

R. Yang, et al.

Learn more about Hidden network generating rules from partially observed complex networks

Fred Sala

The latest from Fred

For models that need to be right. Not just good enough.

How do you want to work with Snorkel?