RESOURCES

Blog

Ideas, updates, and practical guidance from the Snorkel team.

Image for Closing the Evaluation Gap in Agentic AI

Closing the Evaluation Gap in Agentic AI

Announcing a $3M commitment to launch Open Benchmarks Grants

February 11, 2026
All articles
Sort: Newest
Improving upon Precision, Recall, and F1 with Gain metrics
Improving upon Precision, Recall, and F1 with Gain metrics

This blog post introduces variants of Precision, Recall, and F1 metrics called Precision Gain, Recall Gain, and F1 Gain. The gain variants have desirable properties such as meaningful linear interpolation of PR curves and a universal baseline across tasks. This post explains what these benefits mean for you, how the gain metrics are calculated and outline some examples for intuitive comparison. 

Sep 08, 2022
Learn more about Improving upon Precision, Recall, and F1 with Gain metrics
Summer 2022 Snorkel Flow release roundup
Summer 2022 Snorkel Flow release roundup

On the heels of the second annual Future of Data-Centric AI event, we’re energized by what we learned from data scientists, machine learning engineers, and AI leaders who are adopting data-centric approaches to accelerate AI success. The Snorkel Flow platform provides these teams with a seamless workflow across training data creation, model training, and analysis—the scaffolding to make data-centric AI…

Aug 30, 2022
Learn more about Summer 2022 Snorkel Flow release roundup
Introducing Continuous Model Feedback to drive rapid data quality improvement
Introducing Continuous Model Feedback to drive rapid data quality improvement

Continuous Model Feedback, available in beta as part of the new Studio experience, is Snorkel Flow’s latest capabilities to make training data creation and model development more integrated, automated, and guided.

Aug 29, 2022
Learn more about Introducing Continuous Model Feedback to drive rapid data quality improvement
The Future of Data-Centric AI 2022 day 2 highlights
The Future of Data-Centric AI 2022 day 2 highlights

Snorkel AI just hosted the second day of The Future of Data-Centric AI conference 2022. Across 40+ sessions, 50+ Data scientists, ML engineers, and AI leaders came together to share insights, best practices, and research on adopting data-centric approaches with thousands of attendees from all around the world. Aarti Bagul, a Snorkel AI ML Solutions Engineer and one of the…

Aug 05, 2022
Learn more about The Future of Data-Centric AI 2022 day 2 highlights
The Future of Data-Centric AI 2022 day 1 highlights
The Future of Data-Centric AI 2022 day 1 highlights

Snorkel AI just hosted the first day of The Future of Data-Centric AI conference 2022. This conference brings together data scientists, ML engineers, and AI leaders to share insights, best practices, and research on how to evolve the ML lifecycle from model-centric to data-centric approaches. This conference takes place over two days with 40+ sessions, 50+ speakers, and thousands of…

Aug 04, 2022
Learn more about The Future of Data-Centric AI 2022 day 1 highlights
10-Ks information extraction case studies
10-Ks information extraction case studies

Building NLP techniques to understand 10-Ks is time-consuming, costly, and challenging. In this post, Machine Learning Engineer, Aarti Bagul discusses three information extraction case studies on how banks around the world are building highly accurate NLP applications using Snorkel Flow’s AI platform. From retail banking to hedge fund investing, NLP is used across the financial industry. By processing and extracting…

Jul 06, 2022
Learn more about 10-Ks information extraction case studies
Introducing Cluster View: Instant data insight made actionable to speed AI development
Introducing Cluster View: Instant data insight made actionable to speed AI development

Programmatic labeling moves a classic technique from interesting to high-impact So much of real-world AI development entails working with text data that’s messy — in fact, 80%+ of enterprise data is unstructured. And while state-of-the-art models get a lot of the glory, creating the training data that conveys what your model needs to learn is more often the biggest determiner of AI…

Jun 30, 2022
Learn more about Introducing Cluster View: Instant data insight made actionable to speed AI development
Data-centric approaches to multi-label classification
Data-centric approaches to multi-label classification

AI systems are well-suited to tasks involving recognizing and predicting data patterns. Supervised classification systems categorize unseen data into a finite set of discrete classes by learning from millions of hand-labeled labeled sample points. These classifiers are powerful business tools – they automate document sorting, customer sentiment analysis, sales performance, and other distinct business problems. However, they also require an…

Jun 29, 2022
Learn more about Data-centric approaches to multi-label classification
Data annotation guidelines and best practices
Data annotation guidelines and best practices

What is data annotation? Data annotation refers to the process of categorizing and labeling data for training datasets. This process plays a critical role in preparing data for machine learning models, as high-quality training data enables more accurate predictions and insights. In order for a training dataset to be usable, it must be categorized appropriately and annotated for a specific…

Jun 28, 2022
Learn more about Data annotation guidelines and best practices
1 27 28 29 37
Image

Join our newsletter

For expert advice, the latest research, and exclusive events.
By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.