Research Archives

Operationalizing knowledge for data-centric AI

Snorkel AI CEO and Co-Founder Alex Ratner’s introduction to data-centric AI from the 2022 Future of Data-Centric AI virtual conference.

Data Development, Data Labeling, Data-Centric AI, Fine-Tuning, Foundation Models, LLMs, NLP, Partners

Team Snorkel

February 27, 2023

How a Brown professor sharpened and shrunk GPT-3

Brown professor Stephen Bach tells Snorkel CEO Alex Ratner about his research into improving foundation models like GPT-3 with curated data.

Data Development, Data-Centric AI, Foundation Models, NLP

Team Snorkel

February 21, 2023

Cleanlab CEO shows automatic data-cleansing tools

Cleanlab Co-Founder and CEO Curtis Northcutt presents his company’s automatic, universal and open-source tools to quickly clean data sets.

Annotation, Computer vision, Data Labeling, Data-Centric AI, Evaluation

Team Snorkel

February 17, 2023

NASA ML Lead on its WorldView citizen scientist no-code tool

Anirudh Koul is Machine Learning Lead for the NASA Frontier Development Lab and the Head of Machine Learning Sciences at Pinterest. He presented at Snorkel AI’s 2022 Future of Data Centric AI (FDCAI) Conference.

Computer vision, Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning

Team Snorkel

February 6, 2023

Accuracy top concern for Foundation Model adoption—Poll

Most poll respondents at Snorkel AI’s recent Foundation Model Virtual Summit named questionable accuracy as the biggest barrier preventing them from getting organizational value from Foundation Models.

Evaluation, Foundation Models, NLP, Product Releases

Matt Casey

January 31, 2023

How Foundation Models bolster programmatic labeling

Snorkel CEO Alex Ratner interviews Mayee Chen about how Liger improves the effectiveness of programmatic labeling through foundation model embeddings.

Alignment, Annotation, Data Development, Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning, Foundation Models, NLP

Team Snorkel

January 26, 2023

Unmasking Trafficking Risk in Commercial Sex Supply Chains with Machine Learning

Hamsa Bastani presented a summary of her and her co-authors’ ongoing work using machine learning and Snorkel AI’s tools to detect and track activities that are associated with a high risk for global sex trafficking.

Data Development, Data Labeling, NLP, Partners

Team Snorkel

January 20, 2023

Prompting and weak supervision to build better, smaller models

Snorkel AI co-founder and CEO Alex Ratner recently interviewed several Snorkel researchers about their published academic papers. In this video, Alex talks with Ryan Smith, Senior Applied Scientist at Snorkel, about the work he did on using foundation models to build compact, deployable, and effective models.

Annotation, Data-Centric AI, Evaluation, Foundation Models, NLP

Team Snorkel

January 19, 2023

FM Summit shows Foundation Model hurdles and potential

Snorkel AI held its Foundation Model Summit Jan 17, bringing together 12 presenters and over 600 attendees at 10 virtual sessions. The event drew registrants from across many sectors, including the tech industry, healthcare, and financial services.

Alignment, Data Development, Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning, Foundation Models, LLMs, NLP, Product Releases

Matt Casey

January 18, 2023

Contrastive Learning boosts Foundation Model specialization

Snorkel AI co-founder and CEO Alex Ratner talks with Ananya Kumar about the work he did on improving the effectiveness of foundation models by using contrastive learning, image augmentations, and labeled subsamples.

Computer vision, Data-Centric AI, Fine-Tuning, Foundation Models, NLP

Team Snorkel

January 13, 2023

Ask Me Anything approach bolsters foundation models

Researcher Simran Arora tells Snorkel CEO Alex Ratner how she improved foundation model effectiveness by using “Ask Me Anything”-style questions.

Data-Centric AI, Evaluation, Fine-Tuning, Foundation Models, NLP

Team Snorkel

January 4, 2023

Combining human and artificial intelligence with human-in-the-loop ML | FDCAI

More components in an ML lifecycle are designed to run on autopilot, but some tasks require human-in-the-loop ML, an active research topic that has seen an increasing number of publications in the last 10 years.

Annotation, Computer vision, Data-Centric AI, Evaluation, NLP

Team Snorkel

December 28, 2022

Seven research papers push foundation model boundaries

The recent debut of ChatGPT astounded the public with the power and speed of foundation models, but their enterprise use remains hampered by adaptation and deployment challenges. In the past year, Snorkel AI has researched several ways to overcome those challenges.

Data-Centric AI, Foundation Models, NLP

Matt Casey

December 15, 2022

Snorkel AI Team presents research at NeurIPS 2022

The Snorkel AI team will present five research papers advancing weak supervision and programmatic labeling at the NeurIPS 2022 conference that started this week.

Data Labeling, Data-Centric AI, Evaluation, Foundation Models, NLP

Team Snorkel

November 29, 2022

What can Data-Centric AI learn from data & ML engineering?

Databricks’ Chief Technologist: Data-Centric AI can learn from Data Engineering and ML Engineering in five ways: continuous updates, versioning, code-centric deployment, data privatization and actionable monitoring.

Annotation, Data Development, Data-Centric AI, Evaluation, MLOps

Team Snorkel

November 5, 2022

Snorkel Blog

All articles on
Research

Operationalizing knowledge for data-centric AI

How a Brown professor sharpened and shrunk GPT-3

Cleanlab CEO shows automatic data-cleansing tools

NASA ML Lead on its WorldView citizen scientist no-code tool

Accuracy top concern for Foundation Model adoption—Poll

How Foundation Models bolster programmatic labeling

Unmasking Trafficking Risk in Commercial Sex Supply Chains with Machine Learning

Prompting and weak supervision to build better, smaller models

FM Summit shows Foundation Model hurdles and potential

Contrastive Learning boosts Foundation Model specialization

Ask Me Anything approach bolsters foundation models

Combining human and artificial intelligence with human-in-the-loop ML | FDCAI

Seven research papers push foundation model boundaries

Snorkel AI Team presents research at NeurIPS 2022

What can Data-Centric AI learn from data & ML engineering?

All articles on Research

All articles on
Research