The Snorkel AI Blog

Google’s Dr. Arsanjani on Enterprise Foundation Model Promise and Challenges

Ali Arsanjani, director of cloud partner engineering at Google Cloud, presented a talk entitled “Challenges and Ethics of DLM and LLM Adoption in the Enterprise” at Snorkel AI’s recent Foundation Model Virtual Summit.

Fine-Tuning, Foundation Models, LLMs, MLOps, NLP

Team Snorkel

March 2, 2023

Foundation Models 101: a guide with essential FAQs

Foundation Models (FMs), such as GPT-3 and Stable Diffusion, mark the beginning of a new era in machine learning and artificial intelligence. What are they and how will they impact your business? Find out in our guide.

Alignment, Annotation, Data Development, Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning, Foundation Models, NLP

Matt Casey

March 1, 2023

Combining foundation models with weak supervision

Combining foundation model outputs with weak supervision yields faster model development and requires fewer ground truth labels.

Data Development, Data Labeling, Data-Centric AI, Foundation Models, NLP

Matt Hoffman

March 1, 2023

Operationalizing knowledge for data-centric AI

Snorkel AI CEO and Co-Founder Alex Ratner’s introduction to data-centric AI from the 2022 Future of Data-Centric AI virtual conference.

Data Development, Data Labeling, Data-Centric AI, Fine-Tuning, Foundation Models, LLMs, NLP, Partners

Team Snorkel

February 27, 2023

Credo AI DS head on operationalizing responsible AI

Credo AI’s head of data science explains at Snorkel’s FDCAI 2022 how his team works to operationalize responsible AI assessment tools.

Data Development, Data-Centric AI, Evaluation, NLP, Product Releases

Team Snorkel

February 22, 2023

How a Brown professor sharpened and shrunk GPT-3

Brown professor Stephen Bach tells Snorkel CEO Alex Ratner about his research into improving foundation models like GPT-3 with curated data.

Data Development, Data-Centric AI, Foundation Models, NLP

Team Snorkel

February 21, 2023

Cleanlab CEO shows automatic data-cleansing tools

Cleanlab Co-Founder and CEO Curtis Northcutt presents his company’s automatic, universal and open-source tools to quickly clean data sets.

Annotation, Computer vision, Data Labeling, Data-Centric AI, Evaluation

Team Snorkel

February 17, 2023

Aspect-based sentiment analysis in Snorkel Flow

Understanding and quantifying people’s opinions has become increasingly important to businesses, but the way people can express multiple thoughts in the same sentence has frustrated practitioners’ efforts to extract those opinions cleanly—a problem we can solve through aspect-based sentiment analysis (ABSA).

Annotation, Data Development, Data Labeling, Fine-Tuning, Foundation Models, MLOps, NLP

Lia Chin-Purcell

February 15, 2023

Comcast’s data-centric approach to speech interfaces

Jan Neumann, Vice President of Machine Learning for Comcast Applied AI and Discovery, describes Comcast’s data-centric AI approach to speech.

Annotation, Data Labeling, Data-Centric AI, NLP, Synthetic Data

Team Snorkel

February 14, 2023

Meta research manager talks speech and search

Meta senior applied research manager Anoop Sinha and Snorkel AI co-founder Braden Hancock discuss mastering speech and search with TWIML host Sam Charrington.

Data-Centric AI, Fine-Tuning, Foundation Models, LLMs, NLP

Team Snorkel

February 10, 2023

Using Snowflake Connector in Snorkel Flow

As part of Snorkel AI’s partnership with Snowflake, users can now upload millions of rows of data seamlessly from their Snowflake warehouse into Snorkel Flow via the natively-integrated Snowflake connector. With a few clicks, a user can upload massive amounts of Snowflake data and quickly develop high-quality ML models using Snorkel Flow’s Data-Centric AI platform.

Data Development, Data Labeling, Data-Centric AI, Partners, Product Releases

Vashisht Madhavan

February 8, 2023

NASA ML Lead on its WorldView citizen scientist no-code tool

Anirudh Koul is Machine Learning Lead for the NASA Frontier Development Lab and the Head of Machine Learning Sciences at Pinterest. He presented at Snorkel AI’s 2022 Future of Data Centric AI (FDCAI) Conference.

Computer vision, Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning

Team Snorkel

February 6, 2023

Snorkel AI and Google Cloud accelerate AI innovation

Snorkel AI is teaming up with Google Cloud to help F500 companies and AI innovators solve their most difficult problems.

Annotation, Data Development, Data Labeling, Data-Centric AI, MLOps, Partners, Product Releases

Friea Berg

February 2, 2023

Building better datasets with Snorkel Flow error analysis

As machine learning practitioners, few of us would expect the first version of a new model to achieve our objective. We plan for multiple rounds of iteration to address errors and improve performance, and the Snorkel Flow platform provides tools to enable this kind of iteration within the data-centric AI framework.

Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning, MLOps, Product Releases

Josh McGrath

February 2, 2023

Seldon and Snorkel AI partner to advance data-centric AI

Together, Snorkel AI and Seldon enable enterprises to adopt AI across the business at scale by dramatically accelerating development and deployment and tightening the feedback loop to rapidly respond to data drift or changing business requirements.

Data Development, Data Labeling, Data-Centric AI, Foundation Models, MLOps, Partners

Friea Berg

February 1, 2023

Accuracy top concern for Foundation Model adoption—Poll

Most poll respondents at Snorkel AI’s recent Foundation Model Virtual Summit named questionable accuracy as the biggest barrier preventing them from getting organizational value from Foundation Models.

Evaluation, Foundation Models, NLP, Product Releases

Matt Casey

January 31, 2023

How Foundation Models bolster programmatic labeling

Snorkel CEO Alex Ratner interviews Mayee Chen about how Liger improves the effectiveness of programmatic labeling through foundation model embeddings.

Alignment, Annotation, Data Development, Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning, Foundation Models, NLP

Team Snorkel

January 26, 2023

Snorkel AI partners with Snowflake to bring data-centric AI to the Snowflake Data Cloud

Snorkel AI has teamed with Snowflake to help our shared customers transform raw, unstructured data into actionable, AI-powered insights.

Data Development, Data Labeling, Data-Centric AI, Partners

Friea Berg

January 25, 2023

Unmasking Trafficking Risk in Commercial Sex Supply Chains with Machine Learning

Hamsa Bastani presented a summary of her and her co-authors’ ongoing work using machine learning and Snorkel AI’s tools to detect and track activities that are associated with a high risk for global sex trafficking.

Data Development, Data Labeling, NLP, Partners

Team Snorkel

January 20, 2023

Prompting and weak supervision to build better, smaller models

Snorkel AI co-founder and CEO Alex Ratner recently interviewed several Snorkel researchers about their published academic papers. In this video, Alex talks with Ryan Smith, Senior Applied Scientist at Snorkel, about the work he did on using foundation models to build compact, deployable, and effective models.

Annotation, Data-Centric AI, Evaluation, Foundation Models, NLP

Team Snorkel

January 19, 2023

FM Summit shows Foundation Model hurdles and potential

Snorkel AI held its Foundation Model Summit Jan 17, bringing together 12 presenters and over 600 attendees at 10 virtual sessions. The event drew registrants from across many sectors, including the tech industry, healthcare, and financial services.

Alignment, Data Development, Data Labeling, Data-Centric AI, Evaluation, Fine-Tuning, Foundation Models, LLMs, NLP, Product Releases

Matt Casey

January 18, 2023

Contrastive Learning boosts Foundation Model specialization

Snorkel AI co-founder and CEO Alex Ratner talks with Ananya Kumar about the work he did on improving the effectiveness of foundation models by using contrastive learning, image augmentations, and labeled subsamples.

Computer vision, Data-Centric AI, Fine-Tuning, Foundation Models, NLP

Team Snorkel

January 13, 2023

Adapting language-based models beyond English

While a majority of Natural Language Processing (NLP) models focus on English, the real world requires solutions that work with languages across the globe. This demo shows how effectively users can build cross-language models in Snorkel Flow.

Data Development, Data Labeling, Data-Centric AI, Fine-Tuning, Foundation Models, LLMs, NLP

Anastassia Kornilova, April Guo

January 12, 2023

How Pixability uses foundation models to accelerate NLP application development by months

Using Snorkel Flow, Pixability has created a way to build classifiers for massive amounts of YouTube data quickly—that was previously out of reach.

Data Development, Data Labeling, Data-Centric AI, Foundation Models, NLP

Nick Harvey

January 11, 2023

Speech AI Demystified | FDCAI Lightning Talk

Sirisha Rella, Technical Product Marketing Manager at Nvidia, recently gave a Lightning Talk presentation on “demystifying” speech AI at Snorkel AI’s Future of Data-Centric AI virtual conference.

Data-Centric AI, Fine-Tuning, NLP, Product Releases

Team Snorkel

January 10, 2023

Snorkel AI to host Foundation Model Virtual Summit, registration now open

Snorkel AI will hold a free Foundation Model Virtual Summit on Tuesday, January 17 where speakers from across the technology industry, including some from Google and Stanford University, will discuss the enterprise use of Foundation Models.

Data-Centric AI, Fine-Tuning, Foundation Models, NLP, Partners

Team Snorkel

January 5, 2023

Demo: Using Snorkel Flow to train Microsoft Azure Form Recognizer models

Snorkel Flow debuts a new integration with Microsoft Azure Form Recognizer to help organizations leverage Azure AI services.

Computer vision, Data-Centric AI, Fine-Tuning, Partners, Product Releases

Team Snorkel

January 5, 2023

Ask Me Anything approach bolsters foundation models

Researcher Simran Arora tells Snorkel CEO Alex Ratner how she improved foundation model effectiveness by using “Ask Me Anything”-style questions.

Data-Centric AI, Evaluation, Fine-Tuning, Foundation Models, NLP

Team Snorkel

January 4, 2023

Snorkel Flow 2022 year-end release roundup

See what’s in our latest Snorkel Flow release and how we’re accelerating data-centric AI development further.

Data Development, Data Labeling, Data-Centric AI, Foundation Models, LLMs, NLP, Product Releases

Aparna Lakshmiratan

January 3, 2023

Combining human and artificial intelligence with human-in-the-loop ML | FDCAI

More components in an ML lifecycle are designed to run on autopilot, but some tasks require human-in-the-loop ML, an active research topic that has seen an increasing number of publications in the last 10 years.

Annotation, Computer vision, Data-Centric AI, Evaluation, NLP

Team Snorkel

December 28, 2022

Latest posts

Google’s Dr. Arsanjani on Enterprise Foundation Model Promise and Challenges

Foundation Models 101: a guide with essential FAQs

Combining foundation models with weak supervision

Operationalizing knowledge for data-centric AI

Credo AI DS head on operationalizing responsible AI

How a Brown professor sharpened and shrunk GPT-3

Cleanlab CEO shows automatic data-cleansing tools

Aspect-based sentiment analysis in Snorkel Flow

Comcast’s data-centric approach to speech interfaces

Meta research manager talks speech and search

Using Snowflake Connector in Snorkel Flow

NASA ML Lead on its WorldView citizen scientist no-code tool

Snorkel AI and Google Cloud accelerate AI innovation

Building better datasets with Snorkel Flow error analysis

Seldon and Snorkel AI partner to advance data-centric AI

Accuracy top concern for Foundation Model adoption—Poll

How Foundation Models bolster programmatic labeling

Snorkel AI partners with Snowflake to bring data-centric AI to the Snowflake Data Cloud

Unmasking Trafficking Risk in Commercial Sex Supply Chains with Machine Learning

Prompting and weak supervision to build better, smaller models

FM Summit shows Foundation Model hurdles and potential

Contrastive Learning boosts Foundation Model specialization

Adapting language-based models beyond English

How Pixability uses foundation models to accelerate NLP application development by months

Speech AI Demystified | FDCAI Lightning Talk

Snorkel AI to host Foundation Model Virtual Summit, registration now open

Demo: Using Snorkel Flow to train Microsoft Azure Form Recognizer models

Ask Me Anything approach bolsters foundation models

Snorkel Flow 2022 year-end release roundup

Combining human and artificial intelligence with human-in-the-loop ML | FDCAI

Join our newsletter for expert advice, the latest research, and exclusive events.