All articles on
Research

Two approaches to distill LLMs for better enterprise value

Distillation techniques allow enterprises to access the full predictive power of large language models at a tiny fraction of their cost.

Jason Fries Headshot
October 31, 2023

Bloomberg’s Gideon Mann on the power of domain specialist LLMs

Gideon Mann, head of ML Product and Research at Bloomberg LP, chatted with Snorkel CEO Alex Ratner about building BloombergGPT.

Dr. Bubbles, Snorkel AI's mascot
October 17, 2023

Which is better, retrieval augmentation (RAG) or fine-tuning? Both.

Professionals in the data science space often debate whether RAG or fine-tuning yields the better result. The answer is “both.”

Hoang Tran portrayed.
September 20, 2023

Former U.S. Chief Data Scientist on past and future of data science

Past U.S. Chief Data Scientist DJ Patil talked with Snorkel AI CEO Alex Ratner on topics including the origin of the title “data scientist.”

Dr. Bubbles, Snorkel AI's mascot
September 12, 2023

4 new papers show foundation models can build on themselves

The surest way to improve foundation models is through more and better data, but Snorkel researchers showed FMs can learn from themselves.

August 31, 2023

Accelerating predictive task time to value with generative AI

Generative AI can write poems, recite common knowledge, and extract information. GenAI can also help quickly build predictive pipelines.

August 17, 2023
August 4, 2023

Data fuels enterprise AI value: 6 takeaways from the Gartner Hype Cycle for Artificial Intelligence, 2023

GenAI may be the most transformative technology of the past decade but data is where enterprises are able to realize real value from AI today.

August 2, 2023

How we built better GenAI with programmatic data development

We used weak supervision to programmatically curate instruction tuning data for open-source LLMs to build a better GenAI.

July 19, 2023

The future of large language models is faster and more robust

Snorkel and affiliated academic labs have been hard at work reducing how computationally expensive large language models are.

June 29, 2023

LLMs high priority for enterprise data science, but concerns remain

Enterprises—especially the world’s largest—are excited to use large language models, but they want to fine-tune them on proprietary data.

June 23, 2023

How MLCommons is democratizing data with public datasets

Peter Mattson, Google senior staff engineer and president of MLCommons.org, explained MLCommons at The Future of Data-Centric AI in 2022.

Dr. Bubbles, Snorkel AI's mascot
May 31, 2023

Large language models: their history, capabilities and limitations

Large language models have enormous potential. But what are they? Where did they come from? And how can you make them work better?

May 25, 2023

Stanford professor on data-centric AI for healthcare and medicine

Stanford assistant professor James Zou, presents “Responsible Data-Centric AI for Healthcare and Medicine” at The Future of Data-Centric AI.

Dr. Bubbles, Snorkel AI's mascot
May 18, 2023

Poster presenters compete to win desktop GPU

Snorkel AI has accepted the first batch of applications for its first annual virtual poster competition. But there’s still time to add yours to the mix.

May 9, 2023
1 2 3 4 7
Image
See how Snorkel can help you get up to:
100x

Faster Data Curation

40x
Faster Model Delivery
99%
Model Accuracy