Generative AI can write poems, recite common knowledge, and extract information. GenAI can also help quickly build predictive pipelines.
Getting better performance from foundation models (with less data)
GenAI may be the most transformative technology of the past decade but data is where enterprises are able to realize real value from AI today.
We used weak supervision to programmatically curate instruction tuning data for open-source LLMs to build a better GenAI.
Snorkel and affiliated academic labs have been hard at work reducing how computationally expensive large language models are.
Enterprises—especially the world’s largest—are excited to use large language models, but they want to fine-tune them on proprietary data.
Peter Mattson, Google senior staff engineer and president of MLCommons.org, explained MLCommons at The Future of Data-Centric AI in 2022.
Large language models have enormous potential. But what are they? Where did they come from? And how can you make them work better?
Stanford assistant professor James Zou, presents “Responsible Data-Centric AI for Healthcare and Medicine” at The Future of Data-Centric AI.
Snorkel AI has accepted the first batch of applications for its first annual virtual poster competition. But there’s still time to add yours to the mix.
Join us on June 7-8 to learn how to use your data to build your AI moat at The Future of Data-Centric AI 2023 free virtual conference.
Sharon Li is an assistant professor at the University of Wisconsin-Madison. She presented “Detecting Data Distributional Shift: Challenges and Opportunities” at Snorkel AI’s The Future of Data-Centric AI Summit in 2022. The talk covered a novel approach for handling out-of-distribution objects.
Harvard Professor Vijay Janapa Reddi’s presentation: “DataPerf: Benchmarks for data” from Snorkel AI’s 2022 Future of Data-Centric AI event.
Prasanna Balaprakash, research and development lead from Argonne National Laboratory gave a presentation entitled “Extracting the Impact of Climate Change from Scientific Literature using Snorkel-Enabled NLP” at Snorkel AI’s Future of Data-Centric AI Workshop in August, 2022.
Simran Arora is a machine learning researcher at Stanford University. She presented “Ask Me Anything: How are Foundation Models Changing the Way We Build Software” at Snorkel AI’s Foundation Model Virtual Summit 2023.