Professionals in the data science space often debate whether RAG or fine-tuning yields the better result. The answer is “both.”
Past U.S. Chief Data Scientist DJ Patil talked with Snorkel AI CEO Alex Ratner on topics including the origin of the title “data scientist.”
We designed, implemented, and rolled out a multi-faceted autoscaling solution that expands our ML capabilities while saving on cloud costs.
The surest way to improve foundation models is through more and better data, but Snorkel researchers showed FMs can learn from themselves.
GPT-3 unlocked additional capacity by automating first drafts of internal updates—including blog summaries and sample tweets.
Handling complaints effectively and efficiently with AI is essential to maintain customer satisfaction and protect the bank’s reputation.
The following was originally published on Wayfair’s tech blog. We have cross-posted it here, edited only to fit Snorkel’s formatting guidelines. — One of our missions at Wayfair is to help our 22 million customers find the products they are looking for. For example, when a customer searches for a “modern yellow sofa” on Wayfair, we want to show the most…
Generative AI can write poems, recite common knowledge, and extract information. GenAI can also help quickly build predictive pipelines.
Experts named generative AI as the most transformative technology of the decade. What is genAI, how does it work and why does it matter?
As enterprises look toward deploying LLM-powered, business-critical applications, they’re learning to use strategies beyond prompting.
Recent developments in AI tools have made email surveillance for banks better than ever. See how foundation models and Snorkel Flow can help.
Getting better performance from foundation models (with less data)
GenAI may be the most transformative technology of the past decade but data is where enterprises are able to realize real value from AI today.
Generative AI is at peak hype and poised to dive into the “trough of despair,” according to the 2023 Gartner® Hype Cycle™ for AI.
We used weak supervision to programmatically curate instruction tuning data for open-source LLMs to build a better GenAI.
Snorkel AI announced a strategic partnership with Together AI to enable organizations to build their own proprietary LLMs on their data.
This release eases Snorkel Flow application creation process and tightens the iteration loop. It also upgrades our security certifications.
NVIDIA’s Nyla Worker presented “Leveraging Synthetic Data to Train Perception Models Using NVIDIA Omniverse Replicator” in 2022.
Google experts Abhishek Ratna and Robert Crowe discuss practical paths to data-centricity in applied AI at The Future of Data-Centric AI ’22.
State Farm senior data scientist Jason Goldfarb presented “Reusable Data Cleaning Pipelines in Python” at the Future of Data-Centric AI 2022.
Jack Zhou, product manager at Arize, on “How to Apply Machine Learning Observability to Your ML System” from The Future of Data-Centric AI
Snorkel and affiliated academic labs have been hard at work reducing how computationally expensive large language models are.
Claypot AI CEO Chip Huyen presented “Platform for Real-Time Machine Learning” at Snorkel AI’s Future of Data-Centric AI 2022.
Enterprises—especially the world’s largest—are excited to use large language models, but they want to fine-tune them on proprietary data.
Jacomo Corbo and Bryan Richardson with QuantumBlack present “Automating Data Quality Remediation With AI” at The Future of Data-Centric AI.
Stefano Lindt presents “Leveraging NLP to Extract Value From Business Data” at Snorkel AI’s The Future of Data-Centric AI Summit in 2022.
Peter Davio, CTO at Black Swan Data, presented “Petabyte-Level Learning” at Snorkel AI’s The Future of Data-Centric AI Summit in 2022.
Grammarly’s Timo Mertens presents “Toward Superhuman Communication Assistance” at Snorkel AI’s The Future of Data-Centric AI Summit in 2022.
The Future of Data-Centric AI showcased customer to successes, took a deep look at Snorkel Flow, and announced two new solutions.
Day 1 of The Future of Data-Centric AI virtual conference 2023 featured the creator of Spark, the first U.S. Chief Data Scientist, and others.