

Getting better performance from foundation models (with less data)


GenAI may be the most transformative technology of the past decade but data is where enterprises are able to realize real value from AI today.


Generative AI is at peak hype and poised to dive into the “trough of despair,” according to the 2023 Gartner® Hype Cycle™ for AI.


We used weak supervision to programmatically curate instruction tuning data for open-source LLMs to build a better GenAI.


Snorkel AI announced a strategic partnership with Together AI to enable organizations to build their own proprietary LLMs on their data.


This release eases Snorkel Flow application creation process and tightens the iteration loop. It also upgrades our security certifications.


NVIDIA’s Nyla Worker presented “Leveraging Synthetic Data to Train Perception Models Using NVIDIA Omniverse Replicator” in 2022.


Google experts Abhishek Ratna and Robert Crowe discuss practical paths to data-centricity in applied AI at The Future of Data-Centric AI ’22.


State Farm senior data scientist Jason Goldfarb presented “Reusable Data Cleaning Pipelines in Python” at the Future of Data-Centric AI 2022.





