Applied AI

Snorkel AI signs strategic collaboration agreement with AWS to help enterprises cross the demo-to-production chasm

June 27, 2024
3 min read

With AI teams facing overwhelming demand for new generative AI use cases, Snorkel launches AI model accelerator program to address the biggest stumbling block: unstructured data

SAN FRANCISCO (June 27, 2024) Snorkel AI today announced a multi-year Strategic Collaboration Agreement (SCA) with Amazon Web Services (AWS) to help enterprises build custom, production-ready artificial intelligence (AI) models. As part of the SCA, the companies have launched an accelerator program for Amazon SageMaker customers that is designed to deliver private fine-tuned generative AI models along with co-developed benchmarks that evaluate model performance against an organization’s unique goals and objectives. 

Large Language Models (LLMs) are trained on massive public data sets, and their responses are developed off that training data. Most high-value enterprise AI applications need to provide answers that incorporate private business information while complying with organizational and regulatory guidelines. To deliver production-quality results, generic LLMs must be privately tuned using an enterprise’s carefully curated data. 

Snorkel AI has spent the last decade pioneering the practice of AI data development and helps some of the world’s most sophisticated enterprises curate their data to build custom AI services. AWS provides a robust framework for responsible AI development with Amazon SageMaker, a fully managed service that brings together a broad set of tools to build, train, and deploy generative AI and machine learning (ML) models. The collaboration between AWS and Snorkel AI will make it easier and faster for enterprises to build AI applications tailored to their unique requirements.

Snorkel AI, AWS collaborate to accelerate enterprise AI.

“Generative AI quality is entirely dependent on the data used to tune and align models,” said Henry Ehrenberg, Snorkel AI co-founder. “Enterprises working on custom use cases need to quickly identify the best off-the-shelf model, then tune it with scalable and adaptable approaches to developing their data, and deploy these models with enterprise-grade security and privacy. Our relationship with AWS helps organizations around the globe accelerate the demo-to-production pipeline. This is fantastic news for our shared customers and the many AWS customers looking to transform their business with generative AI use cases.” 

As part of the Snorkel and AWS Custom Model Accelerator Program, participants will work directly with Snorkel researchers and AWS experts and get early access to cutting-edge techniques from published and proprietary research. Program components include:

  • Evaluation and benchmark workshops conducted collaboratively with Snorkel and AWS experts to create custom benchmarks, combining Snorkel’s experience in LLM evaluation and data operations with customers’ requirements and domain knowledge–leading to more insightful evaluations of LLM performance.
  • Snorkel-led data & LLM development to support end-to-end delivery of LLMs that are fine-tuned and aligned using Amazon SageMaker JumpStart and other tools, like Amazon Bedrock in the future, to meet production-level performance. 
  • Model cost optimization and serving to optionally distill LLMs into specialized “small language models” (SLMs) that improve enterprise task-specific accuracy, while dramatically reducing cost.
  • Exclusive access to a production-ready model optimized for a specific use case.

Ready to accelerate AI development?

Deploy production AI and ML applications 10-100x faster with Snorkel’s experts, using our proprietary technology.

Request a demo

Share this article

Recommended articles

View all articles
Image
Building AI-Native Systems for Federal Infrastructure: A Conversation with Rezaur Rahman
Christopher Sniffen recently sat down with Rezaur Rahman — CIO / CISO / CAIO at the Advisory Council on Historic Preservation — for a conversation on what it actually takes to build frontier AI for federal infrastructure. They get into the limits of frontier models on geospatial reasoning, mechanistic interpretability for applied AI, the trick that makes vision models useful
May 14, 2026
Snorkel Team
Image
Code World Models and AutoHarness for LLM Agents
At our latest Snorkel AI Reading Group, Carter Wendelken of Google DeepMind walked us through two related papers he presented at ICLR: Code World Models for General Game Playing and AutoHarness: Improving LLM Agents by Automatically Synthesizing a Code Harness. Both ask the same question from opposite ends: when you want an LLM to act reliably in a complex, possibly
May 14, 2026
David Burch
coding-agents-eval
Why coding agents need better data, evals, and environments
Coding agents have moved from tab-complete to teammate. They autonomously inspect repositories, edit files, run commands, diagnose failures, and work through multi-step engineering tasks. That creates a harder reliability problem. A model that only suggests code is easy for a human to evaluate. A coding agent refactoring your repository and testing its own changes is much harder to supervise –
May 11, 2026
Justin Bauer
Image

Join our newsletter

For expert advice, the latest research, and exclusive events.
By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.