The Snorkel AI Blog

What is specialized GenAI evaluation, and why is it so critical to enterprise AI?

Specialized GenAI evaluation ensures AI assistants meet business requirements, SME expertise, and industry regulations—critical for production-ready AI.

Shane Johnson

Why enterprise GenAI evaluation requires fine-grained metrics to be insightful

GenAI needs fine-grained evaluation for AI teams to gain actionable insights.

Shane Johnson

Why GenAI evaluation requires SME-in-the-loop for validation and trust

Shane Johnson

How a global financial services company built a specialized AI copilot accurate enough for production

Team Snorkel

Download and read "The Guide to Data Labeling & Annotation for Enterprise AI Teams"

Latest posts

LLM-as-judge for enterprises: evaluate model alignment at scale

Discover how enterprises can leverage LLM-as-Judge systems to evaluate generative AI outputs at scale, improve model alignment, reduce costs, and tackle challenges like bias and interpretability.

Matt Casey, Tom Walshe

March 26, 2025

Why GenAI evaluation requires SME-in-the-loop for validation and trust

It’s critical enterprises can trust and rely on GenAI evaluation results, and for that, SME-in-the-loop workflows are needed. In my first blog post on enterprise GenAI evaluation, I discussed the importance of specialized evaluators as a scalable proxy for SMEs. It simply isn’t practical to task SMEs with performing manual evaluations – it can take weeks if not longer, unnecessarily…

Shane Johnson

March 20, 2025

Applied AI, Research

Research spotlight: is long chain-of-thought structure all that matters when it comes to LLM reasoning distillation?

We’re taking a look at the research paper, LLMs can easily learn to reason from demonstration (Li et al., 2025), in this week’s community research spotlight. It focuses on how the structure of reasoning traces impacts distillation from models such as DeepSeek R1. What’s the big idea regarding LLM reasoning distillation? The reasoning capabilities of powerful models such as DeepSeek…

Shane Johnson

March 19, 2025

Why enterprise GenAI evaluation requires fine-grained metrics to be insightful

GenAI needs fine-grained evaluation for AI teams to gain actionable insights.

Shane Johnson

March 18, 2025

Latest videos

Browse our YouTube channel

Improving the accuracy of domain specific tasks with LLM distillation

How to Build RAG Applications with Snorkel AI and AWS

Optimizing GenAI systems with AWS and Snorkel

How to Evaluate LLM Performance for Domain-Specific Use Cases

Understand the basics of LLM training in under four minutes!

RAG Optimization: A Practical Overview for Improving Retrieval Augmented Generation

Applied AI

Why GenAI evaluation requires SME-in-the-loop for validation and trust

Shane Johnson

March 20, 2025

Applied AI, Research

Research spotlight: is long chain-of-thought structure all that matters when it comes to LLM reasoning distillation?

Shane Johnson

March 19, 2025

Why enterprise GenAI evaluation requires fine-grained metrics to be insightful

GenAI needs fine-grained evaluation for AI teams to gain actionable insights.

Shane Johnson

March 18, 2025

What is specialized GenAI evaluation, and why is it so critical to enterprise AI?

Specialized GenAI evaluation ensures AI assistants meet business requirements, SME expertise, and industry regulations—critical for production-ready AI.

Shane Johnson

March 5, 2025

Customers

Customers

Call center AI for customer experience management: a case study

How one large financial institution used call center AI to inform customer experience management with real-time data.

Maxwell Williams

August 14, 2024

Customers

How we achieved 89% accuracy on contract question answering

A customer wanted an llm system for complex contract question answering tasks. We helped them build it—beating the baseline by 64 points.

Minhajul Hoque

April 2, 2024

Applied AI, Customers

Content filtering breakthrough: Snorkel client reaches 96% recall in 3 days

Snorkel AI helped a client solve the challenge of social media content filtering quickly and sustainably. Here’s how.

Gabe Smith

March 26, 2024

Customers

First cohort of Snorkel GenAI customers sees gains up to 54 points

In its first six months, Snorkel Foundry collaborated on high-value projects with notable companies and produced impressive results.

Marty Moesta

December 20, 2023

Data development

LLM-as-judge for enterprises: evaluate model alignment at scale

Discover how enterprises can leverage LLM-as-Judge systems to evaluate generative AI outputs at scale, improve model alignment, reduce costs, and tackle challenges like bias and interpretability.

Matt Casey, Tom Walshe

March 26, 2025

Why enterprises should embrace LLM distillation

Unlock possibilities for your enterprise with LLM distillation. Learn how distilled, task-specific models boost performance and shrink costs.

Shane Johnson

February 18, 2025

LLM evaluation in enterprise applications: a new era in ML

Learn about the obstacles faced by data scientists in LLM evaluation and discover effective strategies for overcoming them.

Matt Casey, Venkatesh Rao

November 25, 2024

AI data development: a guide for data science projects

What is AI data development? AI data development includes any action taken to convert raw information into a format useful to AI.

Matt Casey, Minhajul Hoque

November 13, 2024

Product

Databricks + Snorkel Flow: integrated, streamlined AI development

Discover the power of integrating Databricks and Snorkel Flow for efficient data ingestion, labeling, model development, and AI deployment.

Bryan Wood

January 8, 2025

How LLM evaluation drives better models in Snorkel Flow

Discover how Snorkel AI’s methodical workflow can simplify the evaluation of LLM systems. Achieve better model performance in less time.

Rebekah Westerlind

December 17, 2024

Unlock proprietary data with Snorkel Flow and Amazon SageMaker

Accelerate LLM development with Snorkel Flow and SageMaker. Automate dataset curation, accelerate training, and gain a competitive advantage.

Chris Borg, Jennifer Casey

December 2, 2024

Snorkel AI joins the AWS ISV Accelerate Program and launches Snorkel Flow Availability in AWS Marketplace

Snorkel AI and AWS are partnering to help enterprises build, deploy, and evaluate custom, production-ready AI models. Learn how.

Jennifer Casey

November 19, 2024

Research