Snorkel helps build Terminal-Bench 2.0.
Learn more
Capabilities
overview
Our technology
The engine powering our solutions
EXPERT DATA SERVICES
Overview
Expert-curated datasets for frontier AI
Use cases
From agentic systems to coding, explore data applications
Join our expert community
Get paid to shape safer, smarter AI
Enterprise AI Solutions
Overview
Custom AI systems built to unlock ROI fast
Customer stories
Real-world results from enterprise deployments
WATCH NOW
Expert Data-as-a-Service
Our co-founder and CEO Alex Ratner shares how Snorkel helps organizations scale expert data to accelerate safer, more reliable AI development.
Research
RESEARCH
Research hub
Leaderboards
featured benchmark
Introducing SnorkelSpatial
A procedurally generated benchmark for evaluating allocentric and egocentric spatial reasoning capabilities in LLMs.
See the benchmark
Resources
RESOURCES
Resource library
Events
Blog
Docs
Featured blog
Terminal-Bench 2.0 is here.
Developed by Stanford and Laude Institute with contributions from Snorkel AI, it’s a major leap forward in evaluating AI coding agents.
Read more
Company
company
About
Careers
Press
Partners
Security
Contact us
Get started
Get started
Get a demo
Search result for:
Search
Submit
Clear
GenAI
Our best content on GenAI
Applied AI
What is specialized GenAI evaluation, and why is it so critical to enterprise AI?
Learn More
Applied AI
Why enterprise GenAI evaluation requires fine-grained metrics to be insightful
Learn More
Applied AI
Why GenAI evaluation requires SME-in-the-loop for validation and trust
Learn More
All articles and resources on GenAI
Content Type
Blog
Case Study
eBook
Event
Research Paper
Webinars