Snorkel Logo
Snorkel Logo
  • Capabilities
    How it works
    Research-led data and environment development for the frontier's hardest problems
    Learn more
    Data development
    Overview
    Expert-curated datasets for frontier AI
    Use cases
    See how our data improves frontier models
    Specialized agents
    Overview
    Custom AI systems built to unlock ROI fast
    Customer stories
    Real-world results from enterprise deployments
  • Research
    Research
    Research hub
    Our latest papers and data-centric AI findings
    Leaderboards
    Compare model performance across benchmarks
    Open Benchmarks Grants
    Funding for open-source AI research
    Featured BENCHMARK
    Image-Agentic Coding benchmark
    Agentic Coding benchmark
    A benchmark for evaluating AI models on complex, real-world coding tasks that require multi-step reasoning, tool use, and autonomous problem-solving.
  • Resources
    Resources
    Resource library
    Guides, papers, and tools for data-centric AI
    Events
    Upcoming talks, workshops, and conferences
    Reading Group
    AI discussions for researchers and practitioners
    Blog
    News, updates, and perspectives from our team
    featured BLOG
    Image-Terminal-Bench 2.0
    Terminal-Bench 2.0
    Developed by Stanford and Laude Institute with contributions from Snorkel AI, it’s a major leap forward in evaluating AI coding agents.
  • Company
    Company
    About
    Our mission, story, and values
    Careers
    Open roles and life at our company
    Press
    Media resources and announcements
    Partners
    Organizations we work with
    Security
    How we keep data safe
    Contact us
    Get in touch with our team
    Image-Join our expert community
    Join our expert community
    Get paid to shape safer, smarter AI
    Learn more
  • Get started
Get started
See all articles
Snorkel AI Named a 2022 Gartner® Cool Vendor
Awards

Snorkel AI Named a 2022 Gartner® Cool Vendor

Date: June 16, 2022
Share this article

Recommended
articles

See all articles
Research

Benchmarks should shape the frontier, not just measure it

Since launching the Open Benchmarks Grants, we’ve received more than 100 applications from academic groups and industry labs spanning a wide range of domains and capabilities. As the best benchmarks drive how the field allocates research effort, the bar for benchmarks has risen as well. Here, we share what’s now table stakes for useful benchmarks, and what separates the ones…

Vincent Sunn Chen
April 7, 2026
Research

Benchtalks #1: Alex Shaw (Terminal-Bench, Harbor) – Building the Benchmark Factory

To kick off our inaugural Benchtalks, a series dedicated to the researchers building these measurement toolkits, Snorkel AI co-founder Vincent Sunn Chen sat down with Alex Shaw, Founding MTS at Laude Institute and co-creator of Terminal-Bench and Harbor. Highlights More on Terminal-Bench: See the leaderboard and the catalog of tasks at tbench.ai. Explore Harbor: Learn how to scale your agent…

Vincent Sunn Chen
March 31, 2026
Research

Building FinQA: An Open RL Environment for Financial Reasoning Agents

TL;DR: We built FinQA — a financial question-answering environment with 290 expert-curated questions across 22 public companies, now available on OpenEnv. Agents use MCP tools to discover schemas, write constrained SQL queries, and answer multi-step questions from real SEC 10-K filings. Most open-source models struggle with this kind of multi-step tool use, and even frontier closed-source models, while more accurate,…

Bhavishya Pohani
March 30, 2026

Join our newsletter for expert advice, the latest research, and exclusive events.

By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.
Get started

How do you want to work with Snorkel?

Image
AI data development services
  • Accelerate the development of frontier AI models with expert-curated, enterprise-grade data.
  • Learn how Snorkel’s Data-as-a-Service helps teams label, refine, and evaluate high-quality, domain-specific datasets for your projects.
Talk to a data researcher
Image
Build specialized agents
  • Explore how Snorkel can collaborate with your product and development teams to build and deploy custom AI and agentic systems.
  • Solutions undergo rigorous testing based on your business criteria to ensure positive ROI, faster.
Talk to a strategist
Image
Become an expert contributor
  • Join the Snorkel Expert Contributor community and help shape the future of AI with your expertise.
  • Contribute to groundbreaking projects, share domain-specific insights, and get rewarded for your impact.
Join now
Image

Capabilities
How it works

Data development

Specialized agents

Use cases
Customer stories
Research

Research hub

Leaderboards
Open Benchmarks Grants
Resources

Resource library

Events

Reading Group

Blog
Company
About
Careers
Press
Partners
Security
Contact us
Contact
Get started
Join expert network
Compliance
ImageImage

Copyright © 2026 Snorkel AI, Inc. All rights reserved.
Terms of Use
Privacy
Cookie Policy
Image
Image
Image