Image
  • Capabilities
    overview
    Our technology
    The engine powering our solutions
    EXPERT DATA SERVICES
    Overview
    Expert-curated datasets for frontier AI
    Use cases
    From agentic systems to coding, explore 
data applications
    Join our expert community
    Get paid to shape safer, smarter AI
    Enterprise AI Solutions
    Overview
    Custom AI systems built to unlock ROI fast
    Customer stories
    Real-world results from enterprise deployments
  • Research
    RESEARCH
    Research hub
    Leaderboards
    Open Benchmarks Grants
    featured benchmark
    Introducing Agentic Coding
    A benchmark for evaluating AI models on complex, real-world coding tasks that require multi-step reasoning, tool use, and autonomous problem-solving.
    See the benchmark
  • Resources
    RESOURCES
    Resource library
    Events
    Reading Group
    Blog
    Docs
    Featured blog
    Terminal-Bench 2.0 is here.
    Developed by Stanford and Laude Institute with contributions from Snorkel AI, it’s a major leap forward in evaluating AI coding agents.
    Read more
  • Company
    company
    About
    Careers
    Press
    Partners
    Security
    Contact us
  • Get started
Get started
Get a demo
Search result for:
See all articles
The AI 50 2023
Awards

The AI 50 2023

Date: April 11, 2023
Share this article

Recommended
articles

See all articles
Research

Benchmarks should shape the frontier, not just measure it

Since launching the Open Benchmarks Grants, we’ve received more than 100 applications from academic groups and industry labs spanning a wide range of domains and capabilities. As the best benchmarks drive how the field allocates research effort, the bar for benchmarks has risen as well. Here, we share what’s now table stakes for useful benchmarks, and what separates the ones…

Vincent Sunn Chen
April 7, 2026
Research

Benchtalks #1: Alex Shaw (Terminal-Bench, Harbor) – Building the Benchmark Factory

To kick off our inaugural Benchtalks, a series dedicated to the researchers building these measurement toolkits, Snorkel AI co-founder Vincent Sunn Chen sat down with Alex Shaw, Founding MTS at Laude Institute and co-creator of Terminal-Bench and Harbor. Highlights More on Terminal-Bench: See the leaderboard and the catalog of tasks at tbench.ai. Explore Harbor: Learn how to scale your agent…

Vincent Sunn Chen
March 31, 2026
Research

Building FinQA: An Open RL Environment for Financial Reasoning Agents

TL;DR: We built FinQA — a financial question-answering environment with 290 expert-curated questions across 22 public companies, now available on OpenEnv. Agents use MCP tools to discover schemas, write constrained SQL queries, and answer multi-step questions from real SEC 10-K filings. Most open-source models struggle with this kind of multi-step tool use, and even frontier closed-source models, while more accurate,…

Bhavishya Pohani
March 30, 2026

Join our newsletter for expert advice, the latest research, and exclusive events.

By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.
CONNECT WITH US

How do you want to work 
with Snorkel?

Image
AI data development services
  • Accelerate the development of frontier AI models with expert-curated, enterprise-grade data.
  • Learn how Snorkel’s Data-as-a-Service helps teams label, refine, and evaluate high-quality, domain-specific datasets for your projects.
Talk to a data researcher
Image
AI solutions engineering
  • Explore how Snorkel can collaborate with your product and development teams to build and deploy custom AI and agentic systems.
  • Solutions undergo rigorous testing based on your business criteria to ensure positive ROI, faster.
Talk to a strategist
Image
Become an expert contributor
  • Join the Snorkel Expert Contributor community and help shape the future of AI with your expertise.
  • Contribute to groundbreaking projects, share domain-specific insights, and get rewarded for your impact.
Join now
Image
Capabilities
Overview
Our technology
Enterprise AI solutions
Overview
Customer stories
Expert data services
Overview
Use cases
Join our expert community
Solutions
Industries
Banking & finance
Healthcare
Insurance
Public sector
Resources
Resource library
Events
Blog
Docs
AI Primers
Data-centric AI
Data labeling
Generative AI
Large language models
LLM evaluation
Research
Research hub
Leaderboards
Open Benchmarks Grants
Company
About
Careers
Press
Partners
Security
Contact us
Contact
Get started
Apply to join the expert network
Become a Data Development Partner
Compliance
ImageImage

Copyright © 2026 Snorkel AI, Inc. All rights reserved.

Terms of Use Privacy Cookie Policy
Image
Image
Image