Image

Snorkel at CAIS

Join Snorkel at CAIS, connecting leaders building safe, reliable AI systems.

San Jose, CA

May 26-29, 2026

Accepted conference paper

Benchmarking Agents in Insurance Underwriting Environments

As AI agents integrate into enterprise applications, their evaluation demands benchmarks that reflect the complexity of real-world operations. Instead, existing benchmarks overemphasize open-domains such as code, use narrow accuracy metrics, and lack authentic complexity.

We present UNDERWRITE, an expert-first, multi-turn insurance underwriting benchmark designed in close collaboration with domain experts to capture real-world enterprise challenges. 

 

↳ Read the paper
benchmarking-insurance-underwriting

Meet our team on-site

Paroma Varma headshot

Paroma Varma

Co-Founder and Head of Research
Chris Glaze headshot

Chris Glaze

Applied Research Scientist
Vincent Sunn Chen headshot

Vincent Sunn Chen

Research Fellow & Founding Team
Charles Dickens headshot

Charles Dickens

Senior Applied Research Scientist
Zhengyang (Jason) Qi headshot

Zhengyang (Jason) Qi

Research Scientist
Michael Ramirez headshot

Michael Ramirez

Principal Product Marketing Manager, Solutions

Partner with Snorkel Data Research Lab to build and evaluate AI that performs in the real world