Snorkel at CAIS
Join Snorkel at CAIS, connecting leaders building safe, reliable AI systems.
San Jose, CA
May 26-29, 2026
Accepted conference paper
Benchmarking Agents in Insurance Underwriting Environments
As AI agents integrate into enterprise applications, their evaluation demands benchmarks that reflect the complexity of real-world operations. Instead, existing benchmarks overemphasize open-domains such as code, use narrow accuracy metrics, and lack authentic complexity.
We present UNDERWRITE, an expert-first, multi-turn insurance underwriting benchmark designed in close collaboration with domain experts to capture real-world enterprise challenges.
↳ Read the paper






