Snorkel at CAIS

Join Snorkel at CAIS, connecting leaders building safe, reliable AI systems.

San Jose, CA

May 26-29, 2026

Accepted conference paper

Benchmarking Agents in Insurance Underwriting Environments

By Amanda Dsouza, Ramya Ramakrishnan, Charles Dickens, Bhavishya Pohani, Christopher M Glaze

As AI agents integrate into enterprise applications, their evaluation demands benchmarks that reflect the complexity of real-world operations. Instead, existing benchmarks overemphasize open-domains such as code, use narrow accuracy metrics, and lack authentic complexity.

We present UNDERWRITE, an expert-first, multi-turn insurance underwriting benchmark designed in close collaboration with domain experts to capture real-world enterprise challenges.

↳ Read the paper