Image
  • Product
      • SNORKEL AI DATA DEVELOPMENT PLATFORM
      • Snorkel Expert Data-as-a-Service
      • Platform Overview
      • Snorkel Evaluate
      • Snorkel Develop
      • Snorkel Predictive ML
  • Expert Data
    • CUSTOM EXPERT-LEVEL DATA
    • Expert Data-as-a-Service
    • Expert Data-as-a-Service Leaderboard
    • Expert Network
  • Leaderboards
  • Solutions
      • SERVICES
      • Snorkel Expert Data-as-a-Service – Learn more about Snorkel’s white-glove service for creating expert training and evaluation data.
      • INDUSTRIES
      • Banking & Finance
      • Healthcare
      • Insurance
      • Public Sector
      • Customers
      • Customer Stories – See how Snorkel is powering innovation in the Fortune 500 and beyond.
  • Research
  • Resources
      • LEARN
      • Blog
      • Resource Library
      • Docs
      • ENGAGE
      • Events & Conferences
      • Webinars
      • Weekly Demos
      • AI PRIMERS
      • Data-centric AI
      • Data Labeling
      • Generative AI
      • Large Language Models
      • LLM evaluation
  • Company
    • About Us
    • Careers
    • Partners
    • Press & News
    • Contact Us
  • Docs
    • Welcome to Snorkel
    • Installation Overview
    • SDK Reference
    • Glossary
    • Full Documentation
  • Talk to an AI expert
  • Get a demo
Talk to an AI expert
Get a demo
Search result for:
See all articles
Awards

The AI 50 2023

Date: April 11, 2023
Updated: September 27, 2024

Recommended
articles

See all articles
Data development

Building the Benchmark: Inside Our Agentic Insurance Underwriting Dataset

In this post, we unpack how Snorkel built a realistic benchmark dataset to evaluate AI agents in commercial insurance underwriting. From expert-driven data design to multi-tool reasoning tasks, see how our approach surfaces actionable failure modes that generic benchmarks miss—revealing what it really takes to deploy AI in enterprise workflows.

Snorkel Team
July 10, 2025
Data development

Evaluating AI Agents for Insurance Underwriting

In this post, we will show you a specialized benchmark dataset we developed with our expert network of Chartered Property and Casualty Underwriters (CPCUs). The benchmark uncovers several model-specific and actionable error modes, including basic tool use errors and a surprising number of insidious hallucinations from one provider. This is part of an ongoing series of benchmarks we are releasing across verticals…

Chris Glaze
June 26, 2025
Data development

LLM Observability: Key Practices, Tools, and Challenges

LLM observability is crucial for monitoring, debugging, and improving large language models. Learn key practices, tools, and strategies of LLM observability.

Snorkel Team
June 23, 2025

Join our newsletter for expert advice, the latest research, and exclusive events.

By submitting this form, I acknowledge I will receive email updates from Snorkel AI, and I agree to the Terms of Use and acknowledge that my information will be used in accordance with the Privacy Policy.
Image

Product

  • Platform Overview
  • Snorkel Evaluate
  • Snorkel Develop
  • Snorkel Expert Data-as-a-Service
  • Predictive ML

Solutions

Services

  • Snorkel Expert Data-as-a-Service

Industries

  • Banking & finance
  • Healthcare
  • Insurance
  • Public sector

Customers

  • Customer stories

Resources

Learn

  • Blog
  • Resource library
  • Docs

Engage

  • Events & conferences
  • Webinars
  • Weekly demos

AI Primers

  • Data-centric AI
  • Data labeling
  • Generative AI
  • Large language models
  • LLM evaluation

Docs

  • Welcome to Snorkel
  • Installation overview
  • SDK reference
  • Glossary
  • Full documentation

AI Research

  • Snorkel research
  • Research papers

Company

  • About
  • Careers
  • Partners
  • Press & news
  • Security

Contact

  • Contact us
  • Request a demo

Compliance

ImageImage

Copyright © 2025 Snorkel AI, Inc. All rights reserved.
Terms of Use Privacy Cookie Policy
Image
Image
Image