dais-logo-2026

Snorkel at Databricks Data + AI Summit

Join Snorkel and thousands of your peers for 800+ sessions, keynotes and training at the world’s largest data, analytics and AI conference.

San Francisco, CA

June 15-18, 2026

Featured session

The Art & Science of Benchmarking Agents

Track: Artificial Intelligence & Agents

Our ability to measure AI has been outpaced by our ability to develop it, and this eval gap is one of the most important problems in AI. We need more enduring benchmarks to close this gap, and consequently advance entire new vectors of capabilities for the field. In this talk, I'll share our learnings evaluating agents, drawing from experience working with nearly all global frontier labs and leading academics. We'll discuss the science (i.e., mechanics that make benchmarks rigorous and effective) and art (i.e., intangibles driving ambitious and enduring benchmarks) of building great benchmarks. I'll close by sharing some of the learnings from Open Benchmarks Grants— a $3M initiative in partnership with Hugging Face, Together AI, Prime Intellect, Factory, and others.

Image
Speaker

Armin Parchami

Sr. Director of Research Engineering, Snorkel AI

Frontier models inforgraphics

Meet our team on-site

Armin Parchami headshot

Armin Parchami

Sr. Director, R&D
Harini Subramanyan headshot

Harini Subramanyan

Senior Applied AI Engineer
Reem Khattab headshot

Reem Khattab

Senior Technical Delivery Manager
Michael Ramirez headshot

Michael Ramirez

Principal Product Marketing Manager, Solutions
Incynthia Truong headshot

Incynthia Truong

Community Manager

Partner with Snorkel Data Research Lab to build and evaluate AI that performs in the real world