author

Jeong Shin

Research Engineer (Intern)

Snorkel AI

Jeong Shin completed her tenure as Research Intern at Snorkel AI in September 2025; her internship focused on building agentic evaluations. Before Snorkel, Jeong completed a masters degree with a focus on computer science from Stanford.

The latest from Jeong

Blog

Evaluating coding agent capabilities with Terminal-Bench: Snorkel’s role in building the next generation benchmark

Terminal-Bench, developed through a collaboration between Stanford University and Laude Institute, has quickly become the gold standard benchmark for evaluating AI agent capabilities in a command line environment. This comprehensive evaluation framework measures how effectively AI agents can perform complex, real-world tasks within terminal environments. At Snorkel AI, we’re excited to share that we’re one of the top collaborators contributing…

Sep 30, 2025 •

Kobie Crawford, Jeong Shin, Tom Walshe

Learn more about Evaluating coding agent capabilities with Terminal-Bench: Snorkel’s role in building the next generation benchmark

For models that need to be right. Not just good enough.

Request dataset samples

Talk to our team

Jeong Shin

The latest from Jeong

For models that need to be right. Not just good enough.

How do you want to work with Snorkel?