

author
Research Engineer (Intern)
,
Snorkel AI
Jeong Shin completed her tenure as Research Intern at Snorkel AI in September 2025; her internship focused on building agentic evaluations. Before Snorkel, Jeong completed a masters degree with a focus on computer science from Stanford.
The latest from Jeong


Blog
Evaluating coding agent capabilities with Terminal-Bench: Snorkel’s role in building the next generation benchmark
Terminal-Bench, developed through a collaboration between Stanford University and Laude Institute, has quickly become the gold standard benchmark for evaluating AI agent capabilities in a command line environment. This comprehensive evaluation framework measures how effectively AI agents can perform complex, real-world tasks within terminal environments. At Snorkel AI, we’re excited to share that we’re one of the top collaborators contributing…



