Join our inaugural Reading Group in San Francisco on April 29. Register now
Search result for:
AI evaluation systematically measures a model’s performance on tasks. Classically, this applied metrics like accuracy or precision to clear and discrete numerical or categorical targets. Moden evaluation also assesses the output of generative models to ensure they create content within an organization’s standards and guidelines.