European Internet School: Generative AI: From Foundations to Advanced Applications: Evaluating Generative Models: Metrics, Benchmarks, and Human-in-the-Loop Testing