Evals

Perceptual Evaluations: Evals for Aesthetics — Diego Rodriguez, Krea.ai

Perceptual Evaluations: Evals for Aesthetics — Diego Rodriguez, Krea.ai

KREA.ai's cofounder Diego Rodriguez discusses the critical failure of current AI evaluation metrics in understanding human perception and aesthetics, advocating for a new paradigm of personalized, perceptually-aware evals.

AI Agent Development Tradeoffs You NEED to Know

AI Agent Development Tradeoffs You NEED to Know

Sherwood Callaway of 11X discusses the architecture of "Alice," an AI Sales Development Representative. He covers the practical decision to use LangGraph for its reliability in production, the challenges of infrastructure and observability when using hosted agent platforms, and their methodology for running Evals to mitigate hallucinations by comparing generated content against source data.