Observability

LLMOps for eval-driven development at scale

LLMOps for eval-driven development at scale

Mercari's engineering team shares their practical, evaluation-centric approach to LLMOps. Learn how they leverage tiered evaluations, strategic tooling for observability, and rapid iteration to productionize LLM features for over 23 million users, emphasizing that good 'evals' are often more critical than model fine-tuning or RAG.

From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents

From DevOps ‘Heart Attacks’ to AI-Powered Diagnostics With Traversal’s AI Agents

Anish Agarwal and Raj Agrawal, co-founders of Traversal, discuss how their AI agents automate root cause analysis (RCA) for critical system failures. They detail their agent's architecture, which leverages causal inference and large-scale computation to systematically find the root cause in minutes, and argue that the rise of AI-generated code makes AI-powered debugging an essential capability for modern software engineering.