Monitoring

Everything You Need To Know About Agent Observability — Danny Gollapalli and Ben Hylak, Raindrop

Everything You Need To Know About Agent Observability — Danny Gollapalli and Ben Hylak, Raindrop

Agent failures are unlike traditional software failures. This workshop provides a practical framework for monitoring production agents, moving beyond evals to real-world observability by using explicit signals (errors, latency) and implicit signals (user frustration, refusals, self-diagnostics) to catch regressions and understand agent behavior.

Build and monitor multi-agent contact centers using Weights & Biases

Build and monitor multi-agent contact centers using Weights & Biases

This post explores the shift from costly legacy contact center software to multi-agent AI systems. It demonstrates how to build, monitor, and evaluate these complex agentic systems using the Weights & Biases AI Developer Platform, with a focus on tracing, quality assessment, and ensuring consistent customer support.

Production monitoring for AI applications using W&B Weave

Production monitoring for AI applications using W&B Weave

Learn how W&B Weave's online evaluations enable real-time monitoring of AI applications in production, allowing teams to track performance, catch failures, and iterate on quality over time using LLM-as-a-judge scores.