Mlops

The Hidden Bottlenecks Slowing Down AI Agents

The Hidden Bottlenecks Slowing Down AI Agents

Paul van der Boor and Bruce Martens from Prosus discuss the real bottlenecks in AI agent development, arguing that the primary challenges are not tools, but rather evaluation, data quality, and feedback loops. They detail their 'buy-first' philosophy, the practical reasons they often build in-house, and how new coding agents like Devon and Cursor are changing their development workflows.

Enterprise AI Adoption Challenges

Enterprise AI Adoption Challenges

Paul van der Boor and Sean Kenny from Prosus detail the journey of Toqan, an internal AI platform that evolved from a Slack experiment into a sophisticated agentic system. They share insights on driving enterprise adoption, key metrics for measuring productivity, and their future vision of an "AI Workforce" where employees architect AI agents to automate complex, cross-system tasks.

Introduction to LLM serving with SGLang - Philip Kiely and Yineng Zhang, Baseten

Introduction to LLM serving with SGLang - Philip Kiely and Yineng Zhang, Baseten

A deep dive into SGLang, an open-source serving framework for LLMs. This summary covers its core features, history, performance optimization techniques like CUDA Graph and Eagle 3 speculative decoding, and how to contribute to the project.

Real-time Feature Generation at Lyft // Rakesh Kumar // MLOps Podcast #334

Real-time Feature Generation at Lyft // Rakesh Kumar // MLOps Podcast #334

Rakesh Kumar from Lyft details the evolution of their real-time feature generation platform, from cron jobs to a sophisticated streaming architecture using Apache Beam and Flink. Key discussions include solving the 'hot shard' problem with geohashes, building a custom geospatial feature store, and optimizing pipelines with YAML-based configurations.

The Quantum Advantage Is Real—But Where's the Infrastructure?

The Quantum Advantage Is Real—But Where's the Infrastructure?

While general-purpose quantum computers are a decade away, specialized quantum accelerators are already tackling high-speed inference for AI problems in finance and pharma. This summary explores the practical use cases, the immense data ops and MLOps challenges due to the 'no-cloning theorem,' and the need for a new modeling paradigm based on topological data analysis.

MLflow 3.0: The Future of AI Agents

MLflow 3.0: The Future of AI Agents

Eric Peter from Databricks outlines the evolution from the traditional MLOps lifecycle to the more complex Agent Ops lifecycle. He details the five essential components of a successful agent development platform and introduces MLflow 3.0, a new release designed to provide a comprehensive, open-standard solution for building, evaluating, and deploying AI agents.