Agents

Build Hour: Agentic Tool Calling

Build Hour: Agentic Tool Calling

A deep dive into building agentic systems using OpenAI's latest APIs. The session covers the core concept of 'agentic tool calling' (reasoning + tools), outlines a four-part framework (Agent, Infrastructure, Product, Evaluation) for designing long-horizon tasks, and provides a hands-on demonstration of building a non-blocking task processing system with a real-time progress UI.

How 80,000 companies build with AI: Products as organisms and the death of org charts | Asha Sharma

How 80,000 companies build with AI: Products as organisms and the death of org charts | Asha Sharma

Asha Sharma, CVP of Product for Microsoft's AI Platform, shares insights from working with over 15,000 companies building AI. She discusses the shift from "product as artifact" to "product as organism," the rise of post-training as the new competitive moat, and how agents are transforming organizational structures from hierarchies ("org charts") into task-based networks ("work charts").

Knowledge is Eventually Consistent // Devin Stein // MLOps Podcast #335

Knowledge is Eventually Consistent // Devin Stein // MLOps Podcast #335

Devin Stein, CEO of Dosu, discusses a new paradigm for knowledge management where an AI agent learns from code, conversations, and tickets to create an 'eventually consistent' knowledge base. The conversation explores the lifecycle of facts, the challenges of agent interaction, and the future of documentation in a world of collaborating AI agents.

Designing AI-Intensive Applications - swyx

Designing AI-Intensive Applications - swyx

The field of AI Engineering is evolving from simple 1:1 applications to complex, AI-intensive systems with high LLM-call ratios. This talk explores the search for a 'Standard Model' for AI engineering, analogous to MVC or ETL in traditional software, proposing several candidates including LLM OS, LLM SDLC, and a new SPADE (Sync, Plan, Analyze, Deliver, Evaluate) model for building robust applications.

The Truth About LLM Training

The Truth About LLM Training

Paul van der Boor and Zulkuf Genc from Prosus discuss the practical realities of deploying AI agents in production. They cover their in-house evaluation framework, strategies for navigating the GPU market, the importance of fine-tuning over building from scratch, and how they use AI to analyze usage patterns in a privacy-preserving manner.

Prompt Engineering for Generative AI • James Phoenix, Mike Taylor & Phil Winder

Prompt Engineering for Generative AI • James Phoenix, Mike Taylor & Phil Winder

Authors James Phoenix and Mike Taylor discuss the evolution of prompt engineering from a creative art to a rigorous engineering discipline. They cover the core principles of prompting, the importance of programmatic evaluation, the role of agents, and how to manage application lifecycles as models evolve.