Llm

What AI Agent Skills Are and How They Work

What AI Agent Skills Are and How They Work

AI agents, powered by LLMs, excel at reasoning but lack the procedural knowledge required for real-world workflows. Martin Keen explains how the 'agent skills' open standard solves this by packaging step-by-step instructions, enabling agents to automate complex tasks efficiently and reliably.

Code Mode - Sunil Pai, Cloudflare

Code Mode - Sunil Pai, Cloudflare

Sunil Pai from Cloudflare introduces "Code Mode," a paradigm where AI agents generate and execute code (like JavaScript) instead of using traditional JSON-based tool calling. This approach enables more efficient, stateful, and complex interactions with large-scale systems by leveraging the inherent capabilities of programming languages.

Building Agentic Applications with Spring AI • Matthew Meckes • GOTO 2025

Building Agentic Applications with Spring AI • Matthew Meckes • GOTO 2025

Matthew Meckes from AWS makes a compelling case for Java's central role in the future of enterprise AI. This talk explores how Spring AI empowers developers to build robust, production-ready agentic applications by integrating LLMs with existing Java services, moving beyond proofs-of-concept to solve real-world business problems.

LLM Compression Explained: Build Faster, Efficient AI Models

LLM Compression Explained: Build Faster, Efficient AI Models

Learn how AI model compression and quantization techniques are essential for optimizing Large Language Model (LLM) performance and significantly reducing inference costs in production. This deep dive covers practical examples, benefits like reduced latency and increased throughput, and strategies for different AI use cases, demonstrating how to deploy scalable AI with minimal accuracy degradation.

A Common-Sense Guide to AI Engineering • Jay Wengrow & Kris Jenkins • GOTO 2026

A Common-Sense Guide to AI Engineering • Jay Wengrow & Kris Jenkins • GOTO 2026

Jay Wengrow, author of “A Common-Sense Guide to AI Engineering,” breaks down how AI agents work, describing the 'clever hack' of intercepting LLM output to trigger functions. The discussion covers multi-agent architectures for complex tasks, implementing guardrails with regex and judge LLMs, and a pragmatic take on when to use frameworks versus building from scratch. Wengrow emphasizes understanding fundamentals over specific tools to create robust, production-ready AI applications.

Everything We Got Wrong About Research-Plan-Implement -  Dexter Horthy

Everything We Got Wrong About Research-Plan-Implement - Dexter Horthy

Dexter Horthy of HumanLayer critiques the initial Research-Plan-Implement (RPI) framework for AI coding agents, revealing its tendency to encourage 'outsourcing thinking'. He introduces CRISPR, a new structured methodology that emphasizes smaller, focused prompts, human-agent alignment through artifacts like Design Discussions, and engineer ownership to combat 'slop' and improve code quality in complex projects.