Agentic ai

MagenticLite is here: A full-stack agentic experience powered by Small Models

MagenticLite is here: A full-stack agentic experience powered by Small Models

Microsoft Research introduces MagenticLite, an agentic framework powered by two new small, open-weight models: Magentic Orchestrator for planning and coding, and Fara-1.5 for browser automation. The talk details the novel synthetic data generation techniques and training strategies used to achieve state-of-the-art performance in small models, enabling them to compete with much larger ones.

Why Agentic AI Fails: Infinite Loops, Planning Errors, and More

Why Agentic AI Fails: Infinite Loops, Planning Errors, and More

Agentic AI failures are often predictable system design flaws, not random hallucinations. This summary explores the top three failure modes—infinite loops, hallucinated planning, and unsafe tool use—and provides practical strategies for designing more reliable and robust AI agents.

Dark Factory: How OpenClaw Ships Faster Than You Can Read the Diff — Vincent Koc

Dark Factory: How OpenClaw Ships Faster Than You Can Read the Diff — Vincent Koc

Vincent Koc argues that static benchmarks are failing in the era of adaptive AI. He proposes a shift from static testing to 'malleable evals,' where agents self-optimize and curate their own test suites based on user intent and production data, treating evaluation as a living, evolving system.

Why AI Agents Shouldn't Replace Your Fraud Models

Why AI Agents Shouldn't Replace Your Fraud Models

Varant Zanoyan, original author of the Chronon feature platform, introduces 'agentic experimentation'—a pattern where AI agents improve high-stakes ML systems without making live decisions. He explains how Chronon solves key challenges like infrastructure sprawl, safety, and reproducibility through a semantic API, branch-based isolation, and compute reuse, enabling agents to safely create production-ready pipelines for human review.

Serverless Agents: Real-World Tooling with Strands SDK, MCP & AWS • Akshatha Laxmi • GOTO 2025

Serverless Agents: Real-World Tooling with Strands SDK, MCP & AWS • Akshatha Laxmi • GOTO 2025

A deep dive into building production-ready, stateless, and scalable LLM agents by leveraging the Model Context Protocol (MCP) and Strands SDK on AWS Lambda. The session demonstrates how to expose real-world functionality to language models, moving beyond mere reasoning to tangible action.

How RAG, GraphRAG, and Context Engineering Improve AI Performance

How RAG, GraphRAG, and Context Engineering Improve AI Performance

Martin Keen explains that context, not model intelligence, is the biggest bottleneck in AI. He introduces Context Engineering, its four pillars (Connected Access, Knowledge Layer, Precision Retrieval, Runtime Governance), and advanced techniques like GraphRAG to build more reliable, context-aware AI systems.