Agentic ai

The Age Of The 40-Year-Old Solo Founder Is Here

The Age Of The 40-Year-Old Solo Founder Is Here

Bryant Chou, co-founder of Webflow, introduces his new AI-powered platform, Ploy. This episode delves into how Ploy transcends traditional website builders by integrating analytics, CRM, and SEO to autonomously optimize marketing. Chou discusses Ploy's 'anti-slop' approach, leveraging curated data and expertise to produce high-quality web designs, and reflects on building a startup in the AI era compared to Webflow's early days. He also explores the competitive moat of purpose-built AI, the concept of 'agents as customers,' and how experienced founders can leverage AI to 'clone themselves' and achieve unprecedented speed and scale.

Sovereign Escape Velocity: Ownership w Open Models — Gus Martins, & Ian Ballantyne, Google DeepMind

Sovereign Escape Velocity: Ownership w Open Models — Gus Martins, & Ian Ballantyne, Google DeepMind

Google DeepMind's Ian Ballantyne and Gus Martins introduce Gemma 4, a family of open models delivering state-of-the-art performance with remarkable size efficiency. They discuss how models like the 31B variant outperform competitors 2-20x its size while running on a single GPU, the shift to an Apache 2.0 license to foster sovereignty and adoption, and the new economics of running powerful agentic workloads on hardware ranging from a Pixel phone to a single enterprise GPU.

Building AI Agent Systems and Scaling Challenges in Agentic AI

Building AI Agent Systems and Scaling Challenges in Agentic AI

Scaling agentic AI systems presents unique challenges beyond traditional software scaling. This summary explains why expanding a single agent's capabilities leads to non-linear increases in cost, latency, and failure propagation. The talk frames this as a systems design problem solved by moving from a monolithic agent to a multi-agent architecture with distributed responsibilities, and it explores the critical architectural trade-offs between horizontal and vertical scaling of agent capabilities.

Task Fidelity Scaling Laws — Kobie Crawdord, Snorkel

Task Fidelity Scaling Laws — Kobie Crawdord, Snorkel

An experiment by Snorkel AI reveals that in agentic AI training, the quality of tasks is paramount. Using the same model and compute, fine-tuning on high-quality tasks yielded a 6% performance improvement, a 5x greater uplift compared to the 1% gain from low-quality tasks. The key difference lies in the nature of the tasks: high-quality tasks are genuinely harder, featuring more tool calls and cleaner failure modes that provide a meaningful learning signal. In contrast, low-quality tasks often fail due to ambiguity and environmental noise, hindering effective model improvement.

How Cursor Trained Composer on Fireworks: Distributed Infrastructure for High-Performance RL

How Cursor Trained Composer on Fireworks: Distributed Infrastructure for High-Performance RL

Cursor's Federico Cassano and Fireworks' Dmytro Dzhulgakov detail their collaboration on Composer 2, a specialized foundation model for software engineering. They discuss their top-down training strategy, the infrastructure challenges of large-scale distributed Reinforcement Learning on sparse models, and how model specialization achieves frontier performance with superior efficiency.

Agentic Evaluations at Scale, For Everybody — Nicholas Kang & Michael Aaron, Google DeepMind

Agentic Evaluations at Scale, For Everybody — Nicholas Kang & Michael Aaron, Google DeepMind

Nicholas Kang and Michael Aaron from Google DeepMind's Kaggle team discuss the broken state of AI evaluations—scattered, non-transparent, and created by a homogenous group. They present their solutions: a community-driven benchmarks platform, a PvP Game Arena for non-saturating ELO ratings, standardized agent exams, and hackathons to crowdsource novel evals and address the limitations of current benchmarking practices.