Ai agents

Scale AI CEO on Meta’s $14B deal, scaling Uber Eats to $80B, & what frontier labs are building next

Scale AI CEO on Meta’s $14B deal, scaling Uber Eats to $80B, & what frontier labs are building next

Jason Droege, CEO of Scale AI, discusses the evolution of AI training from simple labeling to complex, expert-driven tasks. He shares insights on the future of AI agents, the reality of enterprise AI adoption, and crucial business lessons learned from building Uber Eats from zero to a multi-billion dollar business.

Evals in Action: From Frontier Research to Production Applications

Evals in Action: From Frontier Research to Production Applications

An overview of OpenAI's approach to AI evaluation, covering the GDP-val benchmark for frontier models and the practical tools available for developers to evaluate their own custom agents and applications.

Every AI Founder Should Be Asking These Questions

Every AI Founder Should Be Asking These Questions

Jordan Fisher, co-founder of Standard AI and now at Anthropic, poses critical questions for startup founders facing the imminent arrival of AGI. He explores challenges from software commoditization and building trust in automated teams to finding durable moats and the ethical responsibility of building world-changing technology.

Ex-DeepMind: How To Actually Protect Your Data From AI

Ex-DeepMind: How To Actually Protect Your Data From AI

Dr. Ilia Shumailov, former DeepMind AI Security Researcher, explains why traditional security fails for AI agents. He details the unique threat model of agents, the dangers of supply chain attacks and architectural backdoors, and proposes a system-level solution called CAML to enforce security policies by design, separating model reasoning from data execution.

Software is Eating Labor

Software is Eating Labor

Alex Rampell of a16z explains how software is evolving from digitizing records to performing labor, shifting the industry's focus from the $300 billion SaaS market to the $13 trillion labor market. This transition, accelerated by AI, is forcing a change in business models from seat-based pricing to outcome-based pricing, creating new opportunities and expanding the total addressable market.

This week in AI models: Granite 4.0, Claude 4.5, Sora 2

This week in AI models: Granite 4.0, Claude 4.5, Sora 2

A deep dive into the latest AI model releases, including IBM's hyper-efficient Granite 4.0, Anthropic's code-focused Claude 4.5, and OpenAI's consumer-centric Sora 2. The discussion covers the strategic differentiation between major AI labs, the future of open-source, the rise of AI e-commerce agents, and the emerging cybersecurity challenges of social engineering AI.