Foundation models

Health Tech Founders: The Future of Care Is Personalized, Proactive—and AI-Powered

Health Tech Founders: The Future of Care Is Personalized, Proactive—and AI-Powered

Bryan Kim from a16z, Jonathan Swerdlin from Function Health, and Daniel Cahn from Slingshot AI / Ash discuss how AI is revolutionizing healthcare and mental health by focusing on deep human context rather than just digital behavior. They explore building consumer trust, augmenting rather than replacing professionals, and creating new engagement models like "hourly active use" for a more accessible, continuous, and personalized future of care.

Waymo's EMMA: Teaching Cars to Think - Jyh Jing Hwang, Waymo

Waymo's EMMA: Teaching Cars to Think - Jyh Jing Hwang, Waymo

An exploration of Waymo's research into EMMA, an End-to-End Multimodal Model for Autonomous Driving. This summary details how foundation models like Gemini are being adapted to create a single, generalizable system that processes raw sensor data directly into driving decisions, aiming to solve the long-tail problem and improve scalability. It also covers the use of generative AI for advanced sensor simulation and model evaluation.

No Priors Ep. 124 | With SurgeAI Founder and CEO Edwin Chen

No Priors Ep. 124 | With SurgeAI Founder and CEO Edwin Chen

Edwin Chen, CEO of Surge AI, discusses the critical role of high-quality human data in training frontier models, the flaws in current evaluation benchmarks like LMSys and IF-Eval, the future of complex RL environments, and why he bootstrapped Surge to over $1 billion in revenue.

Chelsea Finn: Building Robots That Can Do Anything

Chelsea Finn: Building Robots That Can Do Anything

Developing general-purpose robots requires a shift from specialized, single-task systems to broad foundation models. This is achieved through a combination of large-scale, diverse, real-world data collection and a specific training methodology: pre-training on all available data and then fine-tuning on a curated, high-quality subset of demonstrations. This recipe, combined with architectural innovations to preserve the capabilities of Vision-Language Model (VLM) backbones, enables robots to perform complex, long-horizon tasks, generalize to unseen environments, and respond to open-ended human instructions.