Introducing Sora 2
A detailed overview of OpenAI's announcement of Sora 2, a flagship video and audio generation model, and the new Sora app, which introduces novel features like "Cameo" for personalized content creation and a new social experience.
A detailed overview of OpenAI's announcement of Sora 2, a flagship video and audio generation model, and the new Sora app, which introduces novel features like "Cameo" for personalized content creation and a new social experience.
The panel discusses NVIDIA's $100 billion investment in OpenAI, analyzing the trend towards vertically integrated AI 'tribes'. They also explore the rise of specialized open-source models like Tongyi DeepResearch, Google's new AP2 agent protocol for secure e-commerce, the ongoing debate on AI existential risk, and Apple's practical approach to wearable AI with the new real-time translation feature in AirPods.
OpenAI’s Chief Scientist, Jakub Pachocki, and Chief Research Officer, Mark Chen, discuss the research behind GPT-5, the push toward long-horizon reasoning, and the grand vision of an automated researcher. They cover how OpenAI evaluates progress beyond saturated benchmarks, the surprising durability of reinforcement learning, and the culture required to protect fundamental research while shipping world-class products.
Russ d'Sa, CEO of LiveKit, recounts the company's unexpected journey from a pandemic-era open-source WebRTC project to becoming a crucial infrastructure provider for AI voice interfaces, most notably for OpenAI's ChatGPT. He details the serendipitous moments that led to this pivot and shares his vision for LiveKit as the nervous system for a multimodal AI future.
Experts discuss an OpenAI paper that reframes hallucinations as a feature driven by training incentives, not just a bug. The panel also revisits Dario Amodei's prediction on AI coding, explores AI's chaotic impact on the job market, and imagines the future of running LLMs on business-card-sized devices.
OpenAI's Build Hour provides a deep dive into GPT-5, showcasing its advanced coding and agentic capabilities. The session covers the new Responses API, critical for leveraging the model's reasoning, along with new parameters for steerability and practical prompting techniques for building complex, reliable applications.