Ai alignment

How to Make AI Forget

How to Make AI Forget

Ben Luria, CEO of Hirundo, discusses the critical need for machine unlearning, framing it as a form of "AI neuro-surgery" for enterprise AI. He explains how this technique directly modifies model weights to remove unwanted data and behaviors, addressing core risks that superficial solutions like guardrails cannot solve.

CES 2026 AI highlights: NVIDIA Rubin & wild gadgets

CES 2026 AI highlights: NVIDIA Rubin & wild gadgets

This episode explores the strategic implications of the Disney-OpenAI licensing deal, critiques Time Magazine's "Architects of AI" focus on business over research, analyzes NVIDIA's full-stack ambitions with the Neotron 3 model release, and delves into Anthropic's unique approach to AI safety with the "Claude Soul Document".

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

Emmett Shear, founder of Twitch and former OpenAI interim CEO, presents a new paradigm for AI alignment called "organic alignment." He argues that the prevalent "steering and control" model is fundamentally flawed, potentially leading to disaster. Shear advocates for developing AI systems that learn to genuinely care about humans, treating alignment as a continuous process rather than a fixed state.

Sam, Jakub, and Wojciech on the future of OpenAI with audience Q&A

Sam, Jakub, and Wojciech on the future of OpenAI with audience Q&A

Sam Altman and Yakob present OpenAI's updated strategy, detailing a concrete research roadmap towards an automated AI researcher by 2028, a vision for an open AI platform, and massive infrastructure plans totaling $1.4 trillion. They also introduce a new corporate structure with a non-profit foundation focused on using AI to cure diseases and build AI resilience.

Why AI Needs Culture (Not Just Data) - Prolific [Sponsored]

Why AI Needs Culture (Not Just Data) - Prolific [Sponsored]

Sara Saab and Enzo Blindow from Prolific discuss the critical, and growing, need for high-quality human evaluation in the age of non-deterministic AI. They explore the limitations of current benchmarks, the dangers of agentic misalignment as revealed by Anthropic's research, and how Prolific is building a "science of evals" by treating human feedback as a robust infrastructure layer.

No Priors Ep. 135 | With Humans& Founder Eric Zelikman

No Priors Ep. 135 | With Humans& Founder Eric Zelikman

Eric Zelikman, formerly of Stanford and xAI, discusses his research on AI reasoning (STaR, Q-STaR) and introduces his new venture, humans&. He argues for a paradigm shift from building AI with pure IQ to AI with EQ, focusing on long-term memory, human collaboration, and empowering users to achieve their full potential.