AI Alignment

Mar 25, 2026

Episode 15 - Inside the Model Spec

OpenAI researcher Jason Wolfe explains the Model Spec, the public framework defining intended model behavior. This summary covers its core principles like the 'chain of command,' how it handles complex edge cases, its evolution through public feedback, and its future role in an increasingly autonomous AI landscape.

Mar 11, 2026

The Department of War is making a huge mistake.

An analysis of the conflict between Anthropic and the US Department of War, exploring its implications for AI alignment, regulation, and the future of mass surveillance. The author argues that while Anthropic's stance is commendable, the structural nature of AI favors authoritarianism, making societal norms and specific laws—not broad regulatory bodies—the only viable defense for a free society.

Jan 15, 2026

How to Make AI Forget

Ben Luria, CEO of Hirundo, discusses the critical need for machine unlearning, framing it as a form of "AI neuro-surgery" for enterprise AI. He explains how this technique directly modifies model weights to remove unwanted data and behaviors, addressing core risks that superficial solutions like guardrails cannot solve.

Jan 09, 2026

CES 2026 AI highlights: NVIDIA Rubin & wild gadgets

This episode explores the strategic implications of the Disney-OpenAI licensing deal, critiques Time Magazine's "Architects of AI" focus on business over research, analyzes NVIDIA's full-stack ambitions with the Neotron 3 model release, and delves into Anthropic's unique approach to AI safety with the "Claude Soul Document".

Dec 31, 2025

The Algorithm That IS The Scientific Method [Dr. Jeff Beck]

Dr. Jeff Beck argues that the future of AI lies not in scaling up large language models, but in building systems that mirror the brain's approach to understanding the world. He posits that true intelligence is grounded in physics and object-centered models, not language. Beck's vision involves creating AI composed of numerous small, modular models—much like a video game engine—that can be dynamically combined, updated through continual learning, and understand the world through causal relationships and forces. This approach, he claims, will solve key challenges in generalization, robotics, and alignment by enabling machines to 'know what they don't know' and reason about the physical world in a way that is fundamentally similar to humans.

Nov 17, 2025

Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering

Emmett Shear, founder of Twitch and former OpenAI interim CEO, presents a new paradigm for AI alignment called "organic alignment." He argues that the prevalent "steering and control" model is fundamentally flawed, potentially leading to disaster. Shear advocates for developing AI systems that learn to genuinely care about humans, treating alignment as a continuous process rather than a fixed state.