OpenAI researcher Jason Wolfe explains the Model Spec, the public framework defining intended model behavior. This summary covers its core principles like the 'chain of command,' how it handles complex edge cases, its evolution through public feedback, and its future role in an increasingly autonomous AI landscape.
The Department of War is making a huge mistake.
An analysis of the conflict between Anthropic and the US Department of War, exploring its implications for AI alignment, regulation, and the future of mass surveillance. The author argues that while Anthropic's stance is commendable, the structural nature of AI favors authoritarianism, making societal norms and specific laws—not broad regulatory bodies—the only viable defense for a free society.
How to Make AI Forget
Ben Luria, CEO of Hirundo, discusses the critical need for machine unlearning, framing it as a form of "AI neuro-surgery" for enterprise AI. He explains how this technique directly modifies model weights to remove unwanted data and behaviors, addressing core risks that superficial solutions like guardrails cannot solve.
CES 2026 AI highlights: NVIDIA Rubin & wild gadgets
This episode explores the strategic implications of the Disney-OpenAI licensing deal, critiques Time Magazine's "Architects of AI" focus on business over research, analyzes NVIDIA's full-stack ambitions with the Neotron 3 model release, and delves into Anthropic's unique approach to AI safety with the "Claude Soul Document".
The Algorithm That IS The Scientific Method [Dr. Jeff Beck]
Dr. Jeff Beck argues that the future of AI lies not in scaling up large language models, but in building systems that mirror the brain's approach to understanding the world. He posits that true intelligence is grounded in physics and object-centered models, not language. Beck's vision involves creating AI composed of numerous small, modular models—much like a video game engine—that can be dynamically combined, updated through continual learning, and understand the world through causal relationships and forces. This approach, he claims, will solve key challenges in generalization, robotics, and alignment by enabling machines to 'know what they don't know' and reason about the physical world in a way that is fundamentally similar to humans.
Emmett Shear on Building AI That Actually Cares: Beyond Control and Steering
Emmett Shear, founder of Twitch and former OpenAI interim CEO, presents a new paradigm for AI alignment called "organic alignment." He argues that the prevalent "steering and control" model is fundamentally flawed, potentially leading to disaster. Shear advocates for developing AI systems that learn to genuinely care about humans, treating alignment as a continuous process rather than a fixed state.