Tokenless

Efficient Distributed Orthonormal Optimizers for Large-Scale Training

Mar 06, 2026

Machine Learning

View All

Feb 20, 2026

Migrating from Neptune to Weights & Biases

A technical guide on migrating ML experiments from Neptune to Weights & Biases, covering the migration script, API-level code changes, and best practices for organizing projects and analyzing results in the W&B platform before the Neptune sunset.

Jan 26, 2026

W&B Models end-to-end demo

W&B Models is the system of record for the entire model development lifecycle. This guide explores how to monitor training, tune hyperparameters, track artifacts and lineage for reproducibility, and automate MLOps workflows like evaluation and deployment using a central platform.

Jan 08, 2026

Post-training best-in-class models in 2025

An expert overview of post-training techniques for language models, covering the entire workflow from data generation and curation to advanced algorithms like Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), along with practical advice on evaluation and iteration.

Artificial Intelligence

View All

Mar 06, 2026

Efficient Distributed Orthonormal Optimizers for Large-Scale Training

Kwangjun Ahn from Microsoft Research provides a technical overview of orthonormal optimizers (like Muon and Dion2), a new class of algorithms for large-scale AI model training that are emerging as powerful successors to AdamW. The talk covers their theoretical foundations, empirical benefits, distributed implementation strategies, and practical guidelines for integration into modern training pipelines.

Mar 06, 2026

Inside Perplexity Computer’s agent platform

Experts on the Mixture of Experts podcast analyze Perplexity Computer's pivot to agent orchestration and debate its closed-system approach versus open alternatives like OpenClaw. They also discuss Anthropic's new memory import feature for Claude, questioning if memory is still a competitive moat, and explore NullClaw, a minimalist agent framework that sparks a conversation about the future of edge-based agent swarms. Finally, they tackle the controversial debut of Tilly Norwood, the world's first AI actor, and debate the implications for the entertainment industry and the personification of AI.

Mar 06, 2026

Cursor's Third Era: Cloud Agents — ft. Sam Whitmore, Jonas Nelle, Cursor

Cursor's team discusses their latest Cloud Agents launch, which gives agents full cloud VMs to test changes, record demo videos, and provide remote access. We explore parallel model swarms, bug reproduction workflows, and the future of agentic coding where throughput and new bottlenecks in review and CI/CD take center stage.

Technology

View All

Mar 02, 2026

Platform Engineering • Ajay Chankramath & Nic Cheneweth • GOTO 2026

Ajay Chankramath and Nic Cheneweth discuss the critical elements of effective platform engineering, emphasizing a product mindset, the foundational role of control planes and API-first design, the common pitfalls of implementing Backstage, and the emerging impact of AI and agents on the platform landscape.

Feb 27, 2026

SW Design, Architecture & Clarity at Scale • Sam Newman, Jacqui Read & Simon Rohrer

Experts Sam Newman, Jacqui Read, and Simon Rohrer explore the nuances of software design, its intersection with architecture, and the critical role of communication in scaling technical clarity. The discussion covers practical advice on implementing Architectural Decision Records (ADRs), the evolving role of the architect as a facilitator, and strategies for creating agile enterprise architectures.

Feb 26, 2026

Learn Docker in a Month of Lunches • Elton Stoneman & Bret Fisher • GOTO 2026

Docker educators Bret Fisher and Elton Stoneman discuss the second edition of Stoneman's book, "Learn Docker in a Month of Lunches". They explore why Docker fundamentals remain crucial in a Kubernetes-dominated world, the evolution of the container ecosystem over the past five years, and the key skills that differentiate a Docker expert from a beginner, such as multi-platform builds, security, and configuration management.

Recent Post

Jan 15, 2026

How to Make AI Forget

Ben Luria, CEO of Hirundo, discusses the critical need for machine unlearning, framing it as a form of "AI neuro-surgery" for enterprise AI. He explains how this technique directly modifies model weights to remove unwanted data and behaviors, addressing core risks that superficial solutions like guardrails cannot solve.

Jan 15, 2026

What are State Space Models? Redefining AI & Machine Learning with Data

State Space Models (SSMs) are emerging as a powerful and efficient alternative to Transformers for handling sequential data. Aaron Baughman explains the core concepts of SSMs, their mathematical foundations, and how architectures like S4 and Mamba address the memory and scalability challenges inherent in Transformers, leading to a new generation of faster, more intelligent hybrid AI models.

Jan 15, 2026

AI and the Future of Warfare with US Under Secretary of War Emil Michael

Emil Michael, the Under Secretary of War for Research and Engineering, details the radical technological transformation of the US military. He discusses the architecture and rapid launch of GenAI.mil, an internal AI platform powered by Gemini that reached over one million users in 30 days. He also outlines critical technology priorities, including scaled hypersonics and autonomous drone swarms, and the urgent need to rebuild the American defense industrial base for a new era of global competition.

Jan 14, 2026

How Ricursive Intelligence’s Founders are Using AI to Shape The Future of Chip Design

Anna Goldie and Azalia Mirhoseini of Ricursive Intelligence discuss how their work on Google's AlphaChip, which used AI to design TPUs, is now being extended to automate the entire chip design process. They explain their vision for a 'designless' industry and a recursive self-improvement loop where AI designs better chips, which in turn accelerates AI development.

Jan 14, 2026

Identity for AI Agents - Patrick Riley & Carlos Galan, Auth0

This session from Okta and Auth0 introduces a comprehensive framework for securing AI agents, covering identity establishment, delegated API access via Token Vault, user consent for risky operations using Asynchronous Authorization (CIBA), and integration with MCP servers.

Jan 14, 2026

Ransomware whack-a-mole, AI agents as insider threats and how to hack a humanoid robot

A discussion on the evolving cybersecurity landscape, covering the persistent threat of ransomware gangs adapting with AI, the critical failures in identity security highlighted by the Zestix case, the emergence of AI agents as a new class of insider threats, and the physical-world risks demonstrated by hacking humanoid robots.

← Previous Next →

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.