Tokenless

Efficient Distributed Orthonormal Optimizers for Large-Scale Training

Mar 06, 2026

Machine Learning

View All

Feb 20, 2026

Migrating from Neptune to Weights & Biases

A technical guide on migrating ML experiments from Neptune to Weights & Biases, covering the migration script, API-level code changes, and best practices for organizing projects and analyzing results in the W&B platform before the Neptune sunset.

Jan 26, 2026

W&B Models end-to-end demo

W&B Models is the system of record for the entire model development lifecycle. This guide explores how to monitor training, tune hyperparameters, track artifacts and lineage for reproducibility, and automate MLOps workflows like evaluation and deployment using a central platform.

Jan 08, 2026

Post-training best-in-class models in 2025

An expert overview of post-training techniques for language models, covering the entire workflow from data generation and curation to advanced algorithms like Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), along with practical advice on evaluation and iteration.

Artificial Intelligence

View All

Mar 06, 2026

Efficient Distributed Orthonormal Optimizers for Large-Scale Training

Kwangjun Ahn from Microsoft Research provides a technical overview of orthonormal optimizers (like Muon and Dion2), a new class of algorithms for large-scale AI model training that are emerging as powerful successors to AdamW. The talk covers their theoretical foundations, empirical benefits, distributed implementation strategies, and practical guidelines for integration into modern training pipelines.

Mar 06, 2026

Inside Perplexity Computer’s agent platform

Experts on the Mixture of Experts podcast analyze Perplexity Computer's pivot to agent orchestration and debate its closed-system approach versus open alternatives like OpenClaw. They also discuss Anthropic's new memory import feature for Claude, questioning if memory is still a competitive moat, and explore NullClaw, a minimalist agent framework that sparks a conversation about the future of edge-based agent swarms. Finally, they tackle the controversial debut of Tilly Norwood, the world's first AI actor, and debate the implications for the entertainment industry and the personification of AI.

Mar 06, 2026

Cursor's Third Era: Cloud Agents — ft. Sam Whitmore, Jonas Nelle, Cursor

Cursor's team discusses their latest Cloud Agents launch, which gives agents full cloud VMs to test changes, record demo videos, and provide remote access. We explore parallel model swarms, bug reproduction workflows, and the future of agentic coding where throughput and new bottlenecks in review and CI/CD take center stage.

Technology

View All

Mar 02, 2026

Platform Engineering • Ajay Chankramath & Nic Cheneweth • GOTO 2026

Ajay Chankramath and Nic Cheneweth discuss the critical elements of effective platform engineering, emphasizing a product mindset, the foundational role of control planes and API-first design, the common pitfalls of implementing Backstage, and the emerging impact of AI and agents on the platform landscape.

Feb 27, 2026

SW Design, Architecture & Clarity at Scale • Sam Newman, Jacqui Read & Simon Rohrer

Experts Sam Newman, Jacqui Read, and Simon Rohrer explore the nuances of software design, its intersection with architecture, and the critical role of communication in scaling technical clarity. The discussion covers practical advice on implementing Architectural Decision Records (ADRs), the evolving role of the architect as a facilitator, and strategies for creating agile enterprise architectures.

Feb 26, 2026

Learn Docker in a Month of Lunches • Elton Stoneman & Bret Fisher • GOTO 2026

Docker educators Bret Fisher and Elton Stoneman discuss the second edition of Stoneman's book, "Learn Docker in a Month of Lunches". They explore why Docker fundamentals remain crucial in a Kubernetes-dominated world, the evolution of the container ecosystem over the past five years, and the key skills that differentiate a Docker expert from a beginner, such as multi-platform builds, security, and configuration management.

Recent Post

Jan 22, 2026

The ML Technique Every Founder Should Know

YC Visiting Partner Francois Chaubard and YC General Partner Ankit Gupta break down diffusion, the machine learning framework behind generative AI models like Sora and Midjourney. They discuss its core principles, trace its evolution from complex KL-divergence methods to the elegant simplicity of flow matching, and explore its vast applications beyond images, from protein folding to robotics, arguing it's a key component for future AI systems.

Jan 22, 2026

Effect Oriented Programming • Bill Frasure, Bruce Eckel, James Ward & Andrew Harmel-Law • GOTO 2026

Authors Bill Frasure, Bruce Eckel, and James Ward discuss the core concepts of Effect-Oriented Programming. They explain how effects are composable operations that encapsulate side effects and defer execution, allowing developers to manage unpredictability with compiler-checked types. The conversation covers ZIO, the expansion of effect systems into languages like TypeScript and Kotlin, and their unique, constraint-driven writing process.

Jan 22, 2026

No Priors Live: Building Durable Software in the AI Age with MongoDB President & CEO CJ Desai

CJ Desai, CEO of MongoDB, discusses why platforms, not products, are the key to long-term success in the software industry, especially in the age of AI. He explores the shifting landscape of enterprise software, the reality of AI adoption in Fortune 500 companies, and what truly constitutes a "moat" when software can be generated on demand.

Jan 22, 2026

What is Agent Observability?

Lior Gavish, CTO and co-founder of Monte Carlo Data, discusses the critical transition from data observability to agent observability. He covers the widespread adoption of AI agents in data teams, the new challenges they introduce for monitoring, and why traditional tools fall short in providing the necessary insights into agent performance, security, and governance.

Jan 21, 2026

Context Engineering Our Way to Long-Horizon Agents: LangChain’s Harrison Chase

Harrison Chase, co-founder of LangChain, explains the evolution of AI agents from early, rigid scaffolding to modern, flexible "harnesses." He argues that "context engineering"—managing what an LLM sees—is the key to building effective long-horizon agents. Chase also explores how agent development differs from traditional software, highlighting the critical role of traces as the new source of truth and memory systems that enable agents to improve themselves over time.

Jan 21, 2026

SW Design, Architecture & Clarity at Scale • Sam Newman, Jacqui Read & Simon Rohrer • GOTO 2025

A panel discussion with Sam Newman, Jacqui Read, and Simon Rohrer exploring the intersection of software design and architecture. The conversation delves into the critical role of communication, the practical application of Architecture Decision Records (ADRs), strategies for bridging the gap between architects and developers, and modern approaches to standardization through platform engineering and creating agile enterprise architectures.

← Previous Next →

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.