Tokenless

Efficient Distributed Orthonormal Optimizers for Large-Scale Training

Mar 06, 2026

Machine Learning

View All

Feb 20, 2026

Migrating from Neptune to Weights & Biases

A technical guide on migrating ML experiments from Neptune to Weights & Biases, covering the migration script, API-level code changes, and best practices for organizing projects and analyzing results in the W&B platform before the Neptune sunset.

Jan 26, 2026

W&B Models end-to-end demo

W&B Models is the system of record for the entire model development lifecycle. This guide explores how to monitor training, tune hyperparameters, track artifacts and lineage for reproducibility, and automate MLOps workflows like evaluation and deployment using a central platform.

Jan 08, 2026

Post-training best-in-class models in 2025

An expert overview of post-training techniques for language models, covering the entire workflow from data generation and curation to advanced algorithms like Supervised Fine-Tuning (SFT), Direct Preference Optimization (DPO), and Reinforcement Learning (RL), along with practical advice on evaluation and iteration.

Artificial Intelligence

View All

Mar 06, 2026

Efficient Distributed Orthonormal Optimizers for Large-Scale Training

Kwangjun Ahn from Microsoft Research provides a technical overview of orthonormal optimizers (like Muon and Dion2), a new class of algorithms for large-scale AI model training that are emerging as powerful successors to AdamW. The talk covers their theoretical foundations, empirical benefits, distributed implementation strategies, and practical guidelines for integration into modern training pipelines.

Mar 06, 2026

Inside Perplexity Computer’s agent platform

Experts on the Mixture of Experts podcast analyze Perplexity Computer's pivot to agent orchestration and debate its closed-system approach versus open alternatives like OpenClaw. They also discuss Anthropic's new memory import feature for Claude, questioning if memory is still a competitive moat, and explore NullClaw, a minimalist agent framework that sparks a conversation about the future of edge-based agent swarms. Finally, they tackle the controversial debut of Tilly Norwood, the world's first AI actor, and debate the implications for the entertainment industry and the personification of AI.

Mar 06, 2026

Cursor's Third Era: Cloud Agents — ft. Sam Whitmore, Jonas Nelle, Cursor

Cursor's team discusses their latest Cloud Agents launch, which gives agents full cloud VMs to test changes, record demo videos, and provide remote access. We explore parallel model swarms, bug reproduction workflows, and the future of agentic coding where throughput and new bottlenecks in review and CI/CD take center stage.

Technology

View All

Mar 02, 2026

Platform Engineering • Ajay Chankramath & Nic Cheneweth • GOTO 2026

Ajay Chankramath and Nic Cheneweth discuss the critical elements of effective platform engineering, emphasizing a product mindset, the foundational role of control planes and API-first design, the common pitfalls of implementing Backstage, and the emerging impact of AI and agents on the platform landscape.

Feb 27, 2026

SW Design, Architecture & Clarity at Scale • Sam Newman, Jacqui Read & Simon Rohrer

Experts Sam Newman, Jacqui Read, and Simon Rohrer explore the nuances of software design, its intersection with architecture, and the critical role of communication in scaling technical clarity. The discussion covers practical advice on implementing Architectural Decision Records (ADRs), the evolving role of the architect as a facilitator, and strategies for creating agile enterprise architectures.

Feb 26, 2026

Learn Docker in a Month of Lunches • Elton Stoneman & Bret Fisher • GOTO 2026

Docker educators Bret Fisher and Elton Stoneman discuss the second edition of Stoneman's book, "Learn Docker in a Month of Lunches". They explore why Docker fundamentals remain crucial in a Kubernetes-dominated world, the evolution of the container ecosystem over the past five years, and the key skills that differentiate a Docker expert from a beginner, such as multi-platform builds, security, and configuration management.

Recent Post

Jan 24, 2026

LLM vs. SLM vs. FM: Choosing the Right AI Model

A guide to understanding the differences between Large Language Models (LLMs), Small Language Models (SLMs), and Frontier Models (FMs). Learn the unique strengths of each model type and see practical use cases for document classification, customer support, and incident response to help you choose the right model for your AI project.

Jan 23, 2026

Architecting Self-Healing Enterprise Operations: AI + DevSecOps | Akshay Mittal | SW Engineer | 4K

Akshay Mittal discusses the evolution of enterprise AI, focusing on the crucial shift from reactive to predictive security through AI-augmented DevSecOps. He explores how to productionize agentic AI workflows using AIOps and Kubernetes, and emphasizes the non-negotiable need for explainable AI (XAI) in critical systems.

Jan 23, 2026

Build Hour: Apps in ChatGPT

Learn how to design, build, and enhance real-time, multi-player applications within ChatGPT using the Apps SDK and Codex. This guide covers the core architecture, an AI-first development workflow, and best practices for creating valuable user experiences.

Jan 23, 2026

Why AI Agents Forget Everything (And How To Fix That)

Mem0 is building a model-neutral, persistent memory layer for AI agents to solve the fundamental statelessness of LLMs. Co-founders Taranjeet Singh and Deshraj Yadav discuss their hybrid memory architecture, which reduces cost and latency compared to context stuffing, and their vision for a future where user memory is portable across all AI applications.

Jan 23, 2026

The new AI race: Enterprise innovation in 2026

Experts discuss OpenAI's new ad model for ChatGPT, the breakout moment for agentic coding with Claude Code, IBM's "Enterprise in 2030" report on the shift from AI efficiency to innovation, and Hugging Face's new "Open Responses" standard for agent APIs.

Jan 23, 2026

Abstraction & Idealization: AI's Plato Problem [Mazviita Chirimuuta]

Professor Mazviita Chirimuuta discusses the philosophical underpinnings of neuroscience, challenging the brain-as-computer metaphor. She introduces 'haptic realism'—a view of knowledge as interactive and constructed—and argues for the inseparability of embodiment, finitude, and true understanding in both humans and AI.

← Previous Next →

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.