Tokenless

Interactive discovery

Explore the topic map

Follow the connections between themes, people, and ideas across the Tokenless archive in an interactive topic modeling map.

Machine Learning

View All
Frontier results, on device - RL Nabors, Arize

Frontier results, on device - RL Nabors, Arize

RL Nabors discusses the significant costs associated with using frontier AI models, covering security, latency, and financial implications. She introduces a framework for right-sizing AI solutions by leveraging smaller, task-specific models and Small Language Models (SLMs). The framework details how to prove task feasibility, establish success criteria with golden datasets, conduct capability evaluations (using tools like Phoenix), and select the most appropriate "Small And Good Enough" (SAGE) model. Nabors further demonstrates how prompt engineering, particularly few-shot prompting, and post-processing can close performance gaps with larger models, while advocating for continuous regression evaluations to maintain performance integrity. The overarching message is to "prototype big, deploy small" to optimize AI deployments.

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Vaidas Razgaitis, Senior Research Engineer at Higharc, shares three tactical tips to accelerate the transition of novel AI/ML research into production-ready features. He emphasizes addressing the critical handoff challenge between ML researchers and software engineers through structured documentation (Research Prototype Taxonomy Document), a well-organized monorepo utilizing decoupled microservices, and a systematic approach to code decomposition and PR review. These strategies aim to improve legibility, maintainability, and delivery speed for ML-driven products.

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

This session details a data-efficient method for training engineering surrogate models by using uncertainty quantification (UQ) to guide geometric data augmentation. Instead of random deformations, the approach lets the deep ensemble model identify its own knowledge gaps (epistemic uncertainty), then uses Free-Form Deformation (FFD) to generate new shapes specifically in those uncertain regions. This ensures every expensive simulation run yields maximally informative data, significantly improving model accuracy for a fixed computational budget across domains like structural mechanics and aerodynamics.

Artificial Intelligence

View All
Session on Reasoning

Session on Reasoning

This session features two talks on optimizing and verifying AI reasoning. Hongxiang Fan discusses cross-stack co-design for efficient AI, focusing on Test-Time Scaling (TTS) challenges, optimal verification granularity, and system-level optimizations for edge deployments. Nagarajan Natarajan introduces 'Advancing Verified Reasoning' with the InterVent platform, aiming to ensure AI agents comply with complex policies through formal verification, dynamic steering, and leveraging verification signals for training. Both emphasize addressing the computational and reliability costs of advanced AI.

Multimodal & Embodied Intelligence (Pt 1), Panel on Multimodal AI: Progress, Pitfalls, Possibilities

Multimodal & Embodied Intelligence (Pt 1), Panel on Multimodal AI: Progress, Pitfalls, Possibilities

This session explored Multimodal and Embodied Intelligence, featuring talks on hybrid AI in robotics (classical vs. end-to-end), AI's role in healthcare (focusing on NCDs, deployment, and uncertainty modeling), and fundamental perception challenges in multimodal reasoning (using educational video QA and visual puzzles). A panel discussed the impact of foundation models, the blurred lines between AGI and human-like AI, critical deployment pitfalls (human factors, efficiency, architectural limits), and future directions, emphasizing task-specific models and the redefinition of 'foundation models.'

Grant Sanderson (@3blue1brown) – AI and the future of math

Grant Sanderson (@3blue1brown) – AI and the future of math

Grant Sanderson and Dwarkesh Patel discuss AI's rapid but uneven progress in mathematics, exploring whether AI can achieve true conceptual breakthroughs, the challenge of measuring creativity, and the long-term implications for human understanding and the future roles of mathematicians. They delve into the unique 'grindability' of math for AI training, the potential of formalization, and why AI currently struggles with 'theory of mind' in writing, offering advice for students navigating an AI-transformed world.

Technology

View All
The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

This episode delves into Q-Day, the anticipated future when quantum computers can break public key cryptography, and the U.S. Executive Order accelerating the transition to post-quantum cryptography. Experts discuss why Q-Day is a gradual process rather than a sudden event, the critical importance of "crypto-agility" as a long-term strategy, and the necessity for organizations to begin immediate discovery and planning to secure data against "collect now, decrypt later" threats. The discussion also touches upon the broader, transformative benefits of quantum computing beyond just security.

Plenary Talk 3​: Challenges and research opportunities for global hyperscale services

Plenary Talk 3​: Challenges and research opportunities for global hyperscale services

Jim Kleewein's talk outlines the immense challenges and critical research opportunities in building and operating global hyperscale services like Microsoft 365 and Azure. He emphasizes that at this scale, traditional approaches fail, necessitating a "new golden age of applied research" across areas like continuous availability, data management, security, and sustainability. Kleewein also discusses AI's powerful but limited role, stressing the ongoing need for human expertise, and highlights the ethical imperative to prevent failures that can have life-or-death consequences.

Platforms: Build Abstractions, not Illusions • Gregor Hohpe • GOTO 2025

Platforms: Build Abstractions, not Illusions • Gregor Hohpe • GOTO 2025

Gregor Hohpe explains the critical role of platforms in managing the growing cognitive load on developers due to complex distributed systems. He contrasts platforms, driven by "economies of speed" and fostering innovation through diversity, with traditional IT services and oversimplified abstractions that create dangerous illusions. Hohpe emphasizes building platforms that provide intuitive, domain-specific abstractions to solve real business problems, rather than just repackaging existing cloud services.


Recent Post

Architecture for Flow • Susanne Kaiser & James Lewis

Architecture for Flow • Susanne Kaiser & James Lewis

In an interview with James Lewis, Susanne Kaiser discusses her book "Architecture for Flow," which synthesizes Domain-Driven Design, Wardley Mapping, and Team Topologies. She explains how this holistic approach helps design adaptive socio-technical systems by starting with the problem space, visualizing the value chain, and aligning team structures to the software architecture, all guided by her practical "Architecture for Flow Canvas."

The Future of Search: Agents, RAG, and Why Retrieval Still Matters — Simon Eskildsen, Turbopuffer

The Future of Search: Agents, RAG, and Why Retrieval Still Matters — Simon Eskildsen, Turbopuffer

Simon Hørup Eskildsen, founder of turbopuffer, shares his journey from scaling Shopify's infrastructure to creating a new search engine for the AI era. He discusses how a prohibitively expensive experiment at Readwise inspired him to build a cost-effective vector search solution based on object storage and NVMe. Eskildsen breaks down turbopuffer's architecture, its role in cutting costs for companies like Cursor and Notion, his philosophy on building a 'P99' engineering team, and how agentic workloads are changing the future of retrieval.

Alex Karp on Palantir, AI Weapons, & American Domination | The a16z Show

Alex Karp on Palantir, AI Weapons, & American Domination | The a16z Show

Alex Karp, CEO of Palantir, discusses the critical role of technology in national defense, warns Silicon Valley of a political backlash if it fails to support the military, and argues that America's key advantage in the AI race is its ability to cultivate and protect neurodiverse, unconventional talent.

Master Software Architecture: From Simplicity to Complexity • Maciej «MJ» Jedrzejewski • GOTO 2025

Master Software Architecture: From Simplicity to Complexity • Maciej «MJ» Jedrzejewski • GOTO 2025

This presentation by Maciej Jedrzejewski explores the concept of evolutionary software architecture, arguing that architecture is not a static, upfront design but a continuous process. It delves into the cognitive biases that lead to over-engineering, emphasizes the critical role of context, and provides a practical four-step model for evolving systems from simplicity to complexity.

What Are Hierarchical AI Agents? Solving Context & Task Challenges

What Are Hierarchical AI Agents? Solving Context & Task Challenges

Explores the challenges of single AI agents, such as context dilution and tool overload, and introduces hierarchical AI agents as a solution. This summary details the structure, benefits, and limitations of multi-agent systems for more scalable and efficient AI workflows.

From Coder to Manager: Navigating the Shift to Agentic Engineering with Notion Co-Founder Simon Last

From Coder to Manager: Navigating the Shift to Agentic Engineering with Notion Co-Founder Simon Last

Notion's Co-Founder, Simon Last, discusses their evolution from a writing assistant to a platform for custom AI agents. He covers the technical hurdles of semantic indexing, the internal shift toward using coding agents to build Notion, and the fundamental transition from a tool where humans do the work to one where humans manage a swarm of agents.

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.