Tokenless

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

Jul 01, 2026

Interactive discovery

Explore the topic map

Follow the connections between themes, people, and ideas across the Tokenless archive in an interactive topic modeling map.

Open the map Browse posts

Machine Learning

View All

Jun 29, 2026

Frontier results, on device - RL Nabors, Arize

RL Nabors discusses the significant costs associated with using frontier AI models, covering security, latency, and financial implications. She introduces a framework for right-sizing AI solutions by leveraging smaller, task-specific models and Small Language Models (SLMs). The framework details how to prove task feasibility, establish success criteria with golden datasets, conduct capability evaluations (using tools like Phoenix), and select the most appropriate "Small And Good Enough" (SAGE) model. Nabors further demonstrates how prompt engineering, particularly few-shot prompting, and post-processing can close performance gaps with larger models, while advocating for continuous regression evaluations to maintain performance integrity. The overarching message is to "prototype big, deploy small" to optimize AI deployments.

Jun 28, 2026

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Vaidas Razgaitis, Senior Research Engineer at Higharc, shares three tactical tips to accelerate the transition of novel AI/ML research into production-ready features. He emphasizes addressing the critical handoff challenge between ML researchers and software engineers through structured documentation (Research Prototype Taxonomy Document), a well-organized monorepo utilizing decoupled microservices, and a systematic approach to code decomposition and PR review. These strategies aim to improve legibility, maintainability, and delivery speed for ML-driven products.

Jun 12, 2026

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

This session details a data-efficient method for training engineering surrogate models by using uncertainty quantification (UQ) to guide geometric data augmentation. Instead of random deformations, the approach lets the deep ensemble model identify its own knowledge gaps (epistemic uncertainty), then uses Free-Form Deformation (FFD) to generate new shapes specifically in those uncertain regions. This ensures every expensive simulation run yields maximally informative data, significantly improving model accuracy for a fixed computational budget across domains like structural mechanics and aerodynamics.

Artificial Intelligence

View All

Jul 01, 2026

The Benchmark With No Instructions — Tufa Labs (ARC-AGI-3)

Tim Scarfe visits Tufa Labs to explore their top-ranking ARC-AGI-3 system, a benchmark for agentic intelligence that challenges LLMs in goal discovery and action efficiency. The team delves into the complexities of fractured representations, the role of human priors, and whether LLMs truly plan or merely simulate it effectively, all while balancing the bitter lesson with AI safety concerns.

Jun 30, 2026

Session on Reasoning

This session features two talks on optimizing and verifying AI reasoning. Hongxiang Fan discusses cross-stack co-design for efficient AI, focusing on Test-Time Scaling (TTS) challenges, optimal verification granularity, and system-level optimizations for edge deployments. Nagarajan Natarajan introduces 'Advancing Verified Reasoning' with the InterVent platform, aiming to ensure AI agents comply with complex policies through formal verification, dynamic steering, and leveraging verification signals for training. Both emphasize addressing the computational and reliability costs of advanced AI.

Jun 30, 2026

Multimodal & Embodied Intelligence (Pt 1), Panel on Multimodal AI: Progress, Pitfalls, Possibilities

This session explored Multimodal and Embodied Intelligence, featuring talks on hybrid AI in robotics (classical vs. end-to-end), AI's role in healthcare (focusing on NCDs, deployment, and uncertainty modeling), and fundamental perception challenges in multimodal reasoning (using educational video QA and visual puzzles). A panel discussed the impact of foundation models, the blurred lines between AGI and human-like AI, critical deployment pitfalls (human factors, efficiency, architectural limits), and future directions, emphasizing task-specific models and the redefinition of 'foundation models.'

Technology

View All

Jul 01, 2026

Are Your Tests Slowing You Down? • Trisha Gee • GOTO 2025

Trisha Gee delivers a compelling talk on Developer Productivity Engineering (DPE) for testing, dissecting common pain points in writing, troubleshooting, and running tests. She advocates for strategic use of IDEs, advanced tooling like build caches and predictive test selection (leveraging ML), and a disciplined approach to test design to overcome these challenges, emphasizing that good tests serve as crucial living documentation.

Jul 01, 2026

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

This episode delves into Q-Day, the anticipated future when quantum computers can break public key cryptography, and the U.S. Executive Order accelerating the transition to post-quantum cryptography. Experts discuss why Q-Day is a gradual process rather than a sudden event, the critical importance of "crypto-agility" as a long-term strategy, and the necessity for organizations to begin immediate discovery and planning to secure data against "collect now, decrypt later" threats. The discussion also touches upon the broader, transformative benefits of quantum computing beyond just security.

Jun 30, 2026

Plenary Talk 3: Challenges and research opportunities for global hyperscale services

Jim Kleewein's talk outlines the immense challenges and critical research opportunities in building and operating global hyperscale services like Microsoft 365 and Azure. He emphasizes that at this scale, traditional approaches fail, necessitating a "new golden age of applied research" across areas like continuous availability, data management, security, and sustainability. Kleewein also discusses AI's powerful but limited role, stressing the ongoing need for human expertise, and highlights the ethical imperative to prevent failures that can have life-or-death consequences.

Recent Post

Feb 20, 2026

Migrating from Neptune to Weights & Biases

A technical guide on migrating ML experiments from Neptune to Weights & Biases, covering the migration script, API-level code changes, and best practices for organizing projects and analyzing results in the W&B platform before the Neptune sunset.

Feb 20, 2026

Spring Then & Now: What’s Next? • Rod Johnson, Arjen Poutsma & Trisha Gee

A panel discussion with Spring Framework creator Rod Johnson and veteran Arjen Poutsma, moderated by Trisha Gee. They discuss the evolution of Spring, the future of reactive programming in the age of virtual threads, their new AI agent framework Embabel, and the essential AI skills modern Java developers need to acquire.

Feb 20, 2026

India's USD $200B AI hub & Claude builds C compiler

Experts from IBM discuss Google's $200B AI investment in India, Claude's autonomous C compiler creation, the significant security risks in AI agent skills, and the looming AI ROI problem facing IT leaders, debating the shift from per-token to value-based pricing.

Feb 19, 2026

Fast & Asynchronous: Drift Your AI, Not Your GPU Bill // Artem Yushkovskiy

Delivery Hero presents "Asya", an open-source framework that replaces traditional AI pipelines with a distributed, asynchronous actor model. This paradigm shift dramatically lowers GPU costs and improves scalability by treating each processing step as an independent, auto-scaling microservice on Kubernetes.

Feb 19, 2026

Beyond the Gold Standard: Evaluating and Trusting Agents in the Wild // Sanjana Sharma

A deep dive into the challenges of deploying AI agents in production, arguing that reliability stems not from model intelligence but from a "system-first" approach. The talk introduces a new architecture that separates the LLM's reasoning from a versioned, auditable "Context Layer" containing business logic and expert knowledge, which is continuously updated through a "Living Ground Truth" loop driven by expert feedback.

Feb 19, 2026

Rethinking Notebooks Powered by AI

Vincent Warmerdam from marimo discusses the recent acquisition by Weights & Biases and the future of Python notebooks. He argues that notebooks should evolve from static scratchpads into dynamic, AI-powered applications, highlighting marimo's features for LLM integration, agentic workflows, and creating interactive, reproducible development environments.

← Previous Next →

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

Explore the topic map

Machine Learning

Frontier results, on device - RL Nabors, Arize

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

Artificial Intelligence

The Benchmark With No Instructions — Tufa Labs (ARC-AGI-3)

Session on Reasoning

Multimodal & Embodied Intelligence (Pt 1), Panel on Multimodal AI: Progress, Pitfalls, Possibilities

Technology

Are Your Tests Slowing You Down? • Trisha Gee • GOTO 2025

The new post-quantum cryptography executive order. Plus: What is Q-Day, really?

Plenary Talk 3​: Challenges and research opportunities for global hyperscale services

Recent Post

Migrating from Neptune to Weights & Biases

Spring Then & Now: What’s Next? • Rod Johnson, Arjen Poutsma & Trisha Gee

India's USD $200B AI hub & Claude builds C compiler

Fast & Asynchronous: Drift Your AI, Not Your GPU Bill // Artem Yushkovskiy

Beyond the Gold Standard: Evaluating and Trusting Agents in the Wild // Sanjana Sharma

Rethinking Notebooks Powered by AI

Stay In The Loop! Subscribe to Our Newsletter.

Plenary Talk 3: Challenges and research opportunities for global hyperscale services