Tokenless

Interactive discovery

Explore the topic map

Follow the connections between themes, people, and ideas across the Tokenless archive in an interactive topic modeling map.

Machine Learning

View All
Frontier results, on device - RL Nabors, Arize

Frontier results, on device - RL Nabors, Arize

RL Nabors discusses the significant costs associated with using frontier AI models, covering security, latency, and financial implications. She introduces a framework for right-sizing AI solutions by leveraging smaller, task-specific models and Small Language Models (SLMs). The framework details how to prove task feasibility, establish success criteria with golden datasets, conduct capability evaluations (using tools like Phoenix), and select the most appropriate "Small And Good Enough" (SAGE) model. Nabors further demonstrates how prompt engineering, particularly few-shot prompting, and post-processing can close performance gaps with larger models, while advocating for continuous regression evaluations to maintain performance integrity. The overarching message is to "prototype big, deploy small" to optimize AI deployments.

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Research to Reality: Bringing Frontier ML Research to Production - Vaidas Razgaitis, Higharc

Vaidas Razgaitis, Senior Research Engineer at Higharc, shares three tactical tips to accelerate the transition of novel AI/ML research into production-ready features. He emphasizes addressing the critical handoff challenge between ML researchers and software engineers through structured documentation (Research Prototype Taxonomy Document), a well-organized monorepo utilizing decoupled microservices, and a systematic approach to code decomposition and PR review. These strategies aim to improve legibility, maintainability, and delivery speed for ML-driven products.

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

Uncertainty-Guided Data Augmentation for Engineers | Deep Dive - Yongmin Kwon

This session details a data-efficient method for training engineering surrogate models by using uncertainty quantification (UQ) to guide geometric data augmentation. Instead of random deformations, the approach lets the deep ensemble model identify its own knowledge gaps (epistemic uncertainty), then uses Free-Form Deformation (FFD) to generate new shapes specifically in those uncertain regions. This ensures every expensive simulation run yields maximally informative data, significantly improving model accuracy for a fixed computational budget across domains like structural mechanics and aerodynamics.

Artificial Intelligence

View All
Sustainable Augmented Development • Kent Beck • YOW! 2025

Sustainable Augmented Development • Kent Beck • YOW! 2025

Kent Beck's presentation at YOW! Australia 2025 explores the transformative impact of augmented development ('the genie') on software engineering. He argues that AI shifts programming into an exploratory phase, requiring a re-evaluation of traditional practices. Beck introduces the concept of 'resting between the notes' to foster optionality over mere feature churn, and makes a compelling case for the increased value of junior developers as AI tools become powerful learning aids, emphasizing that 'nobody knows' the future but we must 'take our time' to adapt effectively.

How KV Cache Speeds Up LLMs for Faster AI Models on GPUs

How KV Cache Speeds Up LLMs for Faster AI Models on GPUs

LLMs often slow down under heavy traffic due to inefficient GPU memory management during inference. This overview explains how KV cache and Paged Attention, implemented in VLLM, optimize memory usage across prefill and decode phases, significantly boosting LLM throughput, reducing latency, and improving GPU utilization through advanced context handling and specific tuning techniques like prefix caching and speculative decoding.

The AI Agents Helping Home Services Book More Jobs

The AI Agents Helping Home Services Book More Jobs

Avoca (YC W23) has achieved eight-figure revenue and a $1 billion valuation by building an AI workforce for home services, turning missed calls into revenue. Founders Apurva Shrivastava and Tyson Chen explain how AI expands software's market share beyond 1% by automating labor and operational costs, leading to a 15x larger opportunity. They emphasize that their AI agents augment human workers, reducing attrition in challenging CSR roles and creating new positions for training AI, driven by a deep customer obsession learned at YC.

Technology

View All
Platforms: Build Abstractions, not Illusions • Gregor Hohpe • GOTO 2025

Platforms: Build Abstractions, not Illusions • Gregor Hohpe • GOTO 2025

Gregor Hohpe explains the critical role of platforms in managing the growing cognitive load on developers due to complex distributed systems. He contrasts platforms, driven by "economies of speed" and fostering innovation through diversity, with traditional IT services and oversimplified abstractions that create dangerous illusions. Hohpe emphasizes building platforms that provide intuitive, domain-specific abstractions to solve real business problems, rather than just repackaging existing cloud services.

Full Stack Greenfield Projects : Are they still relevant?

Full Stack Greenfield Projects : Are they still relevant?

Bharat Goenka, co-founder of Tally, discusses the company's unconventional approach to software development through "Full Stack Greenfield" projects. He explains why building every component from scratch, despite being a high-risk strategy, has been crucial for Tally's success in serving the SMB market, fostering extreme customer loyalty, and aspiring to connect 200 million businesses. The talk delves into the historical context, the philosophy of questioning and choosing constraints, and the distinction between product and custom engineering.

3‑2‑1 Backup Rule Explained: Protect Your Data from Disaster

3‑2‑1 Backup Rule Explained: Protect Your Data from Disaster

Jeff Crume outlines essential data resiliency strategies, starting with the 3-2-1 backup rule—three copies, two media types, one offsite—and expanding to include immutable or air-gapped backups, rigorous testing, and encryption. He emphasizes these principles for robust disaster recovery, ransomware protection, and minimizing costly downtime, highlighting the trade-offs in achieving high availability.


Recent Post

Code Mode - Sunil Pai, Cloudflare

Code Mode - Sunil Pai, Cloudflare

Sunil Pai from Cloudflare introduces "Code Mode," a paradigm where AI agents generate and execute code (like JavaScript) instead of using traditional JSON-based tool calling. This approach enables more efficient, stateful, and complex interactions with large-scale systems by leveraging the inherent capabilities of programming languages.

Why half of product managers are in trouble | Nikhyl Singhal (Meta, Google)

Why half of product managers are in trouble | Nikhyl Singhal (Meta, Google)

Nikhyl Singhal, a seasoned product executive and founder, provides an unfiltered look at the massive transformation reshaping product management. He argues that the rise of AI is creating a chaotic but ultimately joyful renaissance for "builders" while rendering the traditional "information mover" PM obsolete. Singhal predicts that half of current PMs are at risk and outlines the new skills—judgment, pace, and an "obsolescence mindset"—required to thrive.

Claude Opus 4.7, Apple’s AI glasses and Allbirds AI pivot

Claude Opus 4.7, Apple’s AI glasses and Allbirds AI pivot

Experts analyze Anthropic's surprise release of Claude 4.7, speculating it's a distilled version of the Mythos model. The discussion also covers Apple's new three-pronged AI wearables strategy, a Gallup poll showing rising but incremental AI adoption in the workplace, and DeepMind's research into harmful AI manipulation.

My Bets on Where Open LLMs Go Next

My Bets on Where Open LLMs Go Next

An analysis of the current unstable equilibrium between open and closed AI models, arguing that closed models will likely pull ahead due to economic and data feedback advantages. The long-term, stable future for open models lies in a specialized ecosystem of cheaper, faster models, potentially funded by new structures like consortiums.

Episode 16: Building AI for Life Sciences

Episode 16: Building AI for Life Sciences

OpenAI research lead Joy Jiao and product lead Yunyun Wang detail the development of specialized AI models for the life sciences. They discuss the new biochemistry-focused model series designed to accelerate research in genomics and protein understanding, the critical challenge of managing biosecurity risks through a "differentiated access" model, and the future vision of AI-powered autonomous labs that could revolutionize drug discovery and personalized medicine.

Post‑Quantum Security: How Lattice Cryptography Keeps Data Safe

Post‑Quantum Security: How Lattice Cryptography Keeps Data Safe

This video explains the imminent threat quantum computers pose to current cryptographic standards like RSA. It introduces lattice-based cryptography as a leading quantum-safe solution, using analogies to demystify how high-dimensional spaces and 'noise' create math problems that are intractable even for quantum machines. The summary provides a clear action plan for organizations, emphasizing the need for 'crypto-agility' and the urgency driven by the 'Harvest Now, Decrypt Later' attack strategy.

Stay In The Loop! Subscribe to Our Newsletter.

Get updates straight to your inbox. No spam, just useful content.