Posts

AI Models as a Service: Powering Agentic AI, Privacy, & RAG

AI Models as a Service: Powering Agentic AI, Privacy, & RAG

Cedric Clyburn explains the Models-as-a-Service (MaaS) pattern, detailing how organizations can build their own private AI infrastructure to deploy models like LLMs securely and at scale. He covers the benefits over public APIs, including cost control, data sovereignty, and lifecycle management, and outlines a technical architecture using Kubernetes, API gateways, and observability tools.

Why Every Satellite Needs Earth | Northwood CEO on a16z

Why Every Satellite Needs Earth | Northwood CEO on a16z

Bridgit Mendler, CEO of Northwood, details the critical bottleneck in the space economy: ground infrastructure. She explains how Northwood's vertically integrated approach is reducing deployment times from years to months, aiming to create a foundational data layer for space, much like cloud computing did for the internet.

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

Chris Fregly discusses his new book, "AI Systems Performance Engineering", covering the co-design and optimization of hardware, software, and algorithms across PyTorch, CUDA, and NVIDIA GPUs. The talk explores GPU architecture, system-level reliability challenges, and the use of modern coding agents for low-level kernel optimization.

Will machines ever be intelligent?

Will machines ever be intelligent?

Doug Burger, Nicolò Fusi, and Subutai Ahmad explore the intelligence of AI, contrasting transformer-based LLMs with the human brain's distributed, continuously learning architecture. They delve into differences in efficiency, representation, and sensory-motor grounding, debating what intelligence truly means and how future AI might bridge the gap.

The Q/A Layer for the AI Coding Era

The Q/A Layer for the AI Coding Era

Weiwei Wu and Jeff An, co-founders of Momentic, discuss their AI-powered testing platform that acts as a verification layer for software. They explore how the rise of AI-generated code makes robust testing more critical than ever and share their vision for a future of "truth-driven development" where engineers write specs, not code.

Kubernetes at the Edge • Charles Humble & Hannah Foxwell • GOTO 2026

Kubernetes at the Edge • Charles Humble & Hannah Foxwell • GOTO 2026

Charles Humble discusses his e-book "Kubernetes at the Edge," exploring the definition of edge computing, its practical applications in industries like agriculture and healthcare, vendor selection strategies, and the critical importance of Day-2 operations. The conversation also delves into how edge computing promotes sustainability and concludes with a thoughtful examination of the tech industry's ethical responsibilities in the age of generative AI.