Co design

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

Performance Optimization and Software/Hardware Co-design across PyTorch, CUDA, and NVIDIA GPUs

Chris Fregly discusses his new book, "AI Systems Performance Engineering", covering the co-design and optimization of hardware, software, and algorithms across PyTorch, CUDA, and NVIDIA GPUs. The talk explores GPU architecture, system-level reliability challenges, and the use of modern coding agents for low-level kernel optimization.

Building the Real-World Infrastructure for AI, with Google, Cisco & a16z

Building the Real-World Infrastructure for AI, with Google, Cisco & a16z

AI is driving an unprecedented buildout of physical infrastructure. Experts from Google and Cisco discuss the "AI industrial revolution," where power, compute, and networking are the new scarce resources, demanding a complete reinvention of the technology stack from silicon to software.