Platform engineering

The GPU Uptime Battle

The GPU Uptime Battle

Andy Pernsteiner, Field CTO of VAST Data, discusses the immense challenges of transitioning AI projects from prototype to production. He highlights the critical role of data infrastructure, the high cost of GPU downtime, and the necessity of building resilient, scalable platforms that can withstand real-world failures like power outages in massive data centers. The conversation emphasizes a shift in mindset towards empathy, better requirement gathering, and closer collaboration between data scientists and platform engineers to bridge the gap between development and operations.

Fundamentals of DevOps & Software Delivery • Yevgeniy "Jim" Brikman & Kief Morris • GOTO 2025

Fundamentals of DevOps & Software Delivery • Yevgeniy "Jim" Brikman & Kief Morris • GOTO 2025

Yevgeniy (Jim) Brikman, author and co-founder of Gruntwork, discusses the pragmatic definition of DevOps as a methodology for efficient software delivery, born from his experience with a deployment crisis at LinkedIn. The conversation with Kief Morris explores the modern infrastructure stack, the critical role of frameworks over custom scripts, and emerging paradigms like Infrastructure from Code and interactive runbooks.

Platform Engineering: From Theory to Practice • Liz Fong-Jones & Lesley Cordero

Platform Engineering: From Theory to Practice • Liz Fong-Jones & Lesley Cordero

Liz Fong-Jones and Lesley Cordero explore the evolution of platform engineering from its DevOps and SRE roots, discussing the challenges of building effective developer platforms, the importance of psychological safety, the complexities of open source sustainability, and the delicate balance between centralized platform teams and developer autonomy.

Infrastructure as Code • Kief Morris & Abby Bangser

Infrastructure as Code • Kief Morris & Abby Bangser

Kief Morris, author of 'Infrastructure as Code', and Abby Bangser discuss the evolution of IaC over the past decade. They explore the move from server configuration to complex cloud architectures, the limitations of current tooling, and the need for higher-level abstractions, while also looking ahead to the potential impact of AI and the critical role of platform engineering in connecting infrastructure to specific business needs.

The Blind Spots of Platform Engineering • Matt McLarty & Erik Wilde

The Blind Spots of Platform Engineering • Matt McLarty & Erik Wilde

Matt McLarty and Erik Wilde explore the blind spots in platform engineering, arguing that a narrow focus on developer velocity and toolchains overlooks the critical need for creating reusable, API-driven business capabilities that deliver tangible value and organizational optionality.

Platform Engineering: A Deep Dive Conversation • Russ Miles & Kevlin Henney

Platform Engineering: A Deep Dive Conversation • Russ Miles & Kevlin Henney

In this interview from GOTO Copenhagen 2024, Russ Miles, interviewed by Kevlin Henney, explores a human-centric approach to platform engineering, encapsulated by the phrase "Don't feed the pigeons." He advocates for focusing on desired behavioral changes and empowering creative work over doubling down on existing, suboptimal tools and processes. The discussion delves into using OODA loops, creating a developer "habitat," and the critical role of empathy and storytelling in understanding and improving complex sociotechnical systems.