Prompt engineering

How to Future-Proof Your Career in the Age of AI (with Sheamus McGovern)

How to Future-Proof Your Career in the Age of AI (with Sheamus McGovern)

Sheamus McGovern outlines a multi-tiered skills hierarchy for AI and data professionals to navigate the future of work. He argues against fear-mongering, providing a practical roadmap that progresses from foundational GenAI prompting and advanced engineering to orchestration, human-centered skills, and the meta-skill of continuous learning, emphasizing the need to sunset old skills and build a personal brand.

Evals Aren't Useful? Really?

Evals Aren't Useful? Really?

A deep dive into the critical importance of robust evaluation for building reliable AI agents. The summary covers bootstrapping evaluation sets, advanced testing techniques like multi-turn simulations and red teaming, and the necessity of integrating traditional software engineering and MLOps practices into the agent development lifecycle.

Building with MCP and the Claude API

Building with MCP and the Claude API

A discussion with Anthropic engineers Alex Albert, John Welsh, and Michael Cohen about the Model Context Protocol (MCP). They cover its origins as an open standard, best practices for tool design and prompt engineering, and the future of the ecosystem where high-quality MCP servers will become a key competitive advantage.

Evals in Action: From Frontier Research to Production Applications

Evals in Action: From Frontier Research to Production Applications

An overview of OpenAI's approach to AI evaluation, covering the GDP-val benchmark for frontier models and the practical tools available for developers to evaluate their own custom agents and applications.

How to Optimize AI Agents in Production

How to Optimize AI Agents in Production

Engineers building AI agents face a combinatorial explosion of configuration choices (prompts, models, parameters), leading to guesswork and suboptimal results. This talk introduces a structured, data-driven approach using multi-objective optimization to systematically explore this vast design space. Learn how the Traigent SDK helps engineers efficiently identify optimal tradeoffs between cost, latency, and accuracy, yielding significant quality improvements and cost reductions without manual trial-and-error.

Build Hour: GPT-5

Build Hour: GPT-5

OpenAI's Build Hour provides a deep dive into GPT-5, showcasing its advanced coding and agentic capabilities. The session covers the new Responses API, critical for leveraging the model's reasoning, along with new parameters for steerability and practical prompting techniques for building complex, reliable applications.