Adversarial AI

Jan 08, 2026

Hacking AI Systems: How to (Still) Trick Artificial Intelligence • Katharine Jarmul • GOTO 2025

To build secure AI systems, we must first learn to break them. Katharine Jarmul explores the landscape of adversarial AI, detailing how attackers exploit fundamental weaknesses in deep learning models—from poisoned training data and overparameterization to the attention mechanism itself. This talk provides a practical taxonomy of attacks and a primer on building robust defenses.

Aug 11, 2025

AI Agents for Cybersecurity: Enhancing Automation & Threat Detection

An exploration of how LLM-powered AI agents are transforming cybersecurity by moving beyond traditional static rules to provide dynamic, adaptive security operations. The summary covers key applications in threat detection and incident response, while also addressing critical risks like hallucinations and adversarial manipulation, emphasizing a "human-in-the-loop" approach.