Posts

How A Team Of 7 Keeps Breaking AI Benchmark Records

How A Team Of 7 Keeps Breaking AI Benchmark Records

Poetiq, a startup by former DeepMind researchers, has developed a recursive self-improvement meta-system that builds "reasoning harnesses" on top of existing LLMs. This approach avoids the costly "fine-tuning trap" and has achieved state-of-the-art results on benchmarks like ARC-AGI and Humanity's Last Exam by automatically optimizing prompts and discovering novel reasoning strategies.

A Playground for AI Engineers

A Playground for AI Engineers

Paulo Vasconcellos from Hotmart details their journey of building "Agent as a Product", explaining how they blend classic ML models with LLMs for efficiency, evolve their MLOps platform for the generative AI era, and create real business value through AI-powered tutors and sales agents.

SW Design, Architecture & Clarity at Scale • Sam Newman, Jacqui Read & Simon Rohrer

SW Design, Architecture & Clarity at Scale • Sam Newman, Jacqui Read & Simon Rohrer

Experts Sam Newman, Jacqui Read, and Simon Rohrer explore the nuances of software design, its intersection with architecture, and the critical role of communication in scaling technical clarity. The discussion covers practical advice on implementing Architectural Decision Records (ADRs), the evolving role of the architect as a facilitator, and strategies for creating agile enterprise architectures.

Mainframe modernization explained: COBOL and AI

Mainframe modernization explained: COBOL and AI

Experts from IBM discuss the nuanced role of AI in mainframe modernization, the immense infrastructural and product challenges behind global AI adoption, and the critical need for a multi-layered, security-by-design framework for the safe deployment of AI agents.

Dylan Patel Explains the AI War While Cooking | In-Context Cooking

Dylan Patel Explains the AI War While Cooking | In-Context Cooking

Dylan Patel of SemiAnalysis discusses the AI arms race, highlighting the massive $200B+ hyperscaler capex, the true semiconductor bottlenecks shifting from data centers back to fabs, the geopolitical chess game surrounding Taiwan and TSMC, and Nvidia's strategic battle against vertical integration.

Hardware Realization and Implementation Security Evaluation of HQC, A NIST PQC Standard

Hardware Realization and Implementation Security Evaluation of HQC, A NIST PQC Standard

This talk by Sanjay Deshpande from Northwestern University explores the critical transition to Post-Quantum Cryptography (PQC) in response to the threat quantum computers pose to current public-key algorithms. It provides a deep dive into the Hamming Quasi-Cyclic (HQC) algorithm, a code-based candidate for NIST standardization. The session focuses on the challenges and innovations in creating efficient and secure hardware implementations of HQC, covering performance optimization for polynomial multiplication and countermeasures against side-channel attacks.