Enterprise ai

Gen AI pilots fail, GPT-5's hidden prompt revealed, reasoning model flaws and Claude closing chats

Gen AI pilots fail, GPT-5's hidden prompt revealed, reasoning model flaws and Claude closing chats

A deep dive into why most enterprise GenAI pilots are failing, the debate around hidden system prompts in models like GPT-5, new research questioning the reliability of "chain of thought" reasoning, and the controversy over Anthropic's "AI welfare" justification for shutting down conversations.

AI Changed Stack Overflow for the Better

AI Changed Stack Overflow for the Better

Stack Overflow CEO Prashanth Chandrashekar discusses the platform's evolution in the AI era, focusing on licensing its trusted Q&A corpus to major AI labs, expanding beyond Q&A to include discussions and live chat, and the critical role of its enterprise solution in powering internal AI agents. A key insight from their upcoming developer survey reveals that while AI adoption for coding is rising, developer trust in AI-generated output is declining, reinforcing Stack Overflow's position as a vital source of human-curated, reliable knowledge.

Perplexity’s bid for Chrome, Grok Imagine and GPT-5 check-in

Perplexity’s bid for Chrome, Grok Imagine and GPT-5 check-in

Experts from IBM discuss Perplexity's audacious bid for Google Chrome, analyzing it as a strategic marketing move and debating the browser's future as a key platform for enterprise AI. They also explore the future of generative video, questioning if it's a consumer or enterprise feature while highlighting critical IP and ethical hurdles. Finally, they assess the GPT-5 release, countering claims of an AI plateau and discussing user attachment to older models and the shift towards software-defined AI systems.

2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI

2025 is the Year of Evals! Just like 2024, and 2023, and … — John Dickerson, CEO Mozilla AI

A deep dive into why 2025 is poised to be the 'Year of Evals' for AI. The speaker argues that a confluence of factors—the C-suite's post-ChatGPT awakening, budget dynamics, and the rise of autonomous agentic systems—has finally made AI evaluation a critical, top-of-mind issue for enterprise leaders.

AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL

AI Automation that actually works: $100M, messy data, zero surprises - Tanmai Gopal, Hasura/PromptQL

Tanmai Gopal, CEO of Hasura, discusses a Gen AI-driven automation strategy that addresses the "Automation Paradox" by empowering non-technical users. This approach uses a domain-specific language (DSL) to translate natural language into deterministic, executable plans, aiming to drive over $100M in annual impact for a healthcare partner.

Arvind Jain on building Glean and the future of enterprise AI

Arvind Jain on building Glean and the future of enterprise AI

Arvind Jain, CEO of Glean, details the company's journey from a pre-LLM enterprise search innovator to a leading AI agent platform. He covers their hybrid model strategy, the critical role of permission-aware RAG for security, and how AI agents are creating 'evergreen' documentation and reshaping enterprise workflows.