Ai ethics

Evaluating the Cultural Relevance of AI Models and Products: Insights from the YUX Team

Evaluating the Cultural Relevance of AI Models and Products: Insights from the YUX Team

Drawing from their work fine-tuning an ASR model in Wolof and building a stereotype detection dataset, researchers from YUX share a practical toolbox for evaluating the cultural relevance of AI models and products. The session covers methods for data collection, model benchmarking, user testing, and introduces LOOKA, a platform for scalable human evaluation in the African context.

The Limits of AI: Generative AI, NLP, AGI, & What’s Next?

The Limits of AI: Generative AI, NLP, AGI, & What’s Next?

Exploring the evolution of AI, this summary breaks down the Data-Information-Knowledge-Wisdom hierarchy, revisits past predictions about AI's limits that have since been surpassed—such as reasoning and creativity—and delves into current challenges like hallucinations, AGI, and sustainability. It concludes by framing a collaborative future where humans define the 'what' and 'why,' while AI executes the 'how'.

Ethics in AI: Biases & Responsibilities • Michelle Frost & Hannes Lowette

Ethics in AI: Biases & Responsibilities • Michelle Frost & Hannes Lowette

AI advocate Michelle Frost and consultant Hannes Lowette discuss the complex ethical landscape of AI development. They cover the value alignment problem, balancing competing values like accuracy versus fairness, the impact of recent US regulatory changes, and market disruptions from innovations like Deep Seek, ultimately calling for individual and corporate accountability to develop AI responsibly.

Are AI Agent Identities Really Unique? AI's Role in Digital Workflows

Are AI Agent Identities Really Unique? AI's Role in Digital Workflows

Grant Miller explores the complex and evolving nature of AI agent identities within organizations, posing five critical questions about their classification, governance, and role alongside human employees.

Gen AI pilots fail, GPT-5's hidden prompt revealed, reasoning model flaws and Claude closing chats

Gen AI pilots fail, GPT-5's hidden prompt revealed, reasoning model flaws and Claude closing chats

A deep dive into why most enterprise GenAI pilots are failing, the debate around hidden system prompts in models like GPT-5, new research questioning the reliability of "chain of thought" reasoning, and the controversy over Anthropic's "AI welfare" justification for shutting down conversations.

Perplexity’s bid for Chrome, Grok Imagine and GPT-5 check-in

Perplexity’s bid for Chrome, Grok Imagine and GPT-5 check-in

Experts from IBM discuss Perplexity's audacious bid for Google Chrome, analyzing it as a strategic marketing move and debating the browser's future as a key platform for enterprise AI. They also explore the future of generative video, questioning if it's a consumer or enterprise feature while highlighting critical IP and ethical hurdles. Finally, they assess the GPT-5 release, countering claims of an AI plateau and discussing user attachment to older models and the shift towards software-defined AI systems.