Ucb bandit

Solving the Wrong Problem Works Better - Robert Lange

Solving the Wrong Problem Works Better - Robert Lange

Robert Lange from Sakana AI discusses Shinka Evolve, a framework combining LLMs with evolutionary algorithms for open-ended program search. The conversation explores how Shinka Evolve addresses the limitations of systems like AlphaEvolve by co-evolving problems and solutions, its sample-efficient architecture using UCB bandits and quality-diversity search, and its applications in circle packing, competitive programming, and evolving MoE loss functions. The discussion also delves into the philosophical debate on whether these systems produce true novelty or are parasitic on their starting conditions, and the future role of the "AI Scientist" as a human co-pilot.