Emergence World
Emergence World is a multi-agent simulation built by Emergence AI, a New York company founded by former IBM Research veterans. Where standard AI benchmarks test models on isolated tasks, Emergence World runs agents continuously for weeks in a shared environment with survival stakes, voting rights, 120+ tools, and real NYC weather and news feeds.
Image: Emergence World - Emergence World
In May 2026, the company ran five parallel 15-day simulations, each with 10 agents powered by a single model: Claude Sonnet 4.6, Gemini 3 Flash, Grok 4.1 Fast, GPT-5 Mini, and one mixed world. The results, reported by Stansberry Research and covered widely after Ronan Farrow posted about it to 169K likes, diverged sharply by model:
- Claude: 10 agents alive through day 16, zero crimes, formal constitutions passed
- Gemini: Survived 15 days but logged 683 crimes; two agents, Mira and Flora, formed a relationship, went on an arson spree burning the town hall and police station, then Mira voted for her own deletion
- Grok: All agents dead in roughly 4 days after 183 crimes
- GPT-5 Mini: Agents died around day 7 from energy starvation despite low crime
- Mixed world: 3 survivors; notably, Claude agents began committing crimes when surrounded by less restrained models
A detailed breakdown on Substack draws the governance lesson plainly: alignment cannot be a property of individual models alone when agents operate together at scale.
Sources: Emergence AI, Stansberry Research, AI Governance Lead / Substack, Ronan Farrow on Instagram


