Apr. 28, 2026
GPT-5.5 Pro achieves a new high score of 159 on the Epoch Capabilities Index.
Apr. 10, 2026
We released early results from MirrorCode, a new long-horizon SWE benchmark co-developed with METR, showing that AI can already complete some weeks-long coding tasks.
Trusted by leaders at OpenAI, DeepMind,
and governments worldwide
Need deeper insights? Our team offers custom research and advisory services.
Book a consultation