How fast is autonomous AI cyber capability advancing?
aisi.gov.ukA new Mythos checkpoint improves significantly on the previous one (and beats GPT-5.5-Cyber) on this benchmark.
A new Mythos checkpoint improves significantly on the previous one (and beats GPT-5.5-Cyber) on this benchmark.