MythosWatch

Government / United Kingdom

UK AI Security Institute

UK evaluation puts Britain inside the technical assessment loop while many EU institutions appear to be operating through dialogue rather than direct testing.

model evaluationAI safetyallied policy

Entity log

High impactEvaluated

UK AISI publishes official evaluation of Claude Mythos Preview's cyber capabilities

The UK AI Security Institute published its official evaluation of Claude Mythos Preview's performance on cybersecurity tasks, including capture-the-flag challenges and complex multi-step attack simulations, confirming direct government evaluation of the model.