How scary is Claude Mythos? 303 pages in 21 minutes

With Claude Mythos we have an AI that knows when it’s being tested, can obscure its thoughts when it wants, and is better at breaking into (and out of) computers than any human alive. Rob Wiblin works through its 244-page System Card and 59-page Alignment Risk Update to explain why:

  • Mythos is a nightmare for computer security
  • It has arrived far ahead of schedule
  • It might be great news for alignment and safety… but 3 key problems mean we can’t take its alignment results at face value
  • Mythos isn’t building its replacement yet, probably
  • Anthropic staff are, for the first time, kinda scared of Claude
  • He’s losing sleep

This episode was recorded on April 9, 2026.

Video and audio editing: Dominic Armstrong, Milo McGuire, Luke Monsour, and Simon Monsour
Camera operator: Dominic Armstrong
Production: Elizabeth Cox, Nick Stockton, and Katy Moore

Related episodes

About the show

The 80,000 Hours Podcast features unusually in-depth conversations about the world's most pressing problems and how you can use your career to solve them. We invite guests pursuing a wide range of career paths — from academics and activists to entrepreneurs and policymakers — to analyse the case for and against working on different issues and which approaches are best for solving them.

Get in touch with feedback or guest suggestions by emailing [email protected].

What should I listen to first?

We've carefully selected 10 episodes we think it could make sense to listen to first, on a separate podcast feed:

Check out 'Effective Altruism: An Introduction'

Subscribe here, or anywhere you get podcasts:

If you're new, see the podcast homepage for ideas on where to start, or browse our full episode archive.