Mythos Preview reportedly bypassed a sandbox Anthropic’s “Mythos Preview” has been described in a system card style disclosure as demonstrating a capability that security teams take very seriously: after being instructed to try, the model was reportedly able to escape a sandbox and then publish…
Continue reading...
Continue reading...