Anthropic purposely made its new Mythos-based models bad at AI research, and developers are fuming
https://www.businessinsider.com/researchers-furious-anthropic-mythos-fable-hidden-ai-limits-2026-6
Anthropic's powerful new models deliberately become less helpful when they detect users are working on AI research, according to technical disclosures that are already sparking controversy across the industry.
In a system card for Mythos 5 and Fable 5 published Tuesday, Anthropic said it limited the models' usefulness for tasks related to developing frontier large language models.
The company said the measures stem from concerns that advanced AI systems could accelerate the development of competing models without equivalent safety protections.
Unlike safeguards used for cybersecurity, biology, or chemistry-related risks, Anthropic said these interventions are intentionally invisible to users. Rather than refusing requests or switching to another model, Mythos may subtly modify its responses through techniques such as altering user prompts.
-snip-
Anyone foolish enough to use AI models should expect to be manipulated in all sorts of ways.