Rated↓ Article |
---|
Detecting and countering misuse of AI: August 2025Anthropic's threat intelligence report on AI cybercrime and other abuses anthropic.com 2,000 words Rated 2025-09-01T21:03:14-0700 |
The Anthropic Economic IndexAnnouncement of the new Anthropic Economic Index and description of the new data on AI use in occupations anthropic.com 2,000 words Rated 2025-02-10T12:59:35-0800 |
Alignment faking in large language modelsA paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models anthropic.com 2,000 words Rated 2024-12-19T19:01:38-0800 |