Ratings by sethherr

3 Matching Ratings

Rated Article

Detecting and countering misuse of AI: August 2025

Anthropic's threat intelligence report on AI cybercrime and other abuses

anthropic.com 2,000 words

Rated 2025-09-01T21:03:14-0700

The Anthropic Economic Index

Announcement of the new Anthropic Economic Index and description of the new data on AI use in occupations

anthropic.com 2,000 words

Rated 2025-02-10T12:59:35-0800

Alignment faking in large language models

A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models

anthropic.com 2,000 words

Rated 2024-12-19T19:01:38-0800