Ratings by sethherr

4 Matching Ratings

Rated Article

The Anthropic Economic Index

Announcement of the new Anthropic Economic Index and description of the new data on AI use in occupations

anthropic.com 2,000 words

Rated 2025-02-10T12:59:35-0800

Detecting and countering misuse of AI: August 2025

Anthropic's threat intelligence report on AI cybercrime and other abuses

anthropic.com 2,000 words

Rated 2025-09-01T21:03:14-0700

Alignment faking in large language models

A paper from Anthropic's Alignment Science team on Alignment Faking in AI large language models

anthropic.com 2,000 words

Rated 2024-12-19T19:01:38-0800

A small number of samples can poison LLMs of any size

Anthropic research on data-poisoning attacks in large language models

anthropic.com 2,000 words

Rated 2025-10-09T21:39:42-0700