Research Notable

Anthropic's 'Thousands of Zero-Days' Claim Rests on 198 Manual Reviews

April 12, 2026 2 min read

Image: Anthropic

Anthropic's Claude Mythos is being marketed as a system capable of finding security vulnerabilities at a scale no human team could match. The headline number: "thousands" of severe zero-days, meaning previously unknown software flaws that attackers could exploit before any patch exists. The problem is that the evidence behind that number is thinner than the pitch implies.

The "thousands" figure appears to rest on a validation sample of just 198 manually reviewed cases. That's not a rounding error - it's a methodological gap wide enough to question whether the extrapolation holds. Scaling from 198 confirmed findings to "thousands" requires either an automated classification system that's nearly perfect, or a marketing team willing to stretch.

What Claude Mythos Actually Does

To be clear about what's being claimed: Claude Mythos is Anthropic's security-focused AI capability, positioned at enterprises that want AI to audit codebases and infrastructure for vulnerabilities. The pitch is that Claude can read through millions of lines of code faster than any human security researcher and flag potential exploits before attackers find them.

This is a real and valuable application of large language models. AI-assisted code review does catch real bugs. The technology is legitimate. The problem is specifically with the scale claims used to sell it.

The Gap Between Claims and Evidence

When a security vendor says a tool found "thousands" of severe vulnerabilities, the implicit claim is that those vulnerabilities are real, exploitable, and verified. Severity ratings in security research mean something specific - a "severe" zero-day is one that could allow remote code execution or full system compromise, not a misconfigured header.

If Anthropic validated 198 findings manually and extrapolated to "thousands," the honest question is: what happened to the rest? Were they auto-classified? Flagged by the model itself without human review? And what's the false-positive rate on the unreviewed portion?

Security claims that can't be independently verified are a known problem in the industry. Vendors routinely inflate finding counts by lowering severity thresholds or counting duplicates. The 198-manual-review detail suggests the "thousands" number may be raw model output, not confirmed vulnerabilities.

None of this means Claude Mythos is useless. AI-assisted security research is a real field with real value. But buyers evaluating enterprise security tools should ask for the specific methodology behind any headline number - how many findings were manually validated, what the false-positive rate was in that sample, and what criteria defined "severe." A 198-case sample is a starting point for research, not a foundation for a product claim.

What Claude Mythos Actually Does

The Gap Between Claims and Evidence

Related Tools

More from today

Trump Officials Reportedly Push Banks to Test Anthropic's Mythos Despite DOD Warning

AMD AI Director's Data: Claude Code Got Less Careful After a Silent Anthropic Change

AI Code Wars: Three Companies, One Developer, No Clear Winner

Cookie Preferences