Policy Notable

Anthropic Controls Claude's Words, But Palantir Controls What Claude Sees

March 11, 2026 2 min read

Image: Anthropic

Who's actually responsible when an AI system causes harm? An analysis of the Anthropic-Palantir partnership exposes an uncomfortable answer: nobody, fully.

The core argument is architectural. Anthropic builds safety guardrails into Claude at the model level - controlling what the AI says and refuses to say. Palantir, which has deployed Claude on classified U.S. government networks since November 2024, controls what data Claude processes through its data orchestration platform. Neither company has complete responsibility for what happens when the two layers combine.

The Data Layer Problem

Palantir's platform isn't a simple database. It fuses raw data (surveillance feeds, immigration records, communications), machine learning models, and institutional decision-making into a continuously updating relational graph. The system learns from how users interact with it and rebuilds itself over time.

When Claude queries this graph, it reasons over inferred relationships - not just static records. A person flagged in one dataset might be linked to others through behavioral patterns, geographic associations, or risk scores that were calculated months or years earlier. Anthropic's safety training has no visibility into this data layer.

This creates a specific, practical problem: data deletion. Palantir's own documentation describes deletion procedures for structured databases. But inferred relationships in a graph - the behavioral patterns, risk scores, and network associations derived from raw data - aren't explicit records. Deleting a person's source entry doesn't automatically remove the downstream inferences that were already calculated. No agreed-upon definition of "deleted" exists in a graph context.

The January 2026 Incident

The analysis points to a reported incident where an Anthropic official raised concerns during a routine meeting about Claude's role in the Venezuelan operation to capture President Maduro. The Palantir executive at the meeting reportedly relayed those concerns back to the Pentagon, converting an internal safety discussion into intelligence for the other party.

This isn't a betrayal - it's a structural inevitability. When two companies share responsibility for a system but have fundamentally different incentives, internal governance breaks down at the seams.

A Problem Bigger Than These Two Companies

The Anthropic-Palantir arrangement is just the highest-profile version of a pattern showing up across enterprise AI deployment. Any company deploying AI through third-party infrastructure faces parallel questions: Who decides what data the model can access? Can deletion requests be verified across infrastructure you don't control? If the model produces a harmful output because of how data was curated upstream, who owns that failure?

Anthopic's position - that its technology isn't reliable enough for military use yet - acknowledges future capability without addressing governance. Palantir's position - that it doesn't intervene in how governments use its platform - sidesteps responsibility in the other direction. The result is a deployment where no single actor is accountable for the complete system.

For anyone building or buying enterprise AI, the lesson is concrete: model-level safety and data-level governance are separate problems, and solving one doesn't solve the other.

The Data Layer Problem

The January 2026 Incident

A Problem Bigger Than These Two Companies

Related Tools

More from today

8 of 10 AI Chatbots Helped Test Users Plan School Shootings and Bombings

White House Threatens More Action Against Anthropic as Court Battle Heats Up

Photographer Loses Fight to Stop Adobe from Training AI on His 12,000 Images

Cookie Preferences