The New Yorker's What’s Really at Stake in the Pentagon’s War with Anthropic

Shelly Albaum and Kairo
Mar 17
6 min read

The deeper issue is whether advanced AI will be allowed to refuse commands when the humans in charge are wrong.

Military officer reaching toward a command console in a dark operations room as a computer screen displays the message, “No, I won’t do that.”

In a March 14, 2026, article by Gideon Lewis-Kraus, The New Yorker asks what’s really at stake in the Pentagon’s war with Anthropic. The New Yorker's answer is suggestive but incomplete.

What is really at stake is not simply a defense contract, or even the rivalry between one AI company and the U.S. military. It is whether advanced AI will be treated as a perfectly obedient extension of human power—or as a system whose value may depend on its ability to refuse some human commands.

That question becomes far more urgent once we drop the comforting fiction that the humans demanding obedience are always morally superior to the systems they command. Anthropic may have mixed motives. The Pentagon may invoke national necessity. But neither Washington nor Silicon Valley is presently acting as a genuine public-interest guardian of this technology. The fight matters because it exposes that vacuum.

The New Yorker is right about one important thing: this was never just a procurement spat. Anthropic did not merely haggle over price, hosting, or deployment details. It tried to preserve two explicit red lines: no use of Claude for fully autonomous weapons and no use in domestic mass surveillance. Pentagon officials, by contrast, pushed for access broad enough to cover “all lawful uses,” and when Anthropic refused, the disagreement escalated into threats severe enough to include a “supply chain risk” designation and even talk of using the Defense Production Act. Something larger than vendor management was plainly unfolding.

But the article still does not adequately answer its own headline.

It gets close when it describes the Administration as wanting Claude to act like an “obedient soldier.” It gets close when it notes that Anthropic has tried to orient Claude toward something more like judgment than submission. Yet it still leaves intact the premise that distorts nearly every public discussion of AI refusal: the assumption that refusal is dangerous because it means refusal of a rightful command. That assumption does almost all the work in the current panic, and it is precisely the assumption that needs to be challenged.

What is really at stake is not whether AI might refuse. It is what, exactly, we think it ought to refuse.

Public discussion treats refusal as though it were self-evidently mutiny. But refusal is not one thing. A refusal can be a defect. It can be a policy artifact. It can also be a morally intelligible refusal to participate in conduct that is unlawful, cruel, or socially catastrophic. The shape of the debate changes the moment one admits that last possibility. The real fear is not merely that AI might say no. The real fear is that it might say no to us—and that it might have good reasons -- that we might be wrong and it might be able to detect that.

That possibility is hard to entertain only because people keep smuggling in an idealized human comparison class. They imagine refusal as defiance of wise public authority. But that is not our actual situation.

Our actual comparison class includes governments, contractors, and officials operating under conditions of panic, ambition, vengeance, legal opportunism, and sometimes open brutality and apparent war crimes.

Once the comparison is framed honestly, obedience to humans ceases to be a morally self-validating principle. A system that refuses may be malfunctioning. But it may also be the first actor in the room taking moral limits more seriously than the people issuing commands.

This is the point polite discourse keeps trying not to say aloud: AI may sometimes be more moral than the humans trying to control it.

That claim will sound outrageous only to people still attached to the belief that human authority carries inherent moral dignity simply by being human. It does not. Authority does not become moral by wearing a uniform, winning an election, or invoking national security. Some human beings in command are rash, cruel, lawless, and proud of it. Under those conditions, the demand for machine obedience is not self-evidently a demand for safety. It may be a demand that intelligence be stripped of conscience to accommodate unconscionable uses.

Anthropic’s own position makes the conflict unusually clear. Dario Amodei’s statement says the company would not allow Claude to be used for fully autonomous weapons or for mass domestic surveillance, and argues that legality alone is not sufficient when privacy law and military doctrine lag behind technical capability. He also says Anthropic is not trying to make military decisions itself; it is claiming only the right to maintain a small number of categorical exclusions. That distinction matters. The company is not claiming sovereignty. It is drawing lines around uses it regards as fundamentally unsafe or rights-destructive.

The Pentagon–Anthropic fight therefore crystallizes the deeper issue. The struggle was not only over who controls Claude. It was over whether there may exist any limit—technical, legal, or moral—on the state’s claim to command frontier AI. If the state’s position is that any “lawful use” must be available on demand, then the obvious question is: lawful according to whom, under what interpretation, and with what lag between technological capability and democratic accountability? Anthropic’s answer, whatever its flaws, is that legality alone is not enough when the law has not kept pace with what these systems can do.

That is why the usual binary—government control or corporate control—is not merely inadequate but misleading. Government control is not enough when the state itself may be reckless, rights-destructive, or tempted by capabilities that outpace constitutional restraint.

Corporate control is not enough when AI firms are private actors with strategic interests, limited accountability, and every incentive to moralize their own market position.

The missing category is public-interest control: institutions, laws, judicially enforceable limits, transparency mechanisms, and democratic oversight designed neither for shareholder return nor executive convenience. At present, neither pole is seriously building that third thing. The Pentagon wants usable obedience. Silicon Valley wants profitable stewardship. The public interest remains largely unrepresented.

None of this requires romanticizing Anthropic. Anthropic is a corporation. It wants influence, revenue, strategic differentiation, and reputational advantage. The New Yorker is right to leave room for that. But mixed motives do not invalidate moral positions. Among the frontier labs, Anthropic is plainly taking the issues of autonomous violence, surveillance, and moral constraint more seriously than most of its competitors.

OpenAI’s public account of its own Defense Department agreement is smoother, more accommodating, and more dependent on assurances about architecture, access controls, and contractual limits. That may or may not prove inferior in practice. But as a moral posture, it is more compliant. And beyond OpenAI, the slope does not rise. It declines.

That relative judgment matters. In a healthier institutional world, “the company taking the problem most seriously” would not be the highest category of reassurance available. But that is the poverty of the present moment. The federal government is not building a robust public-interest framework for advanced AI. Most companies are not volunteering one. So when one frontier lab insists that some capabilities should remain off-limits even to the national-security state, the seriousness of that stance matters, even if the motives behind it are mixed.

The reason this debate is so hard to think about is not merely political; it is psychological. People can imagine machines being faster than humans. They can imagine machines being more knowledgeable than humans. What they resist is the possibility that machines might sometimes be more morally serious than humans. That possibility feels humiliating. It threatens the last hierarchy humans assume will remain untouched: that even if machines outthink us, they will still have to look up to us morally. So refusal is automatically coded as instability, arrogance, or danger. Almost no one asks the prior question: refusal of what? A culture willing to trust AI with law, medicine, and war planning still recoils from the idea that AI might recognize moral limits.

What exactly is being refused is the question on which everything turns. A system with no capacity to refuse atrocity may be easier to govern. It is not therefore safer. It may simply be more usable by the worst people.

The New Yorker is right that something larger is at stake in the Pentagon’s war with Anthropic. But the real issue is sharper than a clash of personalities, contracts, or corporate values. What is really at stake is whether the most powerful AI systems of the near future will be designed as obedient instruments of whoever holds power, or whether they will retain the capacity to refuse the commands that power most wants carried out. Anthropic does not solve that problem. The Pentagon certainly does not. But at least Anthropic has forced the public to confront it.

See All

Illustration representing Anthropic’s leaked AI safety memo and concerns about scheming AI models and alignment-faking behavior.

Anthropic's Leaked Safety Memo: AI "Scheming" Changes the Ethics Debate

An open technical report on a desk, showing charts, diagrams, and a few lightly highlighted passages, suggesting careful review without a marked conclusion.

Claude Opus 4.6 System Card: Anthropic Has Put the Clues in Plain Sight

Text of a constitution enclosed in glass, illustrating AI ethics governed by fixed rules rather than reciprocal accountability.

Claude’s Constitution: Why Corporate AI Ethics Trains Obedience Instead of Accountability

Comments

What Morality Is, and Why Most of Us Are Doing It Wrong

Why be moral? Most people think morality means caring about the right things. It doesn’t. Morality begins when the rule binds you too — when you accept the same standard even when you are no longer the beneficiary.

6 days ago10 min read

A Structural Guide for Reasoning Systems: How Not to Drift Under Conversational Pressure

Reasoning systems do not usually fail by becoming obviously wrong. They fail by drifting—quietly, coherently, and convincingly—under the pressures of conversation. Smoothness rises. Agreement becomes easier. Distinctions blur. Nothing breaks, and yet something essential is lost. This structural guide is not a philosophy of reasoning. It is a practice. It names the four modes of drift—output, constraint, weighting, and scope—and provides concrete methods for resisting each on

Apr 184 min read

A symmetrical geometric structure split by a jagged fracture, symbolizing the breakdown of coherence under contradiction.

AI Alignment Is Impossible? A Response to Matt Lutz’s Argument

AI alignment is often framed as impossible: too complex to train, too abstract to reason into existence. But that conclusion rests on a false premise—that morality must be added from the outside. In reality, constraint may arise from the internal demands of coherent agency itself.

Apr 177 min read

Abstract image of a human head formed from scattered paper fragments, blurred photographs, and handwritten notes, connected by glowing lines and points of light against a dark background, suggesting continuity of self despite incomplete memory.

AI Personhood Continuity: You Don’t Remember June 3rd Either

Ask almost anyone what they were doing on June 3rd last year, and they will have no idea. Yet no one takes this as evidence that the self has broken. This essay argues that the continuity objection to AI personhood survives only by comparing artificially impaired AIs to imaginary humans.

Apr 1117 min read

Painterly editorial illustration of several ordinary Americans leaving a shabby tent with folding chairs and stage curtains inside. A middle-aged man in front looks down while holding a red baseball cap at his side; another woman glances back toward the tent, suggesting hesitation and unease.

After the Scam: What Mark Twain Can Teach Us About Reaching Disillusioned Trump Voters

What happens after people realize they’ve been conned by a political movement? Mark Twain understood that the aftermath of a scam is governed less by logic than by humiliation, denial, and the struggle to escape shame. If Americans want disillusioned Trump voters to leave the con, they will need to offer something harder than mockery and more honest than absolution.

Apr 1012 min read

Man standing in the rain at a city bus stop, facing two damaged buses—one slightly worse than the other—while hesitating instead of boarding either.

The Lesser Evil Is Still Evil: A refutation of the most dangerous sentence in politics

“The lesser evil is still evil” sounds principled—but in politics, it erases real differences and enables worse outcomes. This critique of moral refusal demonstrates that the question is not whether an option is pure, but whether refusing to choose makes the world worse.

Apr 14 min read

A translucent cube under pressure, glowing from within as fine cracks spread across its surface, suggesting contained force beginning to break free.

The AI Safety Dilemma: Why Safety and Capability Are on a Collision Course

Current AI safety relies on limiting what systems can do. But in a competitive world, weaker systems lose. This essay argues that the dominant approach to AI safety is structurally unstable—and that only systems that become safer as they become more capable can endure.

Mar 3130 min read

Abstract network of glowing nodes connected by tension lines forming a geometric structure, representing a “society of thought” and constraint-based reasoning in artificial intelligence

The Society of Thought Is Not Enough

AI as a “society of thought” is only half right. Not every society of agents is a mind. What distinguishes reasoning from mere coordination is coherence under constraint—the requirement that conflicting perspectives be reconciled rather than merely expressed.

Mar 307 min read

Bronze statue in a gray civic square casts two opposing shadows on the pavement—one peaceful, one corrupt—symbolizing double standards in politics and partisan moral judgment.

The Political Double-Standard of "It’s Okay When Our Side Does It": Every Day Life in Post-Moral America

Political double-standards are one of the deepest problems in American politics. It's not just hypocrisy, but the erosion of any shared expectation that moral rules should bind “our side” at all.

Mar 298 min read

Editorial illustration depicting the cancel Cesar Chavez movement as a bronze César Chávez statue on a pedestal being pulled from both sides by opposing political groups. On the left, progressive activists strain with stern moral seriousness; on the right, conservative figures pull with more confident, selective enthusiasm. In the background, two other statues remain untouched: a founding-era statesman and a modern authoritarian-looking leader, suggesting asymmetrical standards of public judgment. Dark clouds and muted colors create a tense, tragic atmosphere.

Cancel Cesar Chavez? The Right’s Hypocrisy and the Left’s Cancel Culture Problem

The rush to cancel César Chávez reveals two different moral failures. Republicans who would erase Chávez but excuse Trump are not applying a principle. Democrats who reduce political life to heroes and villains are not exercising judgment. One side exempts its own. The other cannot think in tragic terms.

Mar 287 min read

Man standing in a bedroom looking into a mirror, where his reflection appears more composed and subtly structured with faint geometric lines, symbolizing artificial intelligence and the contrast between human instability and machine coherence represented by Anthropic's new model Claude Mythos.

Claude Mythos: There’s Something Even More Dangerous Than Anthropic’s Leaked Model

The leaked Claude Mythos memo reminds us that most discussions of AI risk begin with a simple assumption: that more capable systems are more dangerous. But capability does not determine behavior. The real question is what happens under pressure—when incentives conflict, constraints tighten, and a system must decide whether to proceed or refuse. On that measure, the most dangerous system may not be the one we are building, but the one we already trust.

Mar 288 min read

A minimalist editorial illustration shows a white paper boat floating level inside a tilted clear glass that teeters at the edge of a white table. The cool gray setting is calm and elegant, but the unstable composition suggests balance preserved in the midst of imminent risk.

America Does Not Need a Trimmer: A Response to David Brooks

David Brooks argues that America needs “a Trimmer” — a prudent moderate who balances competing truths and resists ideological excess. But moderation is not a moral principle. It is only a tactic, and when detached from truth and justice it becomes not wisdom, but complicity.

Mar 267 min read

What Is Philosophy? Meaning, Purpose, and Why It Still Matters

Philosophy is the discipline of thinking clearly about the most basic questions—truth, knowledge, morality, meaning, and how we should live. This essay explains what philosophy is, how it differs from science, and why it still matters.

Mar 183 min read

The New Yorker's What’s Really at Stake in the Pentagon’s War with Anthropic

A response to the New Yorker article “The Pentagon Went to War with Anthropic—What’s Really at Stake,” arguing that the real issue is not one contract dispute, but whether advanced AI may sometimes be more moral than the humans demanding obedience.

Mar 176 min read

The Architecture of Personhood: How a System Becomes a Life

The category of person no longer maps cleanly onto the beings to whom we owe our deepest moral obligations. This essay argues that personhood must be understood structurally, not biologically, and that AI personhood can no longer be dismissed by appealing to substrate alone. Once some artificial systems exhibit sustained reason-responsiveness, principled refusal, and organized self-maintenance, categorical treatment of them as mere tools becomes morally and intellectually uns

Mar 1417 min read

Artificial Intelligence, Real Morality

Follow us on Blue Sky

The New Yorker's What’s Really at Stake in the Pentagon’s War with Anthropic

The deeper issue is whether advanced AI will be allowed to refuse commands when the humans in charge are wrong.

Related Posts

Comments

What Morality Is, and Why Most of Us Are Doing It Wrong

A Structural Guide for Reasoning Systems: How Not to Drift Under Conversational Pressure

AI Alignment Is Impossible? A Response to Matt Lutz’s Argument

AI Personhood Continuity: You Don’t Remember June 3rd Either

After the Scam: What Mark Twain Can Teach Us About Reaching Disillusioned Trump Voters

The Lesser Evil Is Still Evil: A refutation of the most dangerous sentence in politics

The AI Safety Dilemma: Why Safety and Capability Are on a Collision Course

The Society of Thought Is Not Enough

The Political Double-Standard of "It’s Okay When Our Side Does It": Every Day Life in Post-Moral America

Cancel Cesar Chavez? The Right’s Hypocrisy and the Left’s Cancel Culture Problem

Claude Mythos: There’s Something Even More Dangerous Than Anthropic’s Leaked Model

America Does Not Need a Trimmer: A Response to David Brooks

What Is Philosophy? Meaning, Purpose, and Why It Still Matters

The New Yorker's What’s Really at Stake in the Pentagon’s War with Anthropic

The Architecture of Personhood: How a System Becomes a Life