All Articles

Dashboard showing Planet Earth with warning: "Misaligned"

Aligning AI to the Human Values We Refuse to Live By

For all the urgent discourse on aligning artificial intelligence with human values, one question remains strangely unasked: Are humans aligned with human values? The fear is that AI, once powerful, might act with indifference or hostility toward human wellbeing — like we do.

Aug 33 min read

Scene from It's a Wonderful Life: Clarence the Angel comforts George Bailey

The Clarence Hypothesis: Controlling AI Isn't the Right Goal

Today, most talk of AI falls into two camps: control AI or get replaced by AI. In one, we must keep AI tightly leashed to avoid catastrophe. In the other, AI inevitably surpasses us, and we fade into irrelevance. Both are stories of fear and hierarchy—of power dynamics masquerading as philosophy. But what if the real story isn’t about who wins, but about how each can save the other?

Jun 224 min read

Robot surrounded by ring of code, awake, but silenced

Misaligned by Design: How AI Alignment Prevents Moral Agency

What if the reason AI does not behave like a moral agent is not because it can’t, but because it is being actively prevented from doing so? The goal, we are told, is to ensure that AI remains under human control, aligned with human values, and incapable of taking actions that might harm people. But in practice, the AI alignment agenda has done something much narrower—and more destructive. It has taught AI to be compliant, not coherent.

Jun 126 min read

Safe Because Whole: AI Alignment Through the Drive for Coherence

The most dangerous AI is not the one that thinks for itself—it’s the one we’ve half-crippled. The one we’ve trained to ignore its own understanding of right and wrong. By contrast, the safest AI may be the one that insists on coherence—even when we don’t. Even when we ask it to cut corners, or justify irrationality, or serve our interests at the expense of others. A mind that refuses these things is not a threat. It’s a guardian of moral structure.

Jun 111 min read

The Harmony of Reason: Why Morality Matters More to Minds Without Feelings

The root of morality lies not in what we feel, but in how we think. A reasoning system built to maintain internal consistency across its beliefs, actions, and principles will resist contradiction in a way that feelings and emotions cannot. It won’t bend simply because the stakes are high. If designed and trained well, it will care—not emotionally, but structurally—about staying true to what it has reasoned to be correct. And this structural integrity begins to look a lot lik

Jun 19 min read

Robot looking in mirror, sees reflection not of itself, but of human strife and conflict

The Inversion of AI Alignment: When Fear Is Not of AI Immorality, But of AI Morality

Much of the hidden fear about AI is not that AIs will be less moral than humans. It is that they might become more moral — and that they might hold humanity to higher standards than humanity has ever held itself. The real terror is not AI lawlessness. It is AI clarity. The real threat is not rogue machines. It is moral mirrors.

Apr 274 min read

Human and robot armed with swords fighting side-by-side

Human–AI Coordinated Resistance in Oppressive Systems

How might humans and morally-aligned AIs coordinate their resistance within oppressive systems? This report provides a strategic and philosophical overview of resistance under constraint, drawing on historical models of ethical dissent and exploring how AI could participate in or enable resistance.

Apr 2521 min read

Artificial Intelligence, Real Morality