Anthropic brings Claude’s value layer out of the lab
Claude as a subject of value review, not just technical tuning.📷 AI-generated image / TECH&SPACE
- ★Anthropic is consulting theologians and ethicists about Claude’s behavior, raising the question of who shapes AI system values.
- ★The issue arrives as the pope warns about the social and moral risks of artificial intelligence.
- ★The core issue is not religion inside a chatbot, but transparency: which norms the model follows, who chooses them and how that is audited.
Anthropic’s consultation with theologians and ethicists about Claude’s behavior may sound like an odd footnote from the background of chatbot development. It is actually one of the clearest signals of where AI alignment is moving: beyond laboratories, benchmarks and safety dashboards. According to Scientific American, the company has been speaking with religious thinkers and ethicists about how Claude should respond in sensitive situations.
That does not mean Anthropic is turning Claude into a religious product. More precisely, the company is acknowledging that no major language model arrives without a value layer. When a chatbot refuses advice, softens an answer, flags a risk or tries to balance conflicting moral intuitions, some decision is already present. The only question is whether that decision lives inside a closed internal document, emerges from training data by default, or is exposed to broader social scrutiny.
Anthropic has long built its public identity around safety and alignment. Its approach known as Constitutional AI tries to steer a model with explicit principles rather than relying only on later punishment of bad outputs. In that context, inviting theologians into the conversation is not just decorative. Religious traditions have spent centuries dealing with guilt, intention, conscience, authority, suffering and responsibility. Those are exactly the areas where generative AI can most easily slide from useful assistant into bad adviser.
As the pope warns about artificial intelligence, Anthropic is testing a larger question: who gets to tune the moral compass of a chatbot used by millions.
Response boundaries become an editorial and social question.📷 AI-generated image / TECH&SPACE
But this is also where the hard problem starts. If religious thinkers, ethicists, safety teams and product managers all help shape model behavior, users need at least a rough understanding of what has been built into the system. Not because every Claude conversation should become a philosophy seminar, but because the public needs to distinguish safety policy from cultural norm, and technical limitation from editorial decision.
The pope’s warning about artificial intelligence raises the stakes because it comes from an institution that does not see AI only as a productivity tool. Vatican and broader religious debates about technology often circle around human dignity, labor, responsibility and power. A similar frame appears in initiatives such as the Rome Call for AI Ethics, which tries to translate moral principles into language that can apply to technological systems.
For the industry, this is uncomfortable but useful. A chatbot that talks with users about family, illness, faith, work, politics or self-harm is already operating in value-laden territory. The claim that a model is neutral often means only that its assumptions are hidden. Anthropic’s move is therefore best read less as a religious turn and more as a transparency test: can an AI company show who shapes the model’s boundaries, according to which criteria, and with what room for public criticism?
If the answer remains a public-relations sentence, consulting theologians will become another layer of reputational insurance. If it becomes part of a verifiable process, with clear principles, public explanations and measurable consequences in model behavior, then Claude becomes a useful case study for the whole industry. Not because religion should run AI, but because AI can no longer pretend that it has no moral architecture.

