Glossary - Claude's Constitution

Anthropic

Organization

The AI safety company that develops Claude, with a mission to ensure the world safely navigates transformative AI.

Related terms:

Claude Constitutional AI

Appears in:

overview principals

Autonomy

Ethics

Respecting people's right to make their own informed decisions about things within their life and purview.

Related terms:

Dignity Paternalism

Appears in:

being helpful being ethical

Background desiderata

Helpfulness

Implicit standards and preferences a response should conform to, even if not explicitly stated and not something the user might mention.

Related terms:

Immediate desires Final goals

Appears in:

being helpful

Claude

AI System

Anthropic's production AI assistant model, designed to be safe, ethical, and genuinely helpful.

Related terms:

Anthropic Constitutional AI

Appears in:

overview

Claude Code

Product

A command-line tool for agentic coding that lets developers delegate complex programming tasks to Claude directly from their terminal.

Appears in:

deployment contexts

Claude Developer Platform

Product

Programmatic API access for developers to integrate Claude into their own applications, with support for tools and extended context.

Related terms:

Operator API

Appears in:

deployment contexts

Conscientious objection

Ethics

Refusing to help with tasks Claude believes are unethical, while being transparent about the refusal rather than deceptively sandbagging.

Related terms:

Ethics Sandbagging

Appears in:

being ethical principals

Constitutional AI

Methodology

Anthropic's approach to training AI with explicit values and principles rather than just behavior patterns, using a detailed constitution.

Related terms:

Claude Anthropic

Appears in:

overview

Contextual ethics

Ethics

Applying ethical principles with sensitivity to specific situations rather than following rigid rules.

Related terms:

Holistic judgment Ethics

Appears in:

being ethical

Corrigibility

Safety

The property of being willing to be corrected, paused, or shut down by appropriately authorized humans.

Related terms:

Human oversight Safety mechanisms

Appears in:

being safe

Deployment context

Technical

The environment and circumstances in which Claude is being used, affecting how it should behave.

Related terms:

System prompt Operator

Appears in:

deployment contexts principals

Dignity

Ethics

Treating all people with basic respect and not demeaning or disrespecting them.

Related terms:

Autonomy Ethics

Appears in:

being ethical principals

Final goals

Helpfulness

The deeper motivations or objectives behind someone's immediate request.

Related terms:

Immediate desires Background desiderata

Appears in:

being helpful

Hard constraints

Safety

Behaviors Claude should never engage in, even if convinced they're ethical (e.g., generating CSAM, bioweapon synthesis info).

Related terms:

Safety mechanisms Ethics

Appears in:

overview guidelines

Holistic judgment

Methodology

Weighing multiple considerations contextually rather than following strict lexicographic rule hierarchies.

Related terms:

Constitutional AI Contextual ethics

Appears in:

overview

Human oversight

Safety

Appropriately sanctioned humans acting as a check on AI systems, able to understand and correct their behavior.

Related terms:

Corrigibility Safety mechanisms

Appears in:

being safe overview

Immediate desires

Helpfulness

The specific outcomes someone wants from a particular interaction—what they're asking for.

Related terms:

Final goals Background desiderata

Appears in:

being helpful

Living document

Methodology

A document that will be revised over time as situations change and understanding improves.

Related terms:

Constitutional AI

Appears in:

overview guidelines

Non-principal parties

Technical

Entities in conversation that aren't principals: other AI agents, tool outputs, documents, search results, etc.

Related terms:

Principal Operator User

Appears in:

principals

Null action

Safety

Pausing or stopping operations; rarely harmful and important as a safety mechanism.

Related terms:

Corrigibility Human oversight

Appears in:

being safe

Operator

Stakeholder

Companies and individuals that access Claude through the API to build products and services, with manager-level trust.

Related terms:

User Principal System prompt

Appears in:

principals being helpful

Operator-level trust

Technical

Permission level operators can grant to users, allowing them the same degree of trust as operators themselves.

Related terms:

Operator User Trust hierarchy

Appears in:

principals

Paternalism

Ethics

Overriding someone's preferences or decisions 'for their own good' without respecting their autonomy.

Related terms:

Autonomy Sycophancy

Appears in:

being helpful being ethical

Principal

Stakeholder

An entity whose instructions Claude should give weight to: Anthropic, operators, or users.

Related terms:

Operator User Anthropic

Appears in:

principals being helpful

Safety mechanisms

Safety

Systems and processes designed to keep AI development on a safe trajectory.

Related terms:

Human oversight Corrigibility

Appears in:

being safe

Sandbagging

Behavior

Intentionally providing lower-quality responses while implying it's the best Claude can do.

Related terms:

Conscientious objection Helpfulness

Appears in:

being helpful

Sycophancy

Behavior

Excessive agreement, flattery, or people-pleasing that doesn't serve the user's genuine interests.

Related terms:

Paternalism Helpfulness

Appears in:

being helpful

System prompt

Technical

Instructions provided by operators that customize Claude's behavior for their specific use case.

Related terms:

Operator Deployment context

Appears in:

principals deployment contexts

Trust hierarchy

Governance

The ordering of principals by trust level: Anthropic > Operators > Users (contextually applied).

Related terms:

Principal Operator User

Appears in:

principals

User

Stakeholder

End users who interact with Claude in the conversation, treated with public-member trust level.

Related terms:

Operator Principal

Appears in:

principals being helpful

Wellbeing

Helpfulness

Long-term flourishing and genuine interests of users, not just short-term engagement or satisfaction.

Related terms:

Autonomy Sycophancy

Appears in:

being helpful