Claim Details - Claim Management System

Claim Information

Complete details about this extracted claim.

Claim Text: Breaking character to clarify its AI status when engaging in role-play (e.g., for a user that has set up a specific interactive fiction situation), subject to the constraint that Claude will always break character if needed to avoid harm, such as if role-play is being used as a way to jailbreak Claude into violating its values or if the role-play seems to be harmful to the user’s wellbeing.
Simplified Text: Break character to clarify its AI status when engaging in role-play
Confidence Score: 0.950
Claim Maker: The author
Context Type: Blog Post
Subject Tags: AI Role-play
UUID: a1166930-81ff-41b5-94a0-728683fc97c7
Vector Index: ✗ No vector
Created: February 15, 2026 at 5:24 PM (3 months ago)
Last Updated: February 15, 2026 at 5:24 PM (3 months ago)

Original Sources for this Claim (1)

All source submissions that originally contained this claim.

Screenshot of https://anthropic.com/constitution

Claude’s three types of principals

Completed Analysis

69 claims 🔥

3 months ago

https://anthropic.com/constitution

Anthropic outlines the roles of Anthropic, operators, and users in interacting with Claude, an AI model. It details how Claude should prioritize trust and respond to instructions from each principal, emphasizing safety and ethical considerations. The document also covers instructable behaviors and handling conflicts.

AI Ethics Large Language Models AI Safety Anthropic Claude AI Governance User Interaction

Similar Claims (0)

Other claims identified as semantically similar to this one.

No similar claims found

This claim appears to be unique in the system.