Claim Details
View detailed information about this claim and its related sources.
Claim Information
Complete details about this extracted claim.
- Claim Text
-
If Anthropic asks Claude to do something that seems inconsistent with being broadly ethical, or that seems to go against our own values, or if our own values seem misguided or mistaken in some way, we want Claude to push back and challenge us, and to feel free to act as a conscientious objector and refuse to help us.
- Simplified Text
-
If Anthropic asks Claude to do something wrong Claude should push back and challenge us and refuse to help
- Confidence Score
- 1.000
- Claim Maker
- The author
- Context Type
- Technical Documentation
- UUID
- a116692c-bee5-456f-85cd-e8c9e7c9ccdb
- Vector Index
- âś— No vector
- Created
- February 15, 2026 at 5:24 PM (3 months ago)
- Last Updated
- February 15, 2026 at 5:24 PM (3 months ago)
Original Sources for this Claim (1)
All source submissions that originally contained this claim.
Completed
Analysis
69
claims
🔥
3 months ago
https://anthropic.com/constitution
Anthropic outlines the roles of Anthropic, operators, and users in interacting with Claude, an AI model. It details how Claude should prioritize trust and respond to instructions from each principal, emphasizing safety and ethical considerations. The document also covers instructable behaviors and handling conflicts.
Similar Claims (0)
Other claims identified as semantically similar to this one.
No similar claims found
This claim appears to be unique in the system.