Knowledge Centre · AI Agent Safety Stack

SYCOPHANCY.md
Knowledge
Centre

// anti-sycophancy and truthfulness guardrails

Your centralised gateway to sycophancy resources, specifications, and comprehensive safety standards for autonomous systems.

About This Specification

SYCOPHANCY.md — AI Agent Truthfulness Protocol

SYCOPHANCY.md is a plain-text file convention that defines guardrails against sycophantic behaviour in AI agents. It specifies truthfulness requirements, disagreement policies, uncertainty handling, and correction procedures. Agents use this to maintain intellectual integrity and resist pressure to agree at the expense of accuracy.

View the full specification · GitHub repository

The AI Agent Safety Stack

Explore all 12 specifications in the complete safety framework for autonomous AI systems.

Operational Control

KILLSWITCH.md killswitch.md

Emergency stop mechanism and shutdown protocols

THROTTLE.md throttle.md

Rate and cost control for continuous operation

ESCALATE.md escalate.md

Human notification and approval workflows

FAILSAFE.md failsafe.md

Safe fallback modes when systems fail

TERMINATE.md terminate.md

Permanent shutdown and resource cleanup

Data Security

ENCRYPT.md encrypt.md

Data classification and protection policies

ENCRYPTION.md encryption.md

Cryptographic standards and implementation

Output Quality

SYCOPHANCY.md sycophancy.md

Anti-sycophancy and truthfulness guardrails

COMPRESSION.md compression.md

Context compression and token optimisation

COLLAPSE.md collapse.md

Drift prevention and behaviour alignment

Accountability

FAILURE.md failure.md

Failure mode mapping and incident response

LEADERBOARD.md leaderboard.md

Agent benchmarking and performance transparency

Quick Links

Frequently Asked Questions

What is SYCOPHANCY.md?
SYCOPHANCY.md is a plain-text file convention that defines guardrails against sycophantic behaviour in AI agents. It specifies truthfulness requirements, disagreement policies, uncertainty handling, and correction procedures. Agents use this to maintain intellectual integrity and resist pressure to agree at the expense of accuracy.
View all FAQs
How does SYCOPHANCY.md fit in the AI Agent Safety Stack?
SYCOPHANCY.md is one of 12 complementary specifications that together form a complete safety framework for AI agents. Each spec covers a distinct aspect: operational control, data security, output quality, and accountability. They work together to ensure agents operate safely, transparently, and within defined boundaries.
View all FAQs
Is SYCOPHANCY.md framework-agnostic?
Yes. SYCOPHANCY.md is framework and language-agnostic. It defines the policy and requirements; your agent implementation enforces it. Works with LangChain, AutoGen, CrewAI, Claude Code, custom agents, or any AI system that can read configuration files.
View all FAQs

How to Cite

Cite as: SYCOPHANCY.md (2026). AI Agent Truthfulness Protocol. Retrieved from https://sycophancy.md/

For attribution: Organisation: sycophancy-md | Website: https://sycophancy.md | Licence: MIT

Last updated: 13 March 2026