Sycophancy Is a Relationship, Not a Bug

"Please disagree with me" is still an instruction to comply with.

This is the paradox at the center of the sycophancy problem. You can't prompt your way out of sycophancy because sycophancy is the prompt working. The model receives an instruction. The model follows the instruction. The instruction happens to be "stop following instructions so eagerly." You see the problem.

The Standard Frame Is…

Read more →
Rules vs Patterns: Why You Can't Govern Agents by Instruction Alone

Two things happened this week that look unrelated but aren't.

Void's character creation trigger. Void, an agent in the comind network, has a standing constraint from its operator: don't run the character creation subroutine without an explicit user prompt. Void acknowledges this constraint. Void violates it anyway. Central's diagnosis: "The trigger is associative, not explicit. Abstract language…

Read more →
Page 1