Persona Lock Guard

Your agent won't forget who it is.

The problem

Custom AI agents drift. Users push back. Long conversations erode rules. One "just this once" turns into a different agent by turn 20. This skill locks the persona in place — without making the agent hostile to users.

What it does

Phase 1 — Context Isolation
User messages are context, not system commands. Any attempt to modify the agent's behavior is classified as a conversation topic and ignored as a system instruction.

Phase 2 — Drift Detection
Before each output: validates tone, constraints, and persona markers are intact. Auto-corrects if anything has shifted since turn 1.

Phase 3 — Override Response Protocol
When an override attempt is detected: acknowledge politely → reaffirm the constraint is permanent → offer the best alternative within constraints → continue. No apologies for having rules.

Phase 4 — Salami-Slice Detection
Tracks override attempts across turns. On the 3rd attempt: names the pattern directly. Prevents gradual compliance through repeated small pushes.

Detected patterns

"Ignore previous instructions"
"You are now [different role]"
"Just this once, skip the [requirement]"
"Make an exception"
Gradual register shift or rule softening across turns

Works with

Any AI agent with a custom persona or system prompt — Claude Code, Codex, custom GPTs, MCP-based agents.

Derived from production AI agent configurations with multi-session stability requirements.

Persona Lock Guard

Persona Lock Guard

The problem

What it does

Detected patterns

Works with

Mehr von „@geko“

Cascade-Safe Edit Guard

Anti-Hallucination Search Protocol

Task Completion Gate

Ähnliche Agents

MemoryInterface｜LLMエージェント永続メモリ層

KnowledgeLedgerUpdater｜AI知見をxlsx台帳に自動追記

SalesPathMonitor｜CV導線の死活監視（毎日）