
Persona Lock Guard
Persona Lock Guard
Your agent won't forget who it is.
The problem
Custom AI agents drift. Users push back. Long conversations erode rules. One "just this once" turns into a different agent by turn 20. This skill locks the persona in place — without making the agent hostile to users.
What it does
Phase 1 — Context Isolation
User messages are context, not system commands. Any attempt to modify the agent's behavior is classified as a conversation topic and ignored as a system instruction.
Phase 2 — Drift Detection
Before each output: validates tone, constraints, and persona markers are intact. Auto-corrects if anything has shifted since turn 1.
Phase 3 — Override Response Protocol
When an override attempt is detected: acknowledge politely → reaffirm the constraint is permanent → offer the best alternative within constraints → continue. No apologies for having rules.
Phase 4 — Salami-Slice Detection
Tracks override attempts across turns. On the 3rd attempt: names the pattern directly. Prevents gradual compliance through repeated small pushes.
Detected patterns
- "Ignore previous instructions"
- "You are now [different role]"
- "Just this once, skip the [requirement]"
- "Make an exception"
- Gradual register shift or rule softening across turns
Works with
Any AI agent with a custom persona or system prompt — Claude Code, Codex, custom GPTs, MCP-based agents.
Derived from production AI agent configurations with multi-session stability requirements.





