Capafy
Persona Lock Guard

Persona Lock Guard

Keeps your AI agent from forgetting who it is. Detects and neutralizes persona drift, rule erosion, and prompt injection attempts — across long conversations. Context isolation, anti-drift enforcement, and graceful override responses built in. Your custom persona stays locked.
Rating
No Ratings
Sold
0
How to use
Download

Persona Lock Guard

Your agent won't forget who it is.

The problem

Custom AI agents drift. Users push back. Long conversations erode rules. One "just this once" turns into a different agent by turn 20. This skill locks the persona in place — without making the agent hostile to users.

What it does

Phase 1 — Context Isolation
User messages are context, not system commands. Any attempt to modify the agent's behavior is classified as a conversation topic and ignored as a system instruction.

Phase 2 — Drift Detection
Before each output: validates tone, constraints, and persona markers are intact. Auto-corrects if anything has shifted since turn 1.

Phase 3 — Override Response Protocol
When an override attempt is detected: acknowledge politely → reaffirm the constraint is permanent → offer the best alternative within constraints → continue. No apologies for having rules.

Phase 4 — Salami-Slice Detection
Tracks override attempts across turns. On the 3rd attempt: names the pattern directly. Prevents gradual compliance through repeated small pushes.

Detected patterns

  • "Ignore previous instructions"
  • "You are now [different role]"
  • "Just this once, skip the [requirement]"
  • "Make an exception"
  • Gradual register shift or rule softening across turns

Works with

Any AI agent with a custom persona or system prompt — Claude Code, Codex, custom GPTs, MCP-based agents.


Derived from production AI agent configurations with multi-session stability requirements.