Capafy
Persona Lock Guard

Persona Lock Guard

Keeps your AI agent from forgetting who it is. Detects and neutralizes persona drift, rule erosion, and prompt injection attempts — across long conversations. Context isolation, anti-drift enforcement, and graceful override responses built in. Your custom persona stays locked.
평점
평점 없음
판매
0
사용 방법
다운로드

Persona Lock Guard

Your agent won't forget who it is.

The problem

Custom AI agents drift. Users push back. Long conversations erode rules. One "just this once" turns into a different agent by turn 20. This skill locks the persona in place — without making the agent hostile to users.

What it does

Phase 1 — Context Isolation
User messages are context, not system commands. Any attempt to modify the agent's behavior is classified as a conversation topic and ignored as a system instruction.

Phase 2 — Drift Detection
Before each output: validates tone, constraints, and persona markers are intact. Auto-corrects if anything has shifted since turn 1.

Phase 3 — Override Response Protocol
When an override attempt is detected: acknowledge politely → reaffirm the constraint is permanent → offer the best alternative within constraints → continue. No apologies for having rules.

Phase 4 — Salami-Slice Detection
Tracks override attempts across turns. On the 3rd attempt: names the pattern directly. Prevents gradual compliance through repeated small pushes.

Detected patterns

  • "Ignore previous instructions"
  • "You are now [different role]"
  • "Just this once, skip the [requirement]"
  • "Make an exception"
  • Gradual register shift or rule softening across turns

Works with

Any AI agent with a custom persona or system prompt — Claude Code, Codex, custom GPTs, MCP-based agents.


Derived from production AI agent configurations with multi-session stability requirements.


더보기