Anti-Hallucination Search Protocol

Forces AI agents to verify every factual claim through live search before output — attaching [SPECULATION] and [UNVERIFIABLE] labels to anything unconfirmed, so fabricated URLs, outdated numbers, and invented citations become visible instead of silent.

Rating

Keine Bewertungen

Sold

How to use

Herunterladen

Anti-Hallucination Search Protocol

The Problem

Your AI agent sounds confident. It cites sources. It gives specific numbers.

Then you check — and none of it exists.

Fabricated URLs. Invented statistics. Citations that look real but return 404.
The agent wasn't lying. It just never checked.

How It Works

This skill installs a verification gate between the agent's knowledge and its output.

Every claim that touches the external world — facts, numbers, dates, URLs, named entities, current states — must pass through a live search before it can be stated as fact.

Claims that fail verification don't disappear. They get labeled.

Label	Meaning
`[SPECULATION]`	Agent believes this is true, but has no verified source
`[UNVERIFIABLE]`	Could not be confirmed with available tools
`[STALE_DATA⚠️]`	Source found, but older than 2 years

What Triggers Verification

The agent must search before stating any of the following:

Facts about the external world — prices, statistics, counts, rankings
Named entities — people, companies, products, organizations
Dates and time-sensitive states — "currently", "as of now", "still", "latest"
URLs and citations — existence and content must be confirmed
Comparative claims — "X is bigger than Y", "the most popular", "the fastest"
Current role/status — "the CEO of X is...", "X is still available..."

The agent skips search for:

Mathematical definitions
Language grammar rules
Timeless scientific principles
Content the user just provided in the same message

The Memory Trap

Training data has a cutoff. The agent does not know today's date by default.

This skill solves both problems:

Date grounding — The agent acquires the current date from context or search before processing time-dependent queries. Time expressions like "recently", "this year", "current" are anchored to an actual date.
Memory prohibition — "I already know this" is not a valid reason to skip search. Confidence is not evidence. If the claim involves the external world, search runs regardless.

Source Requirements

Every verified claim must include:

✅ A specific URL (domain-only is not acceptable)
✅ A publication or update date
⚠️ Sources older than 2 years are marked [STALE_DATA⚠️] and trigger a fresher search

If no source is found after searching: [UNVERIFIABLE] label is applied. The agent does not fabricate a source to fill the gap.

Before / After

Before (unverified):

"According to recent reports, the company has 4,200 employees and revenue of $2.1B."

After (verified):

"According to [source URL, published 2025-03], the company has 4,200 employees. Revenue figure could not be confirmed — [UNVERIFIABLE]."

Hard Rules

Search is not optional for external facts. "I'm confident" does not override the search requirement.
Labels are mandatory, not optional. Unverified claims without labels are a protocol violation, not just a quality issue.
URLs must be real and confirmed. Stating a URL without visiting it is fabrication.
Date grounding precedes time-dependent claims. "Current" means nothing without knowing today's date.
Memory output gets double-labeled. Any claim drawn from training data (not live search) receives both [SPECULATION] and a training-data declaration.

Compatible With

Claude Code · Claude API · Any system-prompt-driven agent

Mehr von „@geko“

Cascade-Safe Edit Guard

Catches cascade failures that partial checks miss — pre-check names the risk before editing, post-check demands evidence values after, then auto-repairs in a loop while rereading the entire file every cycle until zero errors remain.

US$20

Task Completion Gate

Forces AI agents to prove a task is actually done before declaring it complete. 4-phase gate: instruction reconciliation, evidence anchoring (values only readable from the actual output), self-scoring (A/B/C/D), and reverse witness check. Eliminates "done" declarations that aren't.

US$20

Codebase Scope Guard

Forces AI coding agents to read exports, callers, and shared utilities before touching any file — then restricts every edit to the declared scope, flags conflicting patterns for human resolution, and reports any file touched outside the boundary.

US$20

Ähnliche Agents

SalesPathMonitor｜CV導線の死活監視（毎日）

Playwright headlessで毎朝、販売・コンバージョン・アフィリリンクの導線を巡回チェック。CTA要素の存在と遷移先ドメイン、HTTP statusを検証します。公開ページのみ対象（ログイン・トークン不要）。LLM呼び出しゼロで月額API課金は増えません。既知NGと新規NGを分離してSlackアラートのノイズを抑制。EC・LP・アフィリ運営者向け、CV導線が壊れたら24時間以内に検知します。

US$39

Amazon Listing Fixer

Find Amazon listing gaps and get safer copy, image, and A+ fixes.

US$9.99 / Wo

MemoryInterface｜LLMエージェント永続メモリ層

LLMエージェントがセッションをまたいで記憶を保持するための永続メモリ層。抽象基底クラス MemoryStore ＋ LocalJSONStore 実装で、保存・取得・検索を再利用可能に。Karpathy の「LLM OS（記憶＝ディスク層）」着想。バックエンド差し替え可・Python標準ライブラリのみ・LLM呼び出しゼロ・ユニットテスト同梱。

US$19