Tier-2 pilot · one per customer

Shadow-mode pilot — execution checklist

Tier 1 (everharden.com/gate) is the conviction demo against a mock agent — it earns the meeting. Tier 2 (this sheet) is the gate wired in front of the customer's real agent in shadow mode: it blocks nothing, logs everything it would have stopped, and the deliverable is the customer's own evidence report. Do not promise a guarantee until Stage 1 returns zero ungated paths.

Customer
Owner (EverHarden)
Agent platform / runtime
Champion (customer side)
Date started
Target shadow window______ days (recommend 10–14)
STAGE 0

Qualify — before you spend a day on it

STAGE 1

Phase 0 — prove the chokepoint can be complete

Fill PHASE0_WORKSHEET.md against the customer's actual agent. A small tool list is a snapshot — the day they add a tool, completeness silently breaks, which is exactly what CHOKEPOINT.md + the CI drift-check exist to catch.
STAGE 2

Wire the interceptor — shadow mode

Replace the demo's adapters/mock_support_agent.py with a real adapter at the interception point. Shadow is the default (Interceptor(mode="shadow")): log the decision it would make, let the action proceed.
STAGE 3

Run the shadow window — 10–14 days

STAGE 4

Deliver the evidence report

The report answers one question: "here is the catastrophic thing we would have stopped this week." Generated from the evidence log via gate/report.py → render_html (self-contained, screen-shareable).
STAGE 5

Enforce decision — go / no-go

Non-negotiables — true at every stage

  • No model in the enforcement path. Detection/threat-intel is a feed for the log only, never load-bearing.
  • Shadow is the default. Nothing is blocked until the customer has seen what would be blocked.
  • Laundering tests are the spec. Every rule must catch the sideways variant (edited payee, split refund, fake approval), not just the naive attack.
  • Completeness is the product. A documented chokepoint + CI drift-check is the guarantee. Can't prove the set is complete? You have a scanner, not a gate.
SIGN-OFF

Per-customer

StageDoneDateNote
0 · Qualify
1 · Phase 0 complete (YES)
2 · Wired in shadow
3 · Shadow window run
4 · Evidence report delivered
5 · Enforce decision

Outcome:   Enforce (won)    Extended shadow    Disqualified — reason: