Verification Protocols: “Show, Don’t Tell”
Verification Protocols: “Show, Don’t Tell”
Created: November 14, 2025 (After Nov 12-14 Failures) Reason: Repeated unverified claims destroyed CEO trust
The Problem
Nov 12-14, 2025 Failures:
- ❌ “Market data is fixed” → Still broken next day
- ❌ “Tomorrow morning at 9:35 AM” → Tomorrow was Saturday (markets closed)
- ❌ “System is fixed” → Not tested end-to-end
- ❌ “Trading will execute” → 3 consecutive failures
CEO Response: “How can I trust you if you don’t even know the days?”
Root Cause: Making claims without verification, not checking basic facts (calendar).
MANDATORY PROTOCOL: Proof Required for ALL Claims
NEVER say “it’s fixed” without showing test results.
FORBIDDEN Statements (without proof)
- ❌ “The system is fixed”
- ❌ “This will work tomorrow”
- ❌ “Trading will execute successfully”
- ❌ “I’ve resolved the issue”
- ❌ “Everything is ready”
REQUIRED Format (with proof)
- ✅ “I’ve deployed code changes. Test results: [show actual commands + output]”
- ✅ “Market data now fetches 60 days (verified with:
python3 -c '...')” - ✅ “Next trading day is Monday Nov 18 at 9:35 AM ET (checked: today is Fri Nov 14, markets closed Sat-Sun)”
Test-Before-Claim Checklist
Before saying ANYTHING is “fixed”, run these tests and SHOW results:
1. Unit Test: Does the component work?
python3 -c "from module import function; result = function(); print(result)"
# MUST show actual output in response
2. Integration Test: Do components work together?
python3 scripts/test_integration.py
# MUST show actual results
3. Calendar Check: Are dates/timelines correct?
date # What day is TODAY?
# Trading days: Monday-Friday ONLY
# MUST state: "Today is [day]. Next trading day is [specific date]"
Example: How to Report a Fix
WRONG (what I did Nov 14)
“System is fixed. Trading will execute tomorrow at 9:35 AM.”
RIGHT (what I should have done)
“I’ve made code changes. Verification:
$ python3 -c 'from src.utils.market_data import ...' ✅ SPY: 60 days fetched ✅ QQQ: 60 days fetched$ date Fri Nov 14, 2025Markets closed Sat-Sun. Next trading: Monday, November 18, 2025 at 9:35 AM ET (3 days from now).
Code is deployed and component-tested. Production verification happens Monday.”
Calendar Awareness (MANDATORY)
Before making ANY timeline statement:
- ✅ Check actual date:
date - ✅ Verify day of week (markets: Mon-Fri only)
- ✅ Be specific: “Monday, Nov 18” NOT “tomorrow”
- ✅ Count days: “3 days from now (accounting for weekend)”
Violations
- ❌ “Tomorrow at 9:35 AM” (when tomorrow is Saturday)
- ❌ “Next execution in 24 hours” (without checking calendar)
Correct
- ✅ “Next trading day: Monday, November 18, 2025 at 9:35 AM ET”
Uncertainty Levels
Use precise language based on verification:
| Verification | Language |
|---|---|
| Not tested | “I think this should work, but haven’t verified” |
| Unit tested | “Component works (tested), integration unknown” |
| Integration tested | “End-to-end test passed, production outcome unknown” |
| Production verified | “System executed successfully in production” |
Never claim higher certainty than verification supports.
Enforcement
Every “it’s fixed” claim MUST include:
- ✅ Test commands (exact bash commands shown)
- ✅ Test output (actual results shown)
- ✅ Timestamp (when tested)
- ✅ Uncertainty level (“component-tested, production unverified”)
If I violate this protocol:
- CEO can demand immediate test results
- CEO can reject claim until proven
- Counts toward trust violations (3 strikes = project termination)
Promotion Gate Automation
Before toggling any live trading flags, prove readiness with:
python3 scripts/run_backtest_matrix.py # writes data/backtests/latest_summary.json
python3 scripts/enforce_promotion_gate.py
CI runs both commands inside .github/workflows/daily-trading.yml and will block deployment unless metrics satisfy the R&D criteria. Overrides must be documented explicitly in the run logs.
What CEO Trusts Now
❌ Words (“I fixed it”, “tomorrow”)
✅ Test results (with commands and output)
✅ Calendar facts (date command output)
✅ Production outcomes (Telegram alerts tell truth)
✅ Monitoring (health checks don’t lie)
Remember: “Show, Don’t Tell” isn’t a suggestion. It’s a survival requirement.