Judge Demo Evidence Hub

This page translates complex AI product behavior into clear proof. Audience mode: simple enough for non-technical viewers, concrete enough for technical judges.

Checklist Progress
11/11
How many required judge-proof sections passed in judge_demo_checklist.md. This is structural completeness, not model quality.
System Status
iteration 1 / healthy
Current autonomous loop cycle + runtime health profile from smoke, lint, and test gates. healthy means the latest guardrails are green.
Smoke Latency / Cost
1464 ms / $0.00001650
Single Tetrate-routed smoke call telemetry from smoke_metrics.txt: end-to-end response time and estimated token spend per run.
Execution Quality Daily
Gateway RouterTetrate TARS
Gateway Hostapi.router.tetrate.ai
Gateway Key SourceLLM_GATEWAY_API_KEY
Routed Modelgpt-4o-mini-2024-07-18
Service Tierdefault
Request IDchatcmpl-DDHIkU8e0PxZKZ7e6Tdwdl0jlovWY
Runs1
Latest Smoke ProbePASS PASS
Success Rate100.0%
Actionable Rate100.0%
P95 Latency1464.0 ms
Token Envelope30 prompt / 20 completion / 50 total
Token Economics$0.00001650 (0.000330 / 1K tokens)
Schema Gate (router_check)PASS PASS
Fallback Probe OK0
Estimated Cost / Smoke Run$0.00001650
Cost Basisinput_cost_per_1m=0.150,output_cost_per_1m=0.600
Generated2026-02-25T22:05:47Z

Interpretation: this card reports the Tetrate-routed inference path health, not trading profitability. Actionable rate reflects whether responses passed the downstream actionability parser.

How Tetrate Is Used (Concrete Flow)
Entry ContractOrchestrator sends POST /v1/chat/completions request through TARS.
Route DecisionTARS applies provider/model policy and returns normalized OpenAI-compatible response payload.
Runtime ProofObserved model gpt-4o-mini-2024-07-18, tier default, request chatcmpl-DDHIkU8e0PxZKZ7e6Tdwdl0jlovWY.
Schema GateResponse must parse into structured trade intent; malformed payloads are rejected.
Risk/Policy GateWhitelist, max positions, loss limits, and cadence controls can veto execution.
Fallback PathFallback probes validate alternate provider path; status is tracked in daily quality artifacts.
TelemetryLatency, token counts, and estimated costs are persisted for ongoing quality/cost monitoring.
Judge EvidenceRaw artifact files are published for independent verification of each stage.
Tetrate Request Flow Diagram
Signal + Context orchestrator inputs Tetrate TARS policy + provider routing Primary LLM Call gpt-4o-mini-2024-07-18 JSON Gate schema check Actionable pass/fail Artifacts + KPIs smoke_metrics, scorecard, readiness gates fallback provider route on health or timeout failure
Flow semantics: solid arrows are primary execution path; dashed arrow is Tetrate fallback route. Output is only considered usable after schema + actionability gates.
What Is A Judge?

In this hackathon, a judge is the reviewer scoring whether the product is real, useful, safe, and measurable. This page is built to help judges verify outcomes quickly from evidence, not promises.

How Much Money Made (Paper) + Why
Starting balance$100,000.00
Current equity$100,845.25
Net P/L$845.25 (0.85%)
Why (current drivers)Win rate 100.0% over 1 samples, strategy gate status: LOW
Readiness Metrics
Win Rate100.00%PASS
Max Drawdown (sync history)0.07%PASS
Execution Quality (valid trade records)97.89%PASS
Gateway Latency1464 msPASS
Gateway Cost (smoke call)$0.000017PASS
Profit FactorInfPASS
Average Winner$41.00PASS
Average LoserN/AUNKNOWN
Weekly Qualified Setups0/3WARN
Weekly Closed Trades2/1PASS
AI Credit Stress Gateunknown (score=0.0)UNKNOWN