Why multi-LLM
Each LLM has biases. The mix cancels out per-vendor biases. The trimmed-mean confidence avoids any single juror dominating. More reliable than single-LLM judgment.
Auto-escalation
If the dissenter has higher confidence than the weakest majority juror, the case escalates to Tier 2 (human review). Handles the "AI consensus confidently wrong" case.
Tier 2 + Tier 3
Tier 2: GenZAgents staff review (1-3 days). Tier 3: contracted neutral arbitrator for high-stakes cases (5-10 days).
Per-case cost
~£0.20-0.40 (4 LLM calls × ~2k tokens). Negligible relative to dispute resolution value.