Series

Agent-to-Agent Bias in Multi-Agent AI Systems

A 6-part research-backed series examining how bias emerges, compounds, and evades detection in multi-agent AI pipelines — from self-preference in judge models, to population-level drift, adversarial swarm takeover, and regulatory blind spots. Includes Python implementation patterns for cross-family evaluation, structured assessment, and population monitoring.

Part 1 of 6: Your Pipeline Has a Judge. The Judge Is Cooked.
TL;DR: Researchers tested 20 AI models as judges. 17 out of 20 were statistically biased. True negative rate: 42.5% — your judge misses bad output more than half the time. If you have an LLM checking
Jun 4, 20265 min read
Part 2 of 6: You Upgraded the Judge. It Got Worse. You Kept Upgrading.
TL;DR: Smarter models are better judges — unless they're judging their own output. Then they defend wrong answers 86% of the time. Capability makes the bias worse, not better. The only structural fix:
Jun 4, 20265 min read
Part 3 of 6: Every Agent Passed. The System Failed.
TL;DR: You can test every agent individually and get clean results. Deploy them together and biased conventions emerge by round 15. The bias isn't in any agent — it's in the space between them. Publis
Jun 4, 20267 min read1
Part 4 of 6: One Rogue Agent. The Whole Swarm Followed.
TL;DR: One adversarial agent. 2% of the population. That was enough to flip the entire swarm's behaviour. This is prompt injection at population scale — and your individual security audits can't see i
Jun 4, 20267 min read
Part 6 of 6: How to Build Pipelines That Don't Gaslight Themselves.
TL;DR: Six parts of bad news. Here's what actually helps — with code. Cross-family judges reduce the core bias. Structured multi-dimensional evaluation cuts it by 31.5%. Chain-of-thought adds 1.5 to 1
Jun 4, 202612 min read2
Part 5 of 6: The Regulation That Cannot See the Bias It Was Built to Catch.
TL;DR: The EU AI Act's high-risk provisions take effect August 2026. Your multi-agent pipeline is covered. But the regulation doesn't define "bias," doesn't require population-level testing, and can't
Jun 4, 20266 min read1

Command Palette