Orchestrating AI Code Review at scale

Cloudflare built a CI-native AI code review system using OpenCode that orchestrates up to seven specialized AI agents per merge request, covering security, performance, code quality, documentation, release management, and compliance. The architecture uses a composable plugin system, risk-tiered agent selection (trivial/lite/full), circuit breakers with model failback chains, and a coordinator agent that deduplicates and judges findings before posting a single structured review comment. After 30 days across 5,169 repositories and 131,246 review runs, the median review completes in 3m 39s at $0.98, with an 85.7% prompt cache hit rate. Key engineering challenges covered include JSONL streaming, prompt injection sanitization, incremental re-reviews, dynamic model routing via Workers KV, and the limitations of AI review for architectural and cross-system concerns.

#cicd

#cloudflare

Apr 20•26m read time•From blog.cloudflare.com

Table of contents

The architecture: plugins all the way to the moon How we use OpenCode under the hood Specialised agents instead of one big prompt The coordinator helps keep things focused Risk tiers: don't send the dream team to review a typo fix Diff filtering: getting rid of the noise The spawn_reviewers tool: concurrent orchestration Resilience: circuit breakers and failback chains The control plane: Workers for config and telemetry Re-reviews: not starting from scratch Keeping AI context fresh: the AGENTS.md Reviewer How our teams use it Show me the numbers!So, what does a review look like?Limitations we're honest about We’re just getting started

Comment

Bookmark

Copy

Sort: