How We Developed Zeta2 — Zed's Blog

Zed's team details how they built Zeta2, their improved edit prediction model. Key improvements include richer input context (finer-grained edit history, LSP-resolved type/symbol definitions), a switch from Qwen 2.5 Coder (7B) to Seed Coder (8B) as the base model, and a knowledge distillation pipeline using Claude Sonnet as the teacher model. They addressed the 'reversal problem' where the model incorrectly deleted intentional user edits by improving teacher prompting and edit granularity. Training data shifted from synthetic GitHub commit examples to opt-in real user traces from open source repos, yielding ~250-300k training requests per week. The result is a 30% better acceptance rate and faster responses, validated through dogfooding, shadow releases, and gradual rollout.

#deep-learning

#zed

Apr 07•6m read time•From zed.dev

Table of contents

Knowledge distillation Collecting the right training data The reversal problem Switching the base model How to know when it's time to ship What's next Introducing Zed AI We Rebuilt Zeta from the Training Data Up Choose Your Edit Prediction Provider