Harness engineering: leveraging Codex in an agent-first world

OpenAI's team built an internal product over five months with zero manually-written code, using Codex agents to generate approximately one million lines across 1,500 pull requests. The experiment revealed that engineering work shifts from writing code to designing environments, creating feedback loops, and building scaffolding that enables agents to work autonomously. Key learnings include treating documentation as a navigable map rather than a manual, enforcing architectural invariants mechanically, making all context repository-local for agent legibility, and implementing continuous automated refactoring to prevent technical debt accumulation. The system now supports end-to-end feature development where agents can reproduce bugs, implement fixes, validate changes, and merge pull requests with minimal human intervention.

#ai-agents

#codex

#productivity

Feb 11•15m read time•From openai.com

Table of contents

We started with an empty git repository Redefining the role of the engineer Increasing application legibility We made repository knowledge the system of record Agent legibility is the goal Enforcing architecture and taste Throughput changes the merge philosophy What “agent-generated” actually means Increasing levels of autonomy Entropy and garbage collection What we’re still learning

Comment

Bookmark

Copy

Sort: