System prompt learning is an iterative approach to improving AI coding agents by using English feedback from evaluations rather than scalar rewards. The technique was tested on Claude and Cline coding agents using SWEBench benchmarks, where an LLM-as-judge evaluated failures and generated explanations. These explanations were

10m watch time

Sort: