rLLM is an open-source framework for training language agents using reinforcement learning. The project has released several high-performing models including DeepSWE (32B software engineering agent achieving 59% on SWEBench-Verified), DeepCoder (14B coding model matching o3-mini performance with 60.6% Pass@1 on LiveCodeBench),
•4m read time• From github.com
Sort: