rLLM is an open-source framework for training language agents using reinforcement learning. The project has released several high-performing models including DeepSWE (32B software engineering agent achieving 59% on SWEBench-Verified), DeepCoder (14B coding model matching o3-mini performance with 60.6% Pass@1 on LiveCodeBench),

4m read time From github.com
Post cover image
Table of contents
Releases 📰Getting Started 🎯AcknowledgementsCitation

Sort: