In this video I entirely reworked my AI locomotion project to use a proper algorithm.

● Support me on Patreon https://www.patreon.com/c/pezzzaswork
● Join the Discord server https://discord.gg/sAXAnxpTcu

Get 40% off Code Crafters https://app.codecrafters.io/join?via=johnBuffer
The GTC event https://nvda.ws/4rfc1SR

00:00 Introduction
01:00 Creating an Editor
03:40 The old way
06:55 PPO
10:20 Conclusion

Pezzza's Work

A developer shares progress on an AI locomotion project inspired by attending Nvidia's GTC conference. The post covers building a graphical editor for physical model assembly, fixing mass/density issues, and most importantly switching from evolutionary algorithms to PPO (Proximal Policy Optimization) reinforcement learning using the MLAC C++ library. The comparison between the two approaches is striking: evolutionary methods plateau quickly and produce inelegant 'vibrating' solutions, while PPO produces smooth, physically convincing locomotion after about 30 minutes of training. The project uses a pogo-stick-like structure with collider constraints to force proper balancing behavior.

AI Learns to Walk, But Less Dumb