The full RL nanodegree, covered with implementation.

Daily Dose of DS offers a daily dose of inspiration, education, and motivation for data scientists and aspiring data professionals. Through bite-sized articles, tutorials, and curated resources, readers embark on a journey to master the art and science of data analysis, machine learning, and artificial intelligence. By staying updated with the latest trends, techniques, and tools in data science, readers can hone their skills and stay ahead in this rapidly evolving field.

Daily Dose of Data Science | Avi Chawla | Substack

Part 5 of an RL series covering function approximation — the technique needed when tabular methods break down for real-world, continuous-state problems. Topics include why lookup tables fail, parameterized function approximators, gradient Monte Carlo, semi-gradient TD, and the deadly triad of function approximation with bootstrapping and off-policy learning. Includes a hands-on implementation training an agent on the Mountain Car problem. The post also contextualizes RL's growing importance given its role in post-training pipelines for LLMs like DeepSeek-R1, ChatGPT, and Claude.

Function Approximation in RL