An Introduction to Reinforcement Learning

Reinforcement learning is a method of engineering intelligence that emulates biological organisms by transducing information from the environment, processing it, and outputting behavior conducive to survival. It borrows from behaviorism and cognitive science to model agent-environment interactions. Dynamic programming is a mathematical optimization method used in reinforcement learning, but it has limitations that are addressed by model-free approaches and the use of artificial neural networks. The combination of reinforcement learning and artificial neural networks, such as Deep Q-Learning, shows promise in improving the capabilities of AI. However, there are still challenges in achieving artificial general intelligence, as it requires a complex internal architecture and reflective awareness.

#reinforcement-learning

#dynamic-programming

May 28, 2024•33m read time•From towardsdatascience.com

Table of contents

A deep dive into the rudiments of reinforcement learning, including model-based and model-free methods What is Reinforcement Learning?Decision Theory & Control Theory States, Actions & Rewards Quantifying Reward Markov Decision Process (MDP)Dynamic Programming & Bellman Optimality State-Value Function Action-Value Function Model Free Methods: Monte Carlo & Temporal Difference Augmenting Reinforcement Learning with ANNs Off-Policy DQN On-Policy Deep TD(𝝀)Reinforcement Learning and Artificial General Intelligence Selected References

Comment

Bookmark

Copy

Sort: