Reinforcement Learning
Reinforcement Learning
Reinforcement Learning
Reinforcement learning is an area of Machine Learning. It is about taking suitable action to maximize
reward in a particular situation. It is employed by various software and machines to find the best possible
behavior or path it should take in a specific situation. Reinforcement learning differs from the supervised
learning in a way that in supervised learning the training data has the answer key with it so the model is
trained with the correct answer itself whereas in reinforcement learning, there is no answer but the
reinforcement agent decides what to do to perform the given task. In the absence of training dataset, it is
bound to learn from its experience.
Example : The problem is as follows: We have an agent and a reward, with many hurdles in between.
The agent is supposed to find the best possible path to reach the reward. The following problem explains
the problem more easily.
The above image shows robot, diamond and fire. The goal of the robot is to get the reward that is the
diamond and avoid the hurdles that is fire. The robot learns by trying all the possible paths and then
choosing the path which gives him the reward with the least hurdles. Each right step will give the robot a
reward and each wrong step will subtract the reward of the robot. The total reward will be calculated
when it reaches the final reward that is the diamond.
sequentially. In simple words we can say that the output depends decision is made on the initial
on the state of the current input and the next input depends on input or the input given at the