In this course, you will be introduced to Reinforcement Learning, an area of Machine Learning. You will learn the Markov Decision Processes, Bandit Algorithms, Dynamic Programming, and Temporal Difference (TD) methods. You will be introduced to Value function, Bellman Equation, and Value iteration. You will also learn Policy Gradient methods. You will learn to make decisions in uncertain environment.