Train Your Lunar-Lander | Reinforcement Learning | OpenAIGYM

Shiva Verma
6 min readApr 20, 2019
Image by AlphaCoders

Lunar Lander is another interesting problem in OpenAIGym. In my previous blog, I solved the classic control environments. In this blog, I will be solving the Lunar Lander environment.

Reinforcement Learning | Brief Intro

Reinforcement learning is an interesting area of Machine learning. The rough idea is that you have an agent and an environment. The agent takes actions and environment gives reward based on those actions, The goal is to teach the agent optimal behavior in order to maximize the reward received by the environment.

Reinforcement Learning Diagram

For example, have a look at the diagram. This maze represents our environment. Our purpose would be to teach the agent an optimal policy so that it can solve this maze. The maze will provide a reward to the agent based on the goodness of each action it takes. Also, each action taken by agent leads it to the new state in the environment.

About Lunar-Lander

As you can see in the picture below, there is one space-ship. The task is to land the space-ship between the flags smoothly. The ship has 3 throttles in it. One…

--

--

Shiva Verma
Shiva Verma

Written by Shiva Verma

Machine Learning Engineer | Available for consultancy | shivajbd@gmail.com

Responses (5)