Introduction - If you have any usage issues, please Google them yourself
Successfully implemented Q-Learning for a simple robot navigation problem of a robot moving on a 5 x 5 grid with one arbitrary goal (reward of +10) and three arbitrary obstacles (reward of -10)