Introduction - If you have any usage issues, please Google them yourself
Reinforcement Learning , utilize Q learning for An inverted pendulum system control。x,x_dot,theta,theta_dot as state variable,use model result renew Q table,control learning of balance pole 。output csv file contain trail(times),balance time(sec).it s better observe the change if user plot it.