Title:
万门大学强化学习GridEvalu模型GridPolicy模型 Download
- Category:
- AI-NN-PR
- Tags:
-
- File Size:
- 3kb
- Update:
- 2018-04-11
- Downloads:
- 0 Times
- Uploaded by:
- lilin123
Description: In the intensive learning of the University of Wan men, the Grid_world_evaluation algorithm, the python implementation, the algorithm formula are as follows.
V(S) = V(S) + A * ( R(S) + r*V(new_S) - V(S) )
the Grid_world_Policy algorithm.
P(S) = P(S) + A * ( R(S) + r*P(new_S) - P(S) )
To Search:
File list (Check if you may need any files):
Filename | Size | Date |
---|
3Grid_world_evaluation.zip | 1356 | 2018-04-11
|
4Grid_world_Policy.zip | 1622 | 2018-04-11 |