- Category:
- AI-NN-PR
- Tags:
-
- File Size:
- 1kb
- Update:
- 2018-04-11
- Downloads:
- 1 Times
- Uploaded by:
- lilin123
Description: In the intensive learning of the University of Wan men, the RW algorithm, the python implementation, the algorithm formula are as follows.
V(CS) = V(CS) + A * ( V(US) * us - V(CS) * cs )
the TD algorithm.
V(s{t}) = V(s{t}) + a[R(t+1) + rV{S(t+1)} - V{S(t)}]
To Search:
File list (Check if you may need any files):
Filename | Size | Date |
---|
1conditioning_RW_Mode.zip | 631 | 2018-04-11
|
2general_TD.zip | 556 | 2018-04-11 |