next up previous contents
Next: Grid-world Up: Reinforcement Learning Previous: Learning Task

Optimal Policy



Patricia Riddle
Fri May 15 13:00:36 NZST 1998