next up previous contents
Next: Temporal Difference Learning Up: Reinforcement Learning Previous: Updating Sequence

Nondeterministic Rewards and Actions



Patricia Riddle
Fri May 15 13:00:36 NZST 1998