Watkins, C.J.C.H. (1989) Learning from Delayed Rewards. PhD Thesis, University of Cambridge, England.
has been cited by the following article:
Related Articles: