- Wikipedia, Creative Commons ShareAlike License
- Watkins, C.J.C.H. (1989), Learning from Delayed Rewards (Ph.D. thesis), Cambridge University
- Online Q-Learning using Connectionist Systems, Rummery & Niranjan (1994)
- Wiering, Marco; Schmidhuber, Jürgen (1998-10-01), Fast Online Q(λ). Machine Learning. 33 (1): 105-115
- Copyright (c) 2009-2017, Accord.NET Authors at: email@example.com
- Kenan Deen, https://kenandeen.wordpress.com/
With Safari, you learn the way you learn best. Get unlimited access to videos, live online training,
learning paths, books, interactive tutorials, and more.