Author: Rémi Coulom
Date: 23:40:03 07/17/04
Go up one level in this thread
On July 17, 2004 at 23:44:55, Stuart Cracraft wrote: >www.cs.ualberta.ca/~jonathan/Papers/Papers/td.ps If you are interested in papers on TDLeaf(lambda) from someone other than Baxter et al, you should read those by Beal and Smith. They invented the technique first, and applied it to chess. I find their papers to be more convincing than those by Baxter. They used self-play instead of online play, and played many more games. They managed to obtain weights that look much better. Two of their papers were published in the ICGA journal. They have papers in _Information Science_ (122, 2000, 3-21) and _Theoretical Computer Science_ (252 (2001) 105-119), also. Unfortunately, I do not think that any of those papers is available online for free. The paper in _Information Science_ is particularly intersting because it introduces a technique called "temporal coherence" to speed-up learning by using an individual adapative learning rate for every weight. Another interesting reference that is available online is this technical report: http://www.cs.bris.ac.uk/Publications/pub_info.jsp?id=2000100 Rémi
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.