Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: TDleaf from someone other than Baxter et al

Author: Rémi Coulom

Date: 23:40:03 07/17/04

Go up one level in this thread


On July 17, 2004 at 23:44:55, Stuart Cracraft wrote:

>www.cs.ualberta.ca/~jonathan/Papers/Papers/td.ps

If you are interested in papers on TDLeaf(lambda) from someone other than Baxter
et al, you should read those by Beal and Smith. They invented the technique
first, and applied it to chess. I find their papers to be more convincing than
those by Baxter. They used self-play instead of online play, and played many
more games. They managed to obtain weights that look much better.

Two of their papers were published in the ICGA journal. They have papers in
_Information Science_ (122, 2000, 3-21) and _Theoretical Computer Science_ (252
(2001) 105-119), also. Unfortunately, I do not think that any of those papers is
available online for free. The paper in _Information Science_ is particularly
intersting because it introduces a technique called "temporal coherence" to
speed-up learning by using an individual adapative learning rate for every
weight.

Another interesting reference that is available online is this technical report:
http://www.cs.bris.ac.uk/Publications/pub_info.jsp?id=2000100

Rémi



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.