Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Temporal Difference

Author: James Swafford

Date: 08:48:44 01/06/01

Go up one level in this thread


On January 05, 2001 at 17:36:55, Bas Hamstra wrote:

You're a little ahead of me.  I've been wanting to do some work with
TD for some time, but I've still got a couple months of work to do
before I even get started.  I do have a couple questions for you,
though:

1.  Did you start with realistic weights, or did you begin with
random values, or ???

2.  What do you mean by "wrong trend?"  I suppose you mean a term
is "drifting" the wrong way... becoming more negative when it should
be going more positive?

3.  How are you training your evaluator?  With a wide variety of
opponents, or by playing the same programs over and over, or ???
How many games have you played?

4.  Does your engine compete on ICC?

--
James


>I would like to share experience with some that have tried Temporal Difference
>learning. Currently one of the problems I see is that for example BISHOPMOBILITY
>has very large partial derivatives. So the updates for this term swing wildly
>and distort learning. On the other hand, if I reduce the learning factor to
>bring this in proportion, a term like DOUBLEDPAWN won't ever get to a realistic
>value.
>
>One way to do better is to work with derivatives -1 or +1 only, depending on if
>the partial derivative for a term is above or below zero. This results in a
>tendency to realistic values for most terms.
>
>Still, a few terms refuse to show a "trend" at all, or even the wrong trend. Are
>others having this problems?
>
>(To see what is going on I showed the developments of the weights in a graph, to
>verify it does something useful, best results so far with -1/+1 only, bad
>results when using the real derivatives)
>
>
>Regards,
>Bas.



This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.