Computer Chess Club Archives

Search

Terms

Messages

Subject: Temporal Difference

Author: Bas Hamstra

Date: 14:36:55 01/05/01

I would like to share experience with some that have tried Temporal Difference
learning. Currently one of the problems I see is that for example BISHOPMOBILITY
has very large partial derivatives. So the updates for this term swing wildly
and distort learning. On the other hand, if I reduce the learning factor to
bring this in proportion, a term like DOUBLEDPAWN won't ever get to a realistic
value.

One way to do better is to work with derivatives -1 or +1 only, depending on if
the partial derivative for a term is above or below zero. This results in a
tendency to realistic values for most terms.

Still, a few terms refuse to show a "trend" at all, or even the wrong trend. Are
others having this problems?

(To see what is going on I showed the developments of the weights in a graph, to
verify it does something useful, best results so far with -1/+1 only, bad
results when using the real derivatives)


Regards,
Bas.

Re: Temporal Difference Rémi Coulom 12:08:11 01/06/01
- Re: Temporal Difference Bas Hamstra 18:11:04 01/06/01
  - Re: Temporal Difference Rémi Coulom 02:23:53 01/07/01
Re: Temporal Difference James Swafford 08:48:44 01/06/01
- Re: Temporal Difference Bas Hamstra 09:11:32 01/06/01
  - Re: Temporal Difference Jay Scott 10:58:19 01/07/01
  - Re: Temporal Difference David Rasmussen 11:37:28 01/06/01
    - Re: Temporal Difference Bas Hamstra 14:09:08 01/06/01
      - Re: Temporal Difference David Rasmussen 15:16:57 01/06/01
  - Re: Temporal Difference James Swafford 11:34:09 01/06/01
    - Re: Temporal Difference Bas Hamstra 14:17:06 01/06/01
      - Re: Temporal Difference Gian-Carlo Pascutto 02:35:32 01/07/01
        
        Re: Temporal Difference Jay Scott 10:39:22 01/07/01
        
        Re: Temporal Difference Bas Hamstra 08:42:16 01/07/01

This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.