Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: sliding attacks in three #define

Author: Tord Romstad

Date: 13:44:04 04/12/04

On April 12, 2004 at 14:45:28, Christophe Theron wrote:

>On April 12, 2004 at 07:50:47, Tord Romstad wrote:
>>
>>Assume that you make a change to your engine which improves the playing strength
>>by
>>about 10 Elo points.  How many hours of CPU time do you need before you are sure
>>that
>>the change was an improvement?
>>
>
>I would say approximately one week, and I would not even be really sure it is an
>improvement. We are talking about a 1.5% improvement in winning percentage here,
>it's below the statistical noise of a several hundreds games match if you want
>95% reliability!

Thanks, Christophe!

Reading this is actually a great relief to me.  I wondered if you had invented
some kind
of magic which enabled you to find tiny improvements in much shorter time.

>And unfortunately a 10 elo points improvement is becoming rare for me. Most of
>the changes I try make the program weaker, and many changes do not provide any
>measurable improvement!

I have no difficulties believing this.  My engine is still at least 200 points
weaker than
yours, and I have exactly the same experience.

>That's why not having a strong test methodology is totally out of question if
>you are serious about chess programming.

Yes.  It is extremely difficult to me, because I am a very impatient person.
When I make
a small change to my engine, I rarely have enough time to play enough games to
determine
whether it is an improvement, because I have a dozen new ideas I want to try
before my
first test matches are finished.

>Even with a good test methodology chess programming is still an art: in many
>cases you have to decide with your feelings, because the raw data does not give
>you a definite answer.
>
>Now of course there are small improvements that I do not even need to test for a
>long time: if I find a way to make my program 10% faster without changing the
>shape of the tree, then all I need to do is run some safety tests that will only
>look at the number of nodes searched on a large set of positions and compare it
>to the last stable version.

Yes.  That is the only good reason I see to do any low-level optimization (for
weak
amateur engines like mine, that is -- the situation is of course entirely
different for
authours of top engines).  If I manage to make my engine a few percent faster
without
changing the eval or the shape of the tree, I can be 100% sure that the change
was an
improvement, without doing lots of time-consuming tests.

Tord

Re: sliding attacks in three #define Christophe Theron 16:15:01 04/12/04
- Testing Chess Programs Tom Likens 16:51:29 04/12/04
  - Re: Testing Chess Programs Christophe Theron 20:07:46 04/12/04
    - Re: Testing Chess Programs Robert Hyatt 10:36:08 04/13/04
      - Re: Testing Chess Programs Christophe Theron 19:18:00 04/13/04
    - Re: Testing Chess Programs Tord Romstad 10:14:21 04/13/04
    - Re: Testing Chess Programs Peter Fendrich 02:50:29 04/13/04
    - Re: Testing Chess Programs Russell Reagan 22:29:02 04/12/04
      - Re: Testing Chess Programs Christophe Theron 11:21:07 04/13/04
        
        Crafty Stats Matthew Hull 14:00:24 04/13/04
        
        Re: Crafty Stats martin fierz 07:21:55 04/14/04
        
        Re: Crafty Stats Robert Hyatt 09:32:56 04/14/04
        
        Re: Crafty Stats Christophe Theron 16:29:44 04/14/04
        
        Re: Crafty Stats martin fierz 22:05:38 04/14/04
        
        Re: Crafty Stats Christophe Theron 10:18:04 04/15/04
        
        Re: Crafty Stats Robert Hyatt 18:54:58 04/14/04
        
        Re: Crafty Stats martin fierz 16:06:58 04/14/04
        
        Re: Crafty Stats Robert Hyatt 18:56:58 04/14/04
        
        Re: Crafty Stats martin fierz 22:01:59 04/14/04
        
        Re: Crafty Stats Robert Hyatt 06:08:50 04/15/04
        
        Re: Crafty Stats martin fierz 08:01:06 04/15/04
        
        Re: Crafty Stats Robert Hyatt 11:23:51 04/15/04
        
        Re: Crafty Stats Vincent Lejeune 10:06:14 04/15/04
        
        Re: Crafty Stats Keith Evans 22:25:35 04/14/04
        
        Re: Crafty Stats Matthew Hull 07:33:42 04/14/04
        
        Re: Crafty Stats martin fierz 07:38:52 04/14/04
        
        Re: Crafty Stats Robert Hyatt 10:44:54 04/14/04
        
        Re: Crafty Stats martin fierz 16:05:25 04/14/04
        
        Re: Crafty Stats Robert Hyatt 19:06:53 04/14/04
        
        Re: Crafty Stats Dave Gomboc 16:55:15 04/14/04
        
        Re: Crafty Stats martin fierz 22:03:14 04/14/04
        
        Re: Crafty Stats Tord Romstad 13:47:57 04/14/04
        
        Re: Crafty Stats Robert Hyatt 19:10:10 04/14/04
        
        Re: Crafty Stats Sune Fischer 14:38:34 04/14/04
        
        Re: Crafty Stats Christophe Theron 23:51:29 04/13/04
        
        Re: Crafty Stats Matthew Hull 07:15:37 04/14/04
        
        Re: Crafty Stats Christophe Theron 16:14:22 04/14/04
        
        Re: Testing Chess Programs Tord Romstad 12:09:10 04/13/04
        
        Re: Testing Chess Programs Christophe Theron 23:48:34 04/13/04
    - Re: Testing Chess Programs Tom Likens 21:30:33 04/12/04

This page took 0.02 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.