Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10

Author: Maurizio De Leo

Date: 09:21:20 08/30/05


>Under valid and controlled conditions it still seems logical to me to stop a
>test after a 5-0 result and conclude that the winning program is probably the
>stronger one.

>>I don't put much credence in any result of less than 30 games.
>>After 30 games, then you get a lot more plausibility.

>You didn't give any reason for this, so I don't understand. A 6-0 says more
>about engine strength than the above match result with over 100000 games.

Dann is right, I think.
The confidence interval calculation assumes that the score of a game is a
statistic variable with a mean value between 1 and -1 (function of the Elo
difference between the programs) and a standard deviation. Then if the
experiments are independent, the sum of the points will approximate the product
(mean*number of games) with a smaller standard deviation the more the games are.
With enough games the "confidence" will get to 95% when the performance
difference between the two programs is more than 3 standard deviations.
However this assumes a normal distribution. The assumption can be made for any
repeated statistical variable as long as the experiments are independent and
"enough". This "enough" is indeed expressed in most statistics books as 30.

Maurizio

Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Peter Berger 09:27:52 08/30/05
- Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Vasik Rajlich 01:52:11 08/31/05
  - Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Peter Berger 03:22:49 08/31/05
    - Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Vasik Rajlich 05:53:56 08/31/05
      - Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Peter Berger 10:52:09 09/01/05
        
        Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Vasik Rajlich 06:55:55 09/02/05
        
        Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Drexel,Michael 11:39:12 09/01/05
        
        Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Peter Berger 13:18:37 09/01/05
        
        Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Drexel,Michael 15:48:39 09/01/05
        
        Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Peter Berger 15:57:55 09/01/05
  - Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Uri Blass 01:59:18 08/31/05
    - Re: Spike 1.0 Mainz is too strong for Zappa 1.1 so far 16 to 10 Vasik Rajlich 05:58:10 08/31/05

This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.