Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: Statistics and Test results

Author: Chris Welty

Date: 08:38:41 10/07/04

>However, since the sample isn't random, the entire test is meaningless.

What makes you think the sample isn't random?

>If you could test engines like that, you would use the binomial
>distribution and would need more than 30 random games from those engines to
>properly test the probability of one engine winning over another.

That's just wrong. The number of games you need is dependent on the results of
the games.

> However,
>since it is not really possible to get a "random game", you will need to play,
>as others on this board have suggested, you will need to increase the sample
>size to 1000 or so.

Again, that's wrong. The sample size you need depends on the outcome of the
games. If after 500 games the result is 500-0 would you agree that one engine is
better than another?

Re: Statistics and Test results Rick Bischoff 10:53:23 10/07/04
- Re: Statistics and Test results Chris Welty 14:42:20 10/07/04
  - Re: Statistics and Test results Rick Bischoff 15:40:14 10/07/04
    - Re: Statistics and Test results Rick Bischoff 22:10:59 10/07/04
      - Re: Statistics and Test results Chris Welty 00:53:39 10/08/04
        
        Re: Statistics and Test results Rick Bischoff 05:22:34 10/08/04
      - Re: Statistics and Test results Chris Welty 00:29:51 10/08/04

This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.