Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: How to use a [cough] EPD test suite to estimate ELO

Author: Paulo Soares

Date: 19:36:46 02/12/99

On February 12, 1999 at 00:03:51, Bill McGaugh wrote:

>Paulo,
>
>Yes...the idea is that a program "plays a tournament" against the previously
>rated (or calibrated) problems...and receives a performance rating based
>on its win-loss record.
>
>The method is not without problems...ideally the problems would be
>calibrated using a wide range of programs...and the programs would be
>rated using a very large number of these calibrated problems...
>
>My initial efforts are based on far too few programs for problem ratings.
> The error of the measurement is probably somewhat different from  Elo's
>formulae, since the initial problem ratings, which the program ratings are based
>on, are established from the problem performance of a small number of programs.
>An advantage of this method is that, after you calibrate problems, you can pick
>the ones you want for a test suite...and test the performance of the new
>suite...the real test being how well a suite estimates the performance of
>programs that are completely unrelated to the calibration of the suite...
>
>This method is a lot of fun to play with, but it is probably a pursuit
>of the impossible dream...a set of problems that will quickly rate chess
>programs...


Bill,

PII-300 HT=64Mb

Fritz5.32----  2426
Genius6------  2471

        The positions must be more calibrated, as yourselves says.
	I Same start to find that some positions really are not decided,
in view of that the punctuation given to others moves is similar.
Seems that this is one of the points that Dann is placing.

Best regards,

Paulo Soares, from Brazil.

This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.