Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Test suites - can they reliably predict ELO?

Author: Bertil Eklund

Date: 16:46:50 12/11/99

Go up one level in this thread


On December 11, 1999 at 17:52:56, Tom King wrote:

>Which of the well known test suites predicts the strength of chess programs most
>accurately?
>
>I ask this, because I recently made some *slight* mods. to the evaluation
>function in my program, Francesca. I ran the LCT-2 suite, and the results
>indicated that it was a wash - the modification gave me about 5 ELO points,
>apparently.
>
>I then ran a series of fast games against another amateur program. I realize
>it's important to play a large number of games, to reduce the margin of error,
>so I ran two matches of 65 games. The result was this:
>
>MATCH 1
>"Normal" Francesca scored 37% against the amateur program.
>
>MATCH 2
>"Modified" Francesca scored 45% against the amateur program.
>
>Quite a difference! It implies that the modification is worth over 50 ELO. I
>guess I need to play more games, against a variety of programs to verify whether
>this improvement is real, or imaginary.
>
>Anyhow, beware of reading too much into ELO predictions of test suites..
>
>Cheers All,
>Tom

Hi!

Mr Irazoquis secret test-suite is very impressing! I think it´s about 111
positions. He can predict a new programs strength better than any other test I
have seen so far. If his predictions remains as good as his previous results, I
hope we can stop publishing our list and just play for fun.

Bertil SSDF



This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.