Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: CCRL 40/40 Rating list and stats updated

Author: Kirill Kryukov

Date: 07:44:51 02/20/06

Go up one level in this thread


Hi Uri,

On February 20, 2006 at 10:26:43, Uri Blass wrote:

>I think that it means that you may not find strong correlation between engines
>if the evaluation of one engine is twice of the evaluation of the second engine.

Yes. An engine may multiply the evaluation by 1.05, or by 2, or by 10, and still
play normal chess. So that "Evaluation difference" table only compares the
evaluation actually reported by engines.

This is why ponder hit is more reliable table - it compares only things that
engines actually do on board. Expected move is much more reliable than
evaluation. I am only afraid that ponder hit statistics may require larger
number of games.



>I think that it may be better to translate number in evaluation to expected
>result and see correlation between expected results.

How to translate the evaluation into expected result? You mean just to see if it
is plus or minus? Hmm..



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.