Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: CCRL 40/40 Rating list and stats updated

Author: Uri Blass

Date: 07:26:43 02/20/06

Go up one level in this thread


On February 20, 2006 at 08:48:10, Kirill Kryukov wrote:

>Hi Mike,
>
>Thanks for comments and interest!
>
>On February 19, 2006 at 12:50:47, Mike S. wrote:
>
>>Thanks! - I see 6 engines, ranks #2 to #7 within only 19 Elos (2830-2849) with
>>the error margins being bigger than +/- 20. (But this includes 2 deeps on
>>duals.) This is remarkable and tells us that these many engines are virtually
>>equally strong, in comp-comp under the CCRL conditions. Or if they are not
>>equally strong (and I don't doubt that some will want to claim this :-)), then
>>it seems to be very hard to prove even with hundred of games. Maybe the gaps
>>will become bigger with 1.000+ games each, or maybe not...
>
>Yes, it's getting crowded at the top of the rating list.. :-)
>
>
>>I also find the engine correlation statistics very interesting. For example, the
>>"most different" engines seem to be Fritz compared to Ktulu and Tiger, 0.93
>>rating difference. Surprising about this is, that we find the 2nd biggest rating
>>diff between Ktulu and Tiger. So, all these 3 are very different.
>
>There can be different explanations of these observations, not necessarily the
>different style of play. For example some engines are known to be "optimistic" -
>as soon as they see slight advantage they over-estimate it. Such things happen,
>and there are "conservative" engines too.

I think that it means that you may not find strong correlation between engines
if the evaluation of one engine is twice of the evaluation of the second engine.

I think that it may be better to translate number in evaluation to expected
result and see correlation between expected results.

Uri



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.