Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Fruit 2.2.1vsRybka 1.0 beta 32-bit Blitz 10' EngineMatch after 800G

Author: Norm Pollock

Date: 05:11:25 12/14/05

Go up one level in this thread


On December 14, 2005 at 06:16:31, Vincent Lejeune wrote:

>On December 14, 2005 at 06:10:11, Norm Pollock wrote:
>
>>On December 14, 2005 at 05:30:59, Vincent Lejeune wrote:
>>
>>>On December 14, 2005 at 05:22:16, Gian-Carlo Pascutto wrote:
>>>
>>>>On December 14, 2005 at 05:13:48, Joachim Rang wrote:
>>>>
>>>>>On December 14, 2005 at 05:06:41, Mimic wrote:
>>>>>
>>>>>>==============================================================
>>>>>>Rybka 1.0 Beta 32-bit  +282-130=388  59.50%  476.0/800   +66
>>>>>>--------------------------------------------------------------
>>>>>>Fruit 2.2.1            +130-282=388  40.50%  324.0/800   -66
>>>>>>==============================================================
>>>>>>
>>>>>>  Games       Rybka
>>>>>>========================
>>>>>>  100          +31
>>>>>>  200  -----   +33
>>>>>>  300          +50
>>>>>>  400  -----   +50
>>>>>>  500          +53
>>>>>>  600  -----   +60
>>>>>>  700          +63
>>>>>>  800  -----   +66
>>>>>>========================
>>>>>>
>>>>>>Pentium M Dothan 2.52Ghz
>>>>>>32MB Hash
>>>>>>No EGTB
>>>>>>
>>>>>>Mimic
>>>>>
>>>>>has Rybka some hidden learning? The steady increase of the rating difference is
>>>>>surprising.
>>>>
>>>>The rating difference didn't increase, just the plus score. (And it wasn't
>>>>steady, either)
>>>>
>>>>--
>>>>GCP
>>>
>>>
>>>No, +66 is the rating difference, not the score difference.
>>>
>>>Surprisingly it doubled between 200 and 800 games...
>>>Who know it will be after 1200 games ...
>>
>>No, the rating difference takes into account changes in BOTH elos. Therefore the
>>rating difference is +132.
>
>
>I think you're wrong. This table show that Rybka 1.0 Beta 32-bit is 66 Elo
>stronger than Fruit 2.2.1 (and logically Fruit 2.2.1 is 66 Elo weaker than Rybka
>1.0 Beta 32-bit)

A funny thing happened while I was trying to rebut your argument. I tested
things in elostat and this is what happened.

I downloaded the 800 games, stripped away comments etc, and put the games into
elostat. I gave both engines a starting elo of 2500 and here is what I found:

 Program                          Elo    +   -   Games   Score   Av.Op.  Draws

  1 Rybka 1.0 Beta 32-bit          : 2533   17  17   800    59.5 %   2467   48.5
%
  2 Fruit 2.2.1                    : 2467   17  17   800    40.5 %   2533   48.5
%

So it looks like you are correct!
If Rybka was previously rated let's say 400 points higher at the start, then
Rybka would lose points and Fruit would gain. ;-)



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.