Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: The big drop in the rating of my Fruit personality

Author: Robert Hyatt

Date: 08:44:36 10/18/05

Go up one level in this thread


On October 17, 2005 at 20:44:53, Ryan B. wrote:

>On October 17, 2005 at 17:01:25, Robert Hyatt wrote:
>
>>On October 17, 2005 at 10:10:58, Uri Blass wrote:
>>
>>>I am now surprised by the big drop in the CEGT rating of my Fruit personality.
>>>
>>>It was already 2806 after 92 games and now it is 2748 after 223 games.
>>>
>>>I also remember possible error of 61 elo after 92 games but even if the real
>>>rating is 61 elo lower than 2806 then I still do not expect the rating to change
>>>so fast.
>>>
>>>This is surprising also because results that I read earlier not by CEGT
>>>supported my personality.
>>>
>>>I wonder if the real error is not higher than the error that is written
>>>
>>>I wonder what is the reason for the big drop and if there was no problem in the
>>>matches against spike and Jonny that seem to be the main reason for the drop in
>>>my personality(did the same tester play these matches?).
>>>
>>>possible source of mistakes in the results.
>>>
>>>1)testing in different hardware relative to previous fruit.
>>>
>>>The claim of the CEGT is that they test with hardware that is equivalent to 2
>>>ghz PIV but the problem is that there is no equivalence and it is possible that
>>>one program likes more one processor and not another processor.
>>>
>>>2)testing different positions and not the same positions that were tested by
>>>earlier version.
>>>
>>>3)testing against different opponents.
>>>
>>>Uri
>>
>>
>>
>>The standard deviation between two computer opponents is often over 200 rating
>>points.  All you have to do is play on ICC for a while to see this happen.
>>Programs cycle up and down without being changed, nor having their opponents
>>change at all...
>
>
>Very true.  Why is crafty only playing unrated games now by the way?

The eval was ripped apart with pieces not functional.  Everything is now back to
"rated only" although the eval is now no longer asymmetric at all, which will
lead to some interesting stuff...




This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.