Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Something wrong with Fritz...

Author: Kurt Utzinger

Date: 01:10:06 09/20/05

Go up one level in this thread


On September 19, 2005 at 21:48:30, George Speight wrote:

>On September 19, 2005 at 10:20:43, Jonas Cohonas wrote:
>
>>>... I am 99,9% sure.
>>>
>>>Jouni
>>
>>I agree and although i am not a Fritz fan i would question the validity of the
>>whole tournament. The chance of this result to be correct is _very_ small imo.
>
> When u say u are questioning the validity of this tournament, you are being too
>kind to him.  I might have chosen stronger words.  Regards, George

      Hi George
      Sedat Canbaz is an experienced and well know computer
      tester. Before writing such things I would check carefully
      the games to find out if something went wrong. You seem
      to miss that the number of games is low and that such
      results may happen from time to time. I give again the
      sample I have posted many times already.
      Kurt

Gandalf 4.32g vs Program X

Games 1-10
3.0-7.0 [win program X]
Total 3.0-7.0 for program X

Games 11-20
6.5-3.5 [win Gandalf]
Total 9.5-10.5 for program X

Games 21-30
5.0-5.0 [draw]
Total 14.5-15.5 for program X

Games 31-40
3.5-6.5 [win program X]
Total 18.0-22.0 for program X

Games 41-50
4.5-5.5 [win program X]
Total 22.5-27.5 for program X

Games 51-60
3.0-7.0 [win program X
Total 25.5-34.5 for program X

Games 61-70
5.0-5.0 [draw]
Total 30.5-39.5 for program X

Games 71-80
8.0-2.0 [win Gandalf]
Total 38.5-41.5 for program X

Games 81-90
7.0-3.0 [win Gandalf]
Total 45.5-44.5 for Gandalf

Games 91-100
5.5-4.5 [win Gandalf]
Final match result 51.0-49.0 for Gandalf

Can anybody tell me for sure which of the above two is the stronger program??
And what about if I had only played a 20 games match and these games would have
been those played in rounds 71-90? Then, the result would have been 15.0-5.0 in
favour of Gandalf 4.32g!! Imagine what some testers would have argued about the
strenght of program X?

For all these reasons I think that something concrete about the strength between
two programs can only be said if 100, better 200-300 games or even more have
been played.



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.