Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: Not meaningless - just not absolute

Author: Bob Durrett

Date: 05:43:12 02/14/03

On February 14, 2003 at 08:35:29, Albert Silver wrote:

>On February 14, 2003 at 07:10:40, Rolf Tueschen wrote:
>
>>Just to explain some basics for new readers, I show why the whole List is
>>worthless. The rankings are by chance the way they are presented.
>>
>>Since only a few here have basic knowledge in statistics I explain the most
>>apparet things.
>>
>>We are told that for instance the two first programs are seperated by 8 points.
>>No matter Stefan get all the credits here for his first place. But is true that
>>Shredder is stronger than Fritz?
>>
>>Here I must tell you that we simply don't know it. The SSDF pretend to know it,
>>but it is NOT true. How can I say such things? Easy! Look at the deviations.
>>These numbers with + or -. We see that most programs have an expected Elo number
>>varying plus/mius of about 30 points! Note, that the Elo minus 5 is as probable
>>as the fially given Elo for the ranking!
>>
>>If you then take a look at the Elo of the opponents in the far right you can see
>>that even for the top programs the SSDF was unable to create equal conditions.
>>Also this influence by different opponents makes the 8 numbers difference at the
>>top meaningless.
>>
>>In sum we can say that the SSDF failed to show - exactly what they pretend to
>>show - the differences between the actual top programs. The SSDF presents a new
>>leader, but that is against its own results! So that the conclusion is allowed
>>that SSDF makes deliberately their own new number 1!
>
>Your comment that being number 1 in the list is not an absolute is completely
>correct. The SSDF doesn't claim it is a statistical absolute either, which is
>why they present the data: rating performance, number of games, AND the error
>margin.
>
>
>     THE SSDF RATING LIST 2003-02-13   90961 games played by  251 computers
>                                           Rating   +     -  Games   Won  Oppo
>                                           ------  ---   --- -----   ---  ----
>   1 Shredder 7.0  256MB Athlon 1200 MHz     2768   33   -31   547   72%  2606
>   2 Deep Fritz 7.0  256MB Athlon 1200 MHz   2760   29   -28   654   70%  2612
>   3 Fritz 7.0 256MB Athlon 1200 MHz         2740   30   -29   574   64%  2635
>   4 Chess Tiger 15.0  256MB Athlon 1200 MHz 2726   27   -26   704   64%  2623
>
>
>If they present the error margin, doesn't this *clearly* mean that the result
>may be off by that much? However, so far the current performance is 2768 SSDF
>points. How many games does a human play to get their rating? I won't event
>mention the ridiculously low requirement by FIDE to play only 9 games to get a
>first rating. Suppose I had no rating and played 100 games against a 2000 Elo
>player and I scored 75/100. My performance is 2200 exactly. Is it absolute? No,
>there is a good margin of error, yet no one will question the rating and start
>telling me I'm not rated 2200, I'm just rated anywhere between 2140 and 2260. I
>see no difference. They had Shredder 7 play 547 games against other programs,
>and presented the results PLUS the error margin. It *may* still be a fraction
>weaker than Deep Fritz 7, but already it is clear that it performas better than
>Chess Tiger 15 against other computers. But even if another 200 games changed
>the top ratings to Shredder 7 = 2762 and DF7 = 2763 would anyone be so foolish
>as to claim one program is actually any stronger?? I certainly would never think
>of an opponent rated 10 points more as stronger. The fact that two such
>different playing styles achieve almost identical performances shows how rich
>and flexible chess is.
>
>                                         Albert


Excellent points.  The "bottom line" is that SSDF presented their findings
properly, but the problem is in interpretation.  SSDF cannot be held responsible
for errors in interpretation.

Bob D.

>
>>
>>(Note please that this is not a political speech, however it is what statistics
>>demands. The SSDF got this critic so often in the past but they still did't
>>change their experimental setting.)
>>
>>Rolf Tueschen

Re: Statistical methods and their consequences Rolf Tueschen 06:27:26 02/14/03
- Re: Statistical methods and their consequences Bob Durrett 11:56:38 02/14/03
  - Re: Statistical methods and their consequences Rolf Tueschen 12:44:07 02/14/03
- Re: Statistical methods and their consequences Jonas Cohonas 11:30:36 02/14/03
  - Re: Statistical methods and their consequences Rolf Tueschen 13:08:44 02/14/03
    - Re: Statistical methods and their consequences Jonas Cohonas 13:49:56 02/14/03
      - Re: Statistical methods and their consequences Rolf Tueschen 14:25:16 02/14/03
        
        Re: Statistical methods and their consequences Jonas Cohonas 14:45:51 02/14/03
        
        Re: Statistical methods and their consequences Rolf Tueschen 17:30:26 02/14/03
        
        Re: Statistical methods and their consequences Jonas Cohonas 00:18:31 02/15/03
        
        Re: Psychology Rolf Tueschen 04:13:55 02/15/03
        
        Re: Psychology Jonas Cohonas 05:28:43 02/15/03
        
        Re: Psychology Rolf Tueschen 06:17:50 02/15/03
        
        Re: Psychology Jonas Cohonas 06:28:14 02/15/03
- Re: Statistical methods and their consequences Tony Hedlund 10:32:16 02/14/03
  - Re: Statistical methods and their consequences David Dory 01:52:44 02/15/03
    - Re: Statistical methods and their consequences Albert Silver 04:08:52 02/15/03
      - Re: Statistical methods and their consequences Rolf Tueschen 06:34:41 02/15/03
        
        Re: Statistical methods and their consequences Albert Silver 12:54:15 02/15/03
        
        Re: Statistical methods and their consequences Rolf Tueschen 14:21:10 02/15/03
        
        Re: Statistical methods and their consequences Albert Silver 19:16:24 02/15/03
        
        Re: Statistical methods and their consequences - for Albert (Rolf's Way) David Dory 20:08:45 02/15/03
        
        Re: Statistical methods and their consequences - for Albert (Rolf's Way) Albert Silver 07:38:01 02/16/03
        
        Re: Statistical methods and their consequences - for Albert (Rolf's Way) David Dory 02:54:34 02/17/03
        
        Re: Statistical methods and their consequences - for Albert (Rolf's Way) Albert Silver 06:04:27 02/17/03
        
        Re: Statistical methods and their consequences - for Albert (Rolf's Way) Mogens Larsen 11:20:20 02/17/03
        
        Re: Statistical methods and their consequences - for Albert (Rolf's Way) Rolf Tueschen 06:21:58 02/17/03
  - Re: Statistical methods and their consequences Rolf Tueschen 13:27:31 02/14/03
    - Re: Statistical methods and their consequences Tony Hedlund 02:24:43 02/15/03
      - Re: Statistical methods and their consequences Rolf Tueschen 04:12:10 02/15/03
        
        Re: Statistical methods and their consequences Tony Hedlund 10:21:39 02/16/03
        
        Re: Statistical methods and their consequences Rolf Tueschen 03:29:23 02/17/03
        
        Re: Statistical methods and their consequences Tony Hedlund 09:53:52 02/18/03
        
        Re: Statistical methods and their consequences Rolf Tueschen 13:22:58 02/18/03
        
        Re: Statistical methods and their consequences Tony Hedlund 06:12:18 02/20/03
        
        Re: Statistical methods and their consequences Rolf Tueschen 06:32:38 02/20/03
        
        Re: Statistical methods and their consequences Tony Hedlund 09:07:18 02/20/03
        
        Re: Statistical methods and their consequences Uri Blass 03:53:14 02/17/03
        
        Re: Statistical methods and their consequences Rolf Tueschen 06:05:31 02/17/03
        
        Re: Statistical methods and their consequences Tony Hedlund 10:36:28 02/17/03
        
        Re: Statistical methods and their consequences (Red=Green) Rolf Tueschen 14:56:02 02/17/03
        
        Re: Statistical methods and their consequences (Red=Green) Tony Hedlund 10:20:19 02/18/03
        
        Re: Statistical methods and their consequences (Red=Green) Rolf Tueschen 13:11:43 02/18/03
        
        Re: Statistical methods and their consequences (Red=Green) Tony Hedlund 07:32:04 02/20/03
        
        Re: Statistical methods and their consequences (Red=Green) Rolf Tueschen 07:41:39 02/20/03
        
        Re: Statistical methods and their consequences (Red=Green) Tony Hedlund 09:04:42 02/20/03
        
        Re: Final Statement for now Rolf Tueschen 15:40:34 02/20/03
        
        Re: Final Statement for now Bertil Eklund 22:36:22 02/20/03
        
        Re: Final Statement for now--ah! resolution at last! (NT) Stephen A. Boak 20:45:44 02/20/03
    - Re: Statistical methods and their consequences Bertil Eklund 14:37:39 02/14/03
  - Re: Statistical methods and their consequences Rolf Tueschen 13:25:29 02/14/03

This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.