Author: pavel
Date: 08:07:53 09/06/02
Go up one level in this thread
On September 06, 2002 at 10:25:01, Uri Blass wrote: >On September 06, 2002 at 08:22:33, pavel wrote: > >>On September 05, 2002 at 15:47:24, Dann Corbit wrote: >> >>>On September 05, 2002 at 08:26:56, pavel wrote: >>>>On September 05, 2002 at 07:35:12, David Rasmussen wrote: >>>>>I don't know how reliable this tournament is. Chezzz is consistently better on >>>>>ICC against a number of the opponents that are above it in this tournament. >>>>> >>>>>/David >>>> >>>> >>>>No offense to SSDF, but I consider this tournament to be more reliable than >>>>SSDF. >>>>If you look at the format of the tournament and the rules, and the way they are >>>>being played out. >>>> >>>>Only handicap is that, not alot of games are played by each programs in each >>>>division. >>>>But it's still better than differant programs playing "differant numbers of >>>>games" in a rating list. >>> >>>I don't see a problem with that approach, as long as enough games are played. >>>When the number of games for some program is small, then the error bars will be >>>large. >> >>Yes, but it is a faulty method IMO. >> >>From SSDF: >> Rating + - Games Won Av.opp >>1 Fritz 7.0 256MB Athlon 1200 MHz 2741 30 -29 574 64% 2636 >>2 Shredder 6.0 Paderb 256MB Athlon 1200 2727 34 -32 467 65% 2619 >>3 Chess Tiger 14.0 CB 256MB Athlon 1200 2721 33 -32 487 63% 2627 >>4 Gambit Tiger 2.0 256MB Athlon 1200 2718 31 -30 523 60% 2645 >>5 Shredder 6.0 256MB Athlon 1200 MHz 2717 32 -31 505 64% 2618 >> >> >>The games are not played against same opponent (if so, not same number). >> >>For examples, if shredder6 plays another 107 (which is the number of games less >>than Fritz7) games against opponent such as, Crafty and older versions of Fritz >>and lower rated programs, it probably will not only shorten the gap between the >>first and second program but shredder6 will most likely top fritz7 easily. > >How do you know it? > >I agree that there is a problem and it is possible but it is also possible that >shredder does not like to play against Crafty and older versions of Fritz and it >is not going to score enough to earn rating. > >I have no data to know. It is most certainly possible. But if you read my second post, you sill see that I suggested, "It probably will". What are the odds that Shredder6 doesn't do good against crafty or older version of fritz. I don't have proof that contradicts this. However when you don't have even number of games, possibilities like this remains. And thus one cannot 'conclude' anything from a rating list that has un-even number of games by each players. It's fun. And it's also wrong. cheers, pavs > >Uri
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.