Computer Chess Club Archives


Search

Terms

Messages

Subject: Can someone here enhance the next SSDF list please.

Author: Tina Long

Date: 17:06:31 11/19/99


The results of what I am asking could be badly misinterpreted, & could result in
silly arguements, but if read properly would, for many here, be very
interesting.

In the discussions of "Who's best" there is rarely any consideration of the +/-
in the SSDF list, we get statements such as
"ProgramX is best; it's 5 points ahead of the rest."

Now this is poetic, but wrong, as ProgramX's result is 2680 +/- 70,
From the games played we can be 95% sure ProgramX is rated somewhere between
2610 and 2750.

This is not ELO, this is the progression of computers vs computers since some
computers played some humans about 20 years ago.  The whole list was "deflated"
by 100 points about 10 years ago, and looks like it should be deflated by
another 100 points now.  The only real relationship to ELO we currently have is
Rebel's small sample of Computer Human games, and as Rebel is constantly being
improved we don't know it's current rating as the rating is biased by the
"older" Rebel results- but that's a tangent.... sorry

I'll get to the point:
When the next SSDF is release at the end of November, I'd like one of the
smarter maths whizes here to do the following calculations for me:

Using:
What's the improvement in rating in going from a 200mhz to a 450mhz?
(Looking at the last list, it's about 70 +/- 30)
Ditto from 486/50 and P90 to 200 or 450?

Create a list of estimated ratings on a unified platform, combining (where
applicable) the games of ProgramX on multiple platforms (many programs have been
tested on 2 mhz levels).  The +/- needs to be stated as well as this will
increase dramatically, particularly for ProgramY currently ranked on P90 or a
486/50.

(And where would my favourite oldie
129 Mephisto Polgar  6502 5 MHz             1970   17  1793   41%  2036
rank when upgraded (remembering a P450 is probably 300 - not 100 - times faster)
2600 +- 1000 ?)

Maybe deflating the 450's and using P200 as the unified platform would be best
at this time.

I realise the results would actually mean little due to the very high
statistical variance in the results, but I would still find it an interesting
ranking.

Any volunteers to do the sums?
Thanks guys

Tina Long



This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.