Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Judge yourself!

Author: Maurizio De Leo

Date: 14:34:43 07/31/02

Go up one level in this thread


On July 31, 2002 at 16:34:47, Gian-Carlo Pascutto wrote:

>>Actually also #7 Junior is in the "confidence range".
>
>It isn't. (You can't simply add the error margins)
>
>sqrt (30^2 + 30^2) = 42
>
>Fritz 7 is with >95% confidence better than Junior 7.

You are right.

With the ipotesis that ssdf ranges are based on a standard normal distribuition

Fritz
average                       M1 = 2741
medium square error           s1 = 15,306

Junior
average                       M2 = 2689
medium square error           s2 = 14,796

so Z = (M1-M2) / sqrt (s1^2 + s2^2) = 52 / 21.29 = 2.44

and being this also a standard normal it leaves indeed way less than 5%
probability that Junior has the same strenght of Fritz.

Maurizio

P.S.   Thank you for letting me take off a little rust from my math.
P.S.2  So after all the SSDF list isn't so unuseful : it rules out two big
pretender (Junior and Hiarcs) for the trone of best computer program.






This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.