Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: CSS WM TEST - a technical view

Author: Sune Fischer

Date: 07:16:55 06/18/04

Go up one level in this thread


On June 18, 2004 at 09:47:55, Rolf Tueschen wrote:

>Thanks for the information. Let me add in return something important which would
>also explain the big interest of the journal CSS.
>
>Mikhail G. sees himself at eye-height with the programmers of the chess
>programs. He claims that he has worked on the analyses of "his" positions for
>years! And he sees himself justified because the test results from Manfred
>Meiler do show that the CSS-WM-Test ranking list (for actually some 300 engines)
>is similar to those lists that are made to test playing strength. So, Mikhail
>claims that his test is ALSO a good indicator for the playing strength, and
>more, in case of criticism, Mikhail can make the counter argument that his test
>does only deals with "analytical abilities" which is allegedly the base of all
>strengths. So, this is the real reason for the mass hypnosis of the dear CSS
>readers. So, even those who dont like MG still can have the delusion that
>"their" test is kosher because there is no critic against it.

For this to be true, it would have to be a measure of strength equivalent to
playing games, IMO (at least ignoring books, time management and stuff).

Imagine if everyone starts developing for years only to score high on that
suite.
Do you think the one scoring the highest (assuming they don't all get 100%) will
also be the strongest?

It's quite obvious it's not that simple in general, just compare speeds of CM
and Fritz in finding mates and see how that relates to their chess strength.

To make a balanced suite where these two engines end up close to each other
you'd have to find a whole different set of positions where Fritz scores better.

Now you take a third engine, say Junior, which scores badly on the Fritz suite
and the CM suite because it has its own set of strong and weak points. So you
have to add a new suite for Junior. Etc. for a fourth engine....

This kind of reverse engineer patching of a testsuite will by definition produce
the right strength relations, but still it does not indicate strength.

-S.



This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.