Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: an example how users - not programmers - use tests

Author: Rolf Tueschen

Date: 09:34:43 06/20/04

On June 20, 2004 at 12:10:57, David Dahlem wrote:

>I've seen numerous examples of one engine solving a test suite position in a few
>seconds, while another engine of known equal game playing strength never finds
>the solution, even after hours of analysis. To me, this makes test suites
>worthless, or at least very difficult to interpret the results.
>
>Regards
>Dave

Yes, correct, this is what is called the lack of reliability of the results, as
Sandro explained. It's a typical wrong with these position tests, but all test
knowies know it, however the question is how to explain that triviality to lays
and motivated users and to a founder with a blind spot? In special who is losing
himself in the circle argument that every critic at first should run the test
suite because they would THEN realize how good it is. You know from the chess
quality of these positions on...! I can only repeat this: a famous CC journal
and a whole team of forum mods who don't want to "hurt" a test founder and so
tolerate that he loses himself in such a circle - is the main responsible for
that mess. Because that someone, even a scientist, _can_ go wrong and can't
realize this, that is not such a seldom event. It doesn't mean that he's bad or
not intelligent or such. Sometimes you have this "wall" in your head. And you
can't find a brick. Later you break out into laughter and you wonder why you
couldn't see it. Here in our case the main founder is a Russian academic doctor
who certainly has learned the basics of scientific reasoning. Therefore he will
understand in the end the difference between testing the end-product or a
prototype. He does also know these two obstacles, namely validity and
reliability. And he should know that statistical calculation could never
"create" significance if it's not in the data.

I do also think that we must change a couple of terms. When the users are
playing with their engines and run them through 100 positions, this can't be
called "testing"! It looks like but it's not testing.

Re: an example how users - not programmers - use tests David Dahlem 12:34:39 06/20/04
- Re: an example how users - not programmers - use tests Steve Glanzfeld 12:56:00 06/20/04
  - Re: an example how users - not programmers - use tests David Dahlem 13:03:33 06/20/04
    - Re: that wasn't an answer (n.t.) Steve Glanzfeld 13:10:08 06/20/04
      - Re: that wasn't an answer (n.t.) David Dahlem 14:27:43 06/20/04
        
        Re: Still no answer. My question was: Steve Glanzfeld 15:55:08 06/20/04
        
        Re: Still no answer. My question was: David Dahlem 16:06:38 06/20/04
        
        Re: Dahlem cannot answer a simple question. Steve Glanzfeld 16:13:22 06/20/04
        
        Re: Dahlem cannot answer a simple question. David Dahlem 16:21:05 06/20/04
  - Re: an example how users - not programmers - use tests Uri Blass 13:00:38 06/20/04
Re: an example how users - not programmers - use tests Uri Blass 09:39:54 06/20/04
- Re: an example how users - not programmers - use tests Rolf Tueschen 09:53:50 06/20/04
  - Re: an example how users - not programmers - use tests Uri Blass 10:10:23 06/20/04
    - Re: an example how users - not programmers - use tests Rolf Tueschen 10:20:56 06/20/04

This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.