Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: "Easy" Positional Test Suite

Author: David Rasmussen

Date: 16:08:45 12/27/01

Go up one level in this thread


On December 27, 2001 at 14:13:01, Robert Basham wrote:

>Greetings!
>Have you seen this link of basis chess test problems with some computor results?
>
>It looks very good to me.
>
>http://www.multimania.com/albillo/cmain.htm#tests
>
>Robert

I know if these tests. They are good, if a little old. But they are really not
what I was thinking of. For at least two reasons:

1. They are "difficult" tests that tests advanced knowledge
2. Many of them isn't a test of knowledge so much as search depth.

I want tests that are like this: If the program doesn't make the prefer the
right move, instantly, you have a problem because it indicated that some basic
evaluation term is over- or underevaluated. Like the program doesn't prefer some
move, because it doesnt  understand that two connected passers are strong, or it
doesn't prefer some move because it doesnt understand that an outpost is strong,
or that owning an open file is strong etc. Really basic evaluation terms. This
would test two things:

1. That the program has a certain (more or less basic) evaluation term necesary
to recognize the advantage of the position obtained by playing the key move
2. That the evaluation terms are more or less correctly calibrated in relation
to eachother, since it doesn't prefer some other move because it overevaluates
some other term or underevaluates the term in question.

That means that the best positions would be ones that had several imbalances,
that could mislead the program.

What I want is positions where you can say "if your program prefers to double
the rooks in this position or move the king to safety or something entirely
different instead of the 'right' move obtaining a strong knight outpost, your
evaluation terms are miscalibrated".

I know that evaluation is subjective in some sense and that such positions and
their key moves would always be subjective in this sense. Even so, a testsuite
of such positions could really get one started on fixing the worst and most
basic evaluation errors of a program.

/David



This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.