Computer Chess Club Archives


Search

Terms

Messages

Subject: Test suites

Author: David B Weller

Date: 08:14:36 02/01/06


I was just here trying to figure out why my engine doesnt get a certain bm for a
positional test, and it occurred to me ...

Why would I trust that?

Many of the basic terms, eg., isolated pawn, have a fairly well established
value, representing a statisitical average over many, many positions

If my engine,is missing some positonal move, for no other reason than I can
tell, except perhaps my isolated = 20 should be isolated = 25, then I am
disregarding the trillions of other positions where it is, statistically
speaking, really 20

As it has been pointed out many times, these tests suites are good only for
detecting gross errors

So if you plan on tweaking the value of your SE metrics by test suites, make
sure it has about a million positions ;-)

Maybe this is why 'auto' tuning is hard. Because if the suite doesnt contain
enough data to be representative of all the features one is trying to tune, it
will just be a waste of time, and make it worse...

It could be that many problems can be easily solved, simply by inflating or
deflating the right term(s). And certainly a 'genetic' algorithm would find the
right ones to inflate/deflate on a small set of positions in order to get more
of them right...

Fact is, it could be the very reason the position got in the test suite, is
because its is a little 'freakish'. Then what? We're tuning our engines to
become worse!

my $0.02

IMHO

-David



This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.