Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Extensions?!

Author: Don Dailey

Date: 12:49:58 01/13/98

Go up one level in this thread


>This is not perfect.  I should probably normalize the numbers somehow so
>that problems in which both of the programs finish very close to the
>maximum allowable times don't get more weight than those in which
>neither of them can quite finish that last ply.  Also, this doesn't take
>into account that one of the versions might be getting closer to the
>real answer, and therefore is taking more time per ply.  And finally, I
>have had a problem with disk caching -- the second run on any given
>night usually goes faster than the first one, so when I run these
>suites, some of the results are a little bogus.

All of this stuff is a mess.  I don't think the way problem sets are
typically scored make much sense.   They should give credit for quicker
solutions in my opinion not just total solved in less than x minutes.

I believe the solution times should be an important factor.  The tests
should be run long enough so that  getting a solution late gives very
little credit and is basically equivalent to not solving it at all.
This is not a perfect solution either but helps with the phenomenon of
solving 1 second later than the specified time.  Ideally you should be
required to solve every problem but this is not a practical solution.
There should also be a minimum
solution time of something like 1 second because of i/o problems.  My
program for instance may solve a simple problem in 0.1 or 0.2 seconds
randomly.  I would normally give a lot of weight to solving something
twice as fast but not in this case.

If everyone agreed on a simple but more sensible method of scoring any
problem set we could talk and compare numbers more meaningfully than
we do now.   I'm not saying there wouldn't still be problems though,
there is the issue of do you wait to see if it keeps the solution,
are there multiple solutions etc.

Another problem with more complex scoring methods is that until people
understand them, the numbers are even more ambiguous to people.  Saying
I solve 240 Win at Chess in less than 2 minutes is at least something
you can understand immediately.  But too much information is thrown
away.

But you get the idea.  If I make the program 10 percent faster with
no other side effects it might not show up at all in some problem
set unless it just happens to pick up a problem or two.

- Don





This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.