Computer Chess Club Archives


Search

Terms

Messages

Subject: This test is not scientific!

Author: Don Dailey

Date: 09:55:19 01/26/99

Go up one level in this thread


Hi Dan,

We appreciate the results you posted.   But none of us are not being
very consistant about the way we test so we cannot draw any solid
conclusions from your test.   For instance you call it a match
only if the move matches at the 60 second point.

Bruce want's to do a faster version of the test too, but this
is more or less meaningless.  You cannot run a short version
of the test and then say, "see I only got 40% match rate but
Bob's Craftty at 10 minutes on a dual matches 98%"   Yours
is not only too short, but you are using a very strict matching
rule.  I would guess that even Bionic itself (or whatever ran
at the tournament) would get a poor match rate under these
conditions.

So let's do this test correctly.  If time is an issue (it is
certainly a time consuming test as Bruce said) then we should
start with 1 game and go from there.

I propose we use Bruces EPD data from the first game,  run the
test to a very deep level (at least equivalent to a 1000 mhz
running at 10 minutes per position) and consider a match at ANY
POINT AFTER the first 2 ply.   If we don't do this, we cannot
make any claims about the results and all error would be on the
side of hanging Bionic, not fair in my opinion.   If the methodology
we use has errors, it should be in favor of Bionic, not the other
way around.

- Don





On January 26, 1999 at 09:19:17, Dan Homan wrote:

>I ran the first 11 epd files (those for bionic's side) last night.  I'll
>run the remaining 11 (for bionic's opponents) tonight.  Here are my
>intermediate results:
>
>Program: EXchess v2.53
>CPU: 400 MHz Celeron
>Hash: 24 MB
>Solution Time: 60 seconds
>
>Moves were counted "right" if they would have been played at the end of the
>60 second search period.
>
>==> rd01b.epd <==
>"The King vs Bionic 4.01"; Right = 34/60             57%
>==> rd02b.epd <==
>"Bionic Impakt vs Nimzo99"; Right = 13/38            34%
>==> rd03b.epd <==
>"Alexs vs Bionic Impakt"; Right = 22/30              73%
>==> rd04b.epd <==
>"Bionic Impakt vs Ant"; Right = 25/40                63%
>==> rd05b.epd <==
>"Diep vs Bionic Impakt"; Right = 10/17               59%
>==> rd06b.epd <==
>"Bionic Impakt vs Kallisto II"; Right = 32/44        73%
>==> rd07b.epd <==
>"Cilkchess vs Bionic Impakt"; Right = 31/53          58%
>==> rd08b.epd <==
>"Arthur vs Bionic Impakt"; Right = 23/47             49%
>==> rd09b.epd <==
>"Bionic Impakt vs BugChess"; Right = 25/32           78%
>==> rd10b.epd <==
>"Bionic Impakt vs Morphy 3.0"; Right = 16/22         73%
>==> rd11b.epd <==
>"Delta vs Bionic Impakt"; Right = 12/22              55%



This page took 0.02 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.