Author: Enrique Irazoqui
Date: 04:50:56 12/31/99
Go up one level in this thread
On December 30, 1999 at 22:51:14, John Warfield wrote:
>
>
> My question is simple curiosity, Is it really possible for this so-called
>hidden Test of Dr enriques to accurately predict how a program will perform on
>the ssdf. I find this difficult to believe, there seems to be alot of viarables
>to deal with, how would a simple test set, perdict precisely how fritz6 or tiger
>will score. I am open to be educated here. If this test really exist I would
>love to get my hands on it, So Dr Enrique if you read this please send me the
>test, or let me know when it will be availble . Thanks
I am open to be educated too. :)
This test exists and by now has 133 positions, all tactical, unambiguous, not
included before in any test, therefore not cooked. The fact that so far it shows
results very similar to the SSDF list came as a complete surprise to me. I don't
trust positional tests, and what I wanted to get out of my tactical suite when I
started building it was the difference between a tactical test and the SSDF
list. I thought that with this I could see the value of non tactical stuff in a
program. After running this test with some 30 programs, I was very, very
surprised to see that ratings obtained with a tactical test and comp-comp games
are basically the same, at least so far.
As I said in other posts, any programmer can come with a version of his program
optimized for tactics and such a program would do better in a test than in
games. But since I test released, commercial programs tuned for real life and
not for tests, my test is nod being fooled.
So far it works, but... I ran this test with Junior 6 and Shredder 4, and in my
opinion both programs scored less well than they should, according to what I see
when they play, and I trust what I see better than any tests, including mine. I
am extremely curious to see what will be the rating of J6 and S4 in the SSDF
list. In case there is a big difference with my test, it will be interesting to
know why these two programs are the only ones so far to do better in games than
in a tactical test. Maybe, after all, my initial purpose will work and we will
be able to see this difference tactical - not tactical (call it positional,
strategic, whatever, but without a direct impact in the speed up of the search).
Explaining this will be difficult, at least for me.
(I hope this post is not too messy. While writing it I am instaling things in
the new computer)
I got the following results of the last programs:
Test SSDF scale
RT 12 2695
T12-dos 0 2683
CM6K -10 2673
N732 -20 2663
F532 -21 2662
F6a -22 2661
H732 -32 2651
J6 -53 2630
J5 -58 2625
S4 -69 2614
Enrique
This page took 0.01 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.