Author: Gian-Carlo Pascutto
Date: 00:17:44 05/07/04
Go up one level in this thread
On May 06, 2004 at 19:03:48, martin fierz wrote:
>point #3 is perhaps most important for the bob vs vincent duel: the standard
>error for a 4 CPU test run is on the order of 0.2. if vincent's tests were with
>a similarly small number of positions, then the differences measured in these
>experiments (2.8 / 3.0 / 3.1) are statistically insignificant, and the whole
>argument is pointless :-)
Not recessarily - disabling nullmove produced a result with the standard errors
halved in my results. That would still allow a significant conclusion.
If I assume your 0.2 is a 2SD number (95%), your results are compatible and
running a non-nullmove test could still produce the same result. If 0.2 is a
1SD number then for some reason your results were much more variable then mine.
n speedup error (1SD)
------------------------------------------
Nullmove 38 2.82 +- 0.101
No-nullmove 39 3.07 +- 0.056
--
GCP
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.