Author: Robert Hyatt
Date: 13:45:37 12/17/02
Go up one level in this thread
On December 17, 2002 at 11:46:57, Vincent Diepeveen wrote: >On December 17, 2002 at 11:29:19, Robert Hyatt wrote: > >hello Bob, > >please do the same tests i did with DIEP too with crafty. > >Of course as you always say that doing a few tests proof nothing, >please repeat them twice. > >For me doing a test twice with crafty is sufficient. I generally say "your doing a few tests proves nothing." "Why?" you ask? Because you seem incapable of understanding simple ideas. Remember your nonsense about "on a dual Crafty runs _no_ faster at all here..."??? So if I remain a bit suspicious of any number(s) you report, there is a reason for it... I ran 24 positions twice and reported the NPS for 1 thread, no SMT, two threads, no SMT, three threads, SMT on, and four threads, SMT on. What more can I run??? > >I am especially interested in the completed logs too so that we all can >see what mainline you took to compare the speedup and absolute speeds >in nps. All I reported was NPS. I'm not going to post such a huge wad of output here. I will take one position from each of the four tests and give you those. I have no idea what you mean by "what mainline you took to compare the speedup and absolute speeds in NPS." I didn't take _any_ mainline. I didn't report _any_ speedup. I only reported the increase in raw NPS numbers. So the rest of your query simply makes no sense to me. This is not about parallel search efficiency. It is about whether SMT speeds things up or not, and the answer (so far) is clearly "yes it does." This is the last position from the 24 I ran. It is one of the Kopec positions but It doesn't say which one. The position is this (FEN): 3rn2k/ppb2rpp/2ppqp2/5N2/2P1P3/1P5Q/PB3PPP/3RR1K1 w Run 1. one thread, no SMT: White(1): move clearing hash tables time surplus 0.00 time limit 166:39 (166:39) depth time score variation (1) 1 0.00 0.81 1. Bd4 1-> 0.00 0.81 1. Bd4 2 0.00 0.73 1. Bd4 Bb6 2-> 0.00 0.73 1. Bd4 Bb6 3 0.00 -- 1. Bd4 3 0.00 0.29 1. Bd4 g6 2. Nh6 Qxh3 3. gxh3 3 0.01 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 3 0.01 0.63 1. Qf3 g6 2. Nh6 3-> 0.01 0.63 1. Qf3 g6 2. Nh6 4 0.01 0.53 1. Qf3 g6 2. Nh6 Re7 4 0.01 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 4-> 0.04 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 5 0.04 ++ 1. Nh6!! 5 0.05 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 5-> 0.06 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 6 0.06 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 6-> 0.10 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 7 0.11 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 7-> 0.16 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 8 0.20 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 8-> 0.33 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 9 0.37 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 9-> 0.54 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 10 0.61 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 10-> 1.49 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 11 1.62 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 11-> 3.93 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 12 4.22 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 12-> 7.90 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 13 8.84 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 13-> 25.57 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 time=25.57 cpu=99% mat=0 n=31129467 fh=94% nps=1217k ext-> chk=917605 cap=59244 pp=3275 1rep=119119 mate=34241 predicted=0 nodes=31129467 evals=3804978 endgame tablebase-> probes done=0 successful=0 SMP-> split=0 stop=0 data=0/64 cpu=25.50 elap=25.57 Run 2: two threads, no SMT: depth time score variation (1) starting thread 1 1 0.00 0.81 1. Bd4 1-> 0.00 0.81 1. Bd4 2 0.00 0.73 1. Bd4 Bb6 2-> 0.00 0.73 1. Bd4 Bb6 3 0.00 -- 1. Bd4 3 0.00 0.29 1. Bd4 g6 2. Nh6 Qxh3 3. gxh3 3 0.00 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 3 0.01 0.63 1. Qf3 g6 2. Nh6 3-> 0.07 0.63 1. Qf3 g6 2. Nh6 4 0.07 0.53 1. Qf3 g6 2. Nh6 Re7 4 0.07 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 4-> 0.10 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 5 0.10 ++ 1. Nh6!! 5 0.11 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 5-> 0.14 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 6 0.15 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 6-> 0.17 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 (s=4) 7 0.18 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 (s=3) 7-> 0.35 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 (s=6) 8 0.37 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 (s=5) 8-> 0.50 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 (s=4) 9 0.57 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 (s=3) 9-> 0.73 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 (s=5) 10 0.78 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 (s=4) 10-> 1.38 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 (s=8) 11 1.49 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 (s=7) 11-> 2.98 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 (s=10) 12 3.19 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 (s=9) 12-> 6.02 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 (s=9) 13 6.70 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 (s=8) 13-> 18.42 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 (s=12) time=18.42 cpu=188% mat=0 n=34809013 fh=94% nps=1889k ext-> chk=992883 cap=70671 pp=5213 1rep=135350 mate=27885 predicted=0 nodes=34809013 evals=4659923 endgame tablebase-> probes done=0 successful=0 SMP-> split=761 stop=38 data=7/64 cpu=34.74 elap=18.42 Run three: three threads, SMT _on_: White(1): move clearing hash tables time surplus 0.00 time limit 166:39 (166:39) depth time score variation (1) starting thread 1 starting thread 2 1 0.00 0.81 1. Bd4 1-> 0.01 0.81 1. Bd4 2 0.01 0.73 1. Bd4 Bb6 2-> 0.01 0.73 1. Bd4 Bb6 3 0.01 -- 1. Bd4 3 0.01 0.29 1. Bd4 g6 2. Nh6 Qxh3 3. gxh3 3 0.01 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 3 0.02 0.63 1. Qf3 g6 2. Nh6 3-> 0.08 0.63 1. Qf3 g6 2. Nh6 4 0.08 0.53 1. Qf3 g6 2. Nh6 Re7 4 0.09 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 4-> 0.10 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 5 0.11 ++ 1. Nh6!! 5 0.11 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 5-> 0.16 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 6 0.17 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 6-> 0.19 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 (s=4) 7 0.20 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 (s=3) 7-> 0.24 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 (s=7) 8 0.26 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 (s=6) 8-> 0.36 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 (s=4) 9 0.38 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 (s=3) 9-> 0.50 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 (s=5) 10 0.54 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 (s=4) 10-> 1.07 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 (s=8) 11 1.16 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 (s=7) 11-> 2.66 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 (s=10) 12 2.86 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 (s=9) 12-> 5.40 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 (s=8) 13 6.02 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 (s=7) 13-> 17.69 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 (s=13) time=17.69 cpu=296% mat=0 n=38066840 fh=94% nps=2151k ext-> chk=1076447 cap=74646 pp=5437 1rep=149438 mate=35093 predicted=0 nodes=38066840 evals=5253700 endgame tablebase-> probes done=0 successful=0 SMP-> split=2994 stop=217 data=15/64 cpu=52.47 elap=17.69 Run four: four threads, SMT _on_: White(1): move clearing hash tables time surplus 0.00 time limit 166:39 (166:39) depth time score variation (1) starting thread 1 starting thread 2 starting thread 3 1 0.00 0.81 1. Bd4 1-> 0.03 0.81 1. Bd4 2 0.06 0.73 1. Bd4 Bb6 2-> 0.10 0.73 1. Bd4 Bb6 3 0.10 -- 1. Bd4 3 0.11 0.29 1. Bd4 g6 2. Nh6 Qxh3 3. gxh3 3 0.11 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 3 0.11 0.63 1. Qf3 g6 2. Nh6 3-> 0.14 0.63 1. Qf3 g6 2. Nh6 4 0.14 0.53 1. Qf3 g6 2. Nh6 Re7 4 0.14 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 4-> 0.15 0.58 1. Nh6 Re7 2. Qxe6 Rxe6 5 0.16 ++ 1. Nh6!! 5 0.19 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 5-> 0.20 2.19 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 6 0.20 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 6-> 0.28 2.40 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 (s=4) 7 0.35 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 (s=3) 7-> 0.43 2.32 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Bd4 Bb6 (s=7) 8 0.45 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 (s=6) 8-> 0.60 2.31 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 b5 5. cxb5 cxb5 (s=4) 9 0.65 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 (s=3) 9-> 0.76 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 Ke6 (s=5) 10 0.83 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 (s=4) 10-> 1.36 2.41 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Ba5 5. Rg1 b5 6. cxb5 cxb5 (s=8) 11 1.46 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 (s=7) 11-> 3.06 2.37 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Rg1 d5 7. cxd5 cxd5 (s=10) 12 3.27 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 (s=9) 12-> 5.66 2.46 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Rd7 5. Kf3 Re7 6. a4 Bb6 7. Rg1 (s=8) 13 6.34 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 (s=7) 13-> 16.09 2.34 1. Nh6 Qxh3 2. Nxf7+ Kg8 3. gxh3 Kxf7 4. Kg2 Bb6 5. Kf3 Nc7 6. Ba3 Ke7 7. Bb4 Ne6 (s=13) time=16.09 cpu=377% mat=0 n=36609790 fh=94% nps=2275k ext-> chk=1035481 cap=72332 pp=4034 1rep=144147 mate=40133 predicted=0 nodes=36609790 evals=4935572 endgame tablebase-> probes done=0 successful=0 SMP-> split=5501 stop=394 data=16/64 cpu=1:00 elap=16.09 >>>>parallel search overhead, you have a problem on _normal_ SMP machines as well. >>> >>>Indeed it is true that the first seconds the HT/SMT gives big problems >>>in speed. Only after a couple of minutes the speed shows. I see only >>>a speedup after a minute or 3 each position. >> >>So? That is _your_ program's results. Mine are just like they have always >>been. I get a reasonable speedup whether it is one second per move or one >>hour per move. No difference. >> >> >>> >>>I need to add however that i could improve a few issues in this version >>>which could get that down to 1 minute but like you i doubt whether the 11.4% >>>of HT is worth it. >> >>11.4% is _always_ worth it IMHO... >> >> >>> >>>I prefer a dual AMD instead for the moment! >>> >> >> >>Nothing wrong with that. I got the dual xeon because I wanted a chance to play >>with the SMT stuff since it is obviously going to be "the future" of >>microprocessor >>architecture...
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.