Author: Jeremiah Penery
Date: 22:45:53 08/29/03
Go up one level in this thread
On August 29, 2003 at 23:45:44, Eugene Nalimov wrote: >On August 29, 2003 at 23:18:43, Jeremiah Penery wrote: > >>On August 29, 2003 at 22:19:59, Eugene Nalimov wrote: >> >>>Sorry, from SPEC Rate I see only that *independent* processes scales well. Any >>>OS with minimal NUMA knowledge will allocate data for each processor in its >>>local memory, thus totally avoiding such problems. >>> >>>And are you sure that NUMA system that for independent processes "scales almost >>>as well" as shared-bus shared-memory system is really good achievement? >> >>I used the words "nearly as well" without checking. In CINT Rate, Opteron (from >>what I could tell) at 1.8GHz scales at somewhere over 95% from 1 to 4 >>processors. Itanium2 scales at 97+% in IntRate. So Opteron is marginally worse >>here. >>In CFPRate, however, the situation is much different. Opteron scales (from >>different sets of numbers) still around 90%, while Itanium2 drops horrifically >>all the way to near 70% efficiency. > >Itanium2 started from much higher initial score, so it saturated available It started from higher initial score because its architecture is almost singularly favorable to FP code. >bandwidth earlier. SpecFP applications like lot of bandwidtdh -- trust me... I That's really one of the drawbacks to SPECFP, IMO. It relies too much on high bandwidth. >believe hypothetical (much) higher-clocked Opteron with FP performance >approaching Itanium's will also have worse scaling. When adding more CPUs, Opteron's aggregate bandwidth increases linearly with the number of processors, while Itanium's remains more or less constant (for the size of systems we're talking about - 2/4 CPUs). >In any case, 4-CPU SpecFP rate Itanium2 scores are still much better than >Opteron scores. And I don't see 16/32/64 Opteron scores at SPEC web site, >probably you can explain why :-) ? Because Opteron is about 2 months old. There were not scores for 16/32/64 CPU Itaniums until Itanium2 came out, which was at least a year after the original. I see 2CPU score on CFPRate of around 13 for first generation Itanium. Should you base the performance of all Itanium(2) chips by that? >Please notice: I don't have anything against Opteron/Athlon64. Very good CPUs >with solid performance. I just strongly react to all the hype... > >>Remember also that this is Itanium2 with good compiler support against first >>generation Opteron without even decent AMD64 compiler (GCC is not horrible, but >>ICC 32-bit is still much faster - PGO's compiler apparently blows goats for >>compiling SPEC for AMD64, see >>http://www.digit-life.com/articles2/insidespeccpu/insidespeccpu2000-opteron2.html >>). > >Till recently everybody was saying "EPIC needs compiler breakthrough that cannot >be achieved, while OOO CPUs don't need that breakthrough". And now it happens >that EPIC compiler already there... I did not say that. But it did take a very large investment in new compiler technology to make Itanium performance competitive. Even now, the 'magical' HP IA64 compiler produces scores that are far greater than other Itanium compilers. I still think it has a long way to go before maximizing the capability, too. x86 compilers are several generations old and still improving, while IPF compilers are more like 2 generations old. IPF compilers today are not bad, but they can probably be much better.
This page took 0.01 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.