Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: [Q] What is Genius' speed?

Author: Moritz Berger

Date: 08:23:15 08/11/98

Go up one level in this thread


Posted by fca on August 11, 1998 at 08:45:55:
>
>In Reply to: Re: [Q] What is Genius' speed? posted by Tom Kerrigan on August 11, 1998 at 08:06:36:
>
>
>On August 11, 1998 at 08:06:36, Tom Kerrigan wrote:
>
>>Aside from software differences, the Pentium MMX/200 has a 66MHz L2 cache
>>(possibly smaller than 512k) whereas the Pentium II/300 has a 512k 150MHz L2
>>cache.
>
>Are you sure, Tom?  I thought the 1/2 speed clock for the L2 only applied to
>P2's of 350MHz or faster (excepting Xeons, which are of course full speed
>L2-ers).  If so, 100 MHz >> 66 MHz

PII 2nd level cache speed frequency = core frequency / 2 for all PII (i.e.
PII-233 2nd level cache at 116.5 Mhz)

>>If a program really bangs on the L2 cache, it will go much faster on the
>>Pentium II.
>
>Surely. Let us test that this is the cause for the differences blass reports.
>
>(a) Does it follow in broad terms that the higher the nps, the more hash
>activity (but what about other tables?), therefore more L2-dependence (L1 deemed
>to be too small to have too much influence)?

Higher NPS -> higher memory throughput -> less cache efficiency (just imagine a
program that uses e.g. 64 KB hash and "lives" completely in the 512 KB 2nd level
cache of a PII in comparison with Fritz which fills up hundreds of megabytes of
hash tables in a couple of minutes).

>(b) Is the P2/300 : P200MMX ratio even higher for F5 (which I take it is
>accepted is significantly higher in nps terms than J?)

Junior 5 peaks out at slightly above 400kN/s on my PII400. So it's not
"significantly" slower in terms of NPS than Fritz 5.

>If answer for (a) is Yes and answer for (b) is No, the cause is liable to be
>something else.

Something else could be:
a) 16 bit code vs. 32 bit code
b) optimization for P5 pipelines (superscalar design, executing multiple
instructions at once)
c) segment boundary alignment (affects P5 and P6 not in the same way)
d) branch prediction logic of the CPU, assumptions about this in hand crafted
assembler code
e) use of certain "string" (i.e. affecting a sequence of bytes) instructions and
preferred primitive data types (32 bit integers, 16 bit integers, etc.) - P5 and
P6 speed optimizations are often mutually exclusive
f) e.g. using the FPU to initialize hash tables - more than 2x faster on a P5
than using integer moves, about as fast on PII, much worse on K6.
g) add your own favourite difference in processor architectures of P5 (Pentium,
Pentium MMX) and P6 (Pentium Pro, PII)

Saludos

Moritz

P.S: Even different steppings (read: releases) of the Pentium MMX exhibit
massive differences in execution speed of certain commands (stepping 4 several
times slower (on some commands) than stepping 3 comes to my mind), so that's
another distracting factor we have to take into account.



This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.