Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: 64-way Parallel FP Chip

Author: Robert Hyatt

Date: 07:51:29 10/15/03


I blew one bit of the previous calculation.  The C90 is a "super-scalar" sort
of vector machine.  Where I said "one floating add per cycle" change that to
two.  A single vector instruction does _two_ operations per cycle, not one, and
I had simply failed to note that.  That was the main change from the older
X-MP and Y-MP, that was introduced on the C90.  Obviously it makes vector
performance 2x faster even without the clock speed improvement.  IE for my
example:

      v0    v1+v2
      v3    v4+v5
      v6    v0*v3

that code will produce _six_ results per cycle, once the chained vector
pipeline is filled.  Not the _three_ I had given.

_that_ is why the Cray buries the PC in _any_ program that can use vectors.
Even though the C90 only runs at 250 mhz.  The T90 runs that up to 500mhz,
and the Cray-3 doubled it again to 1ghz.  But all mhz/ghz are _not_ created
"equal" for those that understand vector operations.

The C90 is a 250mhz machine, not the 100 Vincent pulls from you-know-where.
But no 2500mhz 80x86 can produce 6 64-bit IEEE floating point operations
every 4 nanoseconds.

I don't know how to explain it better to someone that simply doesn't have a
single scintilla of background on understanding the concept of "a vector
machine."

P4? Anthony Cozzie 08:50:10 10/15/03
- Re: P4? Robert Hyatt 10:26:11 10/15/03
  - Re: P4? Eugene Nalimov 10:33:36 10/15/03
    - Re: P4? Gerd Isenberg 11:54:35 10/15/03
      - Re: P4? Gerd Isenberg 06:49:06 10/16/03
        
        Re: P4? Robert Hyatt 08:43:05 10/16/03
        
        Re: P4? Gerd Isenberg 10:23:40 10/16/03
        
        Re: P4? Robert Hyatt 20:02:12 10/16/03
    - Re: P4? Eugene Nalimov 10:37:08 10/15/03
      - Re: P4? Robert Hyatt 13:16:06 10/15/03

This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.