Author: Anthony Cozzie
Date: 07:12:00 05/25/04
Go up one level in this thread
>>>Yes. I have inlined FirstOne()/LastOne() to use the 64 bit AMD BSF/BSR >>>instructions. There were several changes dealing with updating global shared >>>data that were made to cut down on cache-to-cache (MOESI) transactions and >>>overhead. There were changes made to make local data be allocated in a >>>processor's local memory to decrease access time. Etc... I know cache-cache is traditionally slow, but I thought that on the opteron it was fast, what with the hypertransport link and all. I need to review my multiprocessor notes, but it seems logical that CPU -> CPU >= CPU -> CPU -> MEMORY anthony
This page took 0 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.