Computer Chess Club Archives




Subject: Re: SSE2 bit[64] * byte[64] dot product

Author: Tony Werten

Date: 23:36:05 07/19/04

Go up one level in this thread

On July 18, 2004 at 15:33:33, Gerd Isenberg wrote:

>>I am guessing something like 50 cycles?  Really not that bad . . . probably
>>close to the speed of a scan over attack tables.
>14.45ns on a 2.2GHz Athlon64, ~32 cycles now.
>Some minor changes, byte vector values (weights) 0..63, therefore only one
>psadbw, no movd but two pextrw, final add with gp. Computed bit masks in two
>xmm-registers (0x02:0x01). Some better instruction scheduling.

Nice, I'll think about it when I start a rewrite for 64bit :(



This page took 0.01 seconds to execute

Last modified: Thu, 07 Jul 11 08:48:38 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.