Computer Chess Club Archives




Subject: Re: SSE2 bit[64] * byte[64] dot product

Author: Gerd Isenberg

Date: 12:33:33 07/18/04

Go up one level in this thread

>I am guessing something like 50 cycles?  Really not that bad . . . probably
>close to the speed of a scan over attack tables.

14.45ns on a 2.2GHz Athlon64, ~32 cycles now.

Some minor changes, byte vector values (weights) 0..63, therefore only one
psadbw, no movd but two pextrw, final add with gp. Computed bit masks in two
xmm-registers (0x02:0x01). Some better instruction scheduling.


This page took 0.04 seconds to execute

Last modified: Thu, 07 Jul 11 08:48:38 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.