Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: SSE2 bit[64] * byte[64] dot product

Author: Gerd Isenberg

Date: 12:33:33 07/18/04

Go up one level in this thread



>I am guessing something like 50 cycles?  Really not that bad . . . probably
>close to the speed of a scan over attack tables.
>
>anthony

14.45ns on a 2.2GHz Athlon64, ~32 cycles now.

Some minor changes, byte vector values (weights) 0..63, therefore only one
psadbw, no movd but two pextrw, final add with gp. Computed bit masks in two
xmm-registers (0x02:0x01). Some better instruction scheduling.

Gerd




This page took 0.01 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.