Computer Chess Club Archives

Search

Terms

Messages

Subject: Re: Matt Taylor's magic de Bruijn Constant

Author: Andrew Williams

Date: 04:43:26 07/13/03

Probably a stupid question, but if you have a loop which does just LastOne with
this scheme, wouldn't the table get cached and therefore make it go much faster
than in a test where it's embedded in your code?

AW



On July 13, 2003 at 06:57:48, Bas Hamstra wrote:

>Coicidentally I am working a bit on LastOne too right now. I also have a Athlon
>XP (like you I believe). So BSF isn't too fast. The question is: can you
>outperform it? At first it seems so, in a zillion times loop the following code
>appeared to be nearly a factor 3 faster than the BSF routine below:
>
>const char Tabel[64] =
>		{	0, 0, 0,15, 0, 1,28, 0,16, 0, 0, 0, 2,21,29, 0,
>			0, 0,19,17,10, 0,12, 0, 0, 3, 0, 6, 0,22,30, 0,
>			14, 0,27, 0, 0, 0,20, 0,18, 9,11, 0, 5, 0, 0,13,
>			26, 0, 0, 8, 0, 4, 0,25, 0, 7,24, 0,23, 0,31, 0
>		};
>
>
>__forceinline int LastOne2(unsigned __int64 Bits)
>
>{	unsigned int *p = (unsigned int*) &Bits;
>	unsigned int Short;
>
>	if(p[0])
>	{	Short = p[0];
>		Short = Short ^ (Short-1);
>		Short *= 7*255*255*25;
>		Short >>= 26;
>		return Tabel[Short&63];
>	}
>	Short = p[1];
>	Short = Short ^ (Short-1);
>	Short *= 7*255*255*255;
>	Short >>= 26;
>	return Tabel[Short&63]+32;
>}
>
>VC produces very efficient asm code for it, as far as I can judge. However how
>about the overall performance in the program? It seems to be noticably slower
>than plain BSF, like below:
>
>__forceinline int LastOne(BB M)
>{   __asm
>    {       mov EAX, dword ptr [M]
>            XOR EDX, EDX
>            cmp EAX, 0
>            jnz L2
>            mov EAX, dword ptr [M+4]
>            add EDX, 32
>        L2: bsf EAX, EAX
>            ADD EAX, EDX
>    }
>}
>
>Both routines have an extra branch to test the first 32 bits, so that is not the
>problem!?
>
>
>Regards,
>Bas.
>
>
>
>
>
>
>
>
>On July 13, 2003 at 06:03:00, Gerd Isenberg wrote:
>
>>Hi all,
>>
>>Matt Taylor's invention of the super magic de Bruijn Constant 0x78291ACF is of
>>course usefull for yet another, but fast and portable _firstOne_ Bitscan routine
>>with only one 32bit multiplication!
>>
>>
>>int firstOneKey(BitBoard bb)
>>{
>>	BitBoard lsb = bb ^ (bb - 1);
>>	unsigned int foldedLSB = ((int) lsb) ^ ((int)(lsb>>32));
>>	return (foldedLSB * 0x78291ACF) >> (32-6); // range is 0..63
>>       // to get the bit index, a table lookup is required
>>}
>>
>>
>>Because i'am still looking for a fast key-fuction for two single populated move
>>bitboards, i tried the obvious approach to use Matt's routine twice. It works
>>like firstOneKey(from)*64 + firtsOneKey(to) and therefore maps to a 64*64-range.
>>
>>int getMoveKey(BitBoard frbit, BitBoard tobit)
>>{
>>	frbit -= 1; tobit -= 1;
>>	unsigned int foldedfr = ((int) frbit)  ^ ((int)(frbit>>32));
>>	unsigned int foldedto = ((int) tobit)  ^ ((int)(tobit>>32));
>>	return ((foldedfr*0x78291ACF) ^ ((foldedto*0x78291ACF) >> 6)) >> 20;
>>}
>>
>>x86 assembler output of this routine:
>>
>>00401435 83 C6 FF             add         esi,0FFFFFFFFh
>>00401438 83 D1 FF             adc         ecx,0FFFFFFFFh
>>0040143B 83 C2 FF             add         edx,0FFFFFFFFh
>>0040143E 83 D0 FF             adc         eax,0FFFFFFFFh
>>00401441 33 CE                xor         ecx,esi
>>00401443 33 C2                xor         eax,edx
>>00401445 69 C9 CF 1A 29 78    imul        ecx,ecx,78291ACFh
>>0040144B 69 C0 CF 1A 29 78    imul        eax,eax,78291ACFh
>>00401451 C1 E8 06             shr         eax,6
>>00401454 33 C1                xor         eax,ecx
>>00401456 C1 E8 14             shr         eax,14h
>>
>>only 36 Bytes - but two 32-bit multiplications.
>>
>>Two questions:
>>Any ideas to it better, eg. only by one multiplication?
>>Why are the unsigned multiplications translated to "imul" by the compiler?
>>
>>Thanks in advance,
>>Gerd

Re: Matt Taylor's magic de Bruijn Constant Russell Reagan 09:17:19 07/13/03
- Re: Matt Taylor's magic de Bruijn Constant Andrew Williams 09:42:37 07/13/03
  - Re: Matt Taylor's magic de Bruijn Constant Bas Hamstra 10:17:56 07/13/03
    - Re: Matt Taylor's magic de Bruijn Constant Russell Reagan 14:10:10 07/13/03
      - Re: Matt Taylor's magic de Bruijn Constant Vincent Diepeveen 07:54:49 07/14/03
        
        Re: Matt Taylor's magic de Bruijn Constant Gerd Isenberg 12:33:37 07/14/03
        
        Re: Matt Taylor's magic de Bruijn Constant Vincent Diepeveen 03:20:28 07/15/03
        
        Re: Matt Taylor's magic de Bruijn Constant Robert Hyatt 06:30:36 07/15/03
        
        Re: Matt Taylor's magic de Bruijn Constant Robert Hyatt 13:07:27 07/14/03
        
        Re: Matt Taylor's magic de Bruijn Constant Matt Taylor 14:45:21 07/20/03
        
        Re: Matt Taylor's magic de Bruijn Constant Gerd Isenberg 09:06:30 07/15/03
        
        Re: Matt Taylor's magic de Bruijn Constant Matt Taylor 14:37:26 07/20/03
        
        Re: Matt Taylor's magic de Bruijn Constant Robert Hyatt 10:27:28 07/15/03
        
        Source code to measure it Vincent Diepeveen 03:24:58 07/15/03
        
        Re: Source code to measure it - results Gerd Isenberg 12:24:19 07/15/03
        
        Re: Source code to measure it - results Vincent Diepeveen 17:19:34 07/15/03
        
        Re: Source code to measure it - results Jeremiah Penery 18:13:23 07/15/03
        
        Re: Source code to measure it - results Andrew Dados 22:39:34 07/15/03
        
        Re: Source code to measure it - results Robert Hyatt 09:08:58 07/16/03
        
        Re: Source code to measure it - results Tony Werten 00:43:57 07/17/03
        
        Re: Source code to measure it - results Robert Hyatt 14:58:21 07/17/03
        
        Re: Source code to measure it - results Vincent Diepeveen 19:34:18 07/15/03
        
        Re: Source code to measure it - results Vincent Diepeveen 20:05:37 07/15/03
        
        Re: Source code to measure it - results Robert Hyatt 20:35:30 07/15/03
        
        Re: Source code to measure it - results Keith Evans 21:05:29 07/15/03
        
        Re: Source code to measure it - results Robert Hyatt 21:29:43 07/15/03
        
        Re: Source code to measure it - results Keith Evans 21:44:34 07/15/03
        
        Re: Source code to measure it - results Robert Hyatt 07:29:07 07/16/03
        
        Re: Source code to measure it - results Keith Evans 09:58:33 07/16/03
        
        Re: Source code to measure it - results Robert Hyatt 11:36:56 07/16/03
        
        Re: Source code to measure it - results Vincent Diepeveen 04:20:50 07/16/03
        
        Re: Source code to measure it - results Keith Evans 10:04:40 07/16/03
        
        Precharging at DDR ram Vincent Diepeveen 19:40:10 07/16/03
        
        Re: Precharging at DDR ram Keith Evans 21:26:21 07/16/03
        
        Re: Precharging at DDR ram Robert Hyatt 14:56:00 07/17/03
        
        Re: Precharging at DDR ram Vincent Diepeveen 04:51:05 07/17/03
        
        Re: Precharging at DDR ram Robert Hyatt 14:56:52 07/17/03
        
        Re: Source code to measure it - results Robert Hyatt 11:37:44 07/16/03
        
        Re: Source code to measure it Gerd Isenberg 07:25:25 07/15/03
        
        Re: Source code to measure it Vincent Diepeveen 17:04:40 07/15/03
        
        Re: Source code to measure it Robert Hyatt 06:33:39 07/15/03
        
        Re: Source code to measure it Gerd Isenberg 14:14:45 07/15/03
        
        Re: Source code to measure it Robert Hyatt 17:06:36 07/15/03
        
        RAM properties Vincent Diepeveen 04:13:14 07/16/03
        
        Re: RAM properties Robert Hyatt 07:31:09 07/16/03
        
        Re: RAM properties Vincent Diepeveen 13:46:54 07/16/03
        
        Re: RAM properties Robert Hyatt 15:22:34 07/16/03
        
        Re: RAM properties Vincent Diepeveen 05:12:55 07/17/03
        
        Re: RAM properties Robert Hyatt 14:50:29 07/17/03
        
        Re: Source code to measure it - there is something wrong Gerd Isenberg 14:58:01 07/15/03
        
        Re: Source code to measure it - there is something wrong Vincent Diepeveen 17:26:28 07/15/03
        
        Re: Source code to measure it - there is something wrong Gerd Isenberg 05:04:49 07/16/03
        
        Re: Source code to measure it - there is something wrong Vincent Diepeveen 05:12:43 07/16/03
        
        Re: Source code to measure it - there is something wrong Gerd Isenberg 11:18:23 07/16/03
        
        Re: Source code to measure it - there is something wrong Dieter Buerssner 11:41:45 07/16/03
        
        Re: Source code to measure it - there is something wrong Vincent Diepeveen 13:45:18 07/16/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 14:48:20 07/17/03
        
        Re: Source code to measure it - there is something wrong Gerd Isenberg 12:40:07 07/16/03
        
        Re: Source code to measure it - there is something wrong Dieter Buerssner 13:12:45 07/16/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 15:25:51 07/16/03
        
        Re: Source code to measure it - there is something wrong Vincent Diepeveen 05:07:07 07/17/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 14:23:37 07/17/03
        
        Re: Source code to measure it - there is something wrong Gerd Isenberg 23:17:29 07/16/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 14:26:50 07/17/03
        
        Re: Source code to measure it - there is something wrong Keith Evans 14:37:33 07/17/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 15:27:54 07/17/03
        
        Re: Source code to measure it - there is something wrong Gerd Isenberg 13:40:47 07/16/03
        
        Re: Source code to measure it - there is something wrong Gerd Isenberg 12:25:08 07/16/03
        
        oups - sorry Vincent Gerd Isenberg 13:14:16 07/16/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 20:50:01 07/15/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 17:08:57 07/15/03
        
        Re: Source code to measure it - there is something wrong Vincent Diepeveen 17:30:04 07/15/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 20:37:53 07/15/03
        
        Re: Source code to measure it - there is something wrong Keith Evans 17:58:18 07/15/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 20:39:22 07/15/03
        
        Re: Source code to measure it - there is something wrong Vincent Diepeveen 19:25:01 07/15/03
        
        Re: Source code to measure it - there is something wrong Keith Evans 21:02:35 07/15/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 21:31:28 07/15/03
        
        Re: Source code to measure it - there is something wrong Ricardo Gibert 22:01:55 07/15/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 07:33:37 07/16/03
        
        Re: Source code to measure it - there is something wrong Ricardo Gibert 22:34:52 07/15/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 07:33:58 07/16/03
        
        Re: Source code to measure it - there is something wrong Robert Hyatt 20:41:36 07/15/03
        
        Re: Matt Taylor's magic de Bruijn Constant Gerd Isenberg 13:52:50 07/14/03
        
        Re: Matt Taylor's magic de Bruijn Constant Vincent Diepeveen 03:26:54 07/15/03
        
        Re: Matt Taylor's magic de Bruijn Constant Robert Hyatt 06:35:16 07/15/03
        
        Re: Matt Taylor's magic de Bruijn Constant Eugene Nalimov 13:32:20 07/14/03
        
        Re: Matt Taylor's magic de Bruijn Constant Robert Hyatt 21:29:03 07/14/03
        
        Re: Matt Taylor's magic de Bruijn Constant Eugene Nalimov 21:57:11 07/14/03

This page took 0.12 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.