Computer Chess Club Archives


Search

Terms

Messages

Subject: Re: Some thoughts on Dann Corbit's rotated alternative

Author: Dann Corbit

Date: 20:32:54 03/03/06

Go up one level in this thread


On March 03, 2006 at 18:49:39, Gerd Isenberg wrote:

>On March 03, 2006 at 18:29:46, Dann Corbit wrote:
>
>>On March 03, 2006 at 17:14:54, Tony Werten wrote:
>>
>>>On March 03, 2006 at 15:58:21, Steffan Westcott wrote:
>>>
>>>>Gerd,
>>>>
>>>>I may have terribly misunderstood things, but there is direct algorithm for
>>>>mapping the 128 possible attack bitboards to 7 bit numbers. Additionally, the
>>>>algorithm does not need the location of the ray source square, nor any lookup
>>>>tables.
>>>>
>>>>For a ray along a rank or diagonal, it is sufficient to collapse the files:
>>>>
>>>>typedef unsigned __int64 BitBoard;
>>>>
>>>>int collapsed_files_index(BitBoard b)
>>>>{
>>>>    b |= b >> 32;
>>>>    b |= b >> 16;
>>>>    b |= b >>  8;
>>>>    return b & 0xFF;
>>>>}
>>>>
>>>>For a ray along a file, a little more trickery is needed:
>>>>
>>>>int collapsed_ranks_index(BitBoard b)
>>>>{
>>>>    b |= b >> 4;   // No masking needed
>>>>    b |= b >> 2;   //    "         "
>>>>    b |= b >> 1;   //    "         "
>>>>    return ((b & 0x0101010101010101) * 0x0102040810204080) >> 56;
>>>>}
>>>>
>>>>Both of the above routines return 8 bit results. Depending on the left/right
>>>>orientation of the ray in relation to your chosen bitboard bit numbering scheme,
>>>>you may need to shift the result right by 1 bit.
>>>>
>>>>In addition, if you know the co-ordinates of the source square (either at
>>>>runtime or compile time), simpler variants of the above routines are possible.
>>>>
>>>>I don't use techniques like these as I avoid serialisation of moves or features
>>>>as much as possible. However, I offer you and Dann the above as it may be of
>>>>interest.
>>>
>>>Hi Steffan,
>>>
>>>although the algoritm looks simpler (allthough I'm not sure I understand) , (BB
>>>& mask)*magic_number will give a better performance.
>>
>>Why not just: switch (BB) {/*128 cases go here */}
>>?
>>What is the value of the transform into a shorter integer type?
>>Has anyone measured that it makes the switch faster?  For me it is only
>>important if it makes it faster for 64 bit hardware anyway.
>
>Branchless stuff and memory access versus a 7 folded cmp-jxx chain and a lot of
>btb-entries on some good chance of a miss-prediction here and there.
>
>Of course if you index a function pointer table a "save" missprediction" occurs
>as well. So the folding is intented to avoid branches at all and to index
>precaluclated attack bitboards the same way as in a rotated framework.
>
>While your switch approach seems to cover much more items. All precalculated
>raywise properties of a square one may think about, "indexed" somehow by all up
>to seven other man on that ray - not only occupied states.

OK.  So you make an array of 128 function pointers then, and do this:

status = method[hashval(BB)]();

PGO seems to take a lot of the sting out of missed branch predictions, but of
course there will always be some.




This page took 0 seconds to execute

Last modified: Thu, 15 Apr 21 08:11:13 -0700

Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.