Author: Vincent Diepeveen
Date: 17:26:28 07/15/03
Go up one level in this thread
On July 15, 2003 at 17:58:01, Gerd Isenberg wrote: >Ok, i think there is one problem with Vincent's cache benchmark. > >There are two similar functions DoNrng and DoNreads. DoNrng is used to mesure >the time without hashread. But the instructions has the potential of faster >execution due to less dependencies and stalls. It may execute parts of two loop >bodies of DoNrng interlaced or simultaniesly - that is not possible in DoNreads. >Therefore the time for N DoNrng is not the time used inside the N DoNrng loop, >and maybe much faster. If you look to the generated assembly code you will see that nothing is wrong. If something goes wrong in the pairing of the instructions that might at most make a difference of 1 to 2 clockcycles. At 2Ghz that's 0.5 nanoseconds. It is cool if it measures accurate in 0.5 ns of course, but that was never the intention of the test. It was intended to measure latencies at big supercomputers when n processors read in the memory of just 1 poor node. the other test latencyC.c is doing a criss cross reference. A 0.5 ns error in the measurement there (assuming there is no other system stuff disturbing which there is because actually other processes may also use RAM at your node, which can give major differences, but still doesn't give a bug in the test) is not really relevant. If you care for 0.5 ns difference then shoot me, but don't ever say it's 130ns like bob claims for 133Mhz DDR ram. You're at new fresh i bet cl2 150Mhz DDR ram if not faster. the 280 ns was measured for P4 with 533 bus or 133Mhz DDR ram and the 400 ns as you see is from the duals. For me 380 ns goes to 400 ns or something when using bigger hashtables. I see no significant difference between 380 and 400 so to speak. It's simply *huge* random latency. > >int DoNrng(BITBOARD n) { > BITBOARD i=1,dummyres,nents; > int t1,t2; > > nents = nentries; /* hopefully this gets into a register */ > dummyres = globaldummy; > > t1 = GetClock(); > do { > BITBOARD index = RanrotA()%nents; > dummyres ^= index; > } while( i++ < n ); > t2 = GetClock(); > > globaldummy = dummyres; > return(t2-t1); >} > >int DoNreads(BITBOARD n) { > BITBOARD i=1,dummyres,nents; > int t1,t2; > > nents = nentries; /* hopefully this gets into a register */ > dummyres = globaldummy; > > t1 = GetClock(); > do { > BITBOARD index = RanrotA()%nents; > dummyres ^= hashtable[index]; > } while( i++ < n ); > t2 = GetClock(); > > globaldummy = dummyres; > > return(t2-t1); >}
This page took 0.01 seconds to execute
Last modified: Thu, 15 Apr 21 08:11:13 -0700
Current Computer Chess Club Forums at Talkchess. This site by Sean Mintz.