From: Skybuck Flying on 3 Oct 2009 14:58 Ok, I just tested the "streaming" idea for the cpu. Streaming idea: "do many reads, do many writes, repeat". Non streaming idea: "do single read, do single write repeat". The non streaming idea works faster. (Streaming idea requires multiplications and some extra looping, not sure if that slows it down... most likely reason is that streaming idea requires extra memory to hold the reads... cannot directly read into cache ?!?) Possible solution: try doing fetches instead <- nice idea. Going to try version 0.02 with fetches only ;) :) and then some normal read write cycles or so Bye, Skybuck.
From: Skybuck Flying on 3 Oct 2009 15:10 Ok tried it... The prefetching "streaming" version is also slower than the non streaming version... Maybe the pattern of writing/reading wasn't identical for both versions... but it's the best I could do for now... So for now I give up on this idea ! ;) Bye, Skybuck.
|
Pages: 1 Prev: OpenGL performance benchmarking ! ;) Next: american weather |