Gives over 50% faster scope update (from 4.5ms to 2.2ms here with SD shot)! Probably mostly due to more clever usage of thread-local data (which avoids any lock, when OMP code had a rather stupid critical section for minmax)...
Gives over 50% faster scope update (from 4.5ms to 2.2ms here with SD shot)! Probably mostly due to more clever usage of thread-local data (which avoids any lock, when OMP code had a rather stupid critical section for minmax)...