Fixes T40027. This means we get more CPU usage again when using multiple CUDA, but the impact on performance is too big a problem with the current code.
Fixes T40027. This means we get more CPU usage again when using multiple CUDA, but the impact on performance is too big a problem with the current code.