For a real-world distortion the payload is quite uniformly distributed across scanlines. Surely, in the corners more iterations of minimizer is needed, but that happens in threads without scheduling overhead.
For a real-world distortion the payload is quite uniformly distributed across scanlines. Surely, in the corners more iterations of minimizer is needed, but that happens in threads without scheduling overhead.