Apparently for viewport rendering on GPU tile size of 1024 gives much better performance than using single tile. Not sure why it doesn't work for background rendering, need to be investigated further. Meanwhile use old debug value of 1024 for tile size.