The render session is keeping track of the scene update, which includes kernel loading time. This fixes negative render times reported when CUDA kernels are compiled at runtime. A bit fragile logic, can be re-implemented using some user-counted scope utility classes, so that only outer-most time skip is applied.