blender-archive

Archived

Author	SHA1	Message	Date
Sergey Sharybin	7ad21c3876	Fix T66604: Cycles bake crash on specific scene with volume The issue was caused by un-initialized local storage for volume intersection hits which are supposed to be stored in per-thread KernelGlobals. Fix is to make thread_shader() be the same as thread_render() in respect of KernelGlobals. Reviewers: brecht Reviewed By: brecht Differential Revision: https://developer.blender.org/D5230	2019-07-11 15:44:09 +02:00
Sergey Sharybin	fced0f0437	Cycles: Don't advertise BVH8 being supported on 32bit platforms The kernel does not use AVX2 vectorization, and trying to use BVH8 was leading to an empty scenes. Fixes T64624: Ctest : Win32 + AVX2 fails virtually all cycles tests	2019-05-16 11:51:25 +02:00
Brecht Van Lommel	b63ffa8919	Fix Cycles build error after recent changes We need to do aligned alloc of the services instead of globals now since the concurrent map moved there.	2019-05-14 15:06:23 +02:00
Brecht Van Lommel	a5c89574a3	Fix Cycles assert on exit after recent changes	2019-05-03 18:04:47 +02:00
Brecht Van Lommel	fadb6f3466	Cleanup: refactor Cycles OSL texture handling This adds our own OSL texture handle, that has info for OIIO textures or our own custom texture types. A filename to handle hash map is used for lookups. This is efficient because it happens at OSL compile time, because the optimizer can figure out constant strings and replace them with texture handles.	2019-05-03 15:36:20 +02:00
Campbell Barton	e12c08e8d1	ClangFormat: apply to source, most of intern Apply clang format as proposed in T53211. For details on usage and instructions for migrating branches without conflicts, see: https://wiki.blender.org/wiki/Tools/ClangFormat	2019-04-17 06:21:24 +02:00
Brecht Van Lommel	e691929686	Merge branch 'blender2.7'	2019-03-17 12:54:19 +01:00
Brecht Van Lommel	e17f7af0ce	Cleanup: remove Cycles advanced shading features toggle. It's effectively always enabled, only not on some unsupported OpenCL devices. For testing those it's not useful to disable these features. This is replaced by the more fine grained feature toggles that we have now.	2019-03-17 01:58:39 +01:00
Brecht Van Lommel	ebcea3029d	Merge branch 'blender2.7'	2019-03-06 13:45:21 +01:00
Brecht Van Lommel	f08191a459	Fix Cycles build error on non-x86 processors.	2019-03-06 13:37:06 +01:00
Brecht Van Lommel	e21ae0bb26	Merge branch 'blender2.7'	2019-02-06 15:22:53 +01:00
Lukas Stockner	fccf506ed7	Cycles: animation denoising support in the kernel. This is the internal implementation, not available from the API or interface yet. The algorithm takes into account past and future frames, both to get more coherent animation and reduce noise. Ref D3889.	2019-02-06 15:18:42 +01:00
Lukas Stockner	405cacd4cd	Cycles: prefilter feature passes separate from denoising. Prefiltering of feature passes will happen during rendering, which can then be used for denoising immediately or written as a render pass for later (animation) denoising. The number of denoising data passes written is reduced because of this, leaving out the feature variance passes. The passes are now Normal, Albedo, Depth, Shadowing, Variance and Intensity. Ref D3889.	2019-02-06 15:18:29 +01:00
Stefan Werner	5e121c8eab	Cycles: Fixed uninitialized memory Cryptomatte on CPU with accurate mode was hitting uninitialized variables. This is now explicitly initializing them to NULL.	2019-01-18 15:17:21 +01:00
Brecht Van Lommel	a8b8da5567	Fix T58183: crash with CPU + GPU rendering after profiling changes. Multi-device was not passing along profiler to the CPU.	2018-11-29 23:43:27 +01:00
Lukas Stockner	7fa6f72084	Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel This commit adds a sample-based profiler that runs during CPU rendering and collects statistics on time spent in different parts of the kernel (ray intersection, shader evaluation etc.) as well as time spent per material and object. The results are currently not exposed in the user interface or per Python yet, to see the stats on the console pass the "--cycles-print-stats" argument to Cycles (e.g. "./blender -- --cycles-print-stats"). Unfortunately, there is no clear way to extend this functionality to CUDA or OpenCL, so it is CPU-only for now. Reviewers: brecht, sergey, swerner Reviewed By: brecht, swerner Differential Revision: https://developer.blender.org/D3892	2018-11-29 02:45:24 +01:00
Campbell Barton	e742e0934d	Cleanup: trailing space	2018-11-25 08:01:14 +11:00
Sergey Sharybin	203de0bbf0	Cycles: Cleanup, space after (void) It was used in like 95% of places.	2018-11-09 12:08:51 +01:00
Sergey Sharybin	2330cadb0f	Cycles: Cleanup, don't use strict C prototypes Those are more like a legacy of language, which is not needed in C++.	2018-11-09 12:04:41 +01:00
Sergey Sharybin	cb4b5e12ab	Cycles: Cleanup, spacing after preprocessor It is supposed to be two spaces before comment stating which if else/endif statements corresponds to. Was mainly violated in the header guards.	2018-11-09 11:34:54 +01:00
Stefan Werner	2c5531c0a5	Cycles: Added Embree as BVH option for CPU renders. Note that this is turned off by default and must be enabled at build time with the CMake WITH_CYCLES_EMBREE flag. Embree must be built as a static library with ray masking turned on, the `make deps` scripts have been updated accordingly. There, Embree is off by default too and must be enabled with the WITH_EMBREE flag. Using Embree allows for much faster rendering of deformation motion blur while reducing the memory footprint. TODO: GPU implementation, deduplication of data, leveraging more of Embrees features (e.g. tessellation cache). Differential Revision: https://developer.blender.org/D3682	2018-11-07 12:58:12 +01:00
Sergey Sharybin	e0cc3e9809	Cycles: Fix wrong BVH used when disabling AVX2 in debug settings Mainly useful for debugging. Previously, when AVX2 was disabled in the debug panel but BVH layout was kept on BVH8 nothing was rendered. Needed to make it so supported BVH layout mask for devices is queried in "dynamic", so it is possible to use DebugFlags there.	2018-10-31 11:46:52 +01:00
Stefan Werner	e58c6cf0c6	Cycles: Added Cryptomatte output. This allows for extra output passes that encode automatic object and material masks for the entire scene. It is an implementation of the Cryptomatte standard as introduced by Psyop. A good future extension would be to add a manifest to the export and to do plenty of testing to ensure that it is fully compatible with other renderers and compositing programs that use Cryptomatte. Internally, it adds the ability for Cycles to have several passes of the same type that are distinguished by their name. Differential Revision: https://developer.blender.org/D3538	2018-10-28 05:37:41 -04:00
Lukas Stockner	0234de7d85	Cycles: Reuse existing buffer in the NLM denoising kernels on CPU	2018-10-08 22:17:06 +02:00
Lukas Stockner	a0cc7bd961	Cycles: Implement vectorized NLM kernels for faster CPU denoising	2018-10-06 21:49:54 +02:00
Sergey Sharybin	94ea566b5a	Cycles: Cleanup, whitespace after keyword	2018-08-30 17:34:11 +02:00
Sergey Sharybin	73f2056052	Cycles: Add BVH8 and packeted triangle intersection This is an initial implementation of BVH8 optimization structure and packated triangle intersection. The aim is to get faster ray to scene intersection checks. Scene BVH4 BVH8 barbershop_interior 10:24.94 10:10.74 bmw27 02:41.25 02:38.83 classroom 08:16.49 07:56.15 fishy_cat 04:24.56 04:17.29 koro 06:03.06 06:01.45 pavillon_barcelona 09:21.26 09:02.98 victor 23:39.65 22:53.71 As memory goes, peak usage raises by about 4.7% in a complex scenes. Note that BVH8 is disabled when using OSL, this is because OSL kernel does not get per-microarchitecture optimizations and hence always considers BVH3 is used. Original BVH8 patch from Anton Gavrikov. Batched triangles intersection from Victoria Zhislina. Extra work and tests and fixes from Maxym Dmytrychenko.	2018-08-29 15:03:09 +02:00
Lukas Stockner	94efc651d4	Cycles Denoiser: Allocate a single temporary buffer for the entire denoising process With small tiles, the repeated allocations on GPUs can actually slow down the denoising quite a lot. Allocating the buffer just once reduces rendertime for the default cube with 16x16 tiles and denoising on a mobile 1050 from 22.7sec to 14.0sec.	2018-08-25 12:23:52 -07:00
Sergey Sharybin	658a9c6cf5	Cycles: Cleanup, style I wouldn't mind changing style to have space after keyword, but there was no official code style change proposed.	2018-08-24 14:36:18 +02:00
Lukas Stockner	c960804747	Cycles Denoising: Pass tile buffers to every OpenCL kernel to conform to standard and get rid of set_tile_info	2018-07-04 14:38:03 +02:00
Lukas Stockner	9db8bdbc65	Cycles Denoising: Cleanup: Rename tiles to tile_info	2018-07-04 14:37:24 +02:00
Lukas Stockner	97a0d6fcc7	Cycles Denoising: Refactor denoiser tile handling This deduplicates the calls for tile (un)mapping and allows to have a target buffer that is different from the source buffer (needed for baking and animation denoising).	2018-07-04 14:36:01 +02:00
Lukas Stockner	b10c64bd2f	Cycles Denoising: Split main function into logical steps	2018-07-04 14:35:05 +02:00
Stefan Werner	73eb1bfd55	Revert "Turned off clang warnings in third party includes." This reverts commit `d53093953f`.	2018-06-26 10:26:56 +02:00
Stefan Werner	d53093953f	Turned off clang warnings in third party includes. The latest clang compiler (at least the one in Xcode 9.4.1) warns about the register keyword and macro expansions using defined(). Since these warnings come from third party code, we can't address them directly in Blender. Silencing them via #pramgas will at least keep the warnings during a build down to the ones that are relevant to Blender code.	2018-06-25 23:02:01 +02:00
Brecht Van Lommel	ce3e0afe59	Fix T54001: AMD OpenCL fails with certain resolutions, after recent changes. We should actually be using CL_DEVICE_MEM_BASE_ADDR_ALIGN for sub buffers, previous change in this code was incorrect. Renamed the function now to make the specific purpose of this alignment clear, it's not required for data types in general.	2018-02-05 22:19:49 +01:00
Sergey Sharybin	2f79d1c058	Cycles: Replace use_qbvh boolean flag with an enum-based property This was we can introduce other types of BVH, for example, wider ones, without causing too much mess around boolean flags. Thoughs: - Ideally device info should probably return bitflag of what BVH types it supports. It is possible to implement based on simple logic in device/ and mesh.cpp, rest of the changes will stay the same. - Not happy with workarounds in util_debug and duplicated enum in kernel. Maybe enbum should be stores in kernel, but then it's kind of weird to include kernel types from utils. Soudns some cyclkic dependency. Reviewers: brecht, maxim_d33 Reviewed By: brecht Differential Revision: https://developer.blender.org/D3011	2018-01-22 17:19:20 +01:00
Sergey Sharybin	fa91b43e8c	Cycles: Make it more proper check on vectorization flags from DebugFlags Mimics to checks in system_cpu_support() checks.	2018-01-19 15:48:42 +01:00
Sergey Sharybin	ccec1e7667	Cycles: Cleanup, stop using debug flags in system utilities Debug flags are to be controlling render behavior, nothing to do with low level system utilities. it was simple to hack, but logically is wrong. Lets do things where they are supposed to be done!	2018-01-19 15:22:32 +01:00
Lukas Stockner	2069102c56	Cycles: Fix constness for load_kernels in device_cpu.cpp	2017-12-06 00:00:18 +01:00
Lukas Stockner	fa3d50af95	Cycles: Improve denoising speed on GPUs with small tile sizes Previously, the NLM kernels would be launched once per offset with one thread per pixel. However, with the smaller tile sizes that are now feasible, there wasn't enough work to fully occupy GPUs which results in a significant slowdown. Therefore, the kernels are now launched in a single call that handles all offsets at once. This has two downsides: Memory accesses to accumulating buffers are now atomic, and more importantly, the temporary memory now has to be allocated for every shift at once, increasing the required memory. On the other hand, of course, the smaller tiles significantly reduce the size of the memory. The main bottleneck right now is the construction of the transformation - there is nothing to be parallelized there, one thread per pixel is the maximum. I tried to parallelize the SVD implementation by storing the matrix in shared memory and launching one block per pixel, but that wasn't really going anywhere. To make the new code somewhat readable, the handling of rectangular regions was cleaned up a bit and commented, it should be easier to understand what's going on now. Also, some variables have been renamed to make the difference between buffer width and stride more apparent, in addition to some general style cleanup.	2017-11-30 07:37:08 +01:00
Lukas Stockner	40f528a7da	Cycles: Add per-tile render time debug pass Reviewers: sergey, brecht Differential Revision: https://developer.blender.org/D2920	2017-11-17 16:40:24 +01:00
Brecht Van Lommel	bd4bea3e98	Cycles: avoid reallocating tile denoising memory many times during render.	2017-11-09 20:28:00 +01:00
Mai Lavelle	087331c495	Cycles: Replace __MAX_CLOSURE__ build option with runtime integrator variable Goal is to reduce OpenCL kernel recompilations. Currently viewport renders are still set to use 64 closures as this seems to be faster and we don't want to cause a performance regression there. Needs to be investigated. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2775	2017-11-09 01:04:06 -05:00
Brecht Van Lommel	5801ef71e4	Code refactor: device memory cleanups, preparing for mapped host memory.	2017-11-05 15:22:04 +01:00
Brecht Van Lommel	6ec599c682	Fix T53247: mixed CPU + GPU render wrong texture limits.	2017-11-03 20:32:29 +01:00
Brecht Van Lommel	070a668d04	Code refactor: move more memory allocation logic into device API. * Remove tex_* and pixels_* functions, replace by mem_. Add MEM_TEXTURE and MEM_PIXELS as memory types recognized by devices. * No longer create device_memory and call mem_* directly, always go through device_only_memory, device_vector and device_pixels.	2017-10-24 01:25:19 +02:00
Brecht Van Lommel	aa8b4c5d81	Code refactor: use device_only_memory and device_vector in more places.	2017-10-24 01:25:13 +02:00
Brecht Van Lommel	7ad9333fad	Code refactor: store device/interp/extension/type in each device_memory.	2017-10-24 01:03:59 +02:00
Brecht Van Lommel	ae41f38f78	Code refactor: pass device to scene, check OSL with device info.	2017-10-24 01:03:59 +02:00

1 2 3

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

148 Commits