blender-archive

Archived

Author	SHA1	Message	Date
Brecht Van Lommel	7778a1a0a1	Cycles: optimization for constant background colors. Skip shader evaluation then, as we already do for lights. Less than 1% faster in my tests, but might as well be consistent for both.	2019-03-17 12:01:19 +01:00
Brecht Van Lommel	9873005ecd	Cleanup: simplify kernel features definition. No functional changes, logic here got too complex after many changes over the years.	2019-03-17 12:01:19 +01:00
Brecht Van Lommel	e17f7af0ce	Cleanup: remove Cycles advanced shading features toggle. It's effectively always enabled, only not on some unsupported OpenCL devices. For testing those it's not useful to disable these features. This is replaced by the more fine grained feature toggles that we have now.	2019-03-17 01:58:39 +01:00
Brecht Van Lommel	bc8bd87dff	Merge branch 'blender2.7'	2019-03-15 18:31:48 +01:00
Brecht Van Lommel	65d95879f7	Cycles: upgrade to CUDA 10.1 as the one officially supported version. This version fixes various bugs, and there is no need anymore to use both 9.1 and 10.0 for different cards. There is a bug related to WITH_CYCLES_CUBIN_COMPILER and bump mapping in the regression tests, so that remains disabled same as it was for CUDA 10.0. Fix T59286: CUDA bake failing on some cards. Fix T56858: CUDA 9.2 and 10 issues.	2019-03-15 16:52:28 +01:00
Jeroen Bakker	5051e580e4	Merge branch 'blender2.7'	2019-03-15 16:28:33 +01:00
Jeroen Bakker	2f6257fd7f	Cycles/OpenCL: Compile Kernels During Scene Update The main goals of this change is faster starting when using foreground rendering. This patch will build kernels in parallel to the update process of the scene. When these optimized kernels are not available (yet) an AO kernel will be used. These AO kernels are fast to compile (3-7 seconds) and can be reused by all scenes. When the final kernels become available we will switch to these kernels. In background mode the AO kernels will not be used. Some kernels are being used during Scene update (displace, background light). When these kernels are being used the process can halt until these become available. Reviewed By: brecht, #cycles Maniphest Tasks: T61752 Differential Revision: https://developer.blender.org/D4428	2019-03-15 16:18:21 +01:00
Stefan Werner	d8f1b18d9b	Merge branch 'blender2.7' of git.blender.org:blender	2019-03-14 11:47:27 +01:00
Stefan Werner	47da8dcbca	Cycles: Improved thread order for better CUDA performance. This patch puts threads that render the same pixel closer together, as opposed to threads that render the same sample. Thus threads within a warp are more coherent in memory access and control flow, leading to performance improvements. Example benchmarks on a Quadro RTX4000 (WDDM) on Windows 10: Koro: 4:23 -> 3:46 BMW: 1:18 -> 1:25 Barbershop Interior: 17:52 -> 14:55 Classroom: 4:37 -> 3:45 Performance differences on OpenCL/AMD were hit and miss, some scenes became faster, others lost significantly. Therefore, this is kept as CUDA only change for now.	2019-03-14 11:45:58 +01:00
Brecht Van Lommel	645cc3e871	Merge branch 'blender2.7'	2019-03-12 14:22:53 +01:00
Brecht Van Lommel	e3b1ae9a81	Fix T62481: Cycles crash rendering with UV pass after recent changes.	2019-03-12 14:11:36 +01:00
Brecht Van Lommel	f608964549	Merge branch 'blender2.7'	2019-03-11 14:34:17 +01:00
Brecht Van Lommel	56a633fd2c	Fix T61103: Cycles bevel wrong on objects with negative scale.	2019-03-11 14:26:06 +01:00
Julian Eisel	4041249943	Merge branch 'blender2.7' Conflicts: intern/cycles/blender/addon/properties.py intern/cycles/device/opencl/opencl_split.cpp	2019-03-09 17:19:52 +01:00
Jeroen Bakker	02a7e875d7	Cycles OpenCL: Remove single program Part of the cleanup of the OpenCL codebase. Single program is not effective when using OpenCL, it is slower to compile and slower during rendering (when used in for example `barbershop` or `victor`). Reviewers: brecht, #cycles Maniphest Tasks: T62267 Differential Revision: https://developer.blender.org/D4481	2019-03-08 16:31:35 +01:00
Stefan Werner	c891fb2fbe	Merge branch 'blender2.7'	2019-03-05 15:06:09 +01:00
Brecht Van Lommel	db7f9a70b0	Cycles: Added Float2 attribute type. Float2 are now a new type for attributes in Cycles. Before, the choices for attribute storage were float and float3, the latter padded to float4. This meant that UV maps were inflated to twice the size necessary. Reviewers: brecht, sergey Reviewed By: brecht Subscribers: #cycles Tags: #cycles Differential Revision: https://developer.blender.org/D4409	2019-03-05 14:55:21 +01:00
Jeroen Bakker	6d110a03b7	Merge branch 'blender2.7'	2019-03-05 14:26:28 +01:00
Jeroen Bakker	a325bc6bf3	Fix T58953: Lamp data not always set The Lamp data was not always set. When using CUDA or CPU it was, but when using OpenCL without `OBJECT_MOTION` `sd->lamp` not updated to the actual lamp. This made the TextureCoordinate output the wrong normal when used in a light shader. As the normal was incorrect it made the IES node render incorrectly. (what is the default for the IES node). By setting the lamp data when no `__OBJECT_MOTION__` compile directive is present makes sure that the normal is correctly calculated. Fix D4450 Reviewed By: Brecht van Lommel	2019-03-05 14:22:54 +01:00
Jeroen Bakker	15edae617f	Merge branch 'blender2.7'	2019-02-26 14:07:57 +01:00
Jeroen Bakker	e6099c7e46	T61576: Do Not (Re-)Compile OpenCL kernels The goal of this patch is to have limit the number of times kernels needs to be compiled and are reused as kernels with different compile directives can lead to identical same binaries. The implementation does this by stripping the compile directives. and reshuffling kernels so the output is more likely to be the same. We focussed on the kernels where it was easy to detect and maintain (bundle, bake, displace, do_volume and background). More optimizations could be done but they are probably less obvious. Merged the data_init and state_buffer_size kernels to split_bundle. This patch will also remove empty kernels for do_volume and bake when their features are not enabled. When using the benchmark files there are less background, bake and do_volume kernels compiled. Fix: T61576, T61501, T61466 Reviewed By: brecht, #cycles Differential Revision: https://developer.blender.org/D4390	2019-02-26 12:45:26 +01:00
Sergey Sharybin	8986c92b65	Merge branch 'blender2.7'	2019-02-21 15:33:07 +01:00
Jeroen Bakker	a51d08f473	Fix: Missing closing brackets in include	2019-02-21 14:36:51 +01:00
Jeroen Bakker	fab6c5040d	Fix: OpenCL Displacement and light sampling The bake kernels are also used during mesh displacement and light importance sampling. We disabled the implementation of these kernels when baking was not enabled.	2019-02-21 08:11:02 +01:00
Sergey Sharybin	9e4d561a8b	Merge branch 'blender2.7'	2019-02-20 23:20:43 +01:00
Sergey Sharybin	ccd291aafb	Cycles: Fix uninitialized number of hits Was happening when looking for all intersections for transparent shadow rays in the case the ray is degenerate. Still quesitonable whether we should consider this a transparent or opaque configuraiton. Ideally, we should prevent such rays from happening, but that is another vector of debugging.	2019-02-20 23:20:07 +01:00
Ray molenkamp	4ec6b16b4e	cycles/opencl: Fix compile error. added missing quote, introduced in rB15edda3a8e07003bef695cca939744bbea80ad18	2019-02-20 11:44:06 -07:00
Jeroen Bakker	8a4cdda373	Merge branch 'blender2.7'	2019-02-20 15:22:23 +01:00
Jeroen Bakker	949ab753bb	Cycles OpenCL: Remove OpenCL MegaKernel Using OpenCL MegaKernel has been slow and therefore not usefull. This patch will remove the mega kernel from the OpenCL codebase and the OpenCLDeviceBase class. T61736: removal of mega kernel T61703: baking does not work with mega kernel Tags: #cycles Differential Revision: https://developer.blender.org/D4383	2019-02-20 15:17:22 +01:00
Jeroen Bakker	667033e89e	T61463: Separate Baking kernels Cycles OpenCL: Split baking kernels in own program Fix T61463. Before this patch baking was part of the base kernels. There are 3 baking kernels that and all 3 uses shader evaluation. Only for one of these kernels the functionality was wrapped in the __NO_BAKING__ compile directive. When you start baking this leads to long compile times. By separating in individual programs will reduce the compile times. Also wrapped all baking kernels with __NO_BAKING__ to reduce the compilation times. Impact on compilation time job \| scene_name \| previous \| new \| percentage --------+-----------------+----------+-------+------------ T61463 \| empty \| 10.63 \| 7.27 \| 32% T61463 \| bmw \| 17.91 \| 14.24 \| 20% T61463 \| fishycat \| 19.57 \| 15.08 \| 23% T61463 \| barbershop \| 54.10 \| 48.18 \| 11% T61463 \| classroom \| 17.55 \| 14.42 \| 18% T61463 \| koro \| 18.92 \| 17.15 \| 9% T61463 \| pavillion \| 17.43 \| 14.23 \| 18% T61463 \| splash279 \| 16.48 \| 15.33 \| 7% T61463 \| volume_emission \| 36.22 \| 34.19 \| 6% Impact on render time job \| scene_name \| previous \| new \| percentage --------+-----------------+----------+---------+------------ T61463 \| empty \| 21.06 \| 20.54 \| 2% T61463 \| bmw \| 198.44 \| 189.59 \| 4% T61463 \| fishycat \| 394.20 \| 388.50 \| 1% T61463 \| barbershop \| 1188.16 \| 1185.49 \| 0% T61463 \| classroom \| 341.08 \| 339.27 \| 1% T61463 \| koro \| 472.43 \| 360.70 \| 24% T61463 \| pavillion \| 905.77 \| 902.14 \| 0% T61463 \| splash279 \| 55.26 \| 54.92 \| 1% T61463 \| volume_emission \| 62.59 \| 39.09 \| 38% I don't have a grounded explanation why koro and volume_emission is this much faster; I have done several tests though... Maniphest Tasks: T61463 Differential Revision: https://developer.blender.org/D4376	2019-02-19 16:34:55 +01:00
Jeroen Bakker	15edda3a8e	T61463: Separate Baking kernels Cycles OpenCL: Split baking kernels in own program Fix T61463. Before this patch baking was part of the base kernels. There are 3 baking kernels that and all 3 uses shader evaluation. Only for one of these kernels the functionality was wrapped in the __NO_BAKING__ compile directive. When you start baking this leads to long compile times. By separating in individual programs will reduce the compile times. Also wrapped all baking kernels with __NO_BAKING__ to reduce the compilation times. Impact on compilation time job \| scene_name \| previous \| new \| percentage --------+-----------------+----------+-------+------------ T61463 \| empty \| 10.63 \| 7.27 \| 32% T61463 \| bmw \| 17.91 \| 14.24 \| 20% T61463 \| fishycat \| 19.57 \| 15.08 \| 23% T61463 \| barbershop \| 54.10 \| 48.18 \| 11% T61463 \| classroom \| 17.55 \| 14.42 \| 18% T61463 \| koro \| 18.92 \| 17.15 \| 9% T61463 \| pavillion \| 17.43 \| 14.23 \| 18% T61463 \| splash279 \| 16.48 \| 15.33 \| 7% T61463 \| volume_emission \| 36.22 \| 34.19 \| 6% Impact on render time job \| scene_name \| previous \| new \| percentage --------+-----------------+----------+---------+------------ T61463 \| empty \| 21.06 \| 20.54 \| 2% T61463 \| bmw \| 198.44 \| 189.59 \| 4% T61463 \| fishycat \| 394.20 \| 388.50 \| 1% T61463 \| barbershop \| 1188.16 \| 1185.49 \| 0% T61463 \| classroom \| 341.08 \| 339.27 \| 1% T61463 \| koro \| 472.43 \| 360.70 \| 24% T61463 \| pavillion \| 905.77 \| 902.14 \| 0% T61463 \| splash279 \| 55.26 \| 54.92 \| 1% T61463 \| volume_emission \| 62.59 \| 39.09 \| 38% I don't have a grounded explanation why koro and volume_emission is this much faster; I have done several tests though... Maniphest Tasks: T61463 Differential Revision: https://developer.blender.org/D4376	2019-02-19 16:33:50 +01:00
Jeroen Bakker	e6f5632eb1	T61513: Refactored Cycles Attribute Retrieval There is a generic function to retrieve float and float3 attributes `primitive_attribute_float` and primitive_attribute_float3`. Inside these functions an prioritised if-else construction checked where the attribute is stored and then retrieved from that location. Actually the calling function most of the time already knows where the data is stored. So we could simplify this by splitting these functions and remove the check logic. This patch splits the `primitive_attribute_float?` functions into `primitive_surface_attribute_float?` and `primitive_volume_attribute_float?`. What leads to less branching and more optimum kernels. The original function is still being used by OSL and `svm_node_attr`. This will reduce the compilation time and render time for kernels. Especially in production scenes there is a lot of benefit. Impact in compilation times job \| scene_name \| previous \| new \| percentage -------+-----------------+----------+-------+------------ t61513 \| empty \| 10.63 \| 10.66 \| 0% t61513 \| bmw \| 17.91 \| 17.65 \| 1% t61513 \| fishycat \| 19.57 \| 17.68 \| 10% t61513 \| barbershop \| 54.10 \| 24.41 \| 55% t61513 \| classroom \| 17.55 \| 16.29 \| 7% t61513 \| koro \| 18.92 \| 18.05 \| 5% t61513 \| pavillion \| 17.43 \| 16.52 \| 5% t61513 \| splash279 \| 16.48 \| 14.91 \| 10% t61513 \| volume_emission \| 36.22 \| 21.60 \| 40% Impact in render times job \| scene_name \| previous \| new \| percentage -------+-----------------+----------+--------+------------ 61513 \| empty \| 21.06 \| 20.35 \| 3% 61513 \| bmw \| 198.44 \| 190.05 \| 4% 61513 \| fishycat \| 394.20 \| 401.25 \| -2% 61513 \| barbershop \| 1188.16 \| 912.39 \| 23% 61513 \| classroom \| 341.08 \| 340.38 \| 0% 61513 \| koro \| 472.43 \| 471.80 \| 0% 61513 \| pavillion \| 905.77 \| 899.80 \| 1% 61513 \| splash279 \| 55.26 \| 54.86 \| 1% 61513 \| volume_emission \| 62.59 \| 61.70 \| 1% There is also a possitive impact when using CPU and CUDA, but they are small. I didn't split the hair logic from the surface logic due to: * Hair and surface use same attribute types. It was not clear if it could be splitted when looking at the code only. * Hair and surface are quick to compile and to read. So the benefit is quite small. Differential Revision: https://developer.blender.org/D4375	2019-02-19 16:28:25 +01:00
Jeroen Bakker	d6d306441f	T61513: Refactored Cycles Attribute Retrieval There is a generic function to retrieve float and float3 attributes `primitive_attribute_float` and primitive_attribute_float3`. Inside these functions an prioritised if-else construction checked where the attribute is stored and then retrieved from that location. Actually the calling function most of the time already knows where the data is stored. So we could simplify this by splitting these functions and remove the check logic. This patch splits the `primitive_attribute_float?` functions into `primitive_surface_attribute_float?` and `primitive_volume_attribute_float?`. What leads to less branching and more optimum kernels. The original function is still being used by OSL and `svm_node_attr`. This will reduce the compilation time and render time for kernels. Especially in production scenes there is a lot of benefit. Impact in compilation times job \| scene_name \| previous \| new \| percentage -------+-----------------+----------+-------+------------ t61513 \| empty \| 10.63 \| 10.66 \| 0% t61513 \| bmw \| 17.91 \| 17.65 \| 1% t61513 \| fishycat \| 19.57 \| 17.68 \| 10% t61513 \| barbershop \| 54.10 \| 24.41 \| 55% t61513 \| classroom \| 17.55 \| 16.29 \| 7% t61513 \| koro \| 18.92 \| 18.05 \| 5% t61513 \| pavillion \| 17.43 \| 16.52 \| 5% t61513 \| splash279 \| 16.48 \| 14.91 \| 10% t61513 \| volume_emission \| 36.22 \| 21.60 \| 40% Impact in render times job \| scene_name \| previous \| new \| percentage -------+-----------------+----------+--------+------------ 61513 \| empty \| 21.06 \| 20.35 \| 3% 61513 \| bmw \| 198.44 \| 190.05 \| 4% 61513 \| fishycat \| 394.20 \| 401.25 \| -2% 61513 \| barbershop \| 1188.16 \| 912.39 \| 23% 61513 \| classroom \| 341.08 \| 340.38 \| 0% 61513 \| koro \| 472.43 \| 471.80 \| 0% 61513 \| pavillion \| 905.77 \| 899.80 \| 1% 61513 \| splash279 \| 55.26 \| 54.86 \| 1% 61513 \| volume_emission \| 62.59 \| 61.70 \| 1% There is also a possitive impact when using CPU and CUDA, but they are small. I didn't split the hair logic from the surface logic due to: * Hair and surface use same attribute types. It was not clear if it could be splitted when looking at the code only. * Hair and surface are quick to compile and to read. So the benefit is quite small. Differential Revision: https://developer.blender.org/D4375	2019-02-19 16:25:48 +01:00
Brecht Van Lommel	9800837b98	Cycles: Support multithreaded compilation of kernels This patch implements a workaround to get the multithreaded compilation from D2231 working. So far, it only works for Blender, not for Cycles Standalone. Also, I have only tested the Linux codepath in the helper function. Depends on D2231. Patch by lukasstockner97, jbakker, brecht job \| scene_name \| compilation_time ----------+-----------------+------------------ Baseline \| empty \| 22.73 D2264 \| empty \| 13.94 Baseline \| bmw \| 56.44 D2264 \| bmw \| 41.32 Baseline \| fishycat \| 59.50 D2264 \| fishycat \| 45.19 Baseline \| barbershop \| 212.28 D2264 \| barbershop \| 169.81 Baseline \| victor \| 67.51 D2264 \| victor \| 53.60 Baseline \| classroom \| 51.46 D2264 \| classroom \| 39.02 Baseline \| koro \| 62.48 D2264 \| koro \| 49.03 Baseline \| pavillion \| 54.37 D2264 \| pavillion \| 38.82 Baseline \| splash279 \| 47.43 D2264 \| splash279 \| 37.94 Baseline \| volume_emission \| 145.22 D2264 \| volume_emission \| 121.10 This patch reduced compilation time as the split kernels and base kernels are compiled in parallel. In cycles debug mode (256) you can set unmark the opencl single program file, what reduces the compilation time even further (bmw 17 seconds, barbershop 53 seconds). Reviewers: brecht, dingto, sergey, juicyfruit, lukasstockner97 Reviewed By: brecht Subscribers: Loner, jbakker, candreacchio, 3dLuver, LazyDodo, bliblubli Differential Revision: https://developer.blender.org/D2264	2019-02-15 08:56:20 +01:00
Brecht Van Lommel	4ce9785e01	Cycles: Support multithreaded compilation of kernels This patch implements a workaround to get the multithreaded compilation from D2231 working. So far, it only works for Blender, not for Cycles Standalone. Also, I have only tested the Linux codepath in the helper function. Depends on D2231. Reviewers: brecht, dingto, sergey, juicyfruit, lukasstockner97 Reviewed By: brecht Subscribers: Loner, jbakker, candreacchio, 3dLuver, LazyDodo, bliblubli Differential Revision: https://developer.blender.org/D2264	2019-02-15 08:49:25 +01:00
Brecht Van Lommel	7a41c1634b	Merge branch 'blender2.7'	2019-02-14 20:00:37 +01:00
Brecht Van Lommel	de0e456a6c	Cleanup: fix compiler warnings.	2019-02-14 19:39:39 +01:00
Brecht Van Lommel	9886ae6331	Fix T61470: incorrect saturation clamping in recent bugfix. We should clamp the result after multiplication.	2019-02-14 19:28:44 +01:00
Brecht Van Lommel	dbd9b7590a	Merge branch 'blender2.7'	2019-02-13 19:02:43 +01:00
Brecht Van Lommel	ec559912fb	Fix T61470: inconsistent HSV node results with saturation > 1.0. Values outside the 0..1 range produce negative colors, so now clamp to that range everywhere. Also fixes improper handling of hue > 2.0 in some places.	2019-02-13 17:06:30 +01:00
Brecht Van Lommel	e21ae0bb26	Merge branch 'blender2.7'	2019-02-06 15:22:53 +01:00
Lukas Stockner	fccf506ed7	Cycles: animation denoising support in the kernel. This is the internal implementation, not available from the API or interface yet. The algorithm takes into account past and future frames, both to get more coherent animation and reduce noise. Ref D3889.	2019-02-06 15:18:42 +01:00
Lukas Stockner	c183ac73dc	Cycles: tweak outlier detection, preparing for animation denoising. Ref D3889.	2019-02-06 15:18:38 +01:00
Lukas Stockner	405cacd4cd	Cycles: prefilter feature passes separate from denoising. Prefiltering of feature passes will happen during rendering, which can then be used for denoising immediately or written as a render pass for later (animation) denoising. The number of denoising data passes written is reduced because of this, leaving out the feature variance passes. The passes are now Normal, Albedo, Depth, Shadowing, Variance and Intensity. Ref D3889.	2019-02-06 15:18:29 +01:00
Campbell Barton	8c68ed6df1	Cleanup: remove redundant, invalid info from headers BF-admins agree to remove header information that isn't useful, to reduce noise. - BEGIN/END license blocks Developers should add non license comments as separate comment blocks. No need for separator text. - Contributors This is often invalid, outdated or misleading especially when splitting files. It's more useful to git-blame to find out who has developed the code. See P901 for script to perform these edits.	2019-02-02 02:40:00 +11:00
Campbell Barton	65ec7ec524	Cleanup: remove redundant, invalid info from headers BF-admins agree to remove header information that isn't useful, to reduce noise. - BEGIN/END license blocks Developers should add non license comments as separate comment blocks. No need for separator text. - Contributors This is often invalid, outdated or misleading especially when splitting files. It's more useful to git-blame to find out who has developed the code. See P901 for script to perform these edits.	2019-02-02 01:36:28 +11:00
Brecht Van Lommel	409a21b32e	Merge branch 'blender2.7'	2019-01-28 12:05:51 +01:00
Brecht Van Lommel	d918217d35	OSL: remove fresnel template that was not public domain. Convention is to only have public domain code templates. Also fixes wrong license header in Cycles.	2019-01-28 12:04:54 +01:00
Brecht Van Lommel	728f43d585	Merge branch 'blender2.7'	2019-01-14 12:13:10 +01:00
Brecht Van Lommel	10fa3b790f	Fix T60450: Cycles broken GPU denoising after recent changes.	2019-01-14 11:42:38 +01:00

... 3 4 5 6 7 ...

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

2410 Commits