blender-archive

Archived

Author	SHA1	Message	Date
Brecht Van Lommel	68dd7617d7	Cycles: add utility functions for zero float2/float3/float4/transform Ref D8237, T78710	2021-02-17 16:26:24 +01:00
Patrick Mours	9ff7820f62	Fix T79259: OptiX render with fisheye camera is different to CUDA The fisheye camera setup causes the edges of the image to not shoot primary rays. This was not respected by OptiX because of an optimization that tried to reduce conditionals around trace calls. Removing that does not seem to have an impact on performance anymore however and it fixes the issue.	2020-07-28 15:45:46 +02:00
Brecht Van Lommel	d8e648c352	Fix T78776: Cycles OpenCL error after recent changes for holdouts	2020-07-10 17:10:05 +02:00
Brecht Van Lommel	6435acd8f6	Cycles: support shader transparency for holdout objects Now transparent areas of the object will render objects behind. Fixes T78728.	2020-07-08 13:18:19 +02:00
Brecht Van Lommel	9f1f7ba2bb	Fix T76925: more Cycles OpenCL compile errors with some drivers on Linux	2020-05-25 17:06:10 +02:00
Brecht Van Lommel	1162ba206d	Cycles: change volume step size controls, auto adjust based on voxel size By default it will now set the step size to the voxel size for smoke and volume objects, and 1/10th the bounding box for procedural volume shaders. New settings are: * Scene render/preview step rate: to globally adjust detail and performance * Material step rate: multiplied with auto detected per-object step size * World step size: distance to steo for world shader Differential Revision: https://developer.blender.org/D1777	2020-03-18 11:23:05 +01:00
Brecht Van Lommel	c8ac760c59	Cleanup: tweak Cycles #includes in preparation for clang-format sorting	2020-03-06 14:44:42 +01:00
Stefan Werner	51e898324d	Adaptive Sampling for Cycles. This feature takes some inspiration from "RenderMan: An Advanced Path Tracing Architecture for Movie Rendering" and "A Hierarchical Automatic Stopping Condition for Monte Carlo Global Illumination" The basic principle is as follows: While samples are being added to a pixel, the adaptive sampler writes half of the samples to a separate buffer. This gives it two separate estimates of the same pixel, and by comparing their difference it estimates convergence. Once convergence drops below a given threshold, the pixel is considered done. When a pixel has not converged yet and needs more samples than the minimum, its immediate neighbors are also set to take more samples. This is done in order to more reliably detect sharp features such as caustics. A 3x3 box filter that is run periodically over the tile buffer is used for that purpose. After a tile has finished rendering, the values of all passes are scaled as if they were rendered with the full number of samples. This way, any code operating on these buffers, for example the denoiser, does not need to be changed for per-pixel sample counts. Reviewed By: brecht, #cycles Differential Revision: https://developer.blender.org/D4686	2020-03-05 12:21:38 +01:00
Lukas Stockner	3437c9c3bf	Cycles: perform clamping per light contribution instead of whole path With upcoming light group passes, for them to sum up correctly to the combined pass the clamping must be more fine grained. This also has the advantage that if one light is particularly noisy, it does not diminish the contribution from other lights which do not need as much clamping. Clamp values on existing scenes will need to be tweaked to get similar results, there is no automatic conversion possible which would give the same results as before. Implemented by Lukas, with tweaks by Brecht. Part of D4837	2019-12-12 13:04:43 +01:00
Lukas Stockner	e760972221	Cycles: support for custom shader AOVs Custom render passes are added in the Shader AOVs panel in the view layer settings, with a name and data type. In shader nodes, an AOV Output node is then used to output either a value or color to the pass. Arbitrary names can be used for these passes, as long as they don't conflict with built-in passes that are enabled. The AOV Output node can be used in both material and world shader nodes. Implemented by Lukas, with tweaks by Brecht. Differential Revision: https://developer.blender.org/D4837	2019-12-10 20:44:46 +01:00
Brecht Van Lommel	ad84f22628	Fix T70602: error baking with Cycles OpenCL after recent changes	2019-10-07 16:53:46 +02:00
Patrick Mours	53932f1f06	Cycles: add Optix support in the kernel This adds all the kernel side changes for the Optix backend. Ref D5363	2019-09-13 11:46:22 +02:00
Brecht Van Lommel	d133934ea4	Cycles: code to optionally zero initialize some structs in the kernel This will be used by Optix to help the compiler figure out scoping. It is not used by other devices currently, but worth testing if it helps there too. Ref D5363	2019-08-26 16:07:01 +02:00
Patrick Mours	fd52dc58dd	Cycles: GPU code generation optimizations for direct lighting Use a single loop to iterate over all lights, reducing divergence and amount of code to generate. Moving ray intersection calls out of conditionals will also help the Optix compiler. Ref D5363	2019-08-26 10:26:53 +02:00
Patrick Mours	db257e679a	Cycles: remove workaround to pass ray by value CUDA is working correct without it now, and it's more efficient not to do this. Ref D5363	2019-08-26 10:26:53 +02:00
Patrick Mours	edbb755dfe	Cycles: tweaks for better GPU code generation Uninitialized variables are harder to handle for the compiler. Ref D5363	2019-08-26 10:26:53 +02:00
Patrick Mours	b05e7ea719	Cycles: fixes for building kernel without certain features Ref D5363	2019-08-26 10:10:35 +02:00
Campbell Barton	c47d669f24	Cleanup: comments (long lines) in cycles	2019-05-01 21:41:07 +10:00
Brecht Van Lommel	7a92b8820b	Cycles: remove hair minimum width support. This never really worked as it was supposed to. The main goal of this is to turn noise from sampling tiny hairs into multiple layers of transparency that do not need to be sampled stochastically. However the implementation of this worked by randomly discarding hair intersections in BVH traversal, which defeats the purpose. If it ever comes back, it's best implemented outside the kernel as a preprocess that changes hair radius before BVH building. This would also make it work with Embree, where it's not supported now. But it's not so clear anymore that with many AA samples and GPU rendering this feature is as helpful as it once was for CPU raytracers with few AA samples. The benefit of removing this feature is improved hair ray tracing performance, tested on NVIDIA Titan Xp: bmw27: +0.37% classroom: +0.26% fishy_cat: -7.36% koro: -12.98% pabellon: -0.12% Differential Revision: https://developer.blender.org/D4532	2019-04-24 14:39:47 +02:00
Campbell Barton	e12c08e8d1	ClangFormat: apply to source, most of intern Apply clang format as proposed in T53211. For details on usage and instructions for migrating branches without conflicts, see: https://wiki.blender.org/wiki/Tools/ClangFormat	2019-04-17 06:21:24 +02:00
Lukas Stockner	7fa6f72084	Cycles: Add sample-based runtime profiler that measures time spent in various parts of the CPU kernel This commit adds a sample-based profiler that runs during CPU rendering and collects statistics on time spent in different parts of the kernel (ray intersection, shader evaluation etc.) as well as time spent per material and object. The results are currently not exposed in the user interface or per Python yet, to see the stats on the console pass the "--cycles-print-stats" argument to Cycles (e.g. "./blender -- --cycles-print-stats"). Unfortunately, there is no clear way to extend this functionality to CUDA or OpenCL, so it is CPU-only for now. Reviewers: brecht, sergey, swerner Reviewed By: brecht, swerner Differential Revision: https://developer.blender.org/D3892	2018-11-29 02:45:24 +01:00
Sergey Sharybin	cb4b5e12ab	Cycles: Cleanup, spacing after preprocessor It is supposed to be two spaces before comment stating which if else/endif statements corresponds to. Was mainly violated in the header guards.	2018-11-09 11:34:54 +01:00
Campbell Barton	1daa20ad9f	Cleanup: strip trailing space for cycles	2018-07-06 10:17:58 +02:00
Lukas Stockner	799779d432	Cycles: change Ambient Occlusion shader to output colors. This means the shader can now be used for procedural texturing. New settings on the node are Samples, Inside, Local Only and Distance. Original patch by Lukas with further changes by Brecht. Differential Revision: https://developer.blender.org/D3479	2018-06-15 22:16:06 +02:00
Brecht Van Lommel	7f86afec9d	Cycles: don't count volume boundaries as transparent bounces. This is more important now that we will have tigther volume bounds that we hit multiple times. It also avoids some noise due to RR previously affecting these surfaces, which shouldn't have been the case and should eventually be fixed for transparent BSDFs as well. For non-volume scenes I found no performance impact on NVIDIA or AMD. For volume scenes the noise decrease and fixed artifacts are worth the little extra render time, when there is any.	2018-03-01 01:21:29 +01:00
Brecht Van Lommel	2d81758aa6	Cycles: better path termination for transparency. We now continue transparent paths after diffuse/glossy/transmission/volume bounces are exceeded. This avoids unexpected boundaries in volumes with transparent boundaries. It is also required for MIS to work correctly with transparent surfaces, as we also continue through these in shadow rays. The main visible changes is that volumes will now be lit by the background even at volume bounces 0, same as surfaces. Fixes T53914 and T54103.	2018-02-22 00:55:32 +01:00
Brecht Van Lommel	606bc5f301	Fix T54105: random walk SSS missing in branched indirect paths. Unify the path and branched path indirect SSS code. No performance impact found on CUDA, for AMD split kernel the extra code was already there.	2018-02-21 17:56:26 +01:00
Brecht Van Lommel	0df9b2c715	Cycles: random walk subsurface scattering. It is basically brute force volume scattering within the mesh, but part of the SSS code for faster performance. The main difference with actual volume scattering is that we assume the boundaries are diffuse and that all lighting is coming through this boundary from outside the volume. This gives much more accurate results for thin features and low density. Some challenges remain however: * Significantly more noisy than BSSRDF. Adding Dwivedi sampling may help here, but it's unclear still how much it helps in real world cases. * Due to this being a volumetric method, geometry like eyes or mouth can darken the skin on the outside. We may be able to reduce this effect, or users can compensate for it by reducing the scattering radius in such areas. * Sharp corners are quite bright. This matches actual volume rendering and results in some other renderers, but maybe not so much real world objects. Differential Revision: https://developer.blender.org/D3054	2018-02-09 19:58:33 +01:00
Brecht Van Lommel	aabafece03	Code refactor: tweaks in SSS code to prepare for coming changes. This also fixes a subtle bug in the split kernel branched path SSS, the volume stack update can't be shared between multiple hit points.	2018-02-08 16:56:11 +01:00
Lukas Stockner	322f0223d0	Cycles: option to make background visible through glass transparent. This can be enabled in the Film panel, with an option to control the transmisison roughness below which glass becomes transparent. Differential Revision: https://developer.blender.org/D2904	2018-01-12 01:34:28 +01:00
Mathieu Menuet	83e80db56e	Fix T53349: AO bounces not working correct with OpenCL.	2017-11-26 15:53:00 +01:00
Lukas Stockner	f78e963858	Cycles: Refactor PassType from bitflag to index in order to allow for more passes	2017-11-17 16:34:19 +01:00
Mai Lavelle	087331c495	Cycles: Replace __MAX_CLOSURE__ build option with runtime integrator variable Goal is to reduce OpenCL kernel recompilations. Currently viewport renders are still set to use 64 closures as this seems to be faster and we don't want to cause a performance regression there. Needs to be investigated. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2775	2017-11-09 01:04:06 -05:00
Brecht Van Lommel	8a72be7697	Cycles: reduce closure memory usage for emission/shadow shader data. With a Titan Xp, reduces path trace local memory from 1092MB to 840MB. Benchmark performance was within 1% with both RX 480 and Titan Xp. Original patch was implemented by Sergey. Differential Revision: https://developer.blender.org/D2249	2017-11-05 20:48:33 +01:00
Brecht Van Lommel	2c02a04c46	Code refactor: remove emission and background closures, sum directly.	2017-11-05 18:13:44 +01:00
Brecht Van Lommel	171c4e982f	Cycles: use AO factor to let user adjust intensity of AO bounces. We are already using the AO distance, so might as well offer this extra control over the intensity. Useful when an interior scene is supposed to be significantly darker than the background shader.	2017-10-25 21:46:23 +02:00
Brecht Van Lommel	cdb0b3b1dc	Code refactor: use DeviceInfo to enable QBVH and decoupled volume shading.	2017-10-08 13:17:33 +02:00
Brecht Van Lommel	5bb677e592	Code refactor: zero render buffers outside of kernel. This was originally done with the first sample in the kernel for better performance, but it doesn't work anymore with atomics. Any benefit was very minor anyway, too small to measure it seems.	2017-10-04 21:11:14 +02:00
Brecht Van Lommel	e3e16cecc4	Code refactor: remove rng_state buffer and compute hash on the fly. A little faster on some benchmark scenes, a little slower on others, seems about performance neutral on average and saves a little memory.	2017-10-04 21:11:14 +02:00
Brecht Van Lommel	400e6f37b8	Cycles: reduce subsurface stack memory usage. This is done by storing only a subset of PathRadiance, and by storing direct light immediately in the main PathRadiance. Saves about 10% of CUDA stack memory, and simplifies subsurface indirect ray code.	2017-09-28 15:18:43 +02:00
Brecht Van Lommel	90d4b823d7	Cycles: use defensive sampling for picking BSDFs and BSSRDFs. For the first bounce we now give each BSDF or BSSRDF a minimum sample weight, which helps reduce noise for a typical case where you have a glossy BSDF with a small weight due to Fresnel, but not necessarily small contribution relative to a diffuse or transmission BSDF below. We can probably find a better heuristic that also enables this on further bounces, for example when looking through a perfect mirror, but I wasn't able to find a robust one so far.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	095a01a73a	Cycles: slightly improve BSDF sample stratification for path tracing. Similar to what we did for area lights previously, this should help preserve stratification when using multiple BSDFs in theory. Improvements are not easily noticeable in practice though, because the number of BSDFs is usually low. Still nice to eliminate one sampling dimension.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	b3afc8917c	Code cleanup: refactor BSSRDF closure sampling, for next commit.	2017-09-20 19:38:08 +02:00
Brecht Van Lommel	d750d182e5	Code cleanup: remove hack to avoid seeing transparent objects in noise. Previously the Sobol pattern suffered from some correlation issues that made the outline of objects like a smoke domain visible. This helps simplify the code and also makes some other optimizations possible.	2017-09-20 19:38:08 +02:00
Carlo Andreacchio	ab9079f459	Fix Cycles adaptive compile without volumes broken after recent changes. Differential Revision: https://developer.blender.org/D2847	2017-09-18 12:52:32 +02:00
Brecht Van Lommel	32449e1b21	Code cleanup: store branch factor in PathState.	2017-09-13 15:24:14 +02:00
Brecht Van Lommel	37d9e65ddf	Code cleanup: abstract shadow catcher logic more into accumulation code.	2017-09-13 15:24:14 +02:00
Brecht Van Lommel	f77cdd1d59	Code cleanup: deduplicate some branched and split kernel code. Benchmarks peformance on GTX 1080 and RX 480 on Linux is the same for bmw27, classroom, pabellon, and about 2% faster on fishy_cat and koro.	2017-09-13 15:24:14 +02:00
Brecht Van Lommel	c4c450045d	Code cleanup: tweak inlining for 2% better CUDA performance with hair.	2017-09-13 15:24:14 +02:00
Mathieu Menuet	659ba012b0	Cycles: change AO bounces approximation to do more glossy and transmission. Rather than treating all ray types equally, we now always render 1 glossy bounce and unlimited transmission bounces. This makes it possible to get good looking results with low AO bounces settings, making it useful to speed up interior renders for example. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2818	2017-09-12 15:37:35 +02:00

1 2 3 4 5 ...

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

267 Commits