blender-archive

Archived

Author	SHA1	Message	Date
Mai Lavelle	f2a2d5492b	Cycles: Fix building of OpenCL kernels after volume optimization commit OpenCL is C based, so no support for operators. Related commit: `7377d411b4`	2018-03-02 04:53:13 -05:00
Kévin Dietrich	7377d411b4	Cycles volume: fast empty space optimization by generating a tight mesh around the volume. We generate a tight mesh around the active voxels of the volume in order to effectively skip empty space, and start volume ray marching as close to interesting volume data as possible. See code comments for details on how the mesh generation algorithm works. This gives up to 2x speedups in some scenes. Reviewed by: brecht, dingto Reviewers: #cycles Subscribers: lvxejay, jtheninja, brecht Differential Revision: https://developer.blender.org/D3038	2018-03-01 11:54:01 +01:00
Brecht Van Lommel	8cc7f48581	Cycles: principled absorption color now has more effect at lower values.	2018-02-28 20:11:53 +01:00
Thomas Dinges	9e717c0495	Cycles: Remove Fermi texture code. This should be the last Fermi removal commit, unless I missed something. It's been a pleasure Fermi!	2018-02-17 22:56:58 +01:00
Thomas Dinges	e1ef902058	Cycles: Remove fermi related defines from the code. Did not touch Texture related defines, that comes next.	2018-02-17 22:19:54 +01:00
Brecht Van Lommel	f6107af4cf	Cycles: change Index output of Hair and Particle Info to Random, in 0..1 range. These are used for randomization, so it's convenient if the index is already hashed and consistent with the Object Info node.	2018-02-14 14:55:46 +01:00
Brecht Van Lommel	0df9b2c715	Cycles: random walk subsurface scattering. It is basically brute force volume scattering within the mesh, but part of the SSS code for faster performance. The main difference with actual volume scattering is that we assume the boundaries are diffuse and that all lighting is coming through this boundary from outside the volume. This gives much more accurate results for thin features and low density. Some challenges remain however: * Significantly more noisy than BSSRDF. Adding Dwivedi sampling may help here, but it's unclear still how much it helps in real world cases. * Due to this being a volumetric method, geometry like eyes or mouth can darken the skin on the outside. We may be able to reduce this effect, or users can compensate for it by reducing the scattering radius in such areas. * Sharp corners are quite bright. This matches actual volume rendering and results in some other renderers, but maybe not so much real world objects. Differential Revision: https://developer.blender.org/D3054	2018-02-09 19:58:33 +01:00
Ray molenkamp	36c1122b96	msvc: Use source folder structure for project file. This patch changes the huge list of projects in visual studio into a nice tree matching the source folder structure. see D2823 for details. Differential Revision: http://developer.blender.org/D2823	2018-02-03 16:38:27 -07:00
Sergey Sharybin	ff54dbd8fa	Cycles: Attempt to fix 32 bit linux compilation	2018-02-01 15:13:54 +01:00
Sergey Sharybin	7bd86d74ba	Cycles: Fix for non-vectorized version of bitscan() It was doing bit search in an opposite direction comparing to a vectorized version.	2018-02-01 15:11:17 +01:00
Brecht Van Lommel	1eeb846e78	Fix Cycles viewport render not updating when tweaking displacement shader. This was disabled to avoid updating the geometry every time when the material includes displacement, because there was no way to distinguish between surface shader and displacement updates. As a solution, we now compute an MD5 hash of the nodes linked to the displacement socket, and only update the mesh if that changes. Differential Revision: https://developer.blender.org/D3018	2018-01-29 17:07:08 +01:00
Brecht Van Lommel	848f0c5b5b	Code cleanup: simpler and faster detection of BVH refit.	2018-01-26 08:41:19 +01:00
Sergey Sharybin	2f79d1c058	Cycles: Replace use_qbvh boolean flag with an enum-based property This was we can introduce other types of BVH, for example, wider ones, without causing too much mess around boolean flags. Thoughs: - Ideally device info should probably return bitflag of what BVH types it supports. It is possible to implement based on simple logic in device/ and mesh.cpp, rest of the changes will stay the same. - Not happy with workarounds in util_debug and duplicated enum in kernel. Maybe enbum should be stores in kernel, but then it's kind of weird to include kernel types from utils. Soudns some cyclkic dependency. Reviewers: brecht, maxim_d33 Reviewed By: brecht Differential Revision: https://developer.blender.org/D3011	2018-01-22 17:19:20 +01:00
Sergey Sharybin	fa91b43e8c	Cycles: Make it more proper check on vectorization flags from DebugFlags Mimics to checks in system_cpu_support() checks.	2018-01-19 15:48:42 +01:00
Sergey Sharybin	ccec1e7667	Cycles: Cleanup, stop using debug flags in system utilities Debug flags are to be controlling render behavior, nothing to do with low level system utilities. it was simple to hack, but logically is wrong. Lets do things where they are supposed to be done!	2018-01-19 15:22:32 +01:00
Sergey Sharybin	8e1dd7ed81	Cycles: Remove unneeded include statements Also try to move them from headers to implementation files as much as possible.	2018-01-19 15:19:45 +01:00
Brecht Van Lommel	0fe41009f0	Fix T53830: Cycles OpenCL debug assert on macOS, This was probably harmless besides some unnecessary memory usage due to aligning allocations too much.	2018-01-19 11:35:07 +01:00
Stefan Werner	25b794a39d	Cycles: support animated object scale in motion blur. This was disabled previously due to CUDA compiler bugs, see T32900. Differential Revision: https://developer.blender.org/D2937	2018-01-11 02:58:29 +01:00
Brecht Van Lommel	c621832d3d	Cycles: CUDA support for rendering scenes that don't fit on GPU. In that case it can now fall back to CPU memory, at the cost of reduced performance. For scenes that fit in GPU memory, this commit should not cause any noticeable slowdowns. We don't use all physical system RAM, since that can cause OS instability. We leave at least half of system RAM or 4GB to other software, whichever is smaller. For image textures in host memory, performance was maybe 20-30% slower in our tests (although this is highly hardware and scene dependent). Once other type of data doesn't fit on the GPU, performance can be e.g. 10x slower, and at that point it's probably better to just render on the CPU. Differential Revision: https://developer.blender.org/D2056	2018-01-02 23:50:18 +01:00
Lukas Stockner	fa3d50af95	Cycles: Improve denoising speed on GPUs with small tile sizes Previously, the NLM kernels would be launched once per offset with one thread per pixel. However, with the smaller tile sizes that are now feasible, there wasn't enough work to fully occupy GPUs which results in a significant slowdown. Therefore, the kernels are now launched in a single call that handles all offsets at once. This has two downsides: Memory accesses to accumulating buffers are now atomic, and more importantly, the temporary memory now has to be allocated for every shift at once, increasing the required memory. On the other hand, of course, the smaller tiles significantly reduce the size of the memory. The main bottleneck right now is the construction of the transformation - there is nothing to be parallelized there, one thread per pixel is the maximum. I tried to parallelize the SVD implementation by storing the matrix in shared memory and launching one block per pixel, but that wasn't really going anywhere. To make the new code somewhat readable, the handling of rectangular regions was cleaned up a bit and commented, it should be easier to understand what's going on now. Also, some variables have been renamed to make the difference between buffer width and stride more apparent, in addition to some general style cleanup.	2017-11-30 07:37:08 +01:00
Maxym Dmytrychenko	7e349f2745	Cycles: improve triangle intersection performance. Reduces render time by about 1-2% in benchmark scenes. Differential Revision: https://developer.blender.org/D2911	2017-11-29 18:11:40 +01:00
Lukas Stockner	d8066fb0f1	Cycles: Refactor closure roughness detection to fix a potential bug with Denoising of specular shaders	2017-11-14 04:17:54 +01:00
Sergey Sharybin	d1a761c4d4	Cycles: Fix compilation error of standalone application	2017-11-13 10:49:05 +01:00
Sergey Sharybin	42dff6cc2e	Cycles: Fix compilation error with OIIO compiled against system PugiXML	2017-11-13 10:42:29 +01:00
Sergey Sharybin	db7a78a2be	Cycles: Fix compilation error with latest OIIO There was some changes about namespaces, which causes ambiguities. Replaces using namespace with an explicit symbols we need. Is good idea to NOT pull in the whole namespace anyway!	2017-11-10 10:04:33 +01:00
Sergey Sharybin	46963f359d	Cycles: Bump version number to 1.9.0 This matches Blender Release 2.79.	2017-10-31 13:34:34 +01:00
Brecht Van Lommel	070a668d04	Code refactor: move more memory allocation logic into device API. * Remove tex_* and pixels_* functions, replace by mem_. Add MEM_TEXTURE and MEM_PIXELS as memory types recognized by devices. * No longer create device_memory and call mem_* directly, always go through device_only_memory, device_vector and device_pixels.	2017-10-24 01:25:19 +02:00
Brecht Van Lommel	57a0cb797d	Code refactor: avoid some unnecessary device memory copying.	2017-10-21 20:58:28 +02:00
Brecht Van Lommel	f61c340bc1	Cycles: OpenCL bicubic and tricubic texture interpolation support.	2017-10-08 02:55:44 +02:00
Brecht Van Lommel	23098cda99	Code refactor: make texture code more consistent between devices. * Use common TextureInfo struct for all devices, except CUDA fermi. * Move image sampling code to kernels//kernel__image.h files. * Use arrays for data textures on Fermi too, so device_vector<Struct> works.	2017-10-07 14:53:14 +02:00
Brecht Van Lommel	4537e85584	Fix T53001: more workarounds for crash in AMD compiler with recent drivers.	2017-10-05 17:57:58 +02:00
Brecht Van Lommel	18a353dd24	Fix T52368: Cycles OSL trace() failing on Windows 32 bit.	2017-09-20 19:38:08 +02:00
Sergey Sharybin	885c0a5f90	Cycles: Fix compilation warning	2017-09-04 13:28:15 +02:00
Brecht Van Lommel	1457e5ea73	Fix Cycles Windows render errors with BVH2 CPU rendering. One problem is that it was always using __mm_blendv_ps emulation even if the instruction was supported. The other that the emulation function was wrong. Thanks a lot to Ray Molenkamp for tracking this one down.	2017-08-29 22:55:35 +02:00
Sergey Sharybin	90299e4216	Cycles: Add utility function to query current value of scoped timer	2017-08-25 14:27:34 +02:00
Sergey Sharybin	436d1b4e90	Cycles: FIx issue with -0 being considered a non-finite value	2017-08-24 14:32:56 +02:00
Mai Lavelle	2540741dee	Fix implementation of atomic update max and move to a central location While unlikely to have had any serious effects because of limited use, the previous implementation was not actually atomic due to a data race and incorrectly coded CAS loop. We also had duplicates of this code in a few places, it's now been moved to a single location with all other atomic operations.	2017-08-23 06:54:25 -04:00
Brecht Van Lommel	296d74c4b1	Cycles: reorganize Performance panel layout, move viewport BVH type to debug.	2017-08-21 19:05:17 +02:00
Brecht Van Lommel	4d428d14af	Fix T52443: Cycles OpenCL build error after recent mesh lights changes.	2017-08-19 01:02:55 +02:00
Brecht Van Lommel	6919393a51	Fix T52372: CUDA build error after recent changes.	2017-08-12 20:37:06 +02:00
Brecht Van Lommel	d7639d57dc	Fix T52368: OSL trace() crash after recent changes.	2017-08-12 14:32:52 +02:00
Brecht Van Lommel	267e75158a	Fix T52322: denoiser broken on Windows after recent changes. It's not clear why this only happened on Windows, but the code was wrong and should do a bitcast here instead of conversion.	2017-08-11 01:09:35 +02:00
Sergey Sharybin	fd397a7d28	Cycles: Add utility macro ccl_ref It is defined to & for CPU side compilation, and defined to an empty for any GPU platform. The idea here is to use this macro instead of #ifdef block with bunch of duplicated lines just to make it so CPU code is efficient. Eventually we might switch to references on CUDA as well, but that would require some intensive testing.	2017-08-08 15:27:25 +02:00
Brecht Van Lommel	dc4d850d10	Fix Windows build errors with recent Cycles SIMD refactoring.	2017-08-07 17:54:26 +02:00
Sergey Sharybin	580741b317	Cycles: Cleanup, space after keyword	2017-08-07 14:47:51 +02:00
Brecht Van Lommel	ee77c1e917	Code refactor: use float4 instead of intrinsics for CPU denoise filtering. Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	a24fbf3323	Code refactor: add, remove, optimize various SSE functions. * Remove some unnecessary SSE emulation defines. * Use full precision float division so we can enable it. * Add sqrt(), sqr(), fabs(), shuffle variations, mask(). * Optimize reduce_add(), select(). Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Brecht Van Lommel	a8cc0d707e	Code refactor: split defines into separate header, changes to SSE type headers. I need to use some macros defined in util_simd.h for float3/float4, to emulate SSE4 instructions on SSE2. But due to issues with order of header includes this was not possible, this does some refactoring to make it work. Differential Revision: https://developer.blender.org/D2764	2017-08-07 14:01:24 +02:00
Sergey Sharybin	0d01cf4488	Cycles: Extra tweaks to performance of header expansion Two main things here: 1. Replace all unsafe for #line directive characters into a single loop, avoiding multiple iterations and multiple temporary strings created. 2. Don't merge token char by char but calculate start and end point and then copy all substring at once. This gives about 15% speedup of source processing time. At this point (with all previous commits from today) we've shrinked down compiled sources size from 108 MB down to ~5.5 MB and lowered processing time from 4.5 sec down to 0.047 sec on my laptop running Linux (this was a constant time which Blender will always spent first time loading kernel, even if we've got compiled clbin).	2017-08-03 08:07:06 +02:00
Sergey Sharybin	f879cac032	Cycles: Avoid some expensive operations in header expansions Basically gather lines as-is during traversal, avoiding allocating memory for all the lines in headers. Brings additional performance improvement abut 20%.	2017-08-02 20:59:19 +02:00

1 2 3 4 5 ...

Download

What's New

Roadmap

Documentation

Blender Studio

Manual

Benchmark

Blender Conference

Development Fund

One-time Donations

642 Commits