blender-archive

Archived

Author	SHA1	Message	Date
Sergey Sharybin	94c919349b	Cycles: Cleanup file headers Some of the files were wrongly attributing code to some other organizations and in few places proper attribution was missing. This is mainly either a copy-paste error (when new file was created from an existing one and header wasn't updated) or due to some refactor which split non-original-BF code with purely BF code. Should solve some confusion around.	2016-09-29 10:11:40 +02:00
Lukas Stockner	aae2cea28d	Cycles: Also support the constant emission speedup for mesh lights Reviewers: brecht, sergey, dingto, juicyfruit Differential Revision: https://developer.blender.org/D2220	2016-09-14 18:53:35 +02:00
Mai Lavelle	013b46d6bd	Cycles: Replace object index hack with actual checks for SD_TRANSFORM_APPLIED Using ones complement for detecting if transform has been applied was confusing and led to several bugs. With this proper checks are made. Also added a few transforms where they were missing, mostly affecting baking and displacement when `P` is used in the shader (previously `P` was in the wrong space for these shaders) Also removed `TIME_INVALID` as this may have resulted in incorrect transforms in some cases. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2192	2016-09-11 13:49:05 -04:00
Brecht Van Lommel	e76e8fcdcc	Fix a few OpenCL compiler warnings.	2016-09-03 23:06:12 +02:00
Mai Lavelle	8cac980a28	Cycles: Fix regression where smoke wouldn't show in renders	2016-08-17 10:43:13 -04:00
Mai Lavelle	76b6c77f2c	Cycles microdisplacement: Allow kernels to be built without patch evaluation Kernels can now be built without patch evaluation when not needed by the scene (Catmull-Clark subdivision not in use), giving a performance boost for some devices.	2016-08-15 11:13:18 -04:00
Mai Lavelle	0b68c68006	Cycles microdisplacement: Support for Catmull-Clark subdivision via OpenSubdiv Enables Catmull-Clark subdivision meshes with support for creases and attribute subdivision. Still waiting on OpenSubdiv to fully support face varying interpolation for subdividing uv coordinates tho. Also there may be some inconsistencies with Blender's subdivision which will be resolved at a later time. Code for reading patch tables and creating patch maps is borrowed from OpenSubdiv. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2111	2016-08-07 11:13:11 -04:00
Mai Lavelle	cd809b95d8	Cycles: Add AttributeDescriptor Adds a descriptor for attributes that can easily be passed around and extended to contain more data. Will be used for attributes on subdivision meshes. Reviewed By: brecht Differential Revision: https://developer.blender.org/D2110	2016-08-05 23:49:21 -04:00
Mai Lavelle	41a4967b30	Fix T49003: Cycles volumes have wrong results after recent microdisp commits Problem was that sd->prim can be -1 for volumes and was causing check in subd code to access out of bounds	2016-08-02 15:28:07 -04:00
Sergey Sharybin	6353ecb996	Cycles: Tweaks to support CUDA 8 toolkit All the changes are mainly giving explicit tips on inlining functions, so they match how inlining worked with previous toolkit. This make kernel compiled by CUDA 8 render in average with same speed as previous kernels. Some scenes are somewhat faster, some of them are somewhat slower. But slowdown is within 1% so far. On a positive side it allows us to enable newer generation cards on buildbots (so GTX 10x0 will be officially supported soon).	2016-08-01 15:54:29 +02:00
Campbell Barton	710ab5be36	Cleanup: spelling, style	2016-07-31 17:41:05 +10:00
Mai Lavelle	c96ae81160	Cycles microdisplacement: ngons and attributes for subdivision meshes This adds support for ngons and attributes on subdivision meshes. Ngons are needed for proper attribute interpolation as well as correct Catmull-Clark subdivision. Several changes are made to achieve this: - new primitive `SubdFace` added to `Mesh` - 3 more textures are used to store info on patches from subd meshes - Blender export uses loop interface instead of tessface for subd meshes - `Attribute` class is updated with a simplified way to pass primitive counts around and to support ngons. - extra points for ngons are generated for O(1) attribute interpolation - curves are temporally disabled on subd meshes to avoid various bugs with implementation - old unneeded code is removed from `subd/` - various fixes and improvements Reviewed By: brecht Differential Revision: https://developer.blender.org/D2108	2016-07-29 03:36:30 -04:00
Lukas Stockner	d9cc3ea2c6	Cycles: Fix rays parallel to the surface in the triangle refine and MultiGGX code In the triangle intersection refinement code, rays that are parallel to the triangle caused a divide by zero. These rays might initially hit the triangle due to the watertight intersection test, but are very rare - therefore, just skipping the refinement for them works fine. Also, a few remaining issues in the MultiGGX code are fixed that were caused by rays parallel to the surface (which happened more often there due to smooth shading).	2016-07-25 16:14:25 +02:00
Sergey Sharybin	e7721f5ec8	Cycles: Fix SSS with spatial splits and motion blur	2016-07-25 13:55:03 +02:00
Brecht Van Lommel	f23fecf306	Fix use of uninitialized variable in recent SSS fix.	2016-07-24 16:40:30 +02:00
Sergey Sharybin	9946cca146	Fix T48860: Cycles SSS artifacts with spatially split BVH The issue was caused by SSS intersection code gathering all intersections without check for duplicated ones. This caused situations when same intersection will be recorded twice in the case if triangle is shared by several BVH nodes. Usually this is handled by checking intersection distance after sorting intersections (in shadow_blocked for example) but for SSS we don't do such sorting and using number of intersections to calculate various things. Didn't find anything smarter than to check intersection distance in triangle_intersect_subsurface(). This solves render artifacts in the cost of 1.5% slowdown of extreme case rendering (SSS object filling in whole FullHD screen). Reviewers: brecht Reviewed By: brecht Differential Revision: https://developer.blender.org/D2105	2016-07-18 10:04:20 +02:00
Sergey Sharybin	4355603790	Cycles: Move BVK kernel files to own directory BVH traversal is not really that much a geometry and we've got quite some traversals now. Makes sense to keep them separate in the name of source structure clarity.	2016-07-11 13:58:47 +02:00
Sergey Sharybin	a62967787c	Fix T48808: Regression: Cycles OpenCL broken after Hair BVH commit	2016-07-08 09:41:36 +02:00
Sergey Sharybin	a08e2179f1	Cycles: Implement unaligned nodes BVH traversal This commit implements traversal of unaligned BVH nodes. QBVH traversal is fully SIMD optimized and calculates orientation for all 4 children at a time, regular BVH might probably be optimized a bit more.	2016-07-07 17:25:48 +02:00
Sergey Sharybin	1a2012145d	Cycles: Switch node address to absolute values in BVH tree This seems to be straightforward way to support heterogeneous nodes in the same tree. There is some penalty related on 4gig limit of the address space now, but here's are the thing: Traversal code was already using ints to store final offset, so there can't be regressions really. This is a required commit to make it possible to encode both aligned and unaligned nodes in the same array. Also, in the future we can use this to get rid of __leaf_nodes array (which is a bit tricky to do since trickery in pack_instances().	2016-07-07 17:25:48 +02:00
Sergey Sharybin	17e7454263	Cycles: Reduce memory usage by de-duplicating triangle storage There are several internal changes for this: First idea is to make __tri_verts to behave similar to __tri_storage, meaning, __tri_verts array now contains all vertices of all triangles instead of just mesh vertices. This saves some lookup when reading triangle coordinates in functions like triangle_normal(). In order to make it efficient needed to store global triangle offset somewhere. So no __tri_vindex.w contains a global triangle index which can be used to read triangle vertices. Additionally, the order of vertices in that array is aligned with primitives from BVH. This is needed to keep cache as much coherent as possible for BVH traversal. This causes some extra tricks needed to fill the array in and deal with True Displacement but those trickery is fully required to prevent noticeable slowdown. Next idea was to use this __tri_verts instead of __tri_storage in intersection code. Unfortunately, this is quite tricky to do without noticeable speed loss. Mainly this loss is caused by extra lookup happening to access vertex coordinate. Fortunately, tricks here and there (i,e, some types changes to avoid casts which are not really coming for free) reduces those losses to an acceptable level. So now they are within couple of percent only, On a positive site we've achieved: - Few percent of memory save with triangle-only scenes. Actual save in this case is close to size of all vertices. On a more fine-subdivided scenes this benefit might become more obvious. - Huge memory save of hairy scenes. For example, on koro.blend there is about 20% memory save. Similar figure for bunny.blend. This memory save was the main goal of this commit to move forward with Hair BVH which required more memory per BVH node. So while this sounds exciting, this memory optimization will become invisible by upcoming Hair BVH work. But again on a positive side, we can add an option to NOT use Hair BVH and then we'll have same-ish render times as we've got currently but will have this 20% memory benefit on hairy scenes.	2016-07-07 17:25:48 +02:00
Sergey Sharybin	1eacbf47e3	Cycles: Support visibility check for inner nodes of QBVH It was initially unsupported because initial idea of checking visibility of all children was slowing scenes down a lot. Now the idea has changed and we only perform visibility check of current node. This avoids huge slowdown (from tests here it seems to be withing 1-2%, but more tests would never hurt) and gives nice speedup of ray traversal for complex scenes which utilized ray visibility. Here's timing of koro.blend: Without visibility check With visibility check Original file 4min 20sec 4min 23sec Camera rays only 1min 43 sec 55sec Unfortunately, this doesn't come for free and requires extra data in BVH node, which increases memory usage of BVH nodes by 15%. This we can solve with some future trickery of avoiding __tri_storage created for curve segments.	2016-07-07 17:25:48 +02:00
Brecht Van Lommel	39ae324918	Cycles: remove extended precision hacks, no longer needed with SSE2 requirement. Differential Revision: https://developer.blender.org/D2079	2016-07-04 18:22:11 +02:00
Campbell Barton	9f5621bb4a	Cleanup: comment blocks	2016-07-02 10:08:33 +10:00
Sergey Sharybin	b595a692c8	Cycles: Limit degenerated triangle check got CUDA only OpenCL seems to work fine here, and for some reason that comparison was giving compilation error on OpenCL here. Better to compile OpenCL kernel than to be fully robust to weird corner cases.	2016-06-07 15:48:56 +02:00
Sergey Sharybin	f54a98a1c5	Cycles: Simplify check for degenerated faces on GPU Still not sure how to properly solve the issue, needs some trickery to get actual optimized values from intersection function (using printf() avoids some optimization and makes stuff render correct). For the time being let's just simplify check.	2016-06-03 10:36:04 +02:00
Sergey Sharybin	7dae62cde0	Cycles: Simplify code around debug stats in BVH traversing	2016-05-27 10:55:48 +02:00
Thomas Dinges	c9f1ed1e4c	Cycles: Add support for bindless textures. This adds support for CUDA Texture objects (also known as Bindless textures) for Kepler GPUs (Geforce 6xx and above). This is used for all 2D/3D textures, data still uses arrays as before. User benefits: * No more limits of image textures on Kepler. We had 5 float4 and 145 byte4 slots there before, now we have 1024 float4 and 1024 byte4. This can be extended further if we need to (just change the define). * Single channel textures slots (byte and float) are now supported on Kepler as well (1024 slots for each type). ToDo / Issues: * 3D textures don't work yet, at least don't show up during render. I have no idea whats wrong yet. * Dynamically allocate bindless_mapping array? I hope Fermi still works fine, but that should be tested on a Fermi card before pushing to master. Part of my GSoC 2016. Reviewers: sergey, #cycles, brecht Subscribers: swerner, jtheninja, brecht, sergey Differential Revision: https://developer.blender.org/D1999	2016-05-19 13:14:37 +02:00
Thomas Dinges	16ce1b78b0	Cleanup: Remove outdated comment and add new one about slot IDs.	2016-05-11 22:25:48 +02:00
Thomas Dinges	3807bcb3a8	Cleanup: Rename texture slots to float4 and byte, to distinguish from future float (single channel) and half_float slots. Should be no functional changes, tested CPU and CUDA.	2016-05-06 14:37:35 +02:00
Sergey Sharybin	980f3c3693	Fix T48346: Transparent shadows do not work for instanced objects	2016-05-04 14:46:30 +02:00
Sergey Sharybin	ac00c17900	Cycles: Remove hair support from volume BVH traversal There are couple of reasons: - Volume shader on hair does behave really weird anyway and it's not something considered a bug really. - Volume BVH traversal were only used by camera-in-volume check, which doesn't really make sense to take hair into account since it'll be rendered wrong anyway. Such a removal makes both code easier to extend further (as in, no need to worry about those traversal for hair bvh) and also reduces stress on GPU compilers.	2016-04-11 17:18:14 +02:00
Sergey Sharybin	6cd13a221f	Cycles: Rename tri_woop to tri_storage It's no longer a pre-computed data and just a storage of triangle coordinates which are faster to access to.	2016-04-11 17:18:14 +02:00
Sergey Sharybin	65f279b770	Cycles: Fix wrong camera in volume check when domain is only visible to camera rays	2016-04-04 19:30:38 +02:00
Sergey Sharybin	700722f686	Cycles: Cleanup, indent nested preprocessor directives Quite straightforward, main trick is happening in path_source_replace_includes(). Reviewers: brecht, dingto, lukasstockner97, juicyfruit Differential Revision: https://developer.blender.org/D1794	2016-03-25 13:55:42 +01:00
Sergey Sharybin	0e47e0cc9e	Cycles: Use dedicated BVH for subsurface ray casting This commit makes it so casting subsurface rays will totally ignore all the BVH nodes and primitives which do not belong to a current object, making it much simpler traversal code and reduces number of intersection tests. Reviewers: brecht, juicyfruit, dingto, lukasstockner97 Differential Revision: https://developer.blender.org/D1823	2016-03-25 13:42:13 +01:00
Sergey Sharybin	1c4f21f85e	Cycles: Initial support of 3D textures for CUDA rendering Supports both smoke/fire and point density textures now. Reduces number of textures available for sm_20 and sm_21, but you have to compromise somewhere on such a limited hardware. Currently limited to linear interpolation only, and decoupled ray marching is not supported yet. Think those could be considered just a further improvement. Some quick example: https://developer.blender.org/F282934 Code is minimal and we can fully consider it a fix for missing support of 3D textures with CUDA. Reviewers: lukasstockner97, brecht, juicyfruit, dingto Reviewed By: brecht, juicyfruit, dingto Subscribers: mib2berlin Differential Revision: https://developer.blender.org/D1806	2016-02-15 21:26:29 +01:00
Sergey Sharybin	c93069083e	Cycles: Tweaks for 32bit CUDA binaries Tweak some inline policies. Not totally crazy yet, and in fact we now have one less ifdef statement now.	2016-02-15 19:11:02 +01:00
Sergey Sharybin	3aa74828ab	Cycles: Cleanup, indentation and braces	2016-02-03 15:00:55 +01:00
Sergey Sharybin	72e31d6a72	Cycles: Always inline triangle precalc for CUDA devices Since the SSS changes compiling Experimental sm_52 kernel seems to work just fine.	2016-01-11 21:41:00 +05:00
Campbell Barton	fc9505c9c5	Cleanup: warnings & spelling	2015-12-02 13:15:52 +11:00
Sergey Sharybin	a6bbf05ba6	Cycles: Fix wrong SSS intersection refinement when this option is disabled The code is disabled by default, but we'd better keep it all correct.	2015-12-02 03:14:54 +05:00
Sergey Sharybin	8bca34fe32	Cysles: Avoid having ShaderData on the stack This commit introduces a SSS-oriented intersection structure which is replacing old logic of having separate arrays for just intersections and shader data and encapsulates all the data needed for SSS evaluation. This giver a huge stack memory saving on GPU. In own experiments it gave 25% memory usage reduction on GTX560Ti (722MB vs. 946MB). Unfortunately, this gave some performance loss of 20% which only happens on GPU. This is perhaps due to different memory access pattern. Will be solved in the future, hopefully. Famous saying: won in memory - lost in time (which is also valid in other way around).	2015-11-25 13:01:22 +05:00
Sergey Sharybin	47b1279762	Cycles: Watertight fix for SSS intersection Same as previous commit, just was missing in there.	2015-10-22 22:10:40 +05:00
Sergey Sharybin	f84cbae43e	Cycles: Fix for watertight intersection It was possible to miss some intersection caused by wrong barycentric coordinates sign. Cases when one of the coordinate is zero and other are negative was not handled correct.	2015-10-22 22:07:28 +05:00
Sergey Sharybin	29247a7a05	Cycles: Fix wrong intersection with motion blur and degenerate object transform	2015-10-09 15:58:03 +05:00
Sergey Sharybin	8fa4fccec4	Cycles: Fix intersection issues caused by degenerate instance matrix Issue was caused by wrong intersection distance scaling on instance pop, which could cause intersection distance to become zero, confusing following intersection checks.	2015-10-09 15:58:03 +05:00
Sergey Sharybin	3cee28ebf3	Fix T46143: Faces missing with GPU render Epsilon was quite arbitrary for GPU, replaced with checking for zero-sized faces. It should solve both original report and the new one. After the release we can check why GPU doesn't produce accurate math here and go to the root of the issue.	2015-09-17 17:21:17 +05:00
Sergey Sharybin	1a04179802	Cycles: Cleanup, typo Spotted by Campbell, thanks!	2015-09-09 14:25:43 +05:00
Sergey Sharybin	d13a0e8f4a	Cycles: Limit triangle magnitude check for only GPU Found a way to make AVX2 CPUs happy by reshuffling instructions a bit, so now there's no weird precision errors happening in there. This solves some render speed regressions on CPU, but unfortunately this doesn't help for GPU rendering.	2015-09-09 13:39:36 +05:00

1 2 3 4

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

168 Commits