blender-archive

Archived

Author	SHA1	Message	Date
Sergey Sharybin	77e6f2212f	Cycles: Allow paths customization via environment variables This is for development and test environment setup only, not for regular users usage hence no mentioning in the man page needed.	2015-02-02 02:02:10 +05:00
Sergey Sharybin	a922be9270	Cycles: Repot CPU and CUDA capabilities to system info operator For CPU it gives available instructions set (SSE, AVX and so). For GPU CUDA it reports most of the attribute values returned by cuDeviceGetAttribute(). Ideally we need to only use set of those which are driver-specific (so we don't clutter system info with values which we can get from GPU specifications and be sure they stay the same because driver can't affect on them).	2015-01-06 14:13:21 +05:00
Sergey Sharybin	9e2e408323	Cycles: Add logging to OSL and CUDA initialization/compilation This is what was handy troubleshooting issues in the studio, plus this is exactly the same thing which would be helpful when solving issues with paths to compiled shaders and cubins for standalone repository.	2015-01-01 01:31:08 +05:00
Thomas Dinges	ee36e75b85	Cleanup: Fix Cycles Apache header. This was already mixed a bit, but the dot belongs there.	2014-12-25 02:50:24 +01:00
Campbell Barton	c07f6c02b3	Docs: reference the new manual	2014-12-08 11:18:58 +01:00
Thomas Dinges	e3a6f1c152	Cycles: Remove workaround for missing sm_52 kernel, now we require it for Maxwell cards.	2014-12-02 13:45:39 +01:00
Thomas Dinges	4ff8744669	Cycles / CUDA: Better fix for missing sm_52 kernel, in case user compiles himself.	2014-10-30 11:42:59 +01:00
Sergey Sharybin	e556670b36	Cycles: Do cuda pointer arithmetic in integers, don't use pointer arithmetic This should hopefully fix https://bugs.debian.org/cgi-bin/bugreport.cgi?bug=765187	2014-10-14 17:54:41 +02:00
Sergey Sharybin	0106b94f9d	Cycles: Fix for debug kernel not working with CUDA	2014-10-05 15:31:48 +06:00
Thomas Dinges	a613290775	Cycles / CUDA: Workaround to make sm_52 (Maxwell) cards work. * sm_52 can run a sm_50 kernel, so tell runtime detection to use that until we build a dedicated sm_52 kernel.	2014-10-05 04:13:40 +02:00
Sergey Sharybin	fbed2047c8	Fix wrong track of the memory when doing device vector resize before freeing it This is rather legit case which happens i.e. when having persistent images enabled and session is updating the lookup tables. Now device_memory keeps track of amount of memory being allocated on the device, which makes freeing using the proper allocated size, not the CPU side buffer size.	2014-09-04 17:25:12 +06:00
Thomas Dinges	fb3f32760d	Cycles: Add an experimental CUDA kernel. Now we build 2 .cubins per architecture (e.g. kernel_sm_21.cubin, kernel_experimental_sm_21.cubin). The experimental kernel can be used by switching to the Experimental Feature Set: http://wiki.blender.org/index.php/Doc:2.6/Manual/Render/Cycles/Experimental_Features This enables Subsurface Scattering and Correlated Multi Jitter Sampling on GPU, while keeping the stability and performance of the regular kernel. Differential Revision: https://developer.blender.org/D762 Patch by Sergey and myself. Developer / Builder Note: CUDA Toolkit 6.5 is highly recommended for this, also note that building the experimental kernel requires a lot of system memory (~7-8GB).	2014-08-26 17:02:26 +02:00
Thomas Dinges	603348c56e	Cycles: Drop support for CUDA 5.0 Toolkit, only 6.0 and 6.5 (recommended) are supported now.	2014-08-21 23:35:20 +02:00
Dalai Felinto	8d3cc431d7	Fix T41471 Cycles Bake: Setting small tile size results in wrong bake with stripes rather than the expected noise pattern This problem was introduced in `983cbafd18` Basically the issue is that we were not getting a unique index in the baking routine for the RNG (random number generator). Reviewers: sergey Differential Revision: https://developer.blender.org/D749	2014-08-19 11:40:33 +02:00
Dalai Felinto	2c5b6859d9	Revert "Fix T41222 Blender gives weird output when baking (4096*4096) resolution on GPU" This reverts commit `a48b372b04`. Leaving only the part that fix device_multi.cpp	2014-08-15 11:27:42 +02:00
Dalai Felinto	a48b372b04	Fix T41222 Blender gives weird output when baking (4096*4096) resolution on GPU In collaboration with Sergey Sharybin. Also thanks to Wolfgang Faehnle (mib2berlin) for help testing the solutions. Reviewers: sergey Differential Revision: https://developer.blender.org/D690	2014-08-05 13:50:50 -03:00
Sergey Sharybin	77b7e1fe9a	Deduplicate CUDA and OpenCL wranglers For now it was mainly about OpenCL wrangler being duplicated between Cycles and Compositor, but with OpenSubdiv work those wranglers were gonna to be duplicated just once again. This commit makes it so Cycles and Compositor uses wranglers from this repositories: - https://github.com/CudaWrangler/cuew - https://github.com/OpenCLWrangler/clew This repositories are based on the wranglers we used before and they'll be likely continued maintaining by us plus some more players in the market. Pretty much straightforward change with some tricks in the CMake/SCons to make this libs being passed to the linker after all other libraries in order to make OpenSubdiv linked against those wranglers in the future. For those who're worrying about Cycles being less standalone, it's not truth, it's rather more flexible now and in the future different wranglers might be used in Cycles. For now it'll just mean those libs would need to be put into Cycles repository together with some other libs from Blender such as mikkspace. This is mainly platform maintenance commit, should not be any changes to the user space. Reviewers: juicyfruit, dingto, campbellbarton Reviewed By: juicyfruit, dingto, campbellbarton Differential Revision: https://developer.blender.org/D707	2014-08-05 13:57:50 +06:00
Campbell Barton	9c3025cd26	Spelling	2014-08-02 16:53:52 +10:00
Dalai Felinto	fc55c41bba	Cycles Bake: show progress bar during bake Baking progress preview is not possible, in parts due to the way the API was designed. But at least you get to see the progress bar while baking. Reviewers: sergey Differential Revision: https://developer.blender.org/D656	2014-07-25 11:42:53 -03:00
Martijn Berger	bae2b3a688	Switch to Cuda 4.0 style api for kernel invocation. This is a small clean-up that has no functional changes but makes code a bit more readable. Differential revision: https://developer.blender.org/D659 Reviewed by: Sergey Sharybin, Thomas Dinges	2014-07-25 13:33:19 +02:00
Thomas Dinges	9acabc13de	Cleanup: Typo fixes.	2014-07-05 14:25:34 +02:00
Thomas Dinges	5898abe99d	Cycles: Update CUDA error messages, based on Toolkit 6.0. * Removed deprecated erros, and added some new ones, which might help to figure out problems in the future.	2014-07-02 01:50:42 +02:00
Thomas Dinges	4800c52700	Cleanup: Remove unused checks in CUDA device code.	2014-07-02 01:12:13 +02:00
Brecht Van Lommel	e4e58d4612	Fix T40370: cycles CUDA baking timeout with high number of AA samples. Now baking does one AA sample at a time, just like final render. There is also some code for shader antialiasing that solves T40369 but it is disabled for now because there may be unpredictable side effects.	2014-06-06 15:39:04 +02:00
Brecht Van Lommel	865dfa8a7e	Fix T40228: cycles CUDA multi GPU + world MIS giving error.	2014-06-05 18:10:32 +02:00
Brecht Van Lommel	69c7522b24	Fix T40379: world MIS causing too much CUDA memory usage. The kernel for baking the world texture was the same as the one used for baking. Now that's separate which allows the kernel to reserve much less memory.	2014-05-27 15:11:32 +02:00
Brecht Van Lommel	3b53fffb77	Cycles: revert async CUDA changes, these are giving too much trouble still. Fixes T40027. This means we get more CPU usage again when using multiple CUDA, but the impact on performance is too big a problem with the current code.	2014-05-19 19:33:09 +02:00
Thomas Dinges	c08c931fb6	Cycles / CUDA: Increase maximum image textures on GPU. Instead of 95, we can use 145 images now. This only affects Kepler and above (sm30, sm_35 and sm_50). This can be increased further if needed, but let's first test if this does not come with a performance impact. Originally developed during my GSoC 2013.	2014-05-11 03:38:39 +02:00
Thomas Dinges	fd26a32aa5	Fix T40119, CUDA Toolkit version mismatch	2014-05-10 01:26:04 +02:00
Campbell Barton	dc13969e48	Style cleanup: indentation, braces	2014-05-05 02:19:08 +10:00
Campbell Barton	1618329b00	Code cleanup: style, require ; for cuda_assert, opencl_assert	2014-05-04 03:57:50 +10:00
Brecht Van Lommel	198f5e506a	Cycles: CUDA changes for kernel evaluation cancel	2014-05-02 21:19:10 -03:00
Campbell Barton	8d16869d83	Code cleanup: Add -Werror=float-conversion to Cycles	2014-05-03 07:31:46 +10:00
Brecht Van Lommel	741f17f05b	Cycles CUDA: make CUDA toolkit 6.0 the official supported version. This also updates the configurations to build kernels for compute capability 5.0 cards, when using and older CUDA toolkit version this will be skipped. Also includes tweaks to improve performance with this version: * Increase max registers on sm_30, sm_35 and sm_50 * No longer use texture storage on sm_30	2014-04-30 16:07:27 +02:00
Brecht Van Lommel	39bfde674c	Cycles CUDA: don't use cuLaunchGridAsync at all for display devices. As suggested by Martijn, this is slower than cuLaunchGrid.	2014-04-17 12:18:49 +02:00
Brecht Van Lommel	18da79f471	Cycles CUDA: only do async execution for GPUs not used for display. Otherwise devices used for display will lock up the UI too much. This means you might still get 100% CPU for the display device, but for others CPU usage should be low still. The check to see if a device is used for display may not be entirely reliable, it checks if there is a watchdog timeout on the device, but I'm not entirely sure that always exists for display devices or is disabled for non-display devices, though some tools like cuda-gdb seem to make the same assumption. Ref T39559	2014-04-17 12:08:18 +02:00
Brecht Van Lommel	415e10a0ef	Fix another compile error with recent commit on visual studio.	2014-04-16 21:36:19 +02:00
Brecht Van Lommel	6f1afdbbfc	Cycles CUDA: enabled branched path kernel again, with more registers.	2014-04-16 21:05:04 +02:00
Brecht Van Lommel	2851ed4a55	Cycles code refactor: use __launch_bounds__ instead of -maxrregcount for CUDA. This makes it easier to have per kernel number of registers. Also, all the tunable parameters for this are now in kernel.cu, rather than spread over cmake, scons and device_cuda.cpp.	2014-04-16 21:05:04 +02:00
Thomas Dinges	297a2223b5	Cycles / CUDA: Increase sm_2x registers to 40. This fixes the ptaxs "ACCESS_VIOLATION" error and should allow our Linux and Windows build bots to compile again. Unfortunately this comes with a performance penalty on sm_2x cards, so this is only a workaround for now. Branched Path is still globally disabled on GPU.	2014-04-08 23:25:54 +02:00
Thomas Dinges	d923720312	Cycles: Disable Branched Path on all GPUs for now, until we separate the cubins. SM_20 fails now as well, reported by Zanqdo in IRC.	2014-04-03 22:18:40 +02:00
Brecht Van Lommel	a2e4ebd36a	Cycles code internals: add CPU kernel support for 3D image textures.	2014-03-29 13:03:48 +01:00
Thomas Dinges	859039f732	Cycles: Raise a proper error message when using Branched Path on sm_30, this is currently still disabled.	2014-03-27 10:29:22 +01:00
Sergey Sharybin	74518b2826	Fix T39420: Cycles viewport/preview flickers, when moving mouse across editors Issue was caused by the wrong usage of OCIO GLSL binding API. To make it work properly on pre-GLSL-1.3 drivers shader is to be enabled after the texture is binded to the opengl context. Otherwise it wouldn't know the proper texture size. This is actually a regression in 2.70 and to be ported to 'a'.	2014-03-26 15:58:53 +06:00
Martijn Berger	28c1a860e2	Fix T39247 Changes to interpolation break texture allocation on sm35 and greater.	2014-03-19 07:37:18 +01:00
Martijn Berger	dd2dca2f7e	Add support for multiple interpolation modes on cycles image textures All textures are sampled bi-linear currently with the exception of OSL there texture sampling is fixed and set to smart bi-cubic. This patch adds user control to this setting. Added: - bits to DNA / RNA in the form of an enum for supporting multiple interpolations types - changes to the image texture node drawing code ( add enum) - to ImageManager (this needs to know to allocate second texture when interpolation type is different) - to node compiler (pass on interpolation type) - to device tex_alloc this also needs to get the concept of multiple interpolation types - implementation for doing non interpolated lookup for cuda and cpu - implementation where we pass this along to osl ( this makes OSL also do linear untill I add smartcubic to the interface / DNA/ RNA) Reviewers: brecht, dingto Reviewed By: brecht CC: dingto, venomgfx Differential Revision: https://developer.blender.org/D317	2014-03-07 23:16:33 +01:00
Martijn Berger	1d01675833	Cuda use streams and async to avoid busywaiting This switches api usage for cuda towards using more of the Async calls. Updating only once every second is sufficiently cheap that I don't think it is worth doing it less often. Reviewed By: brecht Differential Revision: https://developer.blender.org/D262	2014-03-06 20:51:46 +01:00
Brecht Van Lommel	6b1a4fc66e	Cycle CUDA: revert the `f1aeb2ccf4` and `84f958754` busywait fixes for now. It's unclear what kind of impact they have on performance at the moment, so I rather play it safe and postpone this for 2.71. Ref T38679, Ref T38712	2014-02-19 16:08:08 +01:00
Martijn Berger	f1aeb2ccf4	this is an attempted Fix: T38679 Cycles GPU Performance Regression From my testing this (what i should have done in the first place) reduces the regression a lot. Lets hope it is enough or we have to go back to busy waiting.	2014-02-17 20:11:45 +01:00
Martijn Berger	84f9587540	Cuda use streams and async to avoid busywaiting This is my first stab at this and is based on this IRC converstation: <mib2berlin> brecht: this is meaning as reminder only, I know you have other things to do > http://openvidia.sourceforge.net/index.php/Optimization_Notes#avoiding_busy_waits <brecht> mib2berlin: thanks, bookmarked only tested on Ubuntu 14.04 / cuda 5.0 but ill do some more testing tomorrow. Also unsure about the placement and the lifetime of the stream and the event. But creating / deleting these seems to incur a non trivial cost. Reviewers: brecht Reviewed By: brecht CC: mib2berlin, dingto Differential Revision: https://developer.blender.org/D262	2014-01-28 18:40:08 +01:00

... 2 3 4 5 6

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

256 Commits