blender-archive

Archived

Author	SHA1	Message	Date
Jeroen Bakker	08a6361b3f	Vulkan: Texture Data Conversions This PR adds basic support for texture update, read back and clearing for Vulkan. In Vulkan we need to convert each data type ourselves as vulkan buffers are untyped. Therefore this change mostly is about data conversions. Considerations: - Use a compute shader to do the conversions: - Leads to performance regression as compute pipeline can stall graphics pipeline - Lead to additional memory usage as two staging buffers are needed one to hold the CPU data, and one to hold the converted data. - Do inline conversion when sending the data to Vulkan using `eGPUDataFormat` - Additional CPU cycles required and not easy to optimize as it the implementation requires many branches. - Do inline conversion when sending the data to Vulkan (optimized for CPU) For this solution it was chosen to implement the 3rd option as it is fast and doesn't require additional memory what the other options do. Use Imath/half.h This patch uses `Imath/half.h` (dependency of OpenEXR) similar to alembic. But this makes vulkan dependent of the availability of OpenEXR. For now this isn't checked, but when we are closer to a working Vulkan backend we have to make a decision how to cope with this dependency. Missing Features Framebuffer textures This doesn't include all possible data transformations. Some of those transformation can only be tested after the VKFramebuffer has been implemented. Some texture types are only available when created for a framebuffer. These include the depth and stencil variations. Component format Is more relevant when implementing VKVertexBuffer. SRGB textures SRGB encoded textures aren't natively supported on all platforms, in all usages and might require workarounds. This should be done in a separate PR in a later stage when we are required to use SRGB textures. Test cases The added test cases gives an overview of the missing bits and pieces of the patch. When the implementation/direction is accepted more test cases can be enabled/implemented. Some of these test cases will skip depending on the actual support of platform the tests are running on. For example OpenGL/NVidia will skip the next test as it doesn't support the texture format on OpenGL, although it does support it on Vulkan. ``` [ RUN ] GPUOpenGLTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI [ SKIPPED ] GPUOpenGLTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI [ RUN ] GPUVulkanTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI [ OK ] GPUVulkanTest.texture_roundtrip__GPU_DATA_2_10_10_10_REV__GPU_RGB10_A2UI ``` Pull Request: blender/blender#105762	2023-03-24 08:09:19 +01:00
Jeroen Bakker	d5df77199b	Vulkan: Resource Submission Tracking In Vulkan multiple commands can be in flight simultaneously. These commands can share resources like descriptor sets or push constants. When between commands these resources are updated a new version of the resources should be created. When a resource is updated it should check the submission id of the command buffer. If this is different than last known by the resources, the previous resources should be freed. If the submission id is the same than previously it has to create a new version of the resource to not intervene with other commands that uses the resource before the update. When the resource wasn't updated between multiple usages in the same submission id it could reuse the previous resource. This PR introduces a `ResourceTracker` and a `SubmissionTracker`. A submission tracker can check if the command buffer is submitted. In this case all resources of the resource tracker should be freed. Unmodified resources in the same submission can be shared. A resource tracker will keep track of all resources that are in flight. After the resources are used (submission + execution) have finished the resources can be cleared. Pull Request: blender/blender#105183	2023-03-24 07:47:50 +01:00
Jeroen Bakker	fda65ad5ca	GPU: Renderdoc Frame Capturing This PR uses renderdoc for frame capturing when enabled. It enabled an easier workflow for frame capturing. - Capture GPU API calls from test cases - Capture GPU API calls from background threads - Capture GPU API calls from background rendering. Renderdoc is an important GPU debugger used by the Eevee/ Viewport module. Previously we needed to change code in order to record background rendering, that could on its own lead to other side-effects. The integration with renderdoc can be enabled using `WITH_RENDERDOC=On` compiler option. `GPU_debug_capture_begin` and `GPU_debug_capture_end` can be added to the section of the code you want to debug. When running Blender inside renderdoc this part will automatically be captured. All GPU test cases are now guarded by these calls. In order to capture the test cases you need to start the test cases from renderdoc and the captured GPU API calls will appear where each capture is a single test case. Pull Request: blender/blender#105921	2023-03-23 16:37:52 +01:00
Campbell Barton	80dfb885d7	License headers: add missing license identifiers	2023-03-23 15:12:09 +11:00
Campbell Barton	2ba1556e69	Cleanup: spelling in comments, use doxygen syntax	2023-03-22 12:22:55 +11:00
Campbell Barton	db762d5508	Cleanup: various C/C++ code cleanups, use utility macros	2023-03-21 19:47:21 +11:00
Hans Goudey	16fbadde36	Mesh: Replace MLoop struct with generic attributes Implements #102359. Split the `MLoop` struct into two separate integer arrays called `corner_verts` and `corner_edges`, referring to the vertex each corner is attached to and the next edge around the face at each corner. These arrays can be sliced to give access to the edges or vertices in a face. Then they are often referred to as "poly_verts" or "poly_edges". The main benefits are halving the necessary memory bandwidth when only one array is used and simplifications from using regular integer indices instead of a special-purpose struct. The commit also starts a renaming from "loop" to "corner" in mesh code. Like the other mesh struct of array refactors, forward compatibility is kept by writing files with the older format. This will be done until 4.0 to ease the transition process. Looking at a small portion of the patch should give a good impression for the rest of the changes. I tried to make the changes as small as possible so it's easy to tell the correctness from the diff. Though I found Blender developers have been very inventive over the last decade when finding different ways to loop over the corners in a face. For performance, nearly every piece of code that deals with `Mesh` is slightly impacted. Any algorithm that is memory bottle-necked should see an improvement. For example, here is a comparison of interpolating a vertex float attribute to face corners (Ryzen 3700x): Before (Average: 3.7 ms, Min: 3.4 ms) ``` threading::parallel_for(loops.index_range(), 4096, [&](IndexRange range) { for (const int64_t i : range) { dst[i] = src[loops[i].v]; } }); ``` After (Average: 2.9 ms, Min: 2.6 ms) ``` array_utils::gather(src, corner_verts, dst); ``` That's an improvement of 28% to the average timings, and it's also a simplification, since an index-based routine can be used instead. For more examples using the new arrays, see the design task. Pull Request: blender/blender#104424	2023-03-20 15:55:13 +01:00
Clément Foucault	335688dd42	EEVEE-Next: Deferred Pipeline Implement GBuffer prepass and deferred lighting (lights only). This decouple lighting from the material shaders making them lighter, less expensive and faster to compile. Trying to keep a nice data flow so we could potentially use the subpass programable blending feature on tiled GPU arch. Not everything is covered yet and #105880 is making the GBuffer layout a bit awkward and not easily extendable. Pull Request: blender/blender#105868	2023-03-18 20:54:20 +01:00
Jeroen Bakker	8ded95c175	Vulkan: Clearing Storage Buffers This PR adds support for `GPU_storagebuf_clear` and `GPU_storagebuf_clear_zero` to the Vulkan backend. It also adds test cases for all backends. Pull Request: blender/blender#105487	2023-03-17 13:48:39 +01:00
Campbell Barton	3b8a050fc1	Cleanup: use function-style casts, ELEM & STREQ macros	2023-03-17 16:45:42 +11:00
Miguel Pozo	8bb411ad1a	EEVEE Next: IrradianceCache surfels debug display Add a surfels debug pass to aid with the GI implementation development. Pull Request: blender/blender#105802	2023-03-16 14:14:33 +01:00
Jeroen Bakker	5e0cc9d277	Cleanup: Silence Unused parameter warnings. - blender::gpu::GLContext - blender::gpu::VKContext	2023-03-16 09:44:09 +01:00
Jeroen Bakker	a25cc51db9	Cleanup: Spelling in previous commit.	2023-03-16 08:55:45 +01:00
Jason Fielder	3d9d67594c	GPU: Add GPU frame capture support. Adds two modes of GPU frame capture support for enhanced debugging. GPU frame capture begin/end allow instantaneous frame capture of all GPU commands within the capture boundary. GPU frame capture scopes allow several user-defined capture regions which can wrap key parts of code. These scopes are exposed to connected GPU tools allowing the user to manually trigger a capture of a known scope at the desired time. This is currently integrated with the Metal backend for support with Xcode. Related to #105591 Pull Request: blender/blender#105717	2023-03-16 08:54:05 +01:00
Jeroen Bakker	397f6250a5	Merge branch 'blender-v3.5-release'	2023-03-16 08:28:21 +01:00
Jason Fielder	7bdd82eca0	Metal: Resolve Race Condition in Memory Manager Fix race condition if several competing threads are inserting Metal buffers into the MTLSafeFreeList simultaneously while a new list chunk is being created. Also raise the limit for an MTLSafeFreeListChunk size to optimize for interactivity when releasing lots of memory simultaneously. Authored by Apple: Michael Parkin-White Pull Request: blender/blender#105254	2023-03-16 08:25:15 +01:00
Jason Fielder	d3409f2159	Fix: Uncached Metal Materials not Being Released Optimized node graphs do not get cached and were not correctly freed once their reference count reached zero, due to being excluded from the GPUPass garbage collection. Also suppress Metal shader warnings, which are prevalent during material optimization. Authored by Apple: Michael Parkin-White Pull Request: blender/blender#105795	2023-03-16 08:19:32 +01:00
Jason Fielder	94855119da	Fix: Metal validation error when shader has no uniforms Metal buffer binding validation would trigger an error when a given shader had an empty PushConstantBlock. This patch removes the default uniform code gen if no uniforms are present, to avoid any possible issues with buffers being bound to a shader where the destination data block is size zero. Authored by Apple: Michael Parkin-White Pull Request: blender/blender#105796	2023-03-16 08:10:38 +01:00
Jason Fielder	ac6a70e5f8	Fix #104012 : Selection crash with AMD on Metal Crash when selecting objects on AMD platforms running Metal. This was caused by shader compilation warnings being treated as errors in macOS 10.15. Wrapping compilation failure with success check resolves error. Authored by Apple: Michael Parkin-White Pull Request: blender/blender#105739	2023-03-16 08:03:15 +01:00
Jason Fielder	b96f8ac9fe	Fix #105606 : Metal texture upload regression immDrawPixels performs significantly slower in Metal than OpenGL. This was caused by two main factors. Firstly, the additional overhead of tiled texture update, where all memory needed to be kept in flight for each update, but caused update to take a slow path. Avoiding tile update with Metal is more efficient for both memory pressure and GPU pipelining. Secondly, on AMD platforms, the staging buffer used for temporary texture data was page-faulting when several texture updates would occur within one frame. This is due to limitations of allocating one large contiguous memory chunk. Using the Metal buffer pool for staging data is more efficient. Authored by Apple: Michael Parkin-White Pull Request: blender/blender#105794	2023-03-16 07:59:22 +01:00
Miguel Pozo	1fc892744c	GPU: Fix: Use 2 slots for each UDIMs texture instead of 4 Create GPUNodeLinks for tiled_image and tiled_image_mapping together, to ensure they are the same texture. See blender/blender#105661 (comment) for context and a more in-depth explanation. Pull Request: blender/blender#105772	2023-03-15 17:58:25 +01:00
Miguel Pozo	ffb120c560	Fix #105661 : (Regression) Materials can use fewer images than before Skip explicit binding location for samplers in OpenGL when not needed, since drivers can usually handle more sampler declarations this way (as long as they're not actually used by the shader). Pull Request: blender/blender#105770	2023-03-15 13:58:28 +01:00
Miguel Pozo	c31ba08f76	Fix #105661 : (Regression) Materials can use fewer images than before Skip explicit binding location for samplers in OpenGL when not needed, since drivers can usually handle more sampler declarations this way (as long as they're not actually used by the shader). Pull Request: blender/blender#105770	2023-03-15 13:37:19 +01:00
Brecht Van Lommel	7b66168bcb	Merge branch 'blender-v3.5-release' into main	2023-03-14 18:19:53 +01:00
Jason Fielder	96c6349cbf	Fix #103605 : Metal barycentric coordinate compilation failure Fix support for Wireframe and parametric nodes by resolving compilation failures surrounding barycentric coordinates. A final missing part of the Metal implementation for barycentric coordinates was missing. Feedback also addressed to move barycentric calculation out of code-gen and into surface_lib. Authored by Apple: Michael Parkin-White This also resolves #103606. Ref #96261 Pull Request: blender/blender#105740	2023-03-14 08:23:02 +01:00
Jeroen Bakker	cf2c9fe186	Vulkan: Add initial VkPixelBuffer. VKPixelBuffer is used by external render engines to allocate buffers using the same GPU backend that Blender runs in. In a later stage we should test what exact binding flags are needed. I expect that it should be able to use as a transfer buffer to copy the pixels over to a texture using transfer commands. Pull Request: blender/blender#105741	2023-03-13 19:25:18 +01:00
Jeroen Bakker	4238080568	Cleanup: Remove gpu prefix in gpu test cases. The _gpu_ prefix is redundant as they are inside a namespace and run as part of a test suite that already contain the name gpu. (GPUOpenGLTest) This patch also moved the texture test cases to its own compile module.	2023-03-13 11:14:17 +01:00
Jeroen Bakker	6567ff558c	Fix: Failing test case due to resource lifetime. When a shader is bound it should outlife the pipeline. In one test case where only the shader data was accessed it also bound the shader. This isn't needed as the shader data should be retrieved without binding the shader. This change fixes the issue by not binding the GPU shader.	2023-03-13 09:54:57 +01:00
Hans Goudey	1dc57a89e9	Mesh: Move functions to C++ header Refactoring mesh code, it has become clear that local cleanups and simplifications are limited by the need to keep a C public API for mesh functions. This change makes code more obvious and makes further refactoring much easier. - Add a new `BKE_mesh.hh` header for a C++ only mesh API - Introduce a new `blender::bke::mesh` namespace, documented here: https://wiki.blender.org/wiki/Source/Objects/Mesh#Namespaces - Move some functions to the new namespace, cleaning up their arguments - Move code to `Array` and `float3` where necessary to use the new API - Define existing inline mesh data access functions to the new header - Keep some C API functions where necessary because of RNA - Move all C++ files to use the new header, which includes the old one In the future it may make sense to split up `BKE_mesh.hh` more, but for now keeping the same name as the existing header keeps things simple. Pull Request: blender/blender#105416	2023-03-12 22:29:15 +01:00
Campbell Barton	ecc3e8c751	Cleanup: format, sort CMake file lists	2023-03-10 12:50:17 +11:00
Jeroen Bakker	af5a115f65	GPU: Refactor API for Clearing Storage Buffers The previous API for clearing storage buffers was following the OpenGL api. OpenGL has many options to support for data conversions, striding and sizzling. Metal and Vulkan don't have these features and we have to deal it ourselves. Blender internally only uses a tiny subset for what is possible in OpenGL. Making the current API to difficult to implement on our future platforms as we had to implement all cases, most even not used at all. By changing the API we make future development easier as we only need to implement what we are actually using. New API `GPU_storagebuf_clear(GPUStorageBuf* ssbo, uint32_t clear_value)` Related issue: #105492 Pull Request: blender/blender#105521	2023-03-09 18:46:28 +01:00
Jeroen Bakker	c20bf8d61e	GPU: Replace old shader tests with shader builder. Both the shader_builder and existing shader tests eventually tested the same aspects. shader_builder is more modern and handles more cases. The old shader test requires a full backend in order to run This commit replaces the old tests to just use the shader builder for validation. Shader builder can still be run at compile time, this is just a convenience to have as a test case as well for CI/CD. Ref: #105482	2023-03-09 10:40:57 +01:00
Jeroen Bakker	0ad06cd39d	Vulkan: Automap Buffers. It is recommended to map buffers once and not each time the mapped memory is needed. This patch will map the buffer when created and unmap the buffer when the buffer is freed. This patch will reduce the overhead where the Vulkan driver or the virtual memory manager needs to be accessed. Pull Request: blender/blender#105588	2023-03-09 09:27:42 +01:00
Campbell Barton	b3625e6bfd	Cleanup: comment blocks	2023-03-09 10:39:49 +11:00
Weizhen Huang	b9649c2e72	Eevee: fall back to Diffuse BSDF before proper implementation of Hair BSDFs Co-authored by @fclem	2023-03-08 15:05:58 +01:00
Jeroen Bakker	63a05f1e18	Vulkan: Fix Shader Compilation Issues. Fixes recent introduced shader compilation issues. - `sampler` is a reserved keyword in Vulkan GLSL.	2023-03-08 11:10:05 +01:00
Omar Emara	cbf569523c	Realtime Compositor: Implement bicubic interpolation This patch implements the bicubic interpolation option in the transform nodes. The path merely reuse the code in the shader image texture and adds bicubic variants to the domain realization shader. Pull Request: blender/blender#105533	2023-03-07 18:02:20 +01:00
Sebastian Parborg	023524765a	Merge branch 'blender-v3.5-release'	2023-03-07 17:35:05 +01:00
Jason Fielder	d31083583c	Fix 105449: Resolve selection in Metal backend MTLFramebuffer's viewport was not correctly updated when updating attachments. Behaviour modified to be consistent with OpenGL. Authored by Apple: Michael Parkin-White Ref #96261 Pull Request: blender/blender#105529	2023-03-07 16:00:23 +01:00
Campbell Barton	5a004ccc6a	Cleanup: use function style casts, nullptr	2023-03-07 15:59:14 +11:00
Campbell Barton	90dc655951	Cleanup: spelling in comments	2023-03-07 15:00:05 +11:00
Chris Blackbourn	b309feb71c	Cleanup: format	2023-03-07 10:24:05 +13:00
Jeroen Bakker	61b457d390	Vulkan: Push constants What are push constants? Push constants is a way to quickly provide a small amount of uniform data to shaders. It should be much quicker than UBOs but a huge limitation is the size of data - spec requires 128 bytes to be available for a push constant range. What are the challenges with push constants? The challenge with push constants is that the limited available size. According to the Vulkan spec each platform should at least have 128 bytes reserved for push constants. Current Mesa/AMD drivers supports 256 bytes, but Mesa/Intel is only 128 bytes. What is our solution? Some shaders of Blender uses more than these boundaries. When more data is needed push constants will not be used, but the shader will be patched to use an uniform buffer instead. This mechanism will be part of the Vulkan backend and shader developers should not see any difference on API level. Known limitations Current state of the vulkan backend does not track resources that are in the command queue. This patch includes some test cases that identified this issue as well. See #104771. Pull Request #104880	2023-03-06 12:28:55 +01:00
Jeroen Bakker	4e32864786	Cleanup: Remove compilation warning. In MTLTexture it was checked that this was valid. What in that case should always be true.	2023-03-06 08:40:30 +01:00
Clément Foucault	b4d36b3efe	GPU: Compute: Document and cleanup header No fonctional changes.	2023-03-05 18:14:17 +01:00
Clément Foucault	4862d56a0e	GPUFrameBuffer: Document and cleanup header No fonctional changes.	2023-03-05 17:57:51 +01:00
Clément Foucault	d782569682	Cleanup: GPU: Improve eGPUBarrier documentation	2023-03-05 17:57:51 +01:00
Clément Foucault	0a10571501	GPU: Replace GPU_finish by correct memory barrier	2023-03-05 17:57:51 +01:00
Clément Foucault	f5e9a78216	GPU: Fix wrong barrier in tests Texture and buffer reads were using the incorrect barrier.	2023-03-04 07:44:34 +01:00
Clément Foucault	fcedc97d11	GPU: Add GPU_BARRIER_BUFFER_UPDATE barrier type This barrier types is needed for correct readback of buffers GPU memory to CPU memory.	2023-03-04 07:44:34 +01:00

1 2 3 4 5 ...

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

4274 Commits