Vulkan: Render graph core #120427

Jeroen Bakker · 2024-04-09T14:00:50+02:00

Jeroen Bakker commented

2024-04-09 14:00:50 +02:00

This PR adds the core of the render graph. The render graph isn't used.
Current implementation of the Vulkan Backend is slow by design. We
focused on stability, before performance. With the new introduced render
graph the focus will shift to performance and keep the stability at where
it is.

Some highlights:

Every context will get its own render graph. (VKRenderGraph).
Resources (and resource state tracking) is device specific (VKResourceStateTracker).
No node reordering / sub graph execution has been implemented. Currently
All nodes in the graph is executed in the order they were added. (VKScheduler).
The links inside the graph describe the resources the nodes read from (input links)
or writes to (output links)
When resources are written to a resource stamp is incremented allowing keeping
track of which nodes needs which stamp of a resource.
At each link the access information (how does the node accesses the resource)
and image layout (for image resources) are stored. This allows the render graph
to find out how a resource was used in the past and will be used in the future.
That is important to construct pipeline barriers that don't stall the whole GPU.

Defined nodes

This implementation has nodes for:

Blit image
Clear color image
Copy buffers to buffers
Copy buffers to images
Copy images to images
Copy images to buffers
Dispatch compute shader
Fill buffers
Synchronization

Each node has a node info, create info and data struct. The create info
contains all data to construct the node, including the links of the graph.
The data struct only contains the data stored inside the node. The node info
contains the node specific implementation.

NOTE: Other nodes will be added after this PR lands to main.

Resources

Before a render graph can be used, the resources should be registered
to VKResourceStateTracker. In the final implementation this will be owned by
the VKDevice. Registration of resources can be done by calling
VKResources.add_buffer or VKResources.add_image.

Render graph

Nodes can be added to the render graph. When adding a node its read/
write dependencies are extracted and converted into links (VKNodeInfo. build_links).
When the caller wants to have a resource up to date the functions
VKRenderGraph.submit_for_read or VKRenderGraph.submit_for_present
can be called.

These functions will select and order the nodes that are needed
and convert them to vkCmd* commands. These commands include pipeline
barrier and image layout transitions.

The vkCmd are recorded into a command buffer which is sent to the
device queue.

Walking the graph

Walking the render graph isn't implemented yet. The idea is to have a
Map<ResourceWithStamp, Vector<NodeHandle>> consumers and
Map<ResourceWithStamp, NodeHandle> producers. These attributes can
be stored in the render graph and created when building the links, or
can be created inside the VKScheduler as a variable. The exact detail
which one would be better is unclear as there aren't any users yet. At
the moment the scheduler would need them we need to figure out the best
way to store and retrieve the consumers/producers.

Unit tests

The render graph can be tested by enabling WITH_GTEST and use
vk_render_graph as a filter.

bin/tests/blender_test --gtest_filter="vk_render_graph*"

**Design Task**: blender/blender#118330 This PR adds the core of the render graph. The render graph isn't used. Current implementation of the Vulkan Backend is slow by design. We focused on stability, before performance. With the new introduced render graph the focus will shift to performance and keep the stability at where it is. Some highlights: - Every context will get its own render graph. (`VKRenderGraph`). - Resources (and resource state tracking) is device specific (`VKResourceStateTracker`). - No node reordering / sub graph execution has been implemented. Currently All nodes in the graph is executed in the order they were added. (`VKScheduler`). - The links inside the graph describe the resources the nodes read from (input links) or writes to (output links) - When resources are written to a resource stamp is incremented allowing keeping track of which nodes needs which stamp of a resource. - At each link the access information (how does the node accesses the resource) and image layout (for image resources) are stored. This allows the render graph to find out how a resource was used in the past and will be used in the future. That is important to construct pipeline barriers that don't stall the whole GPU. # Defined nodes This implementation has nodes for: - Blit image - Clear color image - Copy buffers to buffers - Copy buffers to images - Copy images to images - Copy images to buffers - Dispatch compute shader - Fill buffers - Synchronization Each node has a node info, create info and data struct. The create info contains all data to construct the node, including the links of the graph. The data struct only contains the data stored inside the node. The node info contains the node specific implementation. > NOTE: Other nodes will be added after this PR lands to main. # Resources Before a render graph can be used, the resources should be registered to `VKResourceStateTracker`. In the final implementation this will be owned by the `VKDevice`. Registration of resources can be done by calling `VKResources.add_buffer` or `VKResources.add_image`. # Render graph Nodes can be added to the render graph. When adding a node its read/ write dependencies are extracted and converted into links (`VKNodeInfo. build_links`). When the caller wants to have a resource up to date the functions `VKRenderGraph.submit_for_read` or `VKRenderGraph.submit_for_present` can be called. These functions will select and order the nodes that are needed and convert them to `vkCmd*` commands. These commands include pipeline barrier and image layout transitions. The `vkCmd` are recorded into a command buffer which is sent to the device queue. ## Walking the graph Walking the render graph isn't implemented yet. The idea is to have a `Map<ResourceWithStamp, Vector<NodeHandle>> consumers` and `Map<ResourceWithStamp, NodeHandle> producers`. These attributes can be stored in the render graph and created when building the links, or can be created inside the VKScheduler as a variable. The exact detail which one would be better is unclear as there aren't any users yet. At the moment the scheduler would need them we need to figure out the best way to store and retrieve the consumers/producers. # Unit tests The render graph can be tested by enabling `WITH_GTEST` and use `vk_render_graph` as a filter. ``` bin/tests/blender_test --gtest_filter="vk_render_graph*" ```

👍 3 🎉 3 🚀 2

Jeroen Bakker added this to the 4.2 LTS milestone 2024-04-09 14:00:50 +02:00

Jeroen Bakker added the

 @ -0,0 +17,4 @@
 namespace blender::gpu::render_graph {
 /**
  * Base class for node class.

 @ -0,0 +39,4 @@
   VKBoundPipelines active_pipelines;
   VkPipelineStageFlags src_stage_mask_ = VK_PIPELINE_STAGE_NONE;
   VkPipelineStageFlags dst_stage_mask_ = VK_PIPELINE_STAGE_NONE;

 @ -0,0 +50,4 @@
    * Needs to be called before each build_image/buffer. It ensures that swapchain images are reset
    * to the correct layout and that the pipelines are reset.
    */
   void reset(VKRenderGraph &render_graph);

 @ -0,0 +53,4 @@
   void reset(VKRenderGraph &render_graph);
   /**
    * Build the commands to update the given vk_image to the last version

 @ -0,0 +55,4 @@
   /**
    * Build the commands to update the given vk_image to the last version
    */
   void build_image(VKRenderGraph &render_graph, VkImage vk_image);

 @ -0,0 +64,4 @@
   /**
    * After the commands have been submitted the nodes that have been send to the GPU can be
    * removed.

 @ -0,0 +29,4 @@
  * prefetched and removing a level of indirection. A consequence is that we cannot use class based
  * nodes.
  */
 struct VKNodeData {

 @ -0,0 +31,4 @@
       break;
   }
   memset(&node_data, 0, sizeof(VKNodeData));

 @ -0,0 +19,4 @@
 class VKCommandBuilder;
 class VKNodes {

 @ -0,0 +5,4 @@
 /** \file
  * \ingroup gpu
  *
  * Render graph is a render solution that is able to track resource usages in a single submission

				`@ -0,0 +1,214 @@`
				`/* SPDX-FileCopyrightText: 2024 Blender Authors`

 @ -0,0 +37,4 @@
 class VKScheduler;
 class VKRenderGraph : public NonCopyable {
   VKResourceDependencies resource_dependencies_;

 @ -0,0 +46,4 @@
   /**
    * Not owning pointer to device resources.
    *
    * Is marked optional as device could

 @ -0,0 +56,4 @@
   /**
    * Free all resources held by the render graph.
    */
   void deinit();

 @ -0,0 +63,4 @@
    * Add a node to the render graph.
    */
   template<typename NodeClass, typename NodeCreateInfo>
   void add_node(const NodeCreateInfo &create_info)

 @ -0,0 +68,4 @@
     std::scoped_lock lock(resources_.mutex_get());
     NodeHandle handle = nodes_.add_node<NodeClass, NodeCreateInfo>(create_info);
     NodeClass node_class;
     node_class.build_resource_dependencies(

 @ -0,0 +107,4 @@
   }
   /**
    * Submit the commands to readback the given vk_buffer to the command queue.

 @ -0,0 +122,4 @@
   void submit_for_present(VkImage vk_swapchain_image);
  private:
   friend class VKCommandBuilder;

 @ -0,0 +15,4 @@
 #include "vk_resources.hh"
 #include "vk_types.hh"
 namespace blender::gpu::render_graph {

 @ -0,0 +18,4 @@
 namespace blender::gpu::render_graph {
 class VKResourceDependencies : NonCopyable, NonMovable {
  public:
   struct ResourceUsage {

 @ -0,0 +19,4 @@
  * List for working with handles and items.
  * Reference to the first empty slot is stored internally to stop iterating over all the elements.
  */
 template<typename Handle, typename Item> class VKResourceHandles {

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations