Geometry Nodes: Rewrite Scale Elements node #115142

Iliya Katushenock · 2023-11-19T16:23:22+01:00

Iliya Katushenock commented

2023-11-19 16:23:22 +01:00

Rewrite of Scale Elements. Main changes related with removing unnecessary
abstractions (like structures of fields). Next, by using grouping approach,
all data is represented as spans. This provide ability to unify code for
different domains. Using of general utils like IndexMask, Group processing
and array utils provides much more parallelism and better memory usage.
In result, this refactoring result in 4-10 average speed improvement in
attached benchmark file with different probability and scale of elements.

Measurements of speed improvement for Face domain (ms):
Before:

Probability \ Scale	0.1	1	2	3	4
0.1	870.63	831.13	826.69	833.84	838.3
0.25	832.87	845.32	861.84	834.01	841.95
0.5	451.98	423.2	268.47	453.06	379.79
0.75	20.92	19.46	20.62	20.76	21.11
0.9	21.21	19.8	20.07	21.82	20.85

After:

Probability \ Scale	0.1	1	2	3	4
0.1	214.63	176.11	176.54	176.95	183.03
0.25	186.04	170.19	187.91	182.91	182.32
0.5	126.39	119.48	108.46	114.83	122.53
0.75	6.19	7.69	6.02	5.91	17.55
0.9	5.46	5.77	5.62	6.02	6.1

Average speed up:

Probability \ Scale	0.1	1	2	3	4
0.1	4.0	4.7	4.6	4.7	4.5
0.25	4.4	4.9	4.5	4.5	4.6
0.5	3.5	3.5	2.4	3.9	3.0
0.75	3.3	2.5	3.4	3.5	1.2
0.9	3.8	3.4	3.5	3.6	3.4

Measurements of speed improvement for Edge domain (ms):
Before:

Probability \ Scale	0.1	1	2	3	4
0.1	4'294.62	5'100	5'200	5'200	5'300
0.25	5'100	5'100	5'200	5'100	4'702.83
0.5	2'426.06	2'242.14	1'533.39	2'247.04	1'687.36
0.75	28.24	30.58	27.49	31.08	28.96
0.9	27.79	24.91	27.31	29.35	26.47

After:

Probability \ Scale	0.1	1	2	3	4
0.1	507.54	491.09	493.88	503.8	507.35
0.25	489.82	488.39	518.28	729.58	569.13
0.5	294.78	271.01	305.45	292.44	298.79
0.75	11.35	14.91	12.23	12.52	16.55
0.9	12.36	12.57	12.04	11.58	11.45

Average speed up:

Probability \ Scale	0.1	1	2	3	4
0.1	8.4	10.3	10.5	10.3	10.4
0.25	10.4	10.4	10.0	6.9	8.2
0.5	8.2	8.2	5.0	7.6	5.6
0.75	2.4	2.0	2.2	2.4	1.7
0.9	2.2	1.9	2.2	2.5	2.3

Rewrite of Scale Elements. Main changes related with removing unnecessary abstractions (like structures of fields). Next, by using grouping approach, all data is represented as spans. This provide ability to unify code for different domains. Using of general utils like IndexMask, Group processing and array utils provides much more parallelism and better memory usage. In result, this refactoring result in 4-10 average speed improvement in attached benchmark file with different probability and scale of elements. Measurements of speed improvement for Face domain (ms): Before: | Probability \ Scale | 0.1 | 1 | 2 | 3 | 4 | | -- | -- | -- | -- | -- | -- | | 0.1 | 870.63 | 831.13 | 826.69 | 833.84 | 838.3 | | 0.25 | 832.87 | 845.32 | 861.84 | 834.01 | 841.95 | | 0.5 | 451.98 | 423.2 | 268.47 | 453.06 | 379.79 | | 0.75 | 20.92 | 19.46 | 20.62 | 20.76 | 21.11 | | 0.9 | 21.21 | 19.8 | 20.07 | 21.82 | 20.85 | After: | Probability \ Scale | 0.1 | 1 | 2 | 3 | 4 | | -- | -- | -- | -- | -- | -- | | 0.1 | 214.63 | 176.11 | 176.54 | 176.95 | 183.03 | | 0.25 | 186.04 | 170.19 | 187.91 | 182.91 | 182.32 | | 0.5 | 126.39 | 119.48 | 108.46 | 114.83 | 122.53 | | 0.75 | 6.19 | 7.69 | 6.02 | 5.91 | 17.55 | | 0.9 | 5.46 | 5.77 | 5.62 | 6.02 | 6.1 | Average speed up: | Probability \ Scale | 0.1 | 1 | 2 | 3 | 4 | | -- | -- | -- | -- | -- | -- | | 0.1 | 4.0 | 4.7 | 4.6 | 4.7 | 4.5 | | 0.25 | 4.4 | 4.9 | 4.5 | 4.5 | 4.6 | | 0.5 | 3.5 | 3.5 | 2.4 | 3.9 | 3.0 | | 0.75 | 3.3 | 2.5 | 3.4 | 3.5 | 1.2 | | 0.9 | 3.8 | 3.4 | 3.5 | 3.6 | 3.4 | Measurements of speed improvement for Edge domain (ms): Before: | Probability \ Scale | 0.1 | 1 | 2 | 3 | 4 | | -- | -- | -- | -- | -- | -- | | 0.1 | 4'294.62 | 5'100 | 5'200 | 5'200 | 5'300 | | 0.25 | 5'100 | 5'100 | 5'200 | 5'100 | 4'702.83 | | 0.5 | 2'426.06 | 2'242.14 | 1'533.39 | 2'247.04 | 1'687.36 | | 0.75 | 28.24 | 30.58 | 27.49 | 31.08 | 28.96 | | 0.9 | 27.79 | 24.91 | 27.31 | 29.35 | 26.47 | After: | Probability \ Scale | 0.1 | 1 | 2 | 3 | 4 | | -- | -- | -- | -- | -- | -- | | 0.1 | 507.54 | 491.09 | 493.88 | 503.8 | 507.35 | | 0.25 | 489.82 | 488.39 | 518.28 | 729.58 | 569.13 | | 0.5 | 294.78 | 271.01 | 305.45 | 292.44 | 298.79 | | 0.75 | 11.35 | 14.91 | 12.23 | 12.52 | 16.55 | | 0.9 | 12.36 | 12.57 | 12.04 | 11.58 | 11.45 | Average speed up: | Probability \ Scale | 0.1 | 1 | 2 | 3 | 4 | | -- | -- | -- | -- | -- | -- | | 0.1 | 8.4 | 10.3 | 10.5 | 10.3 | 10.4 | | 0.25 | 10.4 | 10.4 | 10.0 | 6.9 | 8.2 | | 0.5 | 8.2 | 8.2 | 5.0 | 7.6 | 5.6 | | 0.75 | 2.4 | 2.0 | 2.2 | 2.4 | 1.7 | | 0.9 | 2.2 | 1.9 | 2.2 | 2.5 | 2.3 |

scale elements node benchmark 2.blend

1000 KiB

👍 6 ❤️ 1

Iliya Katushenock added 1 commit 2023-11-19 16:23:33 +01:00

3d4d61748d init

Iliya Katushenock added 1 commit 2023-11-19 20:41:14 +01:00

18f7762f09 progress

Iliya Katushenock added 3 commits 2023-11-22 21:16:12 +01:00

5ad7c6a108 Merge branch 'main' into tmp_speedup_scale_elements

89f899d2db Merge branch 'main' into tmp_speedup_scale_elements

48f473b771 index mask assertion

Iliya Katushenock changed title from ~~WIP: Geometry Nodes: Scale Elements node speedup~~ to Geometry Nodes: Scale Elements node speedup

2023-11-24 13:33:56 +01:00

Iliya Katushenock added this to the Nodes & Physics project 2023-11-24 13:34:32 +01:00

Iliya Katushenock added the

 @ -104,0 +132,4 @@
   });
 }
 static Array<int> reverse_indices_in_groups(const Span<int> group_indices,

 @ -177,0 +302,4 @@
       const float total = elem_island.size();
       BLI_assert(total > 0.0f);
       const float scale = accumulate<float>(scale_varray, elem_island) / total;
       const float3 center = accumulate<float3>(center_varray, elem_island) / total;

 @ -290,0 +371,4 @@
   /* If result indices is for gathered array, map than back into global indices. */
   if (face_mask.size() != mesh.faces_num) {
     parallel_transform<int>(r_item_indices, 4098, [&](const int pos) { return face_mask[pos]; });

 @ -344,3 +393,1 @@
   edge_selection.foreach_index([&](const int edge_index) {
     const int2 &edge = edges[edge_index];
     disjoint_set.join(edge[0], edge[1]);
   edge_mask.foreach_index_optimized<int>(GrainSize(4098), [&](const int edge_i) {

 @ -449,0 +487,4 @@
       const GroupedSpan<int> item_islands(item_offsets.as_span(), item_indices);
       const GroupedSpan<int> vert_islands(vert_offsets.as_span(), vert_indices);
       const VArray<float> &scale_varray = evaluator.get_evaluated<float>(0);

 @ -101,0 +211,4 @@
           for (const int i : indices.slice(range)) {
             value += values[i];
           }
           return join_accumulators({value / float(indices.size()), 1}, other);

 @ -207,2 +333,2 @@
     for (const int island_index : range) {
       const ElementIsland &island = islands[island_index];
   AtomicDisjointSet disjoint_set(vert_mask.size());
   const GroupedSpan<int> face_verts(mesh.face_offsets(), mesh.corner_verts());

 @ -394,1 +438,3 @@
   scale_vertex_islands_on_axis(mesh, island, params, get_edge_verts);
   /* If result indices is for gathered array, map than back into global indices. */
   if (edge_mask.size() != mesh.edges_num) {
     parallel_transform<int>(r_item_indices, 4096, [&](const int pos) { return edge_mask[pos]; });

 @ -112,0 +236,4 @@
                             Mesh &mesh)
 {
   MutableSpan<float3> positions = mesh.vert_positions_for_write();
   threading::parallel_for(

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

Geometry Nodes: Rewrite Scale Elements node #115142