Geometry Nodes: Edge Paths to Curves node speedup #115115

Iliya Katushenock · 2023-11-19T00:22:42+01:00

Iliya Katushenock commented

2023-11-19 00:22:42 +01:00

Shortest Path is the set of vertices where each point know index of next one.
For some subset of such points (selection input), is built curve from each point
to next one. In a lot of cases, at some point, multiple curve is merged in some
vertex and all next vertices is shared.

To know size of each curve, and allocate result array, each curve need to be traversed.
While traversing of some curves many vertices can be already visited by previous curves.
Idea is to cache count of all next points for each one. If rephrase this, this is traversal
for directed graph in depth. And by caching depth of visited point, this algorithm is
come to be linear for number of points.

Improvement of curves sizes exploring and point indices gathering (ms):

Number of points:	5'577'362	4'040'246	2'752'442	1'905'715	1'367'849	1'020'922
Old: indices	2'303.94	1'910.15	1'161.3	771.58	489.07	350.28
New: indices	427.94	285.64	169.2	104.76	70.27	49.9
New/Old Diff	5.38x	6.68x	6.86x	7.36x	6.99x	7.09x
curves from indices	358.92	286.53	181.02	109.04	73.54	44.34

Total time of node evaluation (ms):

Number of points:	5'577'362	4'040'246	2'752'442	1'905'715	1'367'849	1'020'922
Old	2'730.42	2026	1222.84	854.09	561.47	403.97
New	959.93	638.06	706.43	349.53	238.29	149.29

Measurement are made in the attached file.

Possible future improvements:

This will not make this 10x faster, but this can be multithreaded graph traversal algorithm.
Gathering of indices of curve points can be less random-memory-read. Each point of graph also can know begin of already written curve and its list of indices.
Hot loop of branch size computation can be less random-memory-writes if use std::set (with sorting by index) or blender::Set stack to hold visited vertices, instead of write -i in each of them.
Statically check if input field is Shortest Edge Path and use more fast algorithm without handling of cycle case.

Shortest Path is the set of vertices where each point know index of next one. For some subset of such points (selection input), is built curve from each point to next one. In a lot of cases, at some point, multiple curve is merged in some vertex and all next vertices is shared. To know size of each curve, and allocate result array, each curve need to be traversed. While traversing of some curves many vertices can be already visited by previous curves. Idea is to cache count of all next points for each one. If rephrase this, this is traversal for directed graph in depth. And by caching depth of visited point, this algorithm is come to be linear for number of points. Improvement of curves sizes exploring and point indices gathering (ms): | Number of points: | 5'577'362 | 4'040'246 | 2'752'442 | 1'905'715 | 1'367'849 | 1'020'922 | | -- | -- | -- | -- | -- | -- | -- | | Old: indices | 2'303.94 | 1'910.15 | 1'161.3 | 771.58 | 489.07 | 350.28 | | New: indices | 427.94 | 285.64 | 169.2 | 104.76 | 70.27 | 49.9 | | New/Old Diff | 5.38x | 6.68x | 6.86x | 7.36x | 6.99x | 7.09x | | curves from indices | 358.92 | 286.53 | 181.02 | 109.04 | 73.54 | 44.34 | Total time of node evaluation (ms): | Number of points: | 5'577'362 | 4'040'246 | 2'752'442 | 1'905'715 | 1'367'849 | 1'020'922 | | -- | -- | -- | -- | -- | -- | -- | | Old | 2'730.42 | 2026 | 1222.84 | 854.09 | 561.47 | 403.97 | | New | 959.93 | 638.06 | 706.43 | 349.53 | 238.29 | 149.29 | Measurement are made in the attached file. Possible future improvements: 1. This will not make this 10x faster, but this can be multithreaded graph traversal algorithm. 2. Gathering of indices of curve points can be less random-memory-read. Each point of graph also can know begin of already written curve and its list of indices. 3. Hot loop of branch size computation can be less random-memory-writes if use `std::set` (with sorting by index) or `blender::Set` stack to hold visited vertices, instead of write -i in each of them. 4. Statically check if input field is `Shortest Edge Path` and use more fast algorithm without handling of cycle case.

Path to Curves speedup test.blend

988 KiB

🚀 1

Iliya Katushenock added the

 @ -49,1 +51,3 @@
       if (next_vert < 0 || next_vert >= mesh.totvert) {
   Array<int> rang(mesh.totvert, non_checked);
   Array<bool> visited(mesh.totvert, false);
   const auto rang_for_vertex = [&](const int vertex) -> int {

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

Geometry Nodes: Edge Paths to Curves node speedup #115115

Checkout