blender-archive

Archived

Author	SHA1	Message	Date
Campbell Barton	02226e9069	Cleanup: spelling in comments	2023-01-09 17:41:08 +11:00
Jacques Lucke	73a2c79c07	Functions: free memory of unused sockets earlier During geometry nodes evaluation some sockets can be determined to be unused, for example based on the condition input in a switch node. Once a socket is determined to be unused, that information has to be propagated backwards through the tree to free any memory that may have been reserved for those sockets already. This is happening before this commit already, but in a less ideal way. Determining that sockets are unused early is good because it helps with memory reuse and avoids copy-on-write copies caused by shared data. Now, nodes that are scheduled because an output became unused have priority over nodes scheduled for other reasons.	2023-01-08 21:09:33 +01:00
Jacques Lucke	891b973029	Functions: optimize multi-function evaluation in materialized mode This allows auto-vectorization to happen when the a multi-function is evaluated in "materialized" mode, i.e. it is processed in chunks where all input and outputs values are stored in contiguous arrays. It also unifies the handling input, mutable and output parameters a bit. Now they all can use tempory buffers in the same way.	2023-01-08 17:21:57 +01:00
Jacques Lucke	5f9a48ed59	Functions: improve compiler optimizability of multi-function evaluation This simplifies the code enough so that msvc is able to unroll and vectorize some multi-functions like simple addition. The performance improvements are almost as good as the GCC improvements shown in D16942 (for add and multiply at least).	2023-01-08 15:04:51 +01:00
Jacques Lucke	ef78811ac7	Cleanup: add missing inline	2023-01-07 23:49:36 +01:00
Jacques Lucke	c4d4db39dc	Functions: enable more gcc optimizations for multi-functions This mainly helps GCC catch up with Clang in terms of field evaluation performance in some cases. In some cases this patch can speedup field evaluation 2-3x (e.g. when there are many float math nodes). See D16942 for a more detailed benchmark.	2023-01-07 20:23:20 +01:00
Jacques Lucke	a2ea32a600	Cleanup: inline signatures into multi-function constructors This reduces the amount of code. Also the signature should be thought of as being setup in the constructor, so it's good if the code is there as well.	2023-01-07 18:00:37 +01:00
Jacques Lucke	eedcf1876a	Functions: introduce multi-function namespace This moves all multi-function related code in the `functions` module into a new `multi_function` namespace. This is similar to how there is a `lazy_function` namespace. The main benefit of this is that many types names that were prefixed with `MF` (for "multi function") can be simplified. There is also a common shorthand for the `multi_function` namespace: `mf`. This is also similar to lazy-functions where the shortened namespace is called `lf`.	2023-01-07 17:32:28 +01:00
Jacques Lucke	a5b27f9858	Functions: simplify multi-function signature type * `depends_on_context` was not used for a long time already. * `param_data_indices` is not used since rB42b88c008861b6. * The remaining data is moved to a single `Vector` to avoid having to do two allocations when the size signature becomes larger than fits into the inline buffer.	2023-01-07 16:51:26 +01:00
Jacques Lucke	577442a26f	Functions: build multi-function signature in-place This avoids a move of the signature after building it. Tthe value had to be moved out of `MFSignatureBuilder` in the `build` method. This also makes the naming a bit less confusing where sometimes both the `MFSignature` and `MFSignatureBuilder` were referred to as "signature".	2023-01-07 16:30:56 +01:00
Jacques Lucke	b3146200a8	Functions: refactor multi-function builder API * New `build_mf` namespace for the multi-function builders. * The type name of the created multi-functions is now "private", i.e. the caller has to use `auto`. This has the benefit that the implementation can change more freely without affecting the caller. * `CustomMF` does not use `std::function` internally anymore. This reduces some overhead during code generation and at run-time. * `CustomMF` now supports single-mutable parameters.	2023-01-07 16:19:59 +01:00
Jacques Lucke	1bbf1ed03c	Functions: improve devirtualization in multi-function builder This refactors how devirtualization is done in general and how multi-functions use it. * The old `Devirtualizer` class has been removed in favor of a simpler solution. It is also more general in the sense that it is not coupled with `IndexMask` and `VArray`. Instead there is a function that has inputs which control how different types are devirtualized. The new implementation is currently less general with regard to the number of parameters it supports. This can be changed in the future, but does not seem necessary now and would make the code less obvious. * Devirtualizers for different types are now defined in their respective headers. * The multi-function builder works with the `GVArray` stored in `MFParams` directly now, instead of first converting it to a `VArray<T>`. This reduces some constant overhead, which makes the multi-function slightly faster. This is only noticable when very few elements are processed though. No functional changes or performance regressions are expected.	2023-01-07 12:55:48 +01:00
Jacques Lucke	1942d55c07	Cleanup: remove unused code	2023-01-07 12:25:46 +01:00
Jacques Lucke	f7e9bc65ab	Cleanup: simplify getting value of generic ValueOrField	2023-01-06 22:30:14 +01:00
Jacques Lucke	e756b0fea0	Cleanup: remove dead code	2023-01-06 22:30:14 +01:00
Clément Foucault	8f44c37f5c	Cleanup: Rename BLI_math_vec_types* files to BLI_math_vector_types This is for the sake of consistency and clarity.	2023-01-06 20:09:51 +01:00
Jacques Lucke	42b88c0088	Functions: simplify multi-function parameters The use of `std::variant` allows combining the four vectors into one which more closely matches the intend and avoids a workaround used before. Note that this uses `std::get_if` instead of `std::get` because `std::get` is only available since macOS 10.14.	2023-01-06 11:50:56 +01:00
Campbell Barton	14fc02f91d	Cleanup: spelling in comments	2023-01-06 14:00:36 +11:00
Jacques Lucke	3819a9b15a	Fix T103614: crash during geometry nodes evaluation with tbb disabled	2023-01-05 15:36:39 +01:00
Jacques Lucke	2ffd08e952	Geometry Nodes: deterministic anonymous attribute lifetimes Previously, the lifetimes of anonymous attributes were determined by reference counts which were non-deterministic when multiple threads are used. Now the lifetimes of anonymous attributes are handled more explicitly and deterministically. This is a prerequisite for any kind of caching, because caching the output of nodes that do things non-deterministically and have "invisible inputs" (reference counts) doesn't really work. For more details for how deterministic lifetimes are achieved, see D16858. No functional changes are expected. Small performance changes are expected as well (within few percent, anything larger regressions should be reported as bugs). Differential Revision: https://developer.blender.org/D16858	2023-01-05 14:05:30 +01:00
Jacques Lucke	83f519b7c1	Functions: initialize node storage and default values on first execution Previously, this happened when the "node task" first runs, which might not actually execute the node if there are missing inputs. Deferring the allocation of storage and default inputs allows for better memory reuse later (currently the memory is not reused).	2023-01-04 18:46:50 +01:00
Brecht Van Lommel	bd2d7a4a81	Fix build error in debug mode due to wrong code in assert	2023-01-04 11:20:01 +01:00
Jacques Lucke	30753f7692	Functions: add method to iterate over all inputs of a field This is part of D16858. Iterating over all field inputs allows us to extract all anonymous attributes used by a field relatively easily which is necessary for D16858. This could potentially be used for better field tooltips for nested fields, but that needs further investigation.	2023-01-03 12:37:18 +01:00
Jacques Lucke	0bc0e3f9f7	Fix: geometry nodes crashes with large trees This was an oversight in rBdba2d828462ae22de5. The evaluator uses multiple threads to initialize node states but it is still in single threaded mode. `get_main_or_local_allocator` did not return the right allocator in this case.	2023-01-02 18:34:01 +01:00
Jacques Lucke	dba2d82846	Geometry Nodes: avoid using enumerable thread specific on single thread The geometry nodes evaluator supports "lazy threading", i.e. it starts out single-threaded. But when it determines that multi-threading can be benefitial, it switches to multi-threaded mode. Now it only creates an enumerable-thread-specific if it is actually using multiple threads. This results in a 6% speedup in my test file with many node groups and math nodes.	2022-12-29 21:05:58 +01:00
Jacques Lucke	b6ca942e47	Functions: support cycles in lazy-function graph Lazy-function graphs are now evaluated properly even if they contain cycles. Note that cycles are only ok if there is no data dependency cycle. For example, a node might output something that is fed back into itself. As long as the output can be computed without the input that it feeds into, everything is ok. The code that builds the graph is responsible for making sure that there are no actual data dependencies.	2022-12-29 16:39:40 +01:00
Jacques Lucke	7e4f988072	BLI: improve node graph export in dot format This makes it bit easier to export node graphs and also allows for more customization of links and sockets.	2022-12-29 15:09:52 +01:00
Jacques Lucke	c37e07bc01	Geometry Nodes: improve dot graph export of lazy function graph * Dim default input values. * Print default input values instead of type name. * Add node/socket names to group input/output nodes.	2022-12-16 12:18:49 +01:00
Hans Goudey	2155bdd500	Cleanup: Remove "done" variable from node runtime The runtime storage is meant for more persistent things. These local states for an algorithm are much better handled by an array now.	2022-12-02 14:14:14 -06:00
Hans Goudey	584089879c	BLI: Follow up and fix recent span slicing change `a5e7657cee` didn't account for slices of zero sizes, and the asserts were slightly incorrect otherwise. Also, the change didn't apply to `Span`, only `MutableSpan`, which was a mistake. This also adds "safe" methods to `IndexMask`, and switches function calls where necessary.	2022-11-23 11:36:06 -06:00
Hans Goudey	14d0b57be7	Cleanup: Use array_utils to copy evaluated field array	2022-11-22 12:49:51 -06:00
Jacques Lucke	0ebb7ab41f	Geometry Nodes: disable unreachable nodes in evaluator Nodes that were not connected to any output could still impact performance. While they were never executed, sometimes their inputs could keep references to geometries that other nodes want to modify. That caused unnecessary geometry copies, because a geometry can only be modified if it is not shared. Now, inputs that will never be used are tagged accordingly and they will never have references to geometries that others might want to modify.	2022-11-16 14:26:11 +01:00
Jacques Lucke	edcce2c073	Cleanup: correct inverted variable name	2022-11-16 13:19:23 +01:00
Jacques Lucke	a6c822733a	BLI: improve CPPType system * Support bidirectional type lookups. E.g. finding the base type of a field was supported, but not the other way around. This also removes the todo in `get_vector_type`. To achieve this, types have to be registered up-front. * Separate `CPPType` from other "type traits". For example, previously `ValueOrFieldCPPType` adds additional behavior on top of `CPPType`. Previously, it was a subclass, now it just contains a reference to the `CPPType` it corresponds to. This follows the composition-over-inheritance idea. This makes it easier to have self-contained "type traits" without having to put everything into `CPPType`. Differential Revision: https://developer.blender.org/D16479	2022-11-12 18:33:31 +01:00
Hans Goudey	97746129d5	Cleanup: replace UNUSED macro with commented args in C++ code This is the conventional way of dealing with unused arguments in C++, since it works on all compilers. Regex find and replace: `UNUSED$(\w+)$` -> `/$1/`	2022-10-03 17:38:16 -05:00
Campbell Barton	333e41eac6	Cleanup: replace C-style casts with functional casts for numeric types Use function style casts in C++ headers & source.	2022-09-26 17:58:36 +10:00
Jacques Lucke	c6e70e7bac	Cleanup: follow C++ type cast style guide in some files https://wiki.blender.org/wiki/Style_Guide/C_Cpp#C.2B.2B_Type_Cast This was discussed in https://devtalk.blender.org/t/rfc-style-guide-for-type-casts-in-c-code/25907.	2022-09-25 17:39:45 +02:00
Campbell Barton	f68cfd6bb0	Cleanup: replace C-style casts with functional casts for numeric types	2022-09-25 20:17:08 +10:00
Campbell Barton	5517c848bd	Cleanup: spelling in comments	2022-09-21 12:00:01 +10:00
Jacques Lucke	5c81d3bd46	Geometry Nodes: improve evaluator with lazy threading In large node setup the threading overhead was sometimes very significant. That's especially true when most nodes do very little work. This commit improves the scheduling by not using multi-threading in many cases unless it's likely that it will be worth it. For more details see the comments in `BLI_lazy_threading.hh`. Differential Revision: https://developer.blender.org/D15976	2022-09-20 11:08:05 +02:00
Jacques Lucke	c8a18fd239	Cleanup: add hint that typedef is used as compiler bug workaround https://developercommunity.visualstudio.com/t/Alias-template-inside-fold-expression-fa/10040507	2022-09-20 10:42:25 +02:00
Campbell Barton	95f05a6a4b	Cleanup: spelling in comments	2022-09-16 18:14:33 +10:00
Jacques Lucke	5c4295ee6f	Workaround for msvc compiler bug This is the same issue as in rB2e8089b6bf50.	2022-09-15 09:34:20 +02:00
Campbell Barton	2c53970bbf	Cleanup: use doxy sections, remove outdated comment	2022-09-15 15:27:21 +10:00
Campbell Barton	f78219c9a8	Cleanup: spelling in comments	2022-09-13 18:03:09 +10:00
Jacques Lucke	4130f1e674	Geometry Nodes: new evaluation system This refactors the geometry nodes evaluation system. No changes for the user are expected. At a high level the goals are: * Support using geometry nodes outside of the geometry nodes modifier. * Support using the evaluator infrastructure for other purposes like field evaluation. * Support more nodes, especially when many of them are disabled behind switch nodes. * Support doing preprocessing on node groups. For more details see T98492. There are fairly detailed comments in the code, but here is a high level overview for how it works now: * There is a new "lazy-function" system. It is similar in spirit to the multi-function system but with different goals. Instead of optimizing throughput for highly parallelizable work, this system is designed to compute only the data that is actually necessary. What data is necessary can be determined dynamically during evaluation. Many lazy-functions can be composed in a graph to form a new lazy-function, which can again be used in a graph etc. * Each geometry node group is converted into a lazy-function graph prior to evaluation. To evaluate geometry nodes, one then just has to evaluate that graph. Node groups are no longer inlined into their parents. Next steps for the evaluation system is to reduce the use of threads in some situations to avoid overhead. Many small node groups don't benefit from multi-threading at all. This is much easier to do now because not everything has to be inlined in one huge node tree anymore. Differential Revision: https://developer.blender.org/D15914	2022-09-13 08:44:32 +02:00
Hans Goudey	6bcda04d1f	Geometry Nodes: Port sample curves node to new data-block Use the newer more generic sampling and interpolation functions developed recently (`ab444a80a2`) instead of the `CurveEval` type. Functions are split up a bit more internally, to allow a separate mode for supplying the curve index directly in the future (T92474). In one basic test, the performance seems mostly unchanged from 3.1. Differential Revision: https://developer.blender.org/D14621	2022-07-22 09:59:28 -05:00
Jacques Lucke	ef8bb8c0d5	Functions: improve span buffer reuse in procedure execution This potentially overallocates buffers so that they are usable for more data types, which allows buffers to be reused more easily. That leads to fewer separate allocations and improved cache usage (in one of my test files the number of separate allocations went down from 1826 to 1555).	2022-06-25 19:41:44 +02:00
Jacques Lucke	b513c89e84	Functions: avoid using Map for small values This leads to a 5-10% performance improvement in my benchmark that runs a procedure many times on a single element.	2022-06-25 18:53:26 +02:00
Hans Goudey	3e5a4d1412	Geometry Nodes: Optimize selection for virtual array input This makes calculation of selected indices slightly faster when the input is a virtual array (the direct output of various nodes like Face Area, etc). The utility can be helpful for other areas that need to find selected indices besides field evaluation. With the face area node used as a selection with 4 million faces, the speedup is 3.51 ms to 3.39 ms, just a slight speedup. Differential Revision: https://developer.blender.org/D15127	2022-06-23 11:51:33 -05:00

1 2 3 4 5 ...

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

272 Commits