Refactor how we handle versionning during readfile, to allow creation of new IDs in a reliable way. #111932

New Issue

Bastien Montagne · 2023-09-04T17:41:05+02:00

Bastien Montagne commented

2023-09-04 17:41:05 +02:00

This design task aims at enabling creation of new IDs at any stage of the read file process, in a safe and sane way. In particular, the place-holders for missing linked data, and the IDs created as part of versioning.

Problem

Several recent issues and incidents have shine some light over several weaknesses of our current do_versions code, regarding adding IDs:

It is forbidden in after_liblink, with some exceptions for certain ID types (!).
It is allowed by code, but not considered good practice, in regular do_version.
It is recommended to use BLO_read_do_version_after_setup when adding new IDs, however this adds some serious draw-backs too, and its usage should remain as exceptional as possible.
In any case, 'new' IDs are added at readtime too, before any do_version code is ran: the empty 'place holders' generated when some directly linked data reference cannot be found anymore.
The main issue with creating new IDs before/during versioning is that these new IDs, even though they have been created with current BKE code, will still go through all the versioning code required by the version of the loaded blendfile.
Adding new IDs before the lib-linking process also means that they need to be taken into account during the liblinking process itself, as their addresses are not known by default by the readfile code. Although to my knowledge there were never a reported bug about it, this is a very nasty 'potential' bug in our current code.

Proposal

The general idea is that IDs actually read by readfile process should be tagged for do_version, such that versioning code can only process them, and skip the others (which are assumed created according to current data version) entirely.

While tagging read IDs is fairly trivial, the problem is to avoid processing them in current do_version code.

Create a new LIB_TAG_DO_VERSIONING ID tag.
Tag all IDs read from blendfile with this new tag (except for the placeholders generated for missing linked data).
- This tag needs to be cleared out at the end of the readfile code, which means that BLO_read_do_version_after_setup code will not be aware of this. This is not expected to be an issue in practice, as code there is supposed to work on high-level data info (like 'is this object a proxy'), not on version-based info.
Refactor the do_version code to let the generic readfile code decide whether a given ID needs to be versioned or not.
- This is the complex part, see below for details.

`do_version` Refactor

The proposal is to replace the usages of LISTBASE_FOREACH over ID types listbases in versioning code, by a dedicated iterator. This iterator code would ensure the generic checks (placeholders, newly created IDs, etc.).

This should also cover slightly more specialized iterators, like e.g. FOREACH_NODETREE, or the generic FOREACH_MAIN_ID.

This change can be implemented in two steps, the first one by proxying current iterators with new defined names. This would be a very noisy commit, but it would be guaranteed to have no effect at all on the behavior of the code.

Proposed new names are to re-use existing ones, prefixed with DO_VERSION_.

The second commit will then be the one implementing the new 'filtered' behavior, together with the other aspects of this design.

Other Ideas

Move versioning code to IDTypes.

Initial idea was to have some sort of IDType structure defined in versioning code, which would gather all versioning for each type. Could even have been added to the actual IDTypeInfo maybe?

But this is likely a fairly complicated change to implement, in case e.g. there are interactions between IDs.

Further more, defining a code structure that would work nicely with the do_version requirements is challenging (each ID can be processed many times, for each version increment, and it's very likely not a good idea to switch to a model where each ID would be processed in one go over all the required versions).

So for now it feels like a potentially huge time consuming task, which does not seems to be worth it.

Add newly created IDs to a temporary separate Main.

While this would avoid the need for new tag for these IDs, and the change to the versioning code itself to filter them out, this has several drawbacks that are likely harder to address:

It requires a new Main with special meanings, and special handling, in the whole readfile code. Probably even one extra main for each library too.
It requires specific handling of naming for the added IDs, to avoid name collision with IDs from the 'real' read Mains.

Notes:

Somewhat related to #92333.
The current 'multi-stage' versioning process causes another type of problems, which is that a later versioning code before lib-linking can make an earlier versioning code in after_liblink invalid/broken. The same is (even more) true when it comes to code in BLO_read_do_version_after_setup, however this is a known and expected issue, since that one is fairly version-agnostic.
There is no clear solution to this problem currently, since it does not seem to be possible to process versioning at a single point in readfile code.

This design task aims at enabling creation of new IDs at any stage of the read file process, in a safe and sane way. In particular, the place-holders for missing linked data, and the IDs created as part of versioning. ## Problem Several recent issues and incidents have shine some light over several weaknesses of our current do_versions code, regarding adding IDs: * It is forbidden in `after_liblink`, with some exceptions for certain ID types (!). * It is allowed by code, but not considered good practice, in regular `do_version`. * It is recommended to use `BLO_read_do_version_after_setup` when adding new IDs, however this adds some serious draw-backs too, and its usage should remain as exceptional as possible. * In any case, 'new' IDs are added at readtime too, before any do_version code is ran: the empty 'place holders' generated when some directly linked data reference cannot be found anymore. * The main issue with creating new IDs before/during versioning is that these new IDs, even though they have been created with current BKE code, will still go through all the versioning code required by the version of the loaded blendfile. * Adding new IDs before the lib-linking process also means that they need to be taken into account during the liblinking process itself, as their addresses are not known by default by the readfile code. Although to my knowledge there were never a reported bug about it, this is a very nasty 'potential' bug in our current code. ## Proposal The general idea is that IDs actually read by readfile process should be tagged for do_version, such that versioning code can only process them, and skip the others (which are assumed created according to current data version) entirely. While tagging read IDs is fairly trivial, the problem is to avoid processing them in current do_version code. * [ ] Create a new `LIB_TAG_DO_VERSIONING` ID tag. * [ ] Tag all IDs read from blendfile with this new tag (except for the placeholders generated for missing linked data). * This tag needs to be cleared out at the end of the readfile code, which means that `BLO_read_do_version_after_setup` code will not be aware of this. This is not expected to be an issue in practice, as code there is supposed to work on high-level data info (like 'is this object a proxy'), not on version-based info. * [ ] Refactor the do_version code to let the generic readfile code decide whether a given ID needs to be versioned or not. * This is the complex part, see below for details. ### `do_version` Refactor The proposal is to replace the usages of `LISTBASE_FOREACH` over ID types listbases in versioning code, by a dedicated iterator. This iterator code would ensure the generic checks (placeholders, newly created IDs, etc.). This should also cover slightly more specialized iterators, like e.g. `FOREACH_NODETREE`, or the generic `FOREACH_MAIN_ID`. This change can be implemented in two steps, the first one by proxying current iterators with new defined names. This would be a very noisy commit, but it would be guaranteed to have no effect at all on the behavior of the code. Proposed new names are to re-use existing ones, prefixed with `DO_VERSION_`. The second commit will then be the one implementing the new 'filtered' behavior, together with the other aspects of this design. #### Other Ideas ##### Move versioning code to IDTypes. Initial idea was to have some sort of IDType structure defined in versioning code, which would gather all versioning for each type. Could even have been added to the actual IDTypeInfo maybe? But this is likely a fairly complicated change to implement, in case e.g. there are interactions between IDs. Further more, defining a code structure that would work nicely with the do_version requirements is challenging (each ID can be processed many times, for each version increment, and it's very likely not a good idea to switch to a model where each ID would be processed in one go over all the required versions). So for now it feels like a potentially huge time consuming task, which does not seems to be worth it. ##### Add newly created IDs to a temporary separate Main. While this would avoid the need for new tag for these IDs, and the change to the versioning code itself to filter them out, this has several drawbacks that are likely harder to address: * It requires a new Main with special meanings, and special handling, in the whole readfile code. Probably even one extra main for each library too. * It requires specific handling of naming for the added IDs, to avoid name collision with IDs from the 'real' read Mains. ## Notes: * Somewhat related to #92333. * The current 'multi-stage' versioning process causes another type of problems, which is that a later versioning code before lib-linking can make an earlier versioning code in `after_liblink` invalid/broken. The same is (even more) true when it comes to code in `BLO_read_do_version_after_setup`, however this is a known and expected issue, since that one is fairly version-agnostic. _There is no clear solution to this problem currently, since it does not seem to be possible to process versioning at a single point in readfile code._

Bastien Montagne added the

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

Refactor how we handle versionning during readfile, to allow creation of new IDs in a reliable way. #111932