Realtime Compositor: Implement Anisotropic Kuwahara #110786

Omar Emara · 2023-08-04T09:39:49+02:00

Omar Emara commented

2023-08-04 09:39:49 +02:00

This patch implements the Anisotropic Kuwahara filter for the Realtime
compositor. The implementation is based on three papers on Anisotropic Kuwahara
filtering, presented and detailed in the code. The implementation is different
from the existing CPU implementation, but is a higher quality one that is also
an order of magnitude faster and conforms to the methods described in the paper.

The new implementation exposes two extra parameters that control the sharpness
and directionality of the output, giving more artistic freedom.

A comparison between an original image, the existing CPU algorithm, and the new GPU algorithm:

A comparison between the differenet sharpness levels:

A comparison between the different eccentricity levels:

Another comparison between an original image, the existing CPU algorithm, and the new GPU algorithm:

Iterative application of filter:

Temporal stability:

This patch implements the Anisotropic Kuwahara filter for the Realtime compositor. The implementation is based on three papers on Anisotropic Kuwahara filtering, presented and detailed in the code. The implementation is different from the existing CPU implementation, but is a higher quality one that is also an order of magnitude faster and conforms to the methods described in the paper. The new implementation exposes two extra parameters that control the sharpness and directionality of the output, giving more artistic freedom. A comparison between an original image, the existing CPU algorithm, and the new GPU algorithm: ![Algorithms](https://projects.blender.org/attachments/64e85dc6-a6ac-4f53-8cd1-3146192f68b7) A comparison between the differenet sharpness levels: ![Sharpness](https://projects.blender.org/attachments/9f29c100-7de8-46f5-b62c-79f2ac26f465) A comparison between the different eccentricity levels: ![Eccentricity](https://projects.blender.org/attachments/6d52c285-a2e6-45f9-92cf-c6a0a953daaf) Another comparison between an original image, the existing CPU algorithm, and the new GPU algorithm: ![Algorithms](https://projects.blender.org/attachments/d202ada7-29b2-4a70-9fc2-a40dced8419c) Iterative application of filter: ![Iterations](https://projects.blender.org/attachments/3b11f55f-fa79-42f8-9842-09d16060787b) Temporal stability: <video src="https://projects.blender.org/attachments/70934539-9906-48c9-8ce6-f3d10b854a42" controls> </video>

algorithms.png

1.4 MiB

algorithms.png

1.1 MiB

eccentricity.png

925 KiB

sharpness.png

933 KiB

iterations.png

1.3 MiB

temporalStability.mp4

2.8 MiB

👍 2 ❤️ 1

Omar Emara added 1 commit 2023-08-04 09:40:00 +02:00

9ac60dffa4 Realtime Compositor: Implement Anisotropic Kuwahara

This patch implements the Anisotropic Kuwahara filter for the realtime
compositor.

- [ ] Fix noisy minimum eigenvector.
- [ ] Expose Eccentricity and Sharpness to the user.

Omar Emara added 1 commit 2023-08-04 11:55:38 +02:00

e29c397177 Fix noisy minimum eigenvector

Omar Emara added 1 commit 2023-08-04 13:32:20 +02:00

2af2931544 Expose eccentricity and sharpness to the user

Omar Emara added 3 commits 2023-08-04 21:19:17 +02:00

e375a57345 Use the sector weighting function from the multi-scale paper

790728d913 Use a more intuitive range for sharpness

f55b9eff31 Define a reasonable eccentricity range

Omar Emara added 1 commit 2023-08-04 21:20:58 +02:00

buildbot/vexp-code-patch-coordinator Build done. Details

3712151710 Correct old documentation

Omar Emara changed title from ~~WIP: Realtime Compositor: Implement Anisotropic Kuwahara~~ to Realtime Compositor: Implement Anisotropic Kuwahara

2023-08-04 21:56:39 +02:00

Omar Emara requested review from Sergey Sharybin 2023-08-04 21:57:34 +02:00

Omar Emara requested review from Clément Foucault 2023-08-04 21:57:41 +02:00

Omar Emara requested review from Habib Gahbiche 2023-08-04 21:57:48 +02:00

Omar Emara commented

2023-08-04 22:01:09 +02:00

@blender-bot package

Blender Bot commented

2023-08-04 22:01:13 +02:00

Package build started. Download here when ready.

Package build started. [Download here](https://builder.blender.org/download/patch/PR110786) when ready.

Habib Gahbiche reviewed 2023-08-05 14:39:36 +02:00

source/blender/makesdna/DNA_node_types.h Outdated

						
				@ -996,6 +996,8 @@ typedef struct NodeKuwaharaData {

				  short size;

				  short variation;

				  int smoothing;

Habib Gahbiche commented

2023-08-05 14:39:36 +02:00

I think sharpness and smoothing are the same thing from a user's perspective. Ideally we shouldn't duplicate it, even if GPU and CPU implementations are different.

Omar Emara commented

2023-08-05 15:52:08 +02:00

They are not really the same thing though, smoothing controls the homogeneity of the directions of neighbouring kuwahara sectors, while sharpness controls the sharpness of the sectors themselves. And to clarify, the GPU implementation uses both.

However, I do agree that we need to clarify the names, smoothing can probably be renamed to something that reflects its function more.

They are not really the same thing though, smoothing controls the homogeneity of the directions of neighbouring kuwahara sectors, while sharpness controls the sharpness of the sectors themselves. And to clarify, the GPU implementation uses both. However, I do agree that we need to clarify the names, smoothing can probably be renamed to something that reflects its function more.

OmarEmaraDev marked this conversation as resolved

Habib Gahbiche reviewed 2023-08-05 16:09:44 +02:00

source/blender/compositor/realtime_compositor/shaders/infos/compositor_kuwahara_info.hh

						
				@ -24,0 +31,4 @@

				GPU_SHADER_CREATE_INFO(compositor_kuwahara_anisotropic)

				    .local_group_size(16, 16)

				    .push_constant(Type::INT, "radius")

Habib Gahbiche commented

2023-08-05 16:09:44 +02:00

radius is actually called size in UI, right? I think we should unify naming here. radius is also a fitting name for the classic variation so I don't mind renaming the UI parameter to radius.

`radius` is actually called `size` in UI, right? I think we should unify naming here. `radius` is also a fitting name for the classic variation so I don't mind renaming the UI parameter to radius.

Omar Emara commented

2023-08-05 16:21:45 +02:00

Size is used throughout filter nodes to denote filter window radius, so it is probably okay to leave it as Size for consistency.

Habib Gahbiche commented

2023-08-05 16:32:17 +02:00

Is it possible to rename radius to size in code then? I found it a bit hard to navigate code if variables get renamed from UI to implementation

Omar Emara commented

2023-08-05 17:01:36 +02:00

Radius makes the code clearer and is what the paper uses, so it is a necessary evil I think.

zazizizou marked this conversation as resolved

Habib Gahbiche reviewed 2023-08-05 16:12:37 +02:00

source/blender/compositor/realtime_compositor/shaders/infos/compositor_kuwahara_info.hh

						
				@ -24,0 +32,4 @@

				GPU_SHADER_CREATE_INFO(compositor_kuwahara_anisotropic)

				    .local_group_size(16, 16)

				    .push_constant(Type::INT, "radius")

				    .push_constant(Type::FLOAT, "eccentricity")

Habib Gahbiche commented

2023-08-05 16:12:37 +02:00

For me it's a bit hard to guess the effect of the parameter without reading documentation. Is preserve edge a better name maybe?

For me it's a bit hard to guess the effect of the parameter without reading documentation. Is `preserve edge` a better name maybe?

Omar Emara commented

2023-08-05 16:24:48 +02:00

You mean eccentricity? Preserve Edge is a bit misleading in that case, because as its documentation says, it defines how directional the filter is. Googling eccentricity gives good visualizations, so I think it is fine personally.

Habib Gahbiche commented

2023-08-05 16:39:23 +02:00

Yes I meant eccentricity sorry. I think I understand what it means, I was just thinking how hard it would be from a user's perspective. I'm ok with the naming if most users can understand the effect of the parameter from reading the tip tool.

Habib Gahbiche commented

2023-08-05 16:39:25 +02:00

Yes I meant eccentricity sorry. I think I understand what it means, I was just thinking how hard it would be from a user's perspective. I'm ok with the naming if most users can understand the effect of the parameter from reading the tip tool.

OmarEmaraDev marked this conversation as resolved

Omar Emara added 1 commit 2023-08-05 16:16:41 +02:00

buildbot/vexp-code-patch-coordinator Build done. Details

51befa50d0 Merge branch 'main' into gpu-anisotropic-kuwahara

Habib Gahbiche reviewed 2023-08-05 16:17:44 +02:00

source/blender/compositor/realtime_compositor/shaders/compositor_kuwahara_anisotropic.glsl

						
				@ -0,0 +150,4 @@

				      vec2 rotated_disk_point = M_SQRT1_2 *

				                                vec2(disk_point.x - disk_point.y, disk_point.x + disk_point.y);

				      /* Finally, we compute the other every other 4 weights starting from the 45 degreed rotated

Habib Gahbiche commented

2023-08-05 16:17:44 +02:00

typo?

Omar Emara commented

2023-08-05 16:25:53 +02:00

I will reword that.

OmarEmaraDev marked this conversation as resolved

Omar Emara commented

2023-08-05 16:19:51 +02:00

@blender-bot package

Blender Bot commented

2023-08-05 16:19:54 +02:00

Package build started. Download here when ready.

Package build started. [Download here](https://builder.blender.org/download/patch/PR110786) when ready.

Habib Gahbiche commented

2023-08-05 16:28:50 +02:00

@OmarEmaraDev Results actually look very good. Your solution also solves a few issues we have with CPU implementation. Nicely done! :)

I did a few tests and looked at the code and have a few comments:

Overall the implementation looks good to me.
I can confirm the speedup against the current full frame implementation.
I didn't measure a significant speedup from the CPU implementation proposed here: https://devtalk.blender.org/t/compositor-new-node-kuwahara-filter/29205/16?u=izo, though I'm on a macbook pro laptop with a relatively fast CPU and weak GPU, so probably not a fair comparison.
When setting radius to 1, I see some black artefacts in the image. I had this issue with CPU implementation and currently it's solved by adding an offset to the user input. This way, the user can't input a wrong value.

@OmarEmaraDev Results actually look very good. Your solution also solves a few issues we have with CPU implementation. Nicely done! :) I did a few tests and looked at the code and have a few comments: - Overall the implementation looks good to me. - I can confirm the speedup against the current full frame implementation. - I didn't measure a significant speedup from the CPU implementation proposed here: https://devtalk.blender.org/t/compositor-new-node-kuwahara-filter/29205/16?u=izo, though I'm on a macbook pro laptop with a relatively fast CPU and weak GPU, so probably not a fair comparison. - When setting `radius` to 1, I see some black artefacts in the image. I had this issue with CPU implementation and currently it's solved by adding an offset to the user input. This way, the user can't input a wrong value.

Omar Emara added 2 commits 2023-08-07 13:18:32 +02:00

8956492f4c Merge branch 'main' into gpu-anisotropic-kuwahara

buildbot/vexp-code-patch-coordinator Build done. Details

2d3d11ba0b Address review and fix missing center weight

- Rename the Smoothing parameter to Uniformity.
- Change Eccentricity range to [0, 2].
- Accumulate missing center pixel.
- Marginally better quality.

Omar Emara commented

2023-08-07 13:19:41 +02:00

@blender-bot package

Blender Bot commented

2023-08-07 13:19:45 +02:00

Package build started. Download here when ready.

Package build started. [Download here](https://builder.blender.org/download/patch/PR110786) when ready.

Sergey Sharybin commented

2023-08-08 16:24:00 +02:00

Nice presentation and well documented code! I really like the results of the GPU implementation. Testing on M2 MacBook is waaay faster.

While the code and functionality of the GPU side all looks good to me, it would be really nice to align CPU implementation to the same algorithm. It is kind of unideal to leave parameters exposed which do not have affect outside of an experimental feature set.

@OmarEmaraDev Is it something you look into already, or can help with?

Nice presentation and well documented code! I really like the results of the GPU implementation. Testing on M2 MacBook is waaay faster. While the code and functionality of the GPU side all looks good to me, it would be really nice to align CPU implementation to the same algorithm. It is kind of unideal to leave parameters exposed which do not have affect outside of an experimental feature set. @OmarEmaraDev Is it something you look into already, or can help with?

xZaki commented

2023-08-08 16:34:52 +02:00

First-time contributor

Nice presentation and well documented code! I really like the results of the GPU implementation. Testing on M2 MacBook is waaay faster.

While the code and functionality of the GPU side all looks good to me, it would be really nice to align CPU implementation to the same algorithm. It is kind of unideal to leave parameters exposed which do not have affect outside of an experimental feature set.

@OmarEmaraDev Is it something you look into already, or can help with?

I asked about this on the Dev Forum and got the answer that the CPU implementation will be updated to match after this patch gets approved.
https://devtalk.blender.org/t/real-time-compositor-feedback-and-discussion/25018/427

> Nice presentation and well documented code! I really like the results of the GPU implementation. Testing on M2 MacBook is waaay faster. > > While the code and functionality of the GPU side all looks good to me, it would be really nice to align CPU implementation to the same algorithm. It is kind of unideal to leave parameters exposed which do not have affect outside of an experimental feature set. > > @OmarEmaraDev Is it something you look into already, or can help with? I asked about this on the Dev Forum and got the answer that the CPU implementation will be updated to match after this patch gets approved. [https://devtalk.blender.org/t/real-time-compositor-feedback-and-discussion/25018/427](url)

Omar Emara commented

2023-08-08 17:26:27 +02:00

@Sergey Yes, definitely. I was just waiting for this patch to get approved, and I will follow it with a patch for the CPU compositor just like we did for classic Kuwahara.

Sergey Sharybin commented

2023-08-09 13:12:32 +02:00

@OmarEmaraDev It is a bit of chicken and egg problem then :( If patch is approved then it means it can land. But landing code which exposes parameters which do not have affect in a default configuration is not something we do.

I don't think there are some major algorithmical changes which would need to be done to the GPU integration. There is tweak to be done for the radius of 1, but other than that I think all of us are happy with the results. Maybe you can work on a CPU implementation as part of this PR now? That would avoid situation when we violate UI/UX topics.

Just trying to find a way forward which keeps UI/UX principles we follow, but also keep all us happy :)

@OmarEmaraDev It is a bit of chicken and egg problem then :( If patch is approved then it means it can land. But landing code which exposes parameters which do not have affect in a default configuration is not something we do. I don't think there are some major algorithmical changes which would need to be done to the GPU integration. There is tweak to be done for the radius of 1, but other than that I think all of us are happy with the results. Maybe you can work on a CPU implementation as part of this PR now? That would avoid situation when we violate UI/UX topics. Just trying to find a way forward which keeps UI/UX principles we follow, but also keep all us happy :)

Omar Emara commented

2023-08-09 13:39:33 +02:00

@Sergey Alright, I will update the pull request with the CPU implementation.

👍 1 ❤️ 1

Omar Emara added 1 commit 2023-08-10 19:42:02 +02:00

bbe86c989e Update CPU implementation to match new algorithm

Omar Emara added 1 commit 2023-08-10 19:44:35 +02:00

buildbot/vexp-code-patch-coordinator Build done. Details

77f5104f8e Merge branch 'main' into gpu-anisotropic-kuwahara

Omar Emara commented

2023-08-10 19:45:13 +02:00

@blender-bot package

Blender Bot commented

2023-08-10 19:45:16 +02:00

Package build started. Download here when ready.

Package build started. [Download here](https://builder.blender.org/download/patch/PR110786) when ready.

Omar Emara added 1 commit 2023-08-10 21:16:36 +02:00

buildbot/vexp-code-patch-coordinator Build done. Details

2a2968ebe9 Fix missing include

Omar Emara commented

2023-08-10 21:16:49 +02:00

@blender-bot package

Blender Bot commented

2023-08-10 21:16:52 +02:00

Package build started. Download here when ready.

Package build started. [Download here](https://builder.blender.org/download/patch/PR110786) when ready.

Sergey Sharybin approved these changes 2023-08-11 09:45:20 +02:00

Sergey Sharybin left a comment

From testing and reading the code did not see anything to wrong.

P.S. Would be nice to somehow avoid duplication, but that's a separate known topic, not tor this PR.

From testing and reading the code did not see anything to wrong. P.S. Would be nice to somehow avoid duplication, but that's a separate known topic, not tor this PR.

Habib Gahbiche reviewed 2023-08-14 02:02:53 +02:00

Habib Gahbiche left a comment

Some observations from testing:

When using GPU, Blender is crashing / whole computer freezing every now and then, can't reproduce reliably. My guess this is because I am rendering using GPU compositor, which is still experimental? Did you have any similar issues on your system?
Issue with radius = 1 is solved, thanks :)
Some UI nitpicking: Sharpness and Eccentricity labels are cropped (see screenshot) after switching to anisotropic variation, so it looks like a typo.

The overall patch still looks very good to me, so not sure my comments should be considered blocking...

Some observations from testing: - When using GPU, Blender is crashing / whole computer freezing every now and then, can't reproduce reliably. My guess this is because I am rendering using GPU compositor, which is still experimental? Did you have any similar issues on your system? - Issue with `radius = 1` is solved, thanks :) - Some UI nitpicking: `Sharpness` and `Eccentricity` labels are cropped (see screenshot) after switching to anisotropic variation, so it looks like a typo. The overall patch still looks very good to me, so not sure my comments should be considered blocking...

Screenshot 2023-08-14 at 00.57.51.png

126 KiB

source/blender/compositor/realtime_compositor/shaders/compositor_kuwahara_anisotropic_compute_structure_tensor.glsl

						
				@ -0,0 +1,40 @@

				#pragma BLENDER_REQUIRE(gpu_shader_compositor_texture_utilities.glsl)

Habib Gahbiche commented

2023-08-14 01:32:30 +02:00

Looking at the CPU implementation made me wonder, isn't this a more general (approximation of) structure tensor which is not specific to Kuwahara filter? So is there a reason the implementation is not in blender/compositor/realtime_compositor/algorithms?

Looking at the CPU implementation made me wonder, isn't this a more general (approximation of) structure tensor which is not specific to Kuwahara filter? So is there a reason the implementation is not in `blender/compositor/realtime_compositor/algorithms`?

Omar Emara commented

2023-08-14 10:23:45 +02:00

Maybe, but that could be done when it is really needed.

zazizizou marked this conversation as resolved

source/blender/nodes/composite/nodes/node_composite_kuwahara.cc

						
				@ -149,0 +237,4 @@

				   *

				   * Since the anisotropy is in the [0, 1] range, the factor tends to 1 as the eccentricity tends

				   * to infinity and tends to infinity when the eccentricity tends to zero. The stored eccentricity

				   * is in the range [0, 2], we map that to the range [infinity, 0.5] by taking the reciprocal,

Habib Gahbiche commented

2023-08-14 01:43:04 +02:00

Since the value gets mapped anyways, does it make sense to give the user the range [0, 1] and use PROP_FACTOR for UI instead?

Since the value gets mapped anyways, does it make sense to give the user the range `[0, 1]` and use `PROP_FACTOR` for UI instead?

Omar Emara commented

2023-08-14 10:27:47 +02:00

We do already use PROP_FACTOR, but there is a good reason why I made the UI range [0, 2], because the maximum 2 doubles the computed eccentricity, while the minimum 0 zeros it. And 1 is an identity and leaves the computed eccentricity as is.

We do already use `PROP_FACTOR`, but there is a good reason why I made the UI range `[0, 2]`, because the maximum `2` doubles the computed eccentricity, while the minimum `0` zeros it. And 1 is an identity and leaves the computed eccentricity as is.

Habib Gahbiche commented

2023-08-14 13:24:48 +02:00

True, PROP_FACTOR is already used. I doubt it's how users perceive it though, most artists probably would just see it as a slider with min and max value. But again, UI questions are always subjective, so it's fine for me if usage is clear to users :)

True, `PROP_FACTOR` is already used. I doubt it's how users perceive it though, most artists probably would just see it as a slider with min and max value. But again, UI questions are always subjective, so it's fine for me if usage is clear to users :)

OmarEmaraDev marked this conversation as resolved

Clément Foucault requested changes 2023-08-16 16:28:31 +02:00

Clément Foucault left a comment

Only some const correctness and avoiding pow(x, 2) which caused some issues in the past.

Only some `const` correctness and avoiding `pow(x, 2)` which caused some issues in the past.

source/blender/compositor/operations/COM_KuwaharaAnisotropicOperation.cc Outdated

						
				@ -101,0 +126,4 @@

				  /* Compute the overlap polynomial parameters for 8-sector ellipse based on the equations in

				   * section "3 Alternative Weighting Functions" of the polynomial weights paper. More on this

				   * later in the code. */

				  int number_of_sectors = 8;

Clément Foucault commented

2023-08-16 16:21:07 +02:00

Use const.

Use `const`.

OmarEmaraDev marked this conversation as resolved

source/blender/compositor/realtime_compositor/shaders/compositor_kuwahara_anisotropic.glsl Outdated

						
				@ -0,0 +30,4 @@

				  /* Compute the first and second eigenvalues of the structure tensor using the equations in

				   * section "3.1 Orientation and Anisotropy Estimation" of the paper. */

				  float eigenvalue_first_term = (dxdx + dydy) / 2.0;

				  float eigenvalue_square_root_term = sqrt(pow(dxdx - dydy, 2.0) + 4.0 * pow(dxdy, 2.0)) / 2.0;

Clément Foucault commented

2023-08-16 16:17:32 +02:00

Do not use pow(x, 2). Use pow2f or square_f. You can introduce a version for vectors.

Do not use `pow(x, 2)`. Use `pow2f` or `square_f`. You can introduce a version for vectors.

OmarEmaraDev marked this conversation as resolved

source/blender/compositor/realtime_compositor/shaders/compositor_kuwahara_anisotropic.glsl Outdated

						
				@ -0,0 +87,4 @@

				  /* Compute the overlap polynomial parameters for 8-sector ellipse based on the equations in

				   * section "3 Alternative Weighting Functions" of the polynomial weights paper. More on this

				   * later in the code. */

				  int number_of_sectors = 8;

Clément Foucault commented

2023-08-16 16:24:16 +02:00

Use const.

OmarEmaraDev marked this conversation as resolved

source/blender/compositor/realtime_compositor/shaders/compositor_kuwahara_anisotropic_compute_structure_tensor.glsl Outdated

						
				@ -0,0 +14,4 @@

				  /* The weight kernels of the filter optimized for rotational symmetry described in section "3.2.1

				   * Gradient Calculation". */

				  float corner_weight = 0.182;

Clément Foucault commented

2023-08-16 16:24:21 +02:00

Use const. On both.

OmarEmaraDev marked this conversation as resolved

source/blender/compositor/realtime_compositor/shaders/compositor_kuwahara_anisotropic_compute_structure_tensor.glsl Outdated

						
				@ -0,0 +33,4 @@

				  /* We encode the structure tensor in a vec4 using a column major storage order. */

				  vec4 structure_tensor = vec4(dot(x_partial_derivative, x_partial_derivative),

				                               dot(x_partial_derivative, y_partial_derivative),

Clément Foucault commented

2023-08-16 16:11:49 +02:00

Here you take twice the same dot product dot(x_partial_derivative, y_partial_derivative). I guess it is just because the dot product is commutative. Still a bit confusing thought. Also that raises the question as to why store twice the same value. Maybe having a R16F + a RG16F texture is a better choice here. But if the textures are not used anywhere it might not be beneficial.

Here you take twice the same dot product `dot(x_partial_derivative, y_partial_derivative)`. I guess it is just because the dot product is commutative. Still a bit confusing thought. Also that raises the question as to why store twice the same value. Maybe having a R16F + a RG16F texture is a better choice here. But if the textures are not used anywhere it might not be beneficial.

Omar Emara commented

2023-08-16 18:06:12 +02:00

That's correct. I just stored the extra value for clarity since that's what the matrix contains since I found it not worth it to use two R16F + a RG16F textures for that.

OmarEmaraDev marked this conversation as resolved

Omar Emara added 2 commits 2023-08-16 17:57:00 +02:00

4ddb96fa3e Merge branch 'main' into gpu-anisotropic-kuwahara

7909511d0f Use square_f and const and avoid duplicate derivatives

Clément Foucault approved these changes 2023-08-17 11:18:50 +02:00

Clément Foucault left a comment

Fine by me. Maybe use square_f on CPU code too. I don't know about how well pow is optimized theses days.

Fine by me. Maybe use `square_f` on CPU code too. I don't know about how well `pow` is optimized theses days.

Omar Emara added 2 commits 2023-08-17 16:18:40 +02:00

19494d47b5 Merge branch 'main' into gpu-anisotropic-kuwahara

buildbot/vexp-code-patch-coordinator Build done. Details

06531e61a6 Use square function

Omar Emara commented

2023-08-17 16:19:12 +02:00

@blender-bot build

Omar Emara merged commit 9ef2310e5f into main

2023-08-17 16:58:42 +02:00

Omar Emara referenced this issue from a commit

2023-08-17 16:58:43 +02:00

Realtime Compositor: Implement Anisotropic Kuwahara

Omar Emara deleted branch gpu-anisotropic-kuwahara

2023-08-17 16:58:44 +02:00

DGruwier commented

2023-09-18 21:08:40 +02:00

First-time contributor

I'm seeing more than a 100x (!) speedup compared to CPU on high resolutions and high kernel sizes in particular. I've gone from having to process individual frames from the command line, maxing out a 24-core CPU for minutes at a time, to working interactively. Wild.

The fixed kernel size is a bit of a creative limitation. For example, the paper "Oil Painting Style Rendering Based on Kuwahara Filter" uses generated saliency image segmentation as input to vary the kernel size around areas of interest, which makes a huge difference. The saliency segmentation is a different beast, but feeding the existing Kuwahara node with a depth map, a vertex-painted detail map AOV, or even a custom roto matte or similar could be very powerful.

This might not be practical with the particular implementation being used, and it seems like this is wrapping up anyway. However, I didn't see any discussion of variable kernel size as a feature, so I thought I'd point it out. If it's as easy as flicking a switch, it might be worth looking into.

Just for reference, I simulated the effect here by manually blending between kernel sizes.

I'm seeing more than a 100x (!) speedup compared to CPU on high resolutions and high kernel sizes in particular. I've gone from having to process individual frames from the command line, maxing out a 24-core CPU for minutes at a time, to working interactively. Wild. The fixed kernel size is a bit of a creative limitation. For example, the paper "Oil Painting Style Rendering Based on Kuwahara Filter" uses generated saliency image segmentation as input to vary the kernel size around areas of interest, which makes a huge difference. The saliency segmentation is a different beast, but feeding the existing Kuwahara node with a depth map, a vertex-painted detail map AOV, or even a custom roto matte or similar could be very powerful. This might not be practical with the particular implementation being used, and it seems like this is wrapping up anyway. However, I didn't see any discussion of variable kernel size as a feature, so I thought I'd point it out. If it's as easy as flicking a switch, it might be worth looking into. Just for reference, I simulated the effect here by manually blending between kernel sizes.

variable_size_anisotropic_kuwahara.jpg

112 KiB

Omar Emara commented

2023-09-22 13:43:42 +02:00

@DGruwier To clarify, you just want the radius to be exposed as an input, not the saliency map generation or the other methods in the paper, correct?

I haven't read the full paper yet, but that shouldn't be hard as far as can tell, except maybe for performance implications which we can avoid through specializations. I could make a test patch next week.

@DGruwier To clarify, you just want the radius to be exposed as an input, not the saliency map generation or the other methods in the paper, correct? I haven't read the full paper yet, but that shouldn't be hard as far as can tell, except maybe for performance implications which we can avoid through specializations. I could make a test patch next week.

👍 1

DGruwier commented

2023-09-22 17:30:04 +02:00

First-time contributor

@OmarEmaraDev Right, exposing the radius as an input, nothing more.
I only mentioned the paper in reference to the idea of varying the kernel size across the image.

@OmarEmaraDev Right, exposing the radius as an input, nothing more. I only mentioned the paper in reference to the idea of varying the kernel size across the image.

Sign in to join this conversation.

No reviewers