VSE: speedup timeline drawing, and improve waveform display #115311

Aras Pranckevicius · 2023-11-23T18:55:00+01:00

2023-11-23 18:55:00 +01:00

TL;DR: sequencer timeline UI repainting is 3x-4x faster now, for complex timelines.

VSE timeline rendering performance overview

For large timelines (used Sprite Fright Edit for testing, 2702 strips visible), most of time is spent on drawing quads, submitting draw calls to the GPU, two triangles at a time. That is because the whole code is built in a "draw one strip completely, move to drawing next strip" fashion, and it internally has to change between drawing triangles vs drawing lines vs drawing some striped shader, etc. Additionally, blender's "imm" code does not even allow to efficiently draw more than one quad in one go, due to the fact that each quad is a triangle fan.

There were other bottlenecks too -- drawing audio waveforms, drawing retiming key shapes one key at a time, and drawing thumbnails where GPU textures are being created and destroyed on every draw (this last one is not addressed in this PR).

Approach taken in this PR to improve performance

Performance numbers teaser first! Sprite Fright Edit data set, with whole timeline visible (2702 strips), repainting the timeline UI with all overlay options (waveforms, offsets, thumbnails etc.):

Windows (Ryzen 5950X, RTX 3080Ti, OpenGL): 62ms -> 18.6ms (so 16FPS -> 54FPS)
Mac (M1 Max, Metal): 39.8ms -> 11.5ms (so 25FPS -> 86FPS)

90% of what the timeline strip drawing is, are either quads or quad outlines (4 lines). So build a small utility class SeqQuadsBatch where you can keep on doing add_quad or add_wire_quad, and call draw in the end. Internally it batches up to 1024 quads to draw; if you keep on adding more it just internally draws them and proceeds to add from start again.

Having that batching utility class, change most of strip drawing code to just use that directly. It has to be broken up in the middle in two places: once for thumbnails (since it's no longer single colored quads, need to change the shader), and once more for locked strips (likewise, different shader to do the striped pattern). So that's overall how the code is structured now, pseudocode:

strips = create_all_strip_contexts
for all strips: batch quads for background, color band, offsets, transition, meta contents
draw_batched_quads

for all strips: draw thumbnail

for all strips: batch quads for fcurve overlay, waveform overlay
draw_batched_quads

for all strips: draw locked overlay

for all strips: batch quads for invalid, fx input / multicam / solo highlights, handles, outlines
draw_batched_quads

For retiming keys, change code so that instead of drawing one key at a time (i.e. telling GPU "draw this one point"), it draws many of them in one go ("draw all these points"). It creates imm batches of up to 1024 keys, when they are full it does immEnd and starts a new batch.

For waveform overlay, the changes both speed it up, but also change how it looks like, following design proposal in #115274. Previous code had complicated logic that switched between single-line drawing and triangle-strip drawing, which is not great for trying to submit larger batches to the GPU. The line part often looked weird when zoomed in, and the triangle part was sometimes displaying wrong data magnitude, due to sample interpolation wrongly using -0.5..+0.5 range instead of 0..1 (as a result, it was sometimes showing "clipped" red areas where in fact there aren't any!). Changing all of that to draw squares/quads for waveform samples is both much simpler code wise, and allows batching for the GPU, and displays waveform peaks better, when zoomed out.

What is not done in this PR

Potential future optimizations, since hey maybe let's not try to address everything in one go:

Drawing thumbnails: still expensive, mostly due to GPU textures being created and destroyed on each and every draw.
Drawing channel names: expensive (~3ms) due to O(N^2) complexity in querying all strips. Addressed part of that by only querying meta strips (which is all that's needed in that code).
Waveform and f-curve overlays: f-curve lookups are costly. Might be possible to multi-thread that for all strips before drawing.
Waveform overlay: when zoomed out, each pixel can cover like 50 samples, and all of them are scanned to find min/max values at render time. Might be possible to build kind-of "mipmaps" of waveform preview data when fetching it, and pick appropriate one based on zoom level.

Open Questions

I was surprised that there's no efficient and convenient way to draw many quads at once via imm API, actually. A good question is, should something like that be built as part of imm, instead of adding SeqQuadsBatch inside sequencer? While the helper class is very simple, it is actually not sequencer specific at all.
Things like waveform drawing, by rendering each pixel column as a quad, are not "terribly efficient". While way faster than the previous code, an even faster approach would be to put all waveform preview data into a large GPU buffer (SSBO), and draw one quad for each preview waveform overlay, with a special shader that would read from said buffer and render the preview. A similar thing could be done for f-curve previews, again put data into a GPU buffer and have a shader sample it. However, this leads to a somewhat more complicated code (strip preview is no longer in just C++ code, but also GLSL). So not sure if it's worth doing. The current approach "one quad for each pixel column" sounds like a waste, but as long as GPU submission is in large enough batches, it is fast enough.

TL;DR: sequencer timeline UI repainting is 3x-4x faster now, for complex timelines. ### VSE timeline rendering performance overview For large timelines (used Sprite Fright Edit for testing, 2702 strips visible), most of time is spent on drawing quads, submitting draw calls to the GPU, two triangles at a time. That is because the whole code is built in a "draw one strip completely, move to drawing next strip" fashion, and it internally has to change between drawing triangles vs drawing lines vs drawing some striped shader, etc. Additionally, blender's "imm" code does not even allow to efficiently draw more than one quad in one go, due to the fact that each quad is a triangle fan. There were other bottlenecks too -- drawing audio waveforms, drawing retiming key shapes one key at a time, and drawing thumbnails where GPU textures are being created and destroyed on every draw (this last one is not addressed in this PR). ### Approach taken in this PR to improve performance Performance numbers teaser first! Sprite Fright Edit data set, with whole timeline visible (2702 strips), repainting the timeline UI with all overlay options (waveforms, offsets, thumbnails etc.): - Windows (Ryzen 5950X, RTX 3080Ti, OpenGL): **62ms -> 18.6ms** (so 16FPS -> 54FPS) - Mac (M1 Max, Metal): **39.8ms -> 11.5ms** (so 25FPS -> 86FPS) 90% of what the timeline strip drawing is, are either quads or quad outlines (4 lines). So build a small utility class `SeqQuadsBatch` where you can keep on doing `add_quad` or `add_wire_quad`, and call `draw` in the end. Internally it batches up to 1024 quads to draw; if you keep on adding more it just internally draws them and proceeds to add from start again. Having that batching utility class, change most of strip drawing code to just use that directly. It has to be broken up in the middle in two places: once for thumbnails (since it's no longer single colored quads, need to change the shader), and once more for locked strips (likewise, different shader to do the striped pattern). So that's overall how the code is structured now, pseudocode: ``` strips = create_all_strip_contexts for all strips: batch quads for background, color band, offsets, transition, meta contents draw_batched_quads for all strips: draw thumbnail for all strips: batch quads for fcurve overlay, waveform overlay draw_batched_quads for all strips: draw locked overlay for all strips: batch quads for invalid, fx input / multicam / solo highlights, handles, outlines draw_batched_quads ``` For retiming keys, change code so that instead of drawing one key at a time (i.e. telling GPU "draw this one point"), it draws many of them in one go ("draw all these points"). It creates imm batches of up to 1024 keys, when they are full it does immEnd and starts a new batch. For waveform overlay, the changes both speed it up, but also **change how it looks like**, following design proposal in #115274. Previous code had complicated logic that switched between single-line drawing and triangle-strip drawing, which is not great for trying to submit larger batches to the GPU. The line part often looked weird when zoomed in, and the triangle part was sometimes displaying _wrong_ data magnitude, due to sample interpolation wrongly using -0.5..+0.5 range instead of 0..1 (as a result, it was sometimes showing "clipped" red areas where in fact there aren't any!). Changing all of that to draw squares/quads for waveform samples is both much simpler code wise, and allows batching for the GPU, and displays waveform peaks better, when zoomed out. ### What is _not_ done in this PR Potential future optimizations, since hey maybe let's not try to address _everything_ in one go: - Drawing thumbnails: still expensive, mostly due to GPU textures being created and destroyed on each and every draw. - Drawing channel names: expensive (\~3ms) due to O(N^2) complexity in querying all strips. Addressed part of that by only querying meta strips (which is all that's needed in that code). - Waveform and f-curve overlays: f-curve lookups are costly. Might be possible to multi-thread that for all strips before drawing. - Waveform overlay: when zoomed out, each pixel can cover like 50 samples, and all of them are scanned to find min/max values at render time. Might be possible to build kind-of "mipmaps" of waveform preview data when fetching it, and pick appropriate one based on zoom level. ### Open Questions 1) I was surprised that there's no efficient and convenient way to draw many quads at once via `imm` API, actually. A good question is, should something like that be built as part of imm, instead of adding `SeqQuadsBatch` inside sequencer? While the helper class is very simple, it is actually not sequencer specific _at all_. 2) Things like waveform drawing, by rendering each pixel column as a quad, are not "terribly efficient". While way faster than the previous code, an even faster approach would be to put all waveform preview data into a large GPU buffer (SSBO), and draw one quad for each preview waveform overlay, with a special shader that would read from said buffer and render the preview. A similar thing could be done for f-curve previews, again put data into a GPU buffer and have a shader sample it. However, this leads to a somewhat more complicated code (strip preview is no longer in just C++ code, but also GLSL). So not sure if it's worth doing. The current approach "one quad for each pixel column" sounds like a waste, but as long as GPU submission is in large enough batches, it is fast enough.

❤️ 5 🎉 1 🚀 4

Aras Pranckevicius added 3 commits 2023-11-23 18:55:11 +01:00

b25ac443ef VSE: speedup timeline drawing

The strips and retiming items were drawn one at a time, switching
between GPU shaders, geometry topologies and using tiny immediate
mode batches (often two triangles) for everything.

Optimize that by going way larger GPU batches, like:
- Draw all strip backgrounds at once,
- Draw all strip handles at once,
- Draw all strip outlines at once,
- Draw all retiming continuity sections at once,
- Draw all retiming keyframes at once.

There's no efficient way to draw separate quads in GPU IMM API, so
instead this adds SeqQuadsBatch utility that batches up to 1024
quads for drawing, using indexed vertices.

In Sprite Fright Edit data set, with whole timeline visible (2702
strips), repainting the timeline UI with audio waveforms on takes:

Windows (Ryzen 5950X, RTX 3080Ti): 46ms -> 23ms. Of the time still
left, 14ms is drawing sound waveforms. Remaining cost is other overheads
not directly related to actual rendering (some squared complexity
selection queries, f-curve existence lookups etc.).

aceff20c3b VSE: speedup timeline drawing pt 2

Drawing of retiming keys was spending almost all the time inside
SEQ_retiming_selection_contains, with essentially squared complexity
(for each retiming key, scanning all existing keys).

Instead, build selected keys into a hashmap via
SEQ_retiming_selection_get for way faster lookup.

In Sprite Fright Edit data set, with whole timeline visible (2702
strips), repainting the timeline UI with audio waveforms takes
(Windows, Ryzen 5950X, RTX 3080Ti): 23ms -> 19.5ms (retime_keys_draw
5.7ms -> 0.9ms)

67435f15d0 VSE: fixes for previous commits

Aras Pranckevicius force-pushed vse-draw-opt from 11b3cf02af to b485388b57

2023-11-23 19:11:49 +01:00

Compare

Aras Pranckevicius added 1 commit 2023-11-23 19:54:15 +01:00

84fd6a17eb VSE: batch draw optimization for draw_strip_offsets

Same story as in previous commits; something like 12ms -> 1ms
with whole sprite fright edit timeline visible.

Aras Pranckevicius added this to the Video Sequencer project 2023-11-24 09:04:34 +01:00

Aras Pranckevicius added 2 commits 2023-11-24 10:48:42 +01:00

cfcd46d855 VSE: optimize draw_channels by speeding up rna_SeqTimelineChannel_owner_get

Drawing the channels list was taking about 3ms on my machine, almost all
of that time querying *all* the VSE strips for each channel mute/lock
button. That is done due to some complexicated code from, where deep
from inside button it calls ui_but_get_fcurve which eventually calls
rna_SeqTimelineChannel_path, which calls rna_SeqTimelineChannel_owner_get,
which was proceeding to construct a set of all the strips.

It's a good question whether this "all the strips" set could be cached
somewhere.

But for now, just change the code to not query all the strips, but
instead query a set of all meta strips, since that's what the code is
only interested in anyway. This makes draw_channels go from 3.0ms
down to 1.3ms.

ed057916a8 VSE: optimize fcurve overlay drawing

Similar to waveform overlay: draw it via batched quads.
On Sprite Fright Edit, fcurve overlay drawing goes
from 11.1ms down to 5.8ms, with most of the remaining cost inside
id_data_find_fcurve.

Sebastian Parborg commented

2023-11-24 12:08:31 +01:00

Thank you for working on this!
The open questions is probably best answered by the viewport module.
@Jeroen-Bakker ^

Thank you for working on this! The open questions is probably best answered by the viewport module. @Jeroen-Bakker ^

Jeroen Bakker requested review from Jeroen Bakker 2023-11-24 12:12:53 +01:00

Jeroen Bakker commented

2023-11-24 12:31:15 +01:00

Seems like expected changes :-) See #114265 for a related task, but then for the outliner. There I also think of drawing layers which collect the information and then send in a few batches to the GPU.

imm is very basic for drawing multiple quads you could check the widget drawing (GPU_SHADER_2D_WIDGET_BASE_INST) , which can do more than only quads. The widget shader is already there, if that isn't enough, we could check for a dedicated shader.

Video previews I am not familiar with But as they are relatively tiny I would expect to do most parts on the CPU for now. Perhaps just cache the result and draw it as a texture. As I am not familiar with the in and outs, you're free to come with your own ideas here as well.

For waveforms I expect an ssbo (one value per pixel in the current zoom level) and a special shader to draw the lines.

Seems like expected changes :-) See #114265 for a related task, but then for the outliner. There I also think of drawing layers which collect the information and then send in a few batches to the GPU. imm is very basic for drawing multiple quads you could check the widget drawing (GPU_SHADER_2D_WIDGET_BASE_INST) , which can do more than only quads. The widget shader is already there, if that isn't enough, we could check for a dedicated shader. Video previews I am not familiar with But as they are relatively tiny I would expect to do most parts on the CPU for now. Perhaps just cache the result and draw it as a texture. As I am not familiar with the in and outs, you're free to come with your own ideas here as well. For waveforms I expect an ssbo (one value per pixel in the current zoom level) and a special shader to draw the lines.

Aras Pranckevicius added 1 commit 2023-11-24 14:21:02 +01:00

b6f54f5b4a VSE: change most of remaining code to batch quads, cleanup

Where possible, change most of remaining strip drawing parts to use
batched drawing. And now that's done, there's much less need
to separate all of it by "layers", and most of it can go back to the
style of "for all strips: draw these parts" since that all goes
through the same batcher. However thumbnails and the locked
state are still treated as "separate layers" since they use different
shader.

Aras Pranckevicius added 1 commit 2023-11-24 14:22:58 +01:00

62115186a4 Merge branch 'main' into vse-draw-opt

Aras Pranckevicius added 1 commit 2023-11-24 14:40:31 +01:00

93217906f2 VSE: cleanup

Aras Pranckevicius commented

2023-11-24 17:30:48 +01:00

See #114265 for a related task, but then for the outliner

Oh, cool! Looking for volunteers to tackle that one? :)

imm is very basic for drawing multiple quads you could check the widget drawing (GPU_SHADER_2D_WIDGET_BASE_INST) , which can do more than only quads. The widget shader is already there, if that isn't enough, we could check for a dedicated shader.

This sounds a bit like overkill. Most of timeline strips are just simple flat-colored quads, but depending on {a lot of factors}, there can be many quads over one strip, all placed slightly differently. And it also might have some single-line outlines, or single-line lines atop/below some parts.

I actually started to experiment with doing things similar to how "a widget" shader is doing, thinking perhaps even going as far as making a complete "timeline strip shader" that would do all the logic in there. But then upon looking further, realized that most of it is just flat-colored rectangles, but with complicated/hairy logic driving when and where they are placed. And so adding a way to "just batch some flat rectangles, and some lines plz" code sounded like the easiest approach. Does that make sense?

Video previews I am not familiar with But as they are relatively tiny I would expect to do most parts on the CPU for now. Perhaps just cache the result and draw it as a texture. As I am not familiar with the in and outs, you're free to come with your own ideas here as well.

The results are cached, but they are cached into CPU-side only images. Which are then, for some reason, turned into GPU textures each and every frame, drawn, and said textures destroyed immediately. But I'd rather keep this part for a separate PR.

For waveforms I expect an ssbo (one value per pixel in the current zoom level) and a special shader to draw the lines.

I thought doing that initially, but then tried doing a "ehh just draw waveform as a series of quads placed next to each other" (which are already batched due to above), and turns out that made all that code completely disappear from profiling results. So this sounded easier :)

> See #114265 for a related task, but then for the outliner Oh, cool! Looking for volunteers to tackle that one? :) > imm is very basic for drawing multiple quads you could check the widget drawing (GPU_SHADER_2D_WIDGET_BASE_INST) , which can do more than only quads. The widget shader is already there, if that isn't enough, we could check for a dedicated shader. This sounds a bit like overkill. Most of timeline strips are just simple flat-colored quads, but depending on {a lot of factors}, there can be _many_ quads over one strip, all placed slightly differently. And it also might have some single-line outlines, or single-line lines atop/below some parts. I actually started to experiment with doing things similar to how "a widget" shader is doing, thinking perhaps even going as far as making a complete "timeline strip shader" that would do all the logic in there. But then upon looking further, realized that most of it is just flat-colored rectangles, but with complicated/hairy logic driving when and where they are placed. And so adding a way to "just batch some flat rectangles, and some lines plz" code sounded like the easiest approach. Does that make sense? > Video previews I am not familiar with But as they are relatively tiny I would expect to do most parts on the CPU for now. Perhaps just cache the result and draw it as a texture. As I am not familiar with the in and outs, you're free to come with your own ideas here as well. The results are cached, but they are cached into CPU-side only images. Which are then, for some reason, turned into GPU textures each and every frame, drawn, and said textures destroyed immediately. But I'd rather keep this part for a separate PR. > For waveforms I expect an ssbo (one value per pixel in the current zoom level) and a special shader to draw the lines. I thought doing that initially, but then tried doing a "ehh just draw waveform as a series of quads placed next to each other" (which are already batched due to above), and turns out that made all that code completely disappear from profiling results. So this sounded easier :)

Aras Pranckevicius requested review from Richard Antalik 2023-11-24 18:03:14 +01:00

Aras Pranckevicius changed title from ~~WIP: VSE: speedup timeline drawing, and improve waveform display~~ to VSE: speedup timeline drawing, and improve waveform display

2023-11-24 18:03:27 +01:00

Aras Pranckevicius added 1 commit 2023-11-25 10:31:23 +01:00

5f4c5c7369 VSE: code comments

Jeroen Bakker commented

2023-11-27 11:17:28 +01:00

I actually started to experiment with doing things similar to how "a widget" shader is doing, thinking perhaps even going as far as making a complete "timeline strip shader" that would do all the logic in there. But then upon looking further, realized that most of it is just flat-colored rectangles, but with complicated/hairy logic driving when and where they are placed. And so adding a way to "just batch some flat rectangles, and some lines plz" code sounded like the easiest approach. Does that make sense?

Makes sense bordered rects with colors batch

The results are cached, but they are cached into CPU-side only images. Which are then, for some reason, turned into GPU textures each and every frame, drawn, and said textures destroyed immediately. But I'd rather keep this part for a separate PR.

Yes a separate PR would be appreciated.

I thought doing that initially, but then tried doing a "ehh just draw waveform as a series of quads placed next to each other" (which are already batched due to above), and turns out that made all that code completely disappear from profiling results. So this sounded easier :)

👍

> I actually started to experiment with doing things similar to how "a widget" shader is doing, thinking perhaps even going as far as making a complete "timeline strip shader" that would do all the logic in there. But then upon looking further, realized that most of it is just flat-colored rectangles, but with complicated/hairy logic driving when and where they are placed. And so adding a way to "just batch some flat rectangles, and some lines plz" code sounded like the easiest approach. Does that make sense? Makes sense bordered rects with colors batch > The results are cached, but they are cached into CPU-side only images. Which are then, for some reason, turned into GPU textures each and every frame, drawn, and said textures destroyed immediately. But I'd rather keep this part for a separate PR. Yes a separate PR would be appreciated. > I thought doing that initially, but then tried doing a "ehh just draw waveform as a series of quads placed next to each other" (which are already batched due to above), and turns out that made all that code completely disappear from profiling results. So this sounded easier :) 👍

Richard Antalik commented

2023-11-27 16:36:21 +01:00

Thanks for patch, with your helper class this is even simpler than I expected :)
So far, I can see some "Z fighting" in selection:

The sound waveform line drawing was implemented by @ZedDB, probably useful for debugging purposes only. Seems, that is is still working, but at certain zoom levels the lines are lost:

I have uploaded wav file for reference.

Other than that, I think that functionally the patch is fine.

Thanks for patch, with your helper class this is even simpler than I expected :) So far, I can see some "Z fighting" in selection: ![Screenshot_2023-11-27_15-41-57.png](/attachments/44e51202-7afd-4cc2-8a1b-855e1a9c362b) The sound waveform line drawing was implemented by @ZedDB, probably useful for debugging purposes only. Seems, that is is still working, but at certain zoom levels the lines are lost: ![Screenshot_2023-11-27_15-41-57.png](/attachments/fade700d-fdca-402d-a251-9f3b47af9743) I have uploaded wav file for reference. Other than that, I think that functionally the patch is fine.

untitled.wav

172 KiB

Screenshot_2023-11-27_16-27-41.png

2.0 KiB

Screenshot_2023-11-27_15-41-57.png

400 B

Sebastian Parborg commented

2023-11-27 17:14:11 +01:00

Ah, right. Besides making is possible to visualize some more synthectic waveforms like this:

It also solved the issue of indicating when there were no audio data left in the strip (however this is not an issue anymore now that the strip color is different for "out of bounds" data.

I were planning to use that file to do a automated test to check for any audio sync issues. However I didn't have the time to finish that.
I've attached the file in a .zip.

I do think being able to visualize data like that is probably quite useful as it can help the end user notice issues with the audio visially that might be lost if the waveform drawing method can't handle drawing it.

While opening the test file, I noticed a bug in the current master. Perhaps it has already been solved with this PR?
If I drag the start frame of the sound strip the drawn waveform gets offset in the wrong direction. But the audio is not offset either and stays in place. I guess this is deeper than just UI code though because the audio playback doesn't change.
(See attached video)

Ah, right. Besides making is possible to visualize some more synthectic waveforms like this: ![image](/attachments/a7fb4d25-9b17-4686-a8c0-fb54fff33ccf) It also solved the issue of indicating when there were no audio data left in the strip (however this is not an issue anymore now that the strip color is different for "out of bounds" data. I were planning to use that file to do a automated test to check for any audio sync issues. However I didn't have the time to finish that. I've attached the file in a .zip. I do think being able to visualize data like that is probably quite useful as it can help the end user notice issues with the audio visially that might be lost if the waveform drawing method can't handle drawing it. While opening the test file, I noticed a bug in the current master. Perhaps it has already been solved with this PR? If I drag the start frame of the sound strip the drawn waveform gets offset in the wrong direction. But the audio is not offset either and stays in place. I guess this is deeper than just UI code though because the audio playback doesn't change. (See attached video)

out.mp4

69 KiB

vse_waveform_sync.zip

410 KiB

image.png

7.1 KiB

Aras Pranckevicius commented

2023-11-27 17:30:33 +01:00

So far, I can see some "Z fighting" in selection

Good catch, will investigate.

The sound waveform line drawing was implemented by @ZedDB, probably useful for debugging purposes only. Seems, that is is still working, but at certain zoom levels the lines are lost

Ah ok. I think I have an idea how to keep the "a line that's not at zero" working from the original code, without alternating between quads and lines that ends up being "not terribly great" for performance :)

While opening the test file, I noticed a bug in the current master. Perhaps it has already been solved with this PR?
If I drag the start frame of the sound strip the drawn waveform gets offset in the wrong direction. But the audio is not offset either and stays in place. I guess this is deeper than just UI code though because the audio playback doesn't change.

In my quick test, that seemed to work fine in 3.6, but is broken in both 4.0 and 4.1, so sounds like an unrelated regression. I can try fixing that in this PR too, but maybe it would be better as a separate commit/PR by someone, for easier backporting to 4.0 if wanted.

> So far, I can see some "Z fighting" in selection Good catch, will investigate. > The sound waveform line drawing was implemented by @ZedDB, probably useful for debugging purposes only. Seems, that is is still working, but at certain zoom levels the lines are lost Ah ok. I think I have an idea how to keep the "a line that's not at zero" working from the original code, without alternating between quads and lines that ends up being "not terribly great" for performance :) > While opening the test file, I noticed a bug in the current master. Perhaps it has already been solved with this PR? If I drag the start frame of the sound strip the drawn waveform gets offset in the wrong direction. But the audio is not offset either and stays in place. I guess this is deeper than just UI code though because the audio playback doesn't change. In my quick test, that seemed to work fine in 3.6, but is broken in both 4.0 and 4.1, so sounds like an unrelated regression. I can try fixing that in this PR too, but maybe it would be better as a separate commit/PR by someone, for easier backporting to 4.0 if wanted.

Sebastian Parborg commented

2023-11-27 17:34:53 +01:00

Maybe it would be better as a separate commit/PR by someone, for easier backporting to 4.0 if wanted.

Probably better to fix it as a separate commit, yes.

> Maybe it would be better as a separate commit/PR by someone, for easier backporting to 4.0 if wanted. Probably better to fix it as a separate commit, yes.

Aras Pranckevicius added 3 commits 2023-11-27 18:32:37 +01:00

7e72cda82a Merge branch 'main' into vse-draw-opt

79bb07df3b VSE: fix visual ordering issues between unselected and selected strips

Also fixes already existing problem where strip text overlay would
appear "behind" other text overlays, while dragging selected
strip over others.

2202c6f6f4 VSE: bring back "line" waveform drawing parts

Their use case was explained in the code review here
#115311 (comment)

Richard Antalik requested changes 2023-11-27 18:39:52 +01:00

source/blender/editors/space_sequencer/sequencer_quads_batch.cc

						
				@ -0,0 +101,4 @@

				    draw();

				  }

				  if (quads_num == 0) {

				    /* Don't actually need this, but it's the only way to clear GPU_VERTBUF_DATA_UPLOADED flag. */

Richard Antalik commented

2023-11-27 17:32:39 +01:00

Do you need to clear the flag, because you are reusing the buffer? I would expect that there is some function to handle this.

Here it seems to work when I remove call to GPU_vertbuf_attr_get_raw_data(). Would have to look how exactly this flag is used.

Do you need to clear the flag, because you are reusing the buffer? I would expect that there is some function to handle this. Here it seems to work when I remove call to `GPU_vertbuf_attr_get_raw_data()`. Would have to look how exactly this flag is used.

aras_p marked this conversation as resolved

source/blender/editors/space_sequencer/sequencer_quads_batch.cc Outdated

						
				@ -0,0 +105,4 @@

				    GPUVertBufRaw attr_data;

				    GPU_vertbuf_attr_get_raw_data(vbo_quads, 0, &attr_data);

				    verts_quads = (ColorVertex *)GPU_vertbuf_get_data(vbo_quads);

Richard Antalik commented

2023-11-27 17:18:21 +01:00

Use static_cast for casting types, see https://wiki.blender.org/wiki/Style_Guide/C_Cpp#C.2B.2B_Type_Cast

Use `static_cast` for casting types, see https://wiki.blender.org/wiki/Style_Guide/C_Cpp#C.2B.2B_Type_Cast

aras_p marked this conversation as resolved

source/blender/editors/space_sequencer/sequencer_retiming_draw.cc Outdated

						
				@ -313,6 +303,12 @@ static void retime_key_draw(const bContext *C, const Sequence *seq, const SeqRet

				  CLAMP(key_position, left_pos_min, right_pos_max);

				  const float alpha = SEQ_retiming_data_is_editable(seq) ? 1.0f : 0.3f;

				  if (*r_point_counter >= MAX_KEYS_IN_BATCH) {

Richard Antalik commented

2023-11-27 17:58:17 +01:00

Bit nitpicky, but to me this feels bit out of place here. This could be done in loop in retime_keys_draw(). That way you don't have to pass point_counter around. You can assume, that fake_keys_draw() draws 2 vertices, and retime_key_draw() draws 1.

Bit nitpicky, but to me this feels bit out of place here. This could be done in loop in `retime_keys_draw()`. That way you don't have to pass `point_counter` around. You can assume, that `fake_keys_draw()` draws 2 vertices, and `retime_key_draw()` draws 1.

aras_p marked this conversation as resolved

source/blender/editors/space_sequencer/sequencer_retiming_draw.cc Outdated

						
				@ -414,2 +415,4 @@

				  }

				  /* Get the selection here once, for faster "is key selected?" lookups later.

				     This is similar to SEQ_retiming_selection_get, except we want only a set. */

Richard Antalik commented

2023-11-27 17:51:06 +01:00

Can't you just pass map returnd by SEQ_retiming_selection_get() and use contains() method? This seems like unnecessary code duplication.

Can't you just pass map returnd by `SEQ_retiming_selection_get()` and use `contains()` method? This seems like unnecessary code duplication.

Aras Pranckevicius commented

2023-11-27 18:55:14 +01:00

That's what I tried initially, but SEQ_retiming_selection_get returns non-const Sequence pointers as map keys, and my C++-fu is not strong enough to find the magic incantation, how to use contains() when I have a const-Sequence pointer. contains_as() with const pointer does not compile, at least on visual studio.

That's what I tried initially, but SEQ_retiming_selection_get returns non-const Sequence pointers as map keys, and my C++-fu is not strong enough to find the magic incantation, how to use `contains()` when I have a const-Sequence pointer. `contains_as()` with const pointer does not compile, at least on visual studio.

Richard Antalik commented

2023-11-28 16:18:18 +01:00

I am not great with C++ either, but you can use non-const SeqRetimingKey for iteration.

There is also const_cast which seems to work, but not sure if it is good idea to use it in this case.

I am not great with C++ either, but you can use non-const `SeqRetimingKey` for iteration. There is also `const_cast` which seems to work, but not sure if it is good idea to use it in this case.

aras_p marked this conversation as resolved

source/blender/editors/space_sequencer/sequencer_timeline_draw.cc Outdated

						
				@ -637,3 +518,1 @@

				    float y_mid = (strip_ctx->bottom + strip_ctx->strip_content_top) / 2.0f;

				    /* The length from the middle of the strip to the top/bottom. */

				    float y_scale = (strip_ctx->strip_content_top - strip_ctx->bottom) / 2.0f;

				    /* Do not draw the bars below 2px height. */

Richard Antalik commented

2023-11-27 18:15:05 +01:00

I did not try to change this, but I have noticed, that in current main I was able to see faint noise, which can be useful to user.
Keep in mind, that assuming 32px total height, a waveform with "1px amplitude" is only -30dB, which is pretty loud. At 64px total height it would be -36dB, still quite loud. Math can be a bit off, but close enough...

I did not try to change this, but I have noticed, that in current main I was able to see faint noise, which can be useful to user. Keep in mind, that assuming 32px total height, a waveform with "1px amplitude" is only -30dB, which is pretty loud. At 64px total height it would be -36dB, still quite loud. Math can be a bit off, but close enough...

Aras Pranckevicius commented

2023-11-27 18:57:05 +01:00

I've re-added "line segments" drawing based on your and Sebastian comments wrt mostly synthetic or low frequency waveforms. Now this code, when below 2px draws a line segment instead, conceptually similar to main.

aras_p marked this conversation as resolved

Richard Antalik commented

2023-11-27 18:42:44 +01:00

So far, I can see some "Z fighting" in selection

Good catch, will investigate.

Maybe I should have mentioned, that the reason why in draw_seq_strips() you need ordered set of strips is, priority of selection overlay drawing, so I would look for suspect there. But from looking at code that doesn't seem to be an issue.

> > So far, I can see some "Z fighting" in selection > > Good catch, will investigate. Maybe I should have mentioned, that the reason why in `draw_seq_strips()` you need ordered set of strips is, priority of selection overlay drawing, so I would look for suspect there. But from looking at code that doesn't seem to be an issue.

Aras Pranckevicius commented

2023-11-27 18:58:05 +01:00

So far, I can see some "Z fighting" in selection

Good catch, will investigate.

Maybe I should have mentioned, that the reason why in draw_seq_strips() you need ordered set of strips is, priority of selection overlay drawing, so I would look for suspect there. But from looking at code that doesn't seem to be an issue.

It was, just pushed a fix (while at it, fixed text overlays being the opposite order - was mentioned on blender chat thread by someone).

> > > So far, I can see some "Z fighting" in selection > > > > Good catch, will investigate. > > Maybe I should have mentioned, that the reason why in `draw_seq_strips()` you need ordered set of strips is, priority of selection overlay drawing, so I would look for suspect there. But from looking at code that doesn't seem to be an issue. It was, just pushed a fix (while at it, fixed text overlays being the opposite order - was mentioned on blender chat thread by someone).

🎉 1

Aras Pranckevicius added 2 commits 2023-11-28 09:27:42 +01:00

d9c163c8b8 VSE: cleanup VBO usage code in SeqQuadsBatch

27817960d2 VSE: simplify code from PR review

Aras Pranckevicius requested review from Richard Antalik 2023-11-28 09:32:30 +01:00

Richard Antalik requested changes 2023-11-28 16:26:33 +01:00

Richard Antalik left a comment

I 've got 2 more minor points, otherwise this seems fine for me.

source/blender/editors/space_sequencer/sequencer_retiming_draw.cc Outdated

						
				@ -333,0 +315,4 @@

				static void draw_continuity(const bContext *C,

				                            const Sequence *seq,

				                            const SeqRetimingKey *key,

				                            const blender::Set<const SeqRetimingKey *> *selection,

Richard Antalik commented

2023-11-28 16:13:09 +01:00

Sorry, I have noticed this only now, you should avoid passing pointers to objects, pass reference instead.

So argument would be const blender::Set<const SeqRetimingKey *> &selection

Sorry, I have noticed this only now, you should avoid passing pointers to objects, pass reference instead. So argument would be `const blender::Set<const SeqRetimingKey *> &selection`

aras_p marked this conversation as resolved

source/blender/editors/space_sequencer/sequencer_retiming_draw.cc Outdated

						
				@ -432,0 +460,4 @@

				    }

				    /* If batch is full, start a new one (fake keys add 2 points max). */

				    if (point_counter + 2 > MAX_KEYS_IN_BATCH) {

Richard Antalik commented

2023-11-28 16:25:11 +01:00

I could have specified this a bit more, you can have just 1 condition
if (point_counter + 3 > MAX_KEYS_IN_BATCH) { after retime_key_draw()

This way batch should never be overfilled, and from what I read, with immBeginAtMost() you can always draw less vertices.

I could have specified this a bit more, you can have just 1 condition `if (point_counter + 3 > MAX_KEYS_IN_BATCH) {` after `retime_key_draw()` This way batch should never be overfilled, and from what I read, with `immBeginAtMost()` you can always draw less vertices.

aras_p marked this conversation as resolved

Richard Antalik commented

2023-11-28 16:31:45 +01:00

@Jeroen-Bakker Do you want to review the code as well? I think, this could be merged after next update, but I can wait for your review.

Aras Pranckevicius added 1 commit 2023-11-28 17:27:42 +01:00

buildbot/vexp-code-patch-coordinator Build done. Details

be09dfdc29 VSE: retiming draw code: code style, simplify

Richard Antalik approved these changes 2023-11-28 20:37:42 +01:00

Richard Antalik left a comment

Thanks for updates, seems fine now.

Jeroen Bakker commented

2023-11-29 17:06:41 +01:00

I looked into the code, but seems all to be isolated in the sequencer code.
I don't see any obvious issue with the code to land. So fine by me to land.

I looked into the code, but seems all to be isolated in the sequencer code. I don't see any obvious issue with the code to land. So fine by me to land.

Jeroen Bakker approved these changes 2023-11-29 17:06:59 +01:00

Jeroen Bakker commented

2023-11-29 17:07:10 +01:00

@blender-bot build

Jeroen Bakker added this to the 4.1 milestone 2023-11-29 17:07:36 +01:00

Aras Pranckevicius merged commit df16f4931e into main

2023-11-29 20:25:30 +01:00

Aras Pranckevicius referenced this issue from a commit

2023-11-29 20:25:32 +01:00

VSE: speedup timeline drawing, and improve waveform display

Aras Pranckevicius deleted branch vse-draw-opt

2023-11-29 20:25:32 +01:00

Sign in to join this conversation.

No reviewers

No Label

Download

What's New

Blender Studio

Manual

Developers Blog

Documentation

Benchmark

Blender Conference

Development Fund

One-time Donations

VSE: speedup timeline drawing, and improve waveform display #115311

VSE timeline rendering performance overview

Approach taken in this PR to improve performance

What is not done in this PR

Open Questions