Generally looks fine, and works better than it used to. There are a couple of comments about keeping the ifdef.
The issue I've run into is: i was testing this patch on Linux on top of…
@LazyDodo and @Sergey
That's the simplest way I could come up with 9333515ce3
Let me know if I need to make…
I think I know what the problem is. I'll try to submit a fix by early next week.
I have a slight lean towards just always requiring the linker since
Originally, HIPCC_EXECUTABLE was required but not HIP_LINKER_EXECUTABLE. At some point, I noticed the linker was no longer…
I will add this (hiprt not installing any of its headers) to the list of changes I am compiling to hand over to the hiprt team. The project does install some of its headers (in the dist folder)…
HIP_LINKER_EXECUTABLE which points to clang++ is a HIP component but if it is missing, it only affects enabling HIP-RT. When HIP-RT headers are present, Found HIPRT is appropriate, but if the HIP…
Thank you @Sergey I'm glad the PR is now in a reasonable shape. @LazyDodo and @ideasman42 let me know what you think. I will be away until next Tuesday. If there are any concern, I will…
Yes, I confirm I can reproduce the issue, and that it is not related to blender/blender#120702
Not sure it worth going more complicated route
Good! If this option reliably works, then I'd much rather stick with the way it is set up now, rather than splitting the work like I did in the…
CustomData_copy
to CustomData_initi_from
.
Do you mean we can break down the kernel_rt_gfx into individual architectures?
Yes.
Is there a flag we can pass to the HIP compiler to build more things in parallel?
Yes, -parallel-jo…
The kernel_rt_gfx.hipfb takes about 30min to compile with rocm 6.1.2 and 6.1.3
More or less the same for me (although I have a very old cpu on my Ubuntu machine.) We can mitigate it by some…