GPU trace shows wrong timeline when using 2 GPUs

imrodriguezro · September 19, 2024, 12:44pm

Hi!
In my Vulkan application, I render one frame in each of my 2 GPUs, I record one command buffer for each GPU, submit both in a single VkQueueSubmit call and wait on a fence before recording the commands for the next frames.

When running my application through Nsight Graphics I see that before one of my GPUs finishes rendering the current frame the other starts rendering the next frame, which shouldn’t be possible since the fence blocks on the CPU. In my opinion, the timeline for both GPUs is not synchronized. Can someone explain why this happens?

AYan · September 23, 2024, 8:19am

Hi imrodriguezro,

Thank you for using Nsight Graphics and providing your feedback. I am not quite sure about your issue, could you please provide a simple example that would allow us to reproduce the issue? This will help us in investigating and resolving the problem more efficiently.

Thanks
An

imrodriguezro · September 23, 2024, 8:53am

Hi Ayan,
thanks for your quick reply. I’m a bit busy right now and cannot provide a simple example (not so simple anyway with Vulkan). Maybe it helps if I show you an image of what I meant.

I drew rectangles on top of a capture to highlight the work done for each frame. The upper row corresponds to GPU 0 and the bottom one to GPU 1. The work for the frames with rectangles red and green is submitted at the same time and I wait on the CPU for them to be done with a fence before recording any more commands.

As you can see, Nsight Graphics shows that the work for the following frame in GPU 1 (yellow rectangle) starts before the work for the previous frames (red and green) finishes. That’s why it makes me think that the timelines for both GPUs is not synchronized.

I hope that illustrates the issue better.

AYan · September 23, 2024, 12:12pm

Hi imrodriguezro,

It’s hard to say anything, but maybe you can try to capture more than 1 frame and see what happen? Just set Max Number of Frames within the Start Activity dialog box.

Thanks
An

imrodriguezro · September 23, 2024, 12:18pm

Hi An,

My application uses an offscreen engine, so the only way I found to use nsight graphics was with GPU Trace Profile and One-Shot, so it basically collects everything. I could show you even more frames but they follow the same pattern. Can you check from the image what I meant?

AYan · September 24, 2024, 5:25am

Hi imrodiguezro,

Yes, I understand your meaning, the workload within yellow should run after the green rectangle, right? How do you get access to multiple GPUs?

Thanks
An

imrodriguezro · September 24, 2024, 7:25am

Hi An,

yes, that’s the issue I’m seeing. I create two logical devices (VkDevice) that correspond to my two physical GPUs.

Now I realize I need to correct what I said before. The commands are recorded separately for each GPU, sent with different VkQueueSubmit calls and waited on one VkFence object each, one after the other.

Ivan

AYan · September 24, 2024, 8:24am

Hi imrodriguezro,

What version of Nsight Graphics are you using? Could you take a try on some latest release of Nsight Graphics? I have to mention that Nsight Graphics/GPUTrace doesn’t support multiple GPUs, only 1 GPU is supported. In theory, you should be able to see only 1 GPU’s workload in recent release of Nsight Graphics.

Thanks
An

imrodriguezro · September 24, 2024, 9:32am

Hi An,

I’m using the version 2022.7.0.0. I can check tomorrow with the latest version since I need to update my drivers in order to use it. But well, I guess your comment explains why multi-GPU doesn’t work. Is there any plan to support multiple GPUs at some point?

imrodriguezro · September 24, 2024, 12:51pm

Side question: how should I modify my offscreen application in order to use the Frame Debugger and not the GPU Trace Profiler? I think that I could get more information about the GPU utilization with it, but is it worth the trouble anyway?

AYan · September 25, 2024, 6:45am

Hi imrodriguezro,

Roughly you need some Present call to let Frame Debugger know it’s a Frame.

Thanks
An

imrodriguezro · September 26, 2024, 7:10am

Hi An,

Is there any plan to support multiple GPUs in the GPU trace profiler?

Thanks

Ivan

imrodriguezro · October 3, 2024, 7:41am

Well, I guess there is no plan to support multiple GPUs. Thanks anyway.

AYan · October 7, 2024, 8:33am

Hi imrodriguezro,

Maybe you can try to use Nsight System to profiler on multi GPU system.

Thanks
An

imrodriguezro · October 8, 2024, 11:40am

Hi An,

I’ll try it then!

Ivan

system · October 22, 2024, 11:41am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
NSight: gpu trace profiler fails to capture trace on 4070 Nsight Graphics	20	57	April 11, 2025
NVIDIA Nsight Systems Adds Vulkan Support Technical Blog	1	442	May 9, 2019
Vulkan GPU Marker seem off Profiling x86 Windows Targets	3	40	March 10, 2025
Inconsistent times when profiling Vulkan-based render engine compared to D3D11 profiling Nsight Graphics	5	1394	April 11, 2022
Vulkan program crashes on GTX 1080 Vulkan	1	1765	September 2, 2018
Program performs far better with NSight Nsight Graphics opengl	9	1404	July 11, 2023
Internal error attempting to replay event Nsight Graphics vulkan , vulkan-raytracing	3	1300	March 16, 2023
Vulkan frame delimiter / headless applications Nsight Graphics headless , vulkan	10	953	November 18, 2024
Why is my application waiting for a semaphore every 5 frames or so? Nsight Graphics	10	1181	July 11, 2022
Poor multithreading performance compared to DX12 Vulkan	17	5454	September 29, 2020

GPU trace shows wrong timeline when using 2 GPUs

Related topics