The latest update to NVIDIA Nsight Systems—a performance analysis tool—is now available for download. Designed to help you tune and scale software across CPUs and GPUs, this release introduces several improvements aimed to enhance the profiling experience.
Nsight Systems is part of the powerful debugging and profiling NVIDIA Nsight Tools Suite. You can start with Nsight Systems for an overall system view and avoid picking less efficient optimizations based on assumptions and false-positive indicators.
- Statistics now available in graphical user interface
- Multireport view with horizontal and vertical layouts to aid investigations across server nodes, VMs, containers, ranks, and processes (coming soon)
- Expert system now includes GPU utilization analysis for OpenGL and DX12
- NVIDIA NIC Infiniband metrics sampling (experimental)
- DirectX12 memory operations and warnings
- DXGI/DX12/Vulkan API calls correlation to WDDM queue packets
- Windows 11 support
Multireport view enhancements can improve investigations. They support merging into a single timeline reports that are continuations of existing sessions or reports captured simultaneously from other server nodes, VMs, container, rank, and process.
Figure 1. Two MPI ranks from separate report files viewed together on a shared timeline
NVIDIA NIC Infiniband metrics sampling (experimental) enables you to understand details of server communications, such as throughput, packet counts, and congestion notifications.
Figure 2. NVIDIA NIC Infiniband metrics sampling
Using DirectX12 trace, a new memory operations row highlights memory usage warnings and situations where expensive functions are called when resources are non-persistently mapped.
Figure 3. DirectX12 memory operations and warnings
WDDM trace now correlated graphics API calls to queue packets so that you can better understand workload creation and its progress through the Windows display driver model.
Figure 4. DXGI, DX12, and Vulkan API call correlation to WDDM queue packets
For more information, see the following resources:
- Optimizing DX12 Resource Uploads to the GPU Using CPU-Visible VRAM
- If you are an Nvprof or NVIDIA Visual Profiler user:
- For future release highlights and feature spotlights, subscribe to the NVIDIA Developer YouTube channel
- To download the latest release of Nsight Systems, see the Nsight Systems product page
- Ask questions or give feedback in the forums