Performance Regressions in 32.4.2 and newer GPU driver versions

theofficialgman · May 19, 2021, 11:24pm

There are GPU performance regressions in the 32.4.2 and newer userspace gpu drivers (shown here with a vulkan benchmark suite).

I have tested the 32.3.1, 32.4.2. 32.4.4, and 32.5.1 package releases by starting with a 32.3.1 jetpack release and upgrading incrementally by pinning the packages, changing the nvidia-l4t-apt-source.list when necessary and then benchmarking each version.

For the ease of representation and reproduction, I used an open benchmark suite to generate my data GitHub - RippeR37/GL_vs_VK: Comparison of OpenGL and Vulkan API in terms of performance.
Attached you will find my source download and compiled binary. I have edited the source slightly to result in a GPU bottleneck on the TX1. Extract the source attached to your users home folder and run the vk_bench_table.sh shell script (found in the bin folder) to generate tables of data for benchmark 1 and 3 for the full frequency range of the GPU.
The script pins the CPU to the maximum allowed frequency and sets the RAM frequency to 1600MHz.
https://drive.google.com/file/d/1N4orvxpx34JRRJXgWCgeiQ3jY8YKEcBc/view?usp=sharing

Below is a description of the two benchmarks taken from the readme:

Test #1 - static scene

This test resolves around single static scene with variable number of rendered objects which quality can be chosen (each is a sphere with specific ammount of vertices).

Number of vertices, number of vertices and update work is customizable to give possibility to emulate different ammount of CPU and GPU work (this gives us an opportunity to test CPU-bound and GPU-bound scenarios).

Test #3 - shadow mapping

In this test we render a “checkboard” floor with differently-colored cubes and above that we render one high-res sphere and many cubes in different positions. We render it in two passes - depth pass from light PoV to acquire shadowmap and then real render pass which simply renderes scene shadowing necessary fragments.

kayccc · May 20, 2021, 7:18am

Thanks for reporting this issue, our team will do the investigation soon.

theofficialgman · June 10, 2021, 9:06pm

any update/new on this issue?

kayccc · June 16, 2021, 7:05am

Currently there is no much update from team regarding this regression. Sorry for that.

theofficialgman · August 6, 2021, 8:59pm

just to confirm, 32.6.1 has the same poor performance

to add more information, there appears to be a significant frame skipping in the benchmark for 32.4.2 and newer that was not present in 32.3.1.

_Diablo · March 5, 2023, 4:41pm

How status for this?
It is still present in the latest 32.7.3 BSP.

Topic		Replies	Views
Performance downgrade on Jetpack 4.2 comparing to Jetpack 3.3 on TX2 Jetson TX2	9	1205	January 20, 2020
Chromium Vulkan Crashes and fallsback to OpenGL on L4T 32.4.X+ Jetson Nano opengl	7	1146	July 4, 2023
Jetpack4.2 based TX2 (Ubuntu18.04) GL performance issue Jetson TX2	14	1375	November 13, 2019
Vulkan FIFO broken (tears) on 32.4.X+ Jetson Nano vulkan	9	434	May 15, 2026
Vulkan shader experiences a significant performance drop and issues with procedural noise with driver versions > 470 Vulkan	1	1603	September 10, 2022
32.6.1-20210726122000 vs 32.6.1-20210916211029 nvidia-l4t-* packages Jetson Nano reflash	3	568	December 15, 2021
HDMI was flicker on demo kit when running stress on gpu Jetson Nano nvbugs , hdmi	65	3142	August 4, 2020
Announcing JetPack 4.6.1 Release with L4T 32.7.1 Jetson Nano	22	11192	December 5, 2022
Poor performance in comparison with OpenGL 4 Vulkan	4	1817	October 21, 2019
Severe performance regression on L4T32 Jetson TX2 performance	3	197	March 5, 2025

Performance Regressions in 32.4.2 and newer GPU driver versions

Test #1 - static scene

Test #3 - shadow mapping

Related topics