CUDA 7.5: Pinpoint Performance Problems with Instruction-Level Profiling

jwitsoe · September 8, 2015, 7:20am

Originally published at: https://developer.nvidia.com/blog/cuda-7-5-pinpoint-performance-problems-instruction-level-profiling/

[Note: Thejaswi Rao also contributed to the code optimizations shown in this post.] Today NVIDIA released CUDA 7.5, the latest release of the powerful CUDA Toolkit. One of the most exciting new features in CUDA 7.5 is new Instruction-Level Profiling support in the NVIDIA Visual Profiler. This powerful new feature, available on Maxwell (GM200) and…

anon69078816 · October 18, 2015, 2:48pm

i cant setup cuda toolkit. My gtx 980ti is not vga cuda compatible? ??

anon95180265 · October 18, 2015, 10:34pm

You'll need to provide more information on the problems you are having. All NVIDIA GPUs are CUDA-compatible.

anon65459637 · October 31, 2015, 4:44am

I upgraded to 7.5 on my Ubuntu host, but now I can't debug on Jetson TK1 target due to error "cuda-gdb version (7.5.123) is not compatible with cuda-gdbserver version (6.5.121)". Is there some way to get 7.5 on the TK1?

anon56094974 · December 3, 2015, 12:06pm

nvvp not shows the information in columns and rows (for example, Utilization (column) and stacks in "Kernel Performance is Bound By Instruction And Memory Latency".
Why?

anon95180265 · December 8, 2015, 11:54pm

I don't fully understand the question. Is there a figure from this post where you see something different? Which figure? Can you link to a screenshot showing what you see instead? Thanks!

anon56094974 · December 9, 2015, 7:14am

In attached screenshots, you could see the difference

anon44529095 · December 11, 2015, 6:21am

We tried with fermi GPU on win7 and could not reproduce this issue.
It seems you have taken both the screenshots on the same platform with same GPU, is that correct?
If you can give detailed steps to reproduce the issue along with the platform/operating system you are working on, it will be helpful for us to reproduce the issue quicker.

anon56094974 · December 11, 2015, 10:24am

Yes, both screenshots are taken on the same computer and the same GPU, one running cuda 7.0 (OK) and the other running cuda 7.5 (not OK).
System is running Scientific Linux 6.7 x86_64 in a i7 processor with 8 GB RAM

anon44529095 · January 12, 2016, 5:35am

We are unable to reproduce this behaviour on
CentOS-7/GTX 480 setup with the CUDA 7.5 Production release(7.5.18).
"Scientific Linux 6.7 x86_64" is not supported officially in CUDA 7.5.

anon56094974 · January 18, 2016, 11:50am

Well... Scientific Linux is "not" supported officially in CUDA, but is very similar to CentOS... so... I suppose CentOS will return the same problem... But, if I have free time now, I will install a CentOS 6.x machine with CUDA 7.0 and 7.5

anon4205104 · February 19, 2016, 8:35pm

Found the same problem on a CentOS 6.6 machine with K80s. Have you fixed the problem?

anon56094974 · February 22, 2016, 11:38am

Now, in a CentOS 7.0, both Cuda 7.0 and Cuda 7.5 runs OK and nvvp shows correctly the information in columns and rows (for example,
Utilization (column) and stacks in "Kernel Performance is Bound By
Instruction And Memory Latency".
So, in CentOS 7.x we could say "OK", but in CentOS 6.x (and SL-6.x) the problem persists...

anon24581448 · December 6, 2017, 10:13pm

Thanks for the tip!

Would you be able to post the modified source code (estimated_combined4.cu)?

anon24188788 · April 13, 2018, 4:40pm

great explanation. but, how can i do this Instruction-Level Profiling on command line via nvprof?

Topic		Replies	Views
solved: instruction level sampling in visual profiler (eclipse edition), not available? CUDA Programming and Performance	5	1112	November 5, 2018
Failed Cuda Driver and Runtime version may be mismatched Cuda installation fails on Ubuntu 10.4 x86_ CUDA Programming and Performance	13	5090	November 17, 2010
Problem with cuda 7 toolkit on centos 6.6 CUDA Setup and Installation	3	4762	June 11, 2015
NVIDIA Visual Profiler is unable to profile application Visual Profiler and nvprof	8	11039	March 31, 2021
CUDA Drivers Fail in Multiple Ways After Fresh Install (Linux) CUDA Setup and Installation	5	2448	August 13, 2023
Issues with Nvidia Drivers on CentOS 7.6/7.7 CUDA Setup and Installation	7	4034	September 29, 2019
Instruction-Level Profiling via nvprof? CUDA Programming and Performance	0	1257	January 21, 2016
Installer Fails ~ CUDA 5.5.20 on GeForce GTX 780 Ti CUDA Setup and Installation	10	6941	March 23, 2014
CUDA driver version is insufficient for CUDA runtime version on CentOS 7.5 CUDA Setup and Installation	2	1434	October 8, 2018
verify2install4CUDAtoolkit CUDA Setup and Installation	7	2453	April 4, 2014

CUDA 7.5: Pinpoint Performance Problems with Instruction-Level Profiling

Related topics