Visual profiler

bdi · September 29, 2011, 9:47pm

Hi all,

I am trying to figure out the capabilities of Compute Visual Profiler of NVidia and I have some questions though:

Can we drill down the kernel code or any kind of code? For example, if a kernel takes 8000 gpu cycles to execute, does the tool support further analysis in order to find out which specific instruction of the kernel is the most gpu consuming?
Does the profiler extract information from mixed code (host + device) or only from pure CUDA code?
Does the profiler support analysis for pure host code? For example, if we have cudaMalloc function, will the profiler show the internal system calls which probably are executed in the host side?

All I’ve seen so far from the Internet and the documentation is that the tool provides mostly numerical statistics and it does not analyze the code thoroughly. i.e. The user will understand for example that a allocation function takes, let’s say, 400 cycles, but he can’t find out which specific instruction or system call from the allocation function is the most time consuming!

Thanks in advance

P.S. Please, please I am new to CUDA and I need guidance! Please help me!

bdi · October 3, 2011, 12:24am

nobody??? External Image External Image External Image

Topic		Replies	Views
CUDA Visual profiler Use on early verson of final program? CUDA Programming and Performance	1	1278	January 28, 2010
CUDA Visual Profiler CUDA Programming and Performance	0	43832	January 29, 2008
visual studio performance profiler on CUDA code CUDA Programming and Performance	1	6941	March 20, 2008
analysis inside kernel CUDA Programming and Performance	2	1451	July 2, 2012
More information from profiler Is there a way to get it? CUDA Programming and Performance	1	862	July 25, 2009
Trace __device__ functions in Visual Profiler CUDA Programming and Performance	2	10615	March 16, 2011
preview of NVIDIA Visual Profiler CUDA Programming and Performance	76	89175	May 18, 2010
Profiler + cufft CUDA Programming and Performance	0	2454	March 6, 2009
Help profiling cuda code CUDA Programming and Performance	2	3899	December 12, 2008
Profiling a computationally bound kernel CUDA Programming and Performance	1	2959	May 19, 2009

Visual profiler

Related topics