Hello,
I’m trying to get the profiler running and I’m getting odd results.
System: MacBook Pro, 10.5.4, NVIDIA GeForce 8600M GT, 256 MB.
I have the CUDA system installed and have implemented my kernels and integrated them into some test code. Now I want to make sure I’ve done all the coalescing correctly etc., so it’s time to use the profiler.
If I set my environment variables as such:
CUDA_PROFILE_CONFIG=/Users/sussillo/.cuda_profile_config
CUDA_PROFILE_CSV=1
CUDA_PROFILE_LOG=.cuda_profile_log.csv
CUDA_PROFILE=1
and the contents of the config file are:
more ~/.cuda_profile_config
gld_coherent
branch
then when I run my program “>testger” and then example the profile log file, it’s empty.
If I unset CUDA_PROFILE_CONFIG, run the program “>testger” then examine the profiler output:
$ more .cuda_profile_log.csv
method,gputime,cputime,occupancy
memcopy,1425.376
memcopy,8.928
_Z18my_cuda_syr_kernelfPfS_i,9638.784,9740.377,0.667
memcopy,2496.224
it works. So basically any reference to the config file gives no profiler output! But it’s really the global memory access information that I’m interested in.
Any ideas? (Sorry if this is newbie and I’m just too new to know.)
Thank you,
-David External Media