OpenCL - Empty profile timeline

VladimirS · April 20, 2016, 8:27am

Hi everyone,

I’m now working on an OpenCL project and have to optimize my program to get the best rate of speed.
So I got to use NSight software (under eclipse edition) to profile my project.

The problem is that no timeline is generated when I click on the “Profile” button.
But with a CUDA sample, it works great.
Have I missed something in the NSight settings ?

It would be usefull if someone has an idea to help me up.
Thanks.

The configuration I have :
Ubuntu 14
Nvidia GTX 960 (OpenCL 1.2)
NSight 7.5

An OpenCL Sample I try to profile but still empty :

#include <iostream>
#include <CL/cl.hpp>
using namespace std;

int main() {
    //get all platforms (drivers)
    std::vector<cl::Platform> all_platforms;
    cl::Platform::get(&all_platforms);
    if(all_platforms.size()==0){
        std::cout<<" No platforms found. Check OpenCL installation!\n";
        exit(1);
    }
    cl::Platform default_platform=all_platforms[0];
    std::cout << "Using platform: "<<default_platform.getInfo<CL_PLATFORM_NAME>()<<"\n";

    //get default device of the default platform
    std::vector<cl::Device> all_devices;
    default_platform.getDevices(CL_DEVICE_TYPE_ALL, &all_devices);
    if(all_devices.size()==0){
        std::cout<<" No devices found. Check OpenCL installation!\n";
        exit(1);
    }
    cl::Device default_device=all_devices[0];
    std::cout<< "Using device: "<<default_device.getInfo<CL_DEVICE_NAME>()<<"\n";


    cl::Context context(CL_DEVICE_TYPE_GPU);

    cl::Program::Sources sources;

    // kernel calculates for each element C=A+B
    std::string kernel_code=
            "   void kernel simple_add(global const int* A, global const int* B, global int* C){       "
            "       C[get_global_id(0)]=A[get_global_id(0)]+B[get_global_id(0)];                 "
            "   }                                                                               ";


    cl::Program program(context, kernel_code, true);


    // create buffers on the device
    cl::Buffer buffer_A(context,CL_MEM_READ_WRITE,sizeof(int)*10);
    cl::Buffer buffer_B(context,CL_MEM_READ_WRITE,sizeof(int)*10);
    cl::Buffer buffer_C(context,CL_MEM_READ_WRITE,sizeof(int)*10);

    int A[] = {0, 1, 2, 3, 4, 5, 6, 7, 8, 9};
    int B[] = {0, 1, 2, 0, 1, 2, 0, 1, 2, 0};

    //create queue to which we will push commands for the device.
    cl::CommandQueue queue(context,default_device, CL_QUEUE_PROFILING_ENABLE);

    //write arrays A and B to the device
    queue.enqueueWriteBuffer(buffer_A,CL_TRUE,0,sizeof(int)*10,A);
    queue.enqueueWriteBuffer(buffer_B,CL_TRUE,0,sizeof(int)*10,B);


    //run the kernel
     auto vadd = cl::make_kernel<cl::Buffer, cl::Buffer, cl::Buffer>(program, "simple_add");
     vadd(cl::EnqueueArgs(queue, cl::NDRange(10)), buffer_A, buffer_B, buffer_C);


    int C[10];
    //read result C from the device to array C
    queue.enqueueReadBuffer(buffer_C,CL_TRUE,0,sizeof(int)*10,C);

    queue.finish();
    queue.flush();


    std::cout<<" result: \n";
    for(int i=0;i<10;i++){
        std::cout<<C[i]<<" ";
    }

    return 0;
}

harryz · April 25, 2016, 6:21am

Hi VladimirS,

Thank you for your asking, I just tested your OCL code and got no time line as well.

Nishgt profile uses nvprof to profile the GPU kernel and generate the time line, as far as I know nvprof doesn’t have support for opencl app

Best Regards

VladimirS · April 25, 2016, 7:45am

Thank you harryz_ for your answer,

I have tested this OCL code with NSight integrated in the Visual Studio solution. And it worked great !
So I’m wondering why the visual studio solution works instead of eclipse for an opencl application profiling.

harryz · April 26, 2016, 8:17am

Hi VladimirS,

You’re right, Performance Analysis in Nsight visual studio uses Nsight monitor and it can analysis OCL app, but the nvprof on windows also cannot work with OCL app, Nsight VSE works far different from the EE version.

By the way, NVIDIA Visual Profiler also cannot profile OCL app as it uses nvprof no matter on Windows or Linux.

Topic		Replies	Views
visual profiler: "No Timeline" message when profiling openCL CUDA Programming and Performance	1	852	August 28, 2014
OpenCL Profiling Nsight Compute	4	2192	March 9, 2020
NSight 3.0 fails to collect samples from OpenCL/GL application Nsight Visual Studio Edition	4	2161	August 22, 2013
OpenCL performance analysis: utilization zero Nsight Visual Studio Edition	5	1871	October 29, 2014
Using Nsight for OpenCL CUDA Programming and Performance	0	548	May 18, 2012
NVIDIA profiler not working for OpenCL even for SDK samples CUDA Programming and Performance	2	10411	January 16, 2011
Profile OpenGL / GLSL with NSight? Nsight Eclipse Edition	2	1704	April 22, 2013
Bad OpenCL Performance - Profiling? CUDA Programming and Performance	0	667	April 23, 2015
profiling opencl applications support of profiling opencl applications with 'visual profiler&#39 CUDA Programming and Performance	0	6196	June 4, 2009
NSight 3.2.2 + VS2012 Pro - Unable to profile Trace with Tools Extension Nsight Visual Studio Edition	2	1239	March 26, 2014

OpenCL - Empty profile timeline

Related topics