I’m profiling the CNN model implemented with PyTorch. I ran inference code with image dataset through ResNet18, MobilenetV2, and I’m going to do this with other models, too.
I was trying to use a lot of profiling tool CUDA toolkit provides. And I noticed that the data I want to get is available with Visual Profiller.(e.g. Memcpy/Kernel Overlap, Kernel Concurrency is available with GPU Usage mode)
However, I’m not sure this data is reliable. My Inference code is really fast (about 10 seconds) when I ran it in my console. (which is done by the command ‘python inference.py’) However, in my time result in Visual Profiler, it said it took 349 seconds. Why it happened?
And it always said there was an error after it finish execution with the message ‘timeline options cannot be enabled for this profile data and will be ignored.’ And sometimes no result appears.