Hello,
My software loads a PTX kernel via the CUDA driver API.
I think currently there might be a serious problem with CUDA driver API.
It has no API to determine/learn/detect/query the compute capability version that the PTX kernel was compiled for ?!
This means that my application is unable to set the correct compute capability launch parameters ?!
So far I have seen the driver api only have functionality to learn:
PTX version
Binary version
I doubt that these fields correspond directly with a compute capability version ?!
(Perhaps binary version means compute capability version ???)
A solution could be to convert a PTX version number to a compute capability version number.
(For example via a ptx-version-to-compute-capability version table or so).
Please clearify the situation.
Thanks and bye,
Skybuck.