For my applications I need to call linear algebraic, signal/image processing and neural network functions from within the GPU kernel, avoiding any interaction with the CPU. I understand support for cuBLAS device functions stopped, and NPP never had device function support.
What are my options for (3rd party?) device callable HPC library functions?
*signal- and image processing
I look forward to your reactions.