Using a CUDA library call as a device function instead of a kernel launch

himat15 · April 2, 2018, 10:45pm

I want to use the cuFFT library, but I don’t want the overhead of launching another kernel.
Is it possible to adapt the library call into a device function so that I can just call it from an already launched function?

Robert_Crovella · April 2, 2018, 11:03pm

not possible for CUFFT. CUBLAS has support for this.

himat15 · April 2, 2018, 11:21pm

So is there a common solution people have to dealing with such things if they want to call these library kernels if they don’t want a lot of kernel launch overhead?

Robert_Crovella · April 2, 2018, 11:36pm

common strategies for improving CUFFT efficiency:

batching of transforms
using the CUFFT API to manage temporary allocations (“workspace”) yourself
reuse of plans

Topic		Replies	Views
cufft on device how to call cufft library function from device function CUDA Programming and Performance	2	7376	January 24, 2011
CUDA based DLL CUDA Programming and Performance	2	526	November 25, 2012
Call cuBLAS from device function GPU-Accelerated Libraries	1	691	November 15, 2019
Calling FFT from device CUDA Programming and Performance	3	771	January 27, 2012
cuFFT Device-callable Library GPU-Accelerated Libraries	12	5078	November 21, 2013
What is the state of device-side libraries? CUDA Programming and Performance	0	335	August 15, 2019
cuFFT Device-callable Library CUDA Programming and Performance	1	661	January 27, 2013
Feature Request - Libraries CUDA Programming and Performance	0	780	April 22, 2011
What about calling non __device__ function inside kernel? Feature suggestion CUDA Programming and Performance	1	7822	June 3, 2011
cublas calls from device CUDA Programming and Performance	1	2319	December 26, 2008

Using a CUDA library call as a device function instead of a kernel launch

Related topics