How to compute mathematical operations in device memory?

kansai · June 10, 2021, 3:38am

I have a function that computes a value on a vector, I have checked and it has the possibility to be parallelized. I would like to make it a CUDA device function. I am in the process of doing this.

However the operation that this function does involves some sum, multiplications and in the end a square root.

The internal calculations will be done in device memory, then the function will return this value that then in the main function I will transfer to the host, to display or any other purpose.

However, I still have to calculate square root on the value and I think I cannot apply std::sqrt() to do this,

My question is, how can I do this with CUDA?

(One suggestion of course is just to return the un-squared value, transfer it to host and then sqrt there, but that defeats the modularization purpose of the function. I wonder if there is other way)

Robert_Crovella · June 10, 2021, 3:41am

CUDA has a math api that includes sqrt().

https://docs.nvidia.com/cuda/cuda-math-api/index.html

Topic		Replies	Views
ptx assembly in cuda for calculating square root CUDA Programming and Performance	1	1302	August 4, 2014
How do you calculate the square-root of a scalar using cuda math api? CUDA Programming and Performance	6	3445	May 17, 2017
sqrt function in CUDA kernel function call fails CUDA Programming and Performance	2	15324	November 5, 2007
Is there a way to make a CUBLAS function write its return value to a device variable CUDA Programming and Performance	0	914	May 1, 2009
Sqrt and Pow on CUDA CUDA Programming and Performance	2	27716	July 8, 2010
Iterating pointers on the host allocated with cudaMalloc() CUDA Programming and Performance	1	621	July 1, 2017
Matrix calculation on device CUDA Programming and Performance	1	348	September 20, 2019
Result of a CUBLAS function CUDA Programming and Performance	7	10806	April 22, 2010
How to extract results from device? Cublas and cuda CUDA Programming and Performance	5	3450	July 20, 2009
How to create vector of objects in the device? CUDA Programming and Performance cuda	1	1000	February 2, 2023

How to compute mathematical operations in device memory?

Related topics