why the result of powf(-0.07346,2) equal to nan?

346221593 · January 5, 2020, 11:18am

I want to use optix acclerate ray tracing, when I use the math libraries, I find the result of powf(-0.07346,2) equal to nan. but the powf(2,2) equal to 4.0000.

Looking forward to your reply, sincerely

Robert_Crovella · January 5, 2020, 11:16pm

I don’t find that to be true:

$ cat t490.cu
#include <math.h>
#include <stdio.h>

__global__ void k(float x, float y){

  printf("%f\n", powf(x, y));
}

int main(){

  k<<<1,1>>>(-0.07346,2);
  cudaDeviceSynchronize();
}
$ nvcc -o t490 t490.cu
$ cuda-memcheck ./t490
========= CUDA-MEMCHECK
0.005396
========= ERROR SUMMARY: 0 errors
$

dhart · January 6, 2020, 7:03pm

Hi 346221593,

Are you using powf() inside an OptiX program, or a CUDA kernel? If after seeing Robert_Crovella’s response you can still reproduce your issue, please let us know some details: what is your system type, driver version, optix version. And if you have a very small code snippet that can reproduce the issue, that will be exceptionally helpful.

–
David.

346221593 · January 7, 2020, 1:35am

I use powf() in optix 6.0.0 ,ubuntu 16.04, my cuda version is 10.1
and my test code:

RT_PROGRAM void pinhole_camera() { if(launch_index.x == 261 &&launch_index.y == 0) { float x = -0.45; int y = 2; rtPrintf("powf(%f,%d) = %f \n\n",-x,y,powf(-x,y)); rtPrintf("powf(%f,%d) = %f \n\n",x,y,powf(x,y)); } }

but its output is

powf(0.450000,2) = 0.202500
powf(-0.450000,2) = nan

346221593 · January 7, 2020, 2:48am

my card info:

nvidia-smi

±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| 0 1109 G /usr/lib/xorg/Xorg 305MiB |
| 0 1935 G compiz 167MiB |
| 0 2244 G /usr/lib/firefox/firefox 3MiB |
| 0 3833 C ./burgercpp 334MiB |
±----------------------------------------------------------------------------+

346221593 · January 11, 2020, 2:21am

I used pow(double,double) to instead ,it worked . thanks.

dhart · January 13, 2020, 9:08pm

Hi, it’s unfortunate to use double precision if you don’t need it. I assume your exponent is not always 2, correct?

I was able to reproduce this behavior, and then I noticed that it is caused by the “fast math” option, and documented in the CUDA programming guide here: Programming Guide :: CUDA Toolkit Documentation Table 9. You can reproduce this behavior in the nvcc example above if you add the nvcc command line option “–use_fast_math”.

You have a few alternative options, if you want. Fast math is something you can turn on per compilation unit, so one option is to leave it turned on, but compile a 2nd unit that has fast math disabled, and put a pow() wrapper function in there. A second option is to turn off fast math, and use the fast float device intrinsics everywhere you need them. That way you can freely mix the fast versions with the robust versions. This might be a little painful to manage, and runs the risk of slowing things down, or later accidentally introducing slow math.

–
David.

Topic		Replies	Views
Unsuspected work of pow() function pow() device function works incorrectly witn negative numbers CUDA Programming and Performance	4	3450	March 4, 2009
__powf(x,y) gives nan CUDA Programming and Performance	11	3251	January 31, 2013
__powf(): wrong behavior CUDA Programming and Performance	8	4902	October 23, 2009
pow math function CUDA Programming and Performance	5	3477	June 26, 2007
Double precision power function error on GPU (CUDA Fortran) Legacy PGI Compilers (archived)	0	8258	May 15, 2019
Possible Compiler Bug with __powf CUDA Programming and Performance	0	1291	January 3, 2009
A more accurate and faster implementation of powf() CUDA Programming and Performance	6	4687	February 13, 2025
wrong results in device CUDA Programming and Performance	8	1783	November 19, 2014
pow function error with 1 float, 1 int input CUDA Programming and Performance	1	3000	June 26, 2007
Bug in the POW function? CUDA Programming and Performance	8	2865	December 5, 2021

why the result of powf(-0.07346,2) equal to nan?

nvidia-smi

Related topics