Nvcc on Linux tries to resolve ::lerp as std::lerp with compute 80 or higher

ET3D · April 22, 2025, 1:07pm

...(93): error: more than one instance of overloaded function "lerp" matches the argument list:
            function "std::lerp(float, float, float) noexcept" (declared at line 1911 of /usr/include/c++/11/cmath)
            function "lerp(float, float, float)" (declared at line 1226 of .../helper_math.h)
            argument types are: (const float, const float, const float)
   auto x0 = ::lerp(a, b, relPos.x);

The above error happens on Linux when compiling with compute architecture 8.0 but not 6.1. It also doesn’t happen on Windows.

On Linux, removing the lerp(float,float,float) definition from helper_math.h solves this. On Windows it causes compilation to fail.

The Linux behaviour seems like an error, since ::lerp shouldn’t be resolved as std::lerp.

I tried it with SDKs 12.4 and 12.8.

Anyone encountered this? Any idea how to bypass it except for different code for Windows and Linux?

Robert_Crovella · April 22, 2025, 1:22pm

helper_math.h is not part of the CUDA toolkit.

It is part of CUDA sample codes which are not intended to be used for production code.

So don’t include or use helper_math.h

Then fix your code.

ET3D · April 22, 2025, 1:42pm

I understand, though this looks to me like a problem on the compiler side, so “fix your code” feels both condescending and wrong. I think it would be better if NVIDIA fixed is own code.

(But thanks for the quick reply.)

Robert_Crovella · April 22, 2025, 1:46pm

sorry to have been condescending and wrong

it looks to me like you have 2 candidates on the linux side (because one is coming from linux system headers, which I don’t necessarily expect to match windows system headers.)

what is the problem on the compiler side? Are you referring to this:

Can you provide a short, complete example of the issue?

I tried this on CUDA 12.8.1, but that did not show an issue:

$ cat test.cu
#include <./cuda-samples/Common/helper_math.h>

__device__ float f(float a, float b, float c){


  return ::lerp(a,b,c);
}
$ nvcc -arch=sm_80 -I.  -dc test.cu
$

ET3D · April 22, 2025, 2:15pm

inline __device__ __host__ float lerp(float a, float b, float t)
{
	return a + t * (b - a);
}

__global__ void interpolate(const float* a, const float* b, const float* t, float* result)
{
	int i = blockIdx.x * blockDim.x + threadIdx.x;

	result[i] = ::lerp(a[i], b[i], t[i]);
}

This shows the problem.

It’s compiled with CMake with:

set(CMAKE_CUDA_ARCHITECTURES "61;80")
add_compile_options($<$<COMPILE_LANGUAGE:CUDA>:--std=c++20>)
add_compile_options($<$<COMPILE_LANGUAGE:CUDA>:--extended-lambda>)
add_compile_options($<$<COMPILE_LANGUAGE:CUDA>:--expt-relaxed-constexpr>)

I don’t think the flags matter. They’re just there because I used what I normally use. Compiles fine with arch 61 but not with arch 80.

Edit: I initially posted with ‘lerp’ instead of ‘::lerp’, but got the same problem with both.

Robert_Crovella · April 22, 2025, 2:20pm

I didn’t have any trouble compiling that on CUDA 12.8.1:

$ cat test2.cu
inline __device__ __host__ float lerp(float a, float b, float t)
{
        return a + t * (b - a);
}

__global__ void interpolate(const float* a, const float* b, const float* t, float* result)
{
        int i = blockIdx.x * blockDim.x + threadIdx.x;

        result[i] = lerp(a[i], b[i], t[i]);
}
$ nvcc -arch=sm_80 -I.  -dc test2.cu
$

Can you shown the full cmake verbose compilation output?

ET3D · April 22, 2025, 2:23pm

Thanks. I’ll try to debug the compilation to see when the problem surfaces.

Robert_Crovella · April 22, 2025, 2:24pm

ahh

The -std=c++20 flag shows the issue:

$ nvcc -arch=sm_80 -I.  -dc test2.cu -std=c++20
test2.cu(10): error: more than one instance of overloaded function "lerp" matches the argument list:
            function "std::lerp(float, float, float) noexcept" (declared at line 1911 of /usr/include/c++/11/cmath)
            function "lerp(float, float, float)" (declared at line 1)
            argument types are: (const float, const float, const float)
   result[i] = lerp(a[i], b[i], t[i]);
               ^

1 error detected in the compilation of "test2.cu".
$

ET3D · April 22, 2025, 2:24pm

Thanks. Good to know you see it.

Robert_Crovella · April 22, 2025, 2:29pm

I have seen other situations where the device code compiler “resolves” undecorated functions using the standard library (perhaps, unexpectedly). So it’s not obvious to me that is not allowed. (I don’t know.)

If you think it is a (nvcc) bug, probably best to file a bug.

If you are looking for workarounds,

it seems like one possibility would be to include your own definition in a namespace, then call out that namespace specifically:

$ cat test2.cu
namespace foo{
inline __device__ __host__ float lerp(float a, float b, float t)
{
        return a + t * (b - a);
}
}

__global__ void interpolate(const float* a, const float* b, const float* t, float* result)
{
        int i = blockIdx.x * blockDim.x + threadIdx.x;

        result[i] = foo::lerp(a[i], b[i], t[i]);
}
$ nvcc -arch=sm_80 -I.  -dc test2.cu -std=c++20
$

And, FWIW, I see the same failure using the non-namespace code, whether I compile for -arch=sm_61 or -arch=sm_80. That error doesn’t appear to be arch-dependent.

ET3D · April 22, 2025, 2:44pm

Okay, thanks. It surfaced when I first used arch 80, but it might have had to do with other defaults of the compiler. IIRC I was using C++17 at the time, so perhaps it matters there.

Anyway, I opened a bug, so I’ll wait to see what happens there and in the mean time hack this thing. Thanks for the help.

Yuki_Ni · April 23, 2025, 10:20am

Thanks for filing a ticket (ID 5238032) . It is under review and we will bring back conclusion here .

Topic		Replies	Views
CudaAPI calls in functions, compiler/linking bug? CUDA Programming and Performance	6	429	August 17, 2023
NVC++ 23.1 F-0000-Internal compiler error with std::function ref to lambda nvc, nvc++ and nvfortran	1	652	March 6, 2023
NVCC compiler incorrectly allows std:: math functions in device code CUDA NVCC Compiler cuda , nvcc	0	728	January 18, 2022
Compatibility CUDA & Lemon Or ... ?! CUDA Setup and Installation	9	1674	January 15, 2016
LLVM Error when compiling C++ STD parallel execution policies to GPU nvc, nvc++ and nvfortran	9	708	May 2, 2024
Boost Functional compile error CUDA Programming and Performance	8	3657	February 13, 2011
Compilation problems with CUDA 12.9 CUDA Programming and Performance	7	662	July 22, 2025
Use CUDA with IT++ CUDA, IT++ CUDA Programming and Performance	2	11434	June 18, 2010
NVC++ using external libraries nvc, nvc++ and nvfortran	17	1412	August 2, 2021
[nvc++][C++17] Regression between SDK 23.5 and 23.9 nvc, nvc++ and nvfortran	3	246	June 7, 2024

Nvcc on Linux tries to resolve ::lerp as std::lerp with compute 80 or higher

Related topics