Bug on Windows

peastman · August 4, 2011, 5:38pm

I’ve received a bug report from a user. He’s running my code on 64 bit Windows 7 with Cuda 4.0, driver 275.33, and a Quadro FX 380M (which supports compute level 1.2). The kernel compiler fails with the following log:

ptxas application ptx input, line 128; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 129; error   : Instruction 'mov' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 130; error   : Instruction 'div' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 163; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 164; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 165; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 166; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 167; error   : Instruction 'mul' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 168; error   : Instruction 'mul' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 169; error   : Instruction 'mul' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 170; error   : Instruction 'mul' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 178; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 179; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas application ptx input, line 180; error   : Instruction 'cvt' requires SM 1.3 or higher, or map_f64_to_f32 directive

ptxas fatal   : Ptx assembly aborted due to errors

ptxas application ptx input, line 128; warning : Double is not supported. Demoting to float

It looks like the compiler is getting confused and generating 1.3 instructions on a device that only supports 1.2. It clearly realizes it shouldn’t do that, as seen from the last line: “Double is not supported. Demoting to float” But it does it anyway.

Has anyone seen anything like this before? Any idea what I can do about it?

Peter

eyebex · August 4, 2011, 8:54pm

This will probably not help much, but doesn’t SM stand for Shader Model and aren’t you confusing that with the Compute Capability? You’re right that the Quadro FX 380M has Compute Capability 1.2, though.

peastman · August 4, 2011, 9:08pm

I don’t believe that’s correct. CUDA generally uses the terms “compute capability” and “SM” as equivalent. For example, from section 3.1.2 of the CUDA C Programming Guide: “A cubin object is generated using the compiler option â€“code that specifies the targeted architecture: For example, compiling with â€“code=sm_13 produces binary code for devices of compute capability 1.3.” The reference to a “map_f64_to_f32 directive” also shows pretty clearly it’s a single vs. double precision issue.

Peter

peastman · August 8, 2011, 7:14pm

Does anyone at Nvidia actually read this forum?

Peter

JavaDev · August 9, 2011, 3:42am

Quadro FX 380M not support Double-precision floating-point operations CUDA - Wikipedia

peastman · August 9, 2011, 8:25pm

Yes, I’m well aware of that. Yet the compiler appears to be generating double precision instructions for it anyway.

Peter

Topic		Replies	Views
double precision with Quadro FX770M? CUDA Programming and Performance	1	4341	November 26, 2008
Compute 1.3 and invalid device function CUDA Programming and Performance	2	3152	January 30, 2009
Unable to do double precision calcs CUDA Programming and Performance	4	2194	April 7, 2009
Hidden double: Search and destroy ptx file: Double is not supported. Demoting to float CUDA Programming and Performance	4	2696	August 30, 2011
Quadro FX 3800 compute capability CUDA Programming and Performance	6	4743	August 7, 2009
double's on the GTX 285 CUDA Programming and Performance	2	1883	June 25, 2009
CUDA SDK CUDA Programming and Performance	3	1210	March 15, 2010
Problem with running code with double precision values Double precision gives wrong result CUDA Programming and Performance	2	1241	August 28, 2009
Double not supported; demoting to float Compiling on a comp: 2.1 device and getting precision errors CUDA Programming and Performance	5	2625	February 28, 2012
Is CUDA backward compatible? CUDA Programming and Performance	2	2911	February 26, 2009

Bug on Windows

Related topics