On my Core-i7 PC (running Win7) with CUDA 3.1, the below simplified version of my ptx code apparently causes an internal error with ptxas. Am I doing something wrong, or is this an actual bug?
On my Core-i7 PC (running Win7) with CUDA 3.1, the below simplified version of my ptx code apparently causes an internal error with ptxas. Am I doing something wrong, or is this an actual bug?
Switched to 3.2 – alas, my actual program unfortunately still dies… :( The original source is some 900 lines of ptx code; here’s a similar test case that still causes ptxas to complain. Obviously the program below doesn’t actually make sense - adding the addresses of two labels and masking away the high 16 bits, then jumping to the result isn’t the smartest thing to do if you want a stable machine. :) But the essence of what I’m trying to do is there. Well, apparently not from ptxas’s point of view… :(
Switched to 3.2 – alas, my actual program unfortunately still dies… :( The original source is some 900 lines of ptx code; here’s a similar test case that still causes ptxas to complain. Obviously the program below doesn’t actually make sense - adding the addresses of two labels and masking away the high 16 bits, then jumping to the result isn’t the smartest thing to do if you want a stable machine. :) But the essence of what I’m trying to do is there. Well, apparently not from ptxas’s point of view… :(
I assume that you are a registered developer. If so, it would be helpful if you could file a bug so our toolchain team can follow up on this. If you are unable to file a bug, please let me know (you can also send me a personal message through the forums). Thanks.
I assume that you are a registered developer. If so, it would be helpful if you could file a bug so our toolchain team can follow up on this. If you are unable to file a bug, please let me know (you can also send me a personal message through the forums). Thanks.