When I compile matrxMul by line commands I get an executable that will work. When I compile either singly or batch with the make command, I get a executable that will hang when it is invoked. The files are also of different sizes. The output shown should demonstrate what I am saying.
Only you can answer that, I suspect. The big clue is that the build process isn’t the same and the resulting executable isn’t the same, so what is different in the build process? You can’t seriously expect someone to answer that.
How long are you waiting before ctrl-c’ing? I ask because I’ve run into this mysterious problem where in the first execution after compiling the first cuda call takes an extra 12-14 secs and subsequent executions seem to fine after that (even after rebooting). Maybe if you wait long enough it will eventually complete.