compute_13 flag versus sm_13 flag

nasacort · June 20, 2009, 9:38pm

I noticed that if I compile using the -arch compute_13 flag, the resulting .cu_o file is more than double in size compared to that generated using the -arch sm_13 flag. However, there is no noticeable difference in performance between the two cases (running on a GTX 280). What is the main difference between the two flags?

MisterAnderson42 · June 21, 2009, 3:17pm

According to the documentation, compute_13 stores more abstract GPU code which will then be compiled to the real device cubin at runtime. sm_13 buids the cubin at compile time.

nasacort · June 22, 2009, 5:33pm

Thanks for the clarification.