I am trying to decide whether the Tesla C870 is for me.
My applications do not have floating-point operations, but rather
integer and logic operations, including shifts, as well as many loads and
stores. Will the cores on the card be able to execute such programs efficiently,
if these are the characteristics of the individual threads?
Can C++ programs be compiled efficiently for execution on the C870?
Do you have tutorials about porting single-threaded applications
written in C++ to execute efficiently on the C870?