Optimization Guide for TK1 SoC

Hello All,

I am working on Nvidia TK1 with algorithm development and porting.
I am looking for optimization guide C/C++ specifically for Nvidia TK1 SoC if available otherwise general optimization guide for Nvidia would also work…

With optimization, I am looking for efficient way of using GPUs… I am exploring on CUDA. is there any other ways for GPUs to work with general computing like algorithms?

Any help is highly appreciated…

Hi KapilMehta, you might want to check the CUDA Programming and Performace forum instead: https://devtalk.nvidia.com/default/board/57/cuda-programming-and-performance/

They may, however, refer you to the CUDA C Best Practices Guide: http://docs.nvidia.com/cuda/cuda-c-best-practices-guide/index.html

CUDA should be mostly the same between the x86 and ARM platforms. For situations outside of CUDA, ARM optimization information should work, and a few of the gaps in between should be answered by searching through this forum. The eLinux wiki should also be useful, both inside and outside the Jetson section: http://elinux.org/Jetson_TK1