How to make single thread C++ program to exploit nvidia jetson tx2 multi cpu and their cores

I am developing image processing application. Presently i wrote the application as a single threaded C++ application. it works. Please englighten me in what ways i can make this application exploit the nVidia features like multi CPU and their cores. Please guide me.