Some question about maximizing CPU performance

Hi!
I have some problem about maximizing TX1 CPU, I have done the following step:
http://elinux.org/Jetson/Performance
https://devtalk.nvidia.com/default/topic/886502/?comment=4701015

but I execute “tegrastats”, the CPU seems didn’t use it’s maximum efficiency(I have already run some large program about file access):

RAM 1655/3995MB (lfb 247x4MB) cpu [7%,4%,71%,27%]@1734 GR3D 0%@998 EDP limit 0
RAM 1594/3995MB (lfb 247x4MB) cpu [6%,9%,17%,81%]@1734 GR3D 0%@998 EDP limit 0
RAM 1630/3995MB (lfb 247x4MB) cpu [2%,8%,96%,3%]@1734 GR3D 0%@998 EDP limit 0
RAM 1653/3995MB (lfb 247x4MB) cpu [28%,5%,70%,6%]@1734 GR3D 12%@998 EDP limit 0
RAM 1655/3995MB (lfb 247x4MB) cpu [6%,5%,96%,3%]@1734 GR3D 0%@998 EDP limit 0
RAM 1655/3995MB (lfb 247x4MB) cpu [3%,6%,97%,3%]@1734 GR3D 0%@998 EDP limit 0
RAM 1655/3995MB (lfb 247x4MB) cpu [3%,7%,97%,4%]@1734 GR3D 0%@998 EDP limit 0
RAM 1655/3995MB (lfb 247x4MB) cpu [6%,4%,43%,57%]@1734 GR3D 0%@998 EDP limit 0
RAM 1607/3995MB (lfb 247x4MB) cpu [83%,8%,7%,18%]@1734 GR3D 2%@998 EDP limit 0

Who can I maximizing CPU performance to reduce the programming time?
Whether it is the only method to do?

Thank you !!

Hi Chiang_Kuan_Ting,

The cpu clk is already up to maximum and your program does not use 100% on any core at all.