We are getting weird results when using the TX2’s CPU. We are running an OpenCV Stereo Block Matching algorithm and it seems like the TX2’s performance is waaay slower than a standard x86 processor. The Jetson is running at 2000ms/frame and on a standard x86 processor it runs at 100ms/frame.
Does anyone happen to know what could be causing this x20 performance hit?
Pure OpenCV application is CPU-based and some hardware blocks(GPU, NVENC, NVDEC) may not be leveraged. Would like to suggest you try gstreamer or tegra_multimedia_api.
Every single core is at 100% and at max speed. The only reason I’m asking the question is we really cannot quantify why the Jetson’s ARM CPU would be 20x worse than a regular x86 performance.
We have suspicions that it might be because of the L2 cache being 2mb.