OpenVX-VisionWorks' nodes executed sequentially

stereoIV · January 25, 2017, 9:37am

Hi,

I am developing a stereo vision application using OpenVX framework. It will be deployed on Jetson TX1 embedded platform, but currently I am testing it on a Ubuntu 16.04 machine with Quadro K420 GPU and some Intel i5 processor.

It is a graph based pipeline that contains some OpenVX nodes, such as vxRemapNode, vxMeanStdDevNode and a custom node that I have implemented using CUDA. It takes two images from the left and right cameras and produces a disparity. Both vxRemapNode and vxMeanStdDevNode are applied upon both images, therefore I expected here to see some concurrency. However both images are processed one after another. The next thing I don’t understand is why those two nodes are executed on the GPU(based on the profiling results)?
I also tried using the medianBlur node and it gave the same results in terms of execution.

In order to see what happens I tried using NVIDIA visual profiler. You can see the profiling results here:
https://s29.postimg.org/mscja0fgn/profiling.png

Topic		Replies	Views
VisionWorks OpenVX vs OpenCV Jetson TX2	6	3258	October 18, 2021
Visionworks : how can I execute parallel node process in graph? Jetson TX1	13	2938	October 18, 2021
[VisionWorks] Parallel execution of nodes (or graphs) GPU-Accelerated Libraries	8	2610	August 13, 2018
Unstable performance of OpenVX nodes Jetson TX1	4	601	August 24, 2016
User custom node with OpenCV kernel does not run on GPU Jetson TX2	7	1071	October 18, 2021
[Visionworks-Openvx] Set target GPU for Openvx Functions Jetson TX1	5	1787	October 18, 2021
User defined Custom CUDA node profiling using Nsight Jetson TX1	16	2017	October 18, 2021
Parallel processing twice stereo matching Jetson TX2	25	1959	November 30, 2017
Performance degradation on CUDA Jetson TX2	10	2325	October 18, 2021
CUDA vs DX execution times DX GPGPU code --> CUDA = slower CUDA Programming and Performance	15	13386	January 30, 2008

OpenVX-VisionWorks' nodes executed sequentially

Related topics