VisionWorks primitive implementation

SanghoYeo · April 21, 2017, 7:03pm

I’m new in VisionWorks programminng

today, I knew that openvx only give some primitives, and vendor implement it.

in the visionworks, I know that there is already implemented primitives like canny edge detection. also there is
multiple implementations in one primitives like cpu version… and gpu(cuda) version.

my question is how i can check those implementation version by visionworks programming and nvidia profiler

when i test visionworks sample code, only gpu implemented version is executed.

is there any method to execute cpu implemented version?

and I knew that selecting one of the implemented node of primitives is contorlled inside the visionworks.

is it right?

AastaLLL · April 24, 2017, 6:02am

Hi,

Thanks for your question.

VisionWorks primitives are CUDA optimized except MedianFlow & FindHomography extensions.

85% of VisionWorks OpenVX API is also accelerated with NEON.
Please find more information from VisionWorks document: Log in | NVIDIA Developer

Go to “VisionWorks API” > “NVIDIA Extensions API” > "Vision Primitives API”

If you want to check implementation by profiler, please use nvvp:

SanghoYeo · April 25, 2017, 8:20am

thanks for answer,
but I already knew that there is CUDA optimized and CPU implementation.
and I already knew that how to use nvvp
sorry, but your reply is different from what i expected.

I already found some information about my question .
in the refrence, there is some api for resource control, “vxSetNodeTarget”

and visionworks release note, there were some other implementations of primitives,
in the release note, some implementation is added when version is changed.

so here is question,

is there some other implementation of primitives for the CPU or GPU?
if i change node to cpu target by “vxSetNodeTarget”, Could I see some node is running on the cpu?
When I run samples, all implementation ran on the GPU, not CPU.

AastaLLL · April 27, 2017, 2:37am

Hi,

Thanks for your feedback and also sorry for my previous unclear reply.

vx_: Khronos OpenVX API(Nvidia optimized), please find document to check detail implementation. (mentioned in #2)
nvx_: Nvidia extension API, only GPU implementation is available.
nvxcu_*: Low-level CUDA API, only GPU implementation is available.

Ideally, target should be switched by vxSetNodeTarget.

But we found this function works abnomarllay when target string set to “CPU”.
We are checking this issue internally. Will update information to you later.

Thanks.

AastaLLL · May 2, 2017, 6:43am

Hi,

Please use this API to set target to CPU.

vxSetNodeTarget(node, NVX_TARGET_CPU, NULL)

Thanks for your patience.

symao · July 14, 2017, 3:40am

Hi @AastaLLL,

I am new in VisionWorks programminng, and i am confused about:

What the difference between vx_, nvx_ and nvxcu_? From your previous answer and official documents, i know that vx_ and nvx_* are both based on openvx framework, but vx is Nvidia optimized and accelerated with NEON. nvxcu_* is based on CUDA and without opencv. Is that right?
Which one is prefer in TX2 platform concerning about efficiency? or Can these be mixed in using and how to do that?

AastaLLL · July 17, 2017, 2:12am

Hi,

vx_ is from the OpenVX standard, nvx_ is Nvidia-extension and nvxcu_ is our low-level API.
Please check this tutorial for more details.
https://www.khronos.org/assets/uploads/developers/library/2016-embedded-vision-summit/V2_VisionWorks_OpenVX_tutorial.pdf
If using GPU node, different API should have similar performance.
We also have lots of VisionWorks sample. Please find here:

$ /usr/share/visionworks/sources/install-samples.sh .
$ cd /home/ubuntu/VisionWorks-1.6-Samples

Topic		Replies	Views
[Visionworks-Openvx] Set target GPU for Openvx Functions Jetson TX1	5	1747	October 18, 2021
Visionworks : how can I execute parallel node process in graph? Jetson TX1	13	2852	October 18, 2021
User custom node with OpenCV kernel does not run on GPU Jetson TX2	7	994	October 18, 2021
VPI and VisionWorks questions Jetson AGX Xavier	6	1618	October 18, 2021
User defined Custom CUDA node profiling using Nsight Jetson TX1	16	1902	October 18, 2021
Nvidia Visionworks Jetson TK1	11	2419	October 18, 2021
kernel runs much faster when being profiled with Visual Profiler Visual Profiler and nvprof	4	4690	August 29, 2014
[VisionWorks] Parallel execution of nodes (or graphs) GPU-Accelerated Libraries	8	2549	August 13, 2018
About nvvp execution error CUDA Programming and Performance	3	510	February 12, 2021
VisionWorks Object Tracking with Vision Primitives API Jetson Xavier NX visionworks	9	1077	October 18, 2021

VisionWorks primitive implementation

Related topics