VPI Pyramidal LK Optical Flow Poor Tracking Results

lee.schloesser · June 20, 2022, 3:19pm

I create a class wrapping the VPI PLK offerings and I am disappointed by the tracking results. When tracking a sparse set of points on an image to itself (so, 0 motion and with useInitialFlow set to 0), every single feature point is tracked successfully. However if I warp the target image ever so slightly (small rotations, translations, shear, etc.) many of the feature points end far from their starting positions and so give garbage solutions. This is the same with both CPU and CUDA backends.

This first image shows the tracked points color coded by angle.

This second image show the inlier set from an affine transform.

This third image shows the two input images and the diff after warping:

Does anyone else experience this? Are there know bugs in this library? Or, maybe more likely, any thoughts on what I am doing wrong?

static void vpi_track(const VPIImage& i0_wrapper, const VPIImage& i1_wrapper,   
                      VPIImage& gray0, VPIImage& gray1, VPIPyramid& pyr0,          
                      VPIPyramid& pyr1, VPIArray& pts0, VPIArray& pts1,         
                      VPIArray& status, VPIPayload& plk,                        
                      VPIOpticalFlowPyrLKParams* plk_params, VPIStream& stream, 
                      VPIBackend backend) {                                        
  try {                                                                          
    // Convert to grayscale                                                        
    vpi_check_status(vpiSubmitConvertImageFormat(stream, backend, i0_wrapper,   
                                                 gray0, nullptr));               
    vpi_check_status(vpiSubmitConvertImageFormat(stream, backend, i1_wrapper,   
                                                 gray1, nullptr));                 
                                                                                   
    // Fill pyramids                                                               
    vpi_check_status(vpiSubmitGaussianPyramidGenerator(stream, backend, gray0,  
                                                       pyr0, VPI_BORDER_CLAMP));
    vpi_check_status(vpiSubmitGaussianPyramidGenerator(stream, backend, gray1,  
                                                       pyr1, VPI_BORDER_CLAMP));
                                                                                   
    // Run optical flow                                                            
    vpi_check_status(vpiSubmitOpticalFlowPyrLK(stream, 0, plk, pyr0, pyr1, pts0,
                                               pts1, status, plk_params));      
                                                                                
    // Wait for processing to finish                                            
    vpi_check_status(vpiStreamSync(stream));                                    
  } catch (std::exception& e) {                                                 
    LOG(ERROR) << e.what();                                                     
  }                                                                             
}

I can paste more code if needed. But again, everything is “great” if there is zero motion between image0 and image1.

lee.schloesser · June 20, 2022, 4:22pm

Hmm, seems that the parameters don’t exactly match OpenCV and so more pyramid levels and iterations are needed than expected. I’m also having no luck with VPI tracking across time, as in images taken from the same vantage point at different times. I first run an edge magnitude operation on the images to avoid gradient reversal issues. OpenCV PLK responds well to this preprocessing step and usually tracks successfully:

and

VPI PLK fails miserably at this task, no matter how I parameterize. My guess for the discrepancy is that epsilon in OpenCV corresponds to feature motion from one iteration to the next, whereas in VPI it corresponds to avg. L1 over the feature window. I think the solution is to change the meaning of epsilon in VPI to match OpenCV and to add a new parameter to allow “successful” tracking of features with much higher avg. L1 differences, something like maxAvgL1Error.

lee.schloesser · June 20, 2022, 4:39pm

I’d like to also complain about the name windowDimension. It’s odd to me that it can be an even number. Is it a square radius or a square diameter? Please improve the documentation.

ilya9 · June 21, 2022, 10:45am

We had some issues with the optical flow as well. Increasing the pyramid level to 3 or 4 gave much better results. Also, we use VPI 1.1, I wonder if on newer versions the performance is better.

lee.schloesser · June 22, 2022, 1:16pm

I’m using 2.0, so it seems like not much has changed. It’s a shame there’s no github issue tracking or similar for this library.

shiremath · June 23, 2022, 8:35pm

@lee.schloesser - Forwarding this to the VPI team to investigate this matter. We will get back to you soon. Thanks for your patience!

Topic		Replies	Views
Providing points to VPI's PyrLK optical flow algorithm Jetson AGX Orin vpi	3	45	October 8, 2024
Maximum of numIterations in VPI LK Optical Flow Jetson TX2 vpi	5	753	October 18, 2021
Accuracy of NV-VPI sparse Optical Flow on Orin Jetson AGX Orin hw , cuda , jetson-inference	14	914	September 1, 2023
Basic VPI questions Computer Vision & Image Processing vpi	1	573	July 19, 2023
I use klt tracker in vpi 1.0 with usb camera but tracker not smooth and usually lost object Jetson Nano vpi	7	672	September 27, 2021
Optical flow acceleration Jetson AGX Orin	25	3212	April 27, 2023
visionworks-1.6 vxOpticalFlowPyrLKNode issue Jetson AGX Xavier	6	818	October 18, 2021
Use vpi to calculate the dense optical flow , error！ Jetson AGX Xavier vpi	2	515	April 5, 2023
Timing performance of vpiSubmitOpticalFlowPyrLK() with different backend flags Jetson AGX Orin vpi	4	186	May 27, 2024
VPI Optical Flow settings for performance similar to NVOFA Jetson Orin NX vpi	3	51	July 30, 2025

VPI Pyramidal LK Optical Flow Poor Tracking Results

Related topics