2D cross correlation

seddona · January 14, 2009, 1:49am

Hi,

I’m looking to do 2D cross correlation on some image sets. Typical image resolution is VGA with maybe a 100x200 template. I’m looking into OpenVIDIA but it would appear to only support small templates. I was hoping somebody could comment on the availability of any libraries/example code for my task and if not perhaps the suitability of the task for GPU acceleration. I am currently using the intel IPP ippiCrossCorr function, but I’m hoping to accelerate with CUDA.

Thanks,

Smokey · January 14, 2009, 2:28am

Speaking from personal experience, implementing a 2D normalised cross-correlation is relatively efficient - although I’m yet to get ‘significant’ performance improvements over IPP - depending on the case I’m anywhere between 20% faster to 20% slower (depending on pixel format of the image/template, and size of the template). So yes, you’d likely get increases from using CUDA.

As for existing libraries that implement this, I’m not aware of any…

seddona · January 14, 2009, 9:51am

20% over IPP is a bit dissapointing. Have you any idea where the bottleneck is?

fyi my images are 8bit greyscale and the correlation dosn’t need to be normalised. I could in fact get away with sum of absolute differences, perhaps there is code for this as it’s a common video compression step.

papag62 · January 14, 2009, 1:28pm

Here is a stereo matching cuda kernel that uses SSD. It may be of use to you. It is from NVIDIA.
[url=“http://sourceforge.net/project/showfiles.php?group_id=98913&package_id=283815”]http://sourceforge.net/project/showfiles.p...ckage_id=283815[/url]

bog · January 20, 2009, 11:42am

You could use the instruction: __usad. See the documentation for this. It’s prety fast.

DaManu · January 20, 2009, 7:36pm

Perhaps look for another algo?
I implemented a really fast 2d correlation function with Cuda’s FFT lib.
Good old style: iFFT(FFT(image) * FFT(what your looking for))
Do normalization on phase, then search for maxima, then do a normalized traditional correlation on the result for ensuring ?

Smokey · January 20, 2009, 11:13pm

I should note, my kernels still have room for optimization - they’re currently processor bound (each thread has a loop which iterates (for my case, which is a 9x9 template against 30x30 source regions) 9*9 times).

That kernel at the moment only has 66% occupancy, and I could do more tricks to share results between threads - a) giving me more processing power to work with, and B) reducing the kernel’s instruction count up to 9 times in this case).

I’m guessing I could probably get double the speed of IPP if it were a priority for me - and far more than double the speed for larger source/template sizes - however the cross correlation in my application isn’t the largest bottleneck - thus I haven’t given it that much attention.

seddona · March 11, 2009, 5:57pm

Hi DaManu, is there any chance of posting your code? If not what sort of performance are you seeing?

Ashutosh_Gupta · January 20, 2011, 4:48am

Hi, I have implemented NCC on GPU where; for smaller template and image size I am getting a 1.5~2X speedup &

for larger sizes a speedup of 6X is achievable. my observation says that if u do correlation on GPU and normalization on CPU, the results are good.

small size : Image = 32X32 template = 16X16
larger : Image=512X512 template = 64X64

using cuFFT.

RyuKa · May 18, 2011, 12:05pm

How can I implement the classical iFFT[FFT(images1)*FFT(images2)] the easiest way ?

RyuKa · May 19, 2011, 7:05am

Another question :

does anyone have a link where a correlation is done with CUDA ?

HenrikAndresen · May 19, 2011, 7:52am

The typical difference between correlation using time domain and that using frequency domain is going from On^2 to On log2(n), assuming you can zero-pad your data sizes to be of size 2^k.

Topic		Replies	Views
cuda 2d correlation CUDA Programming and Performance	1	8630	October 31, 2010
Normalized correlation image v.s. template CUDA Programming and Performance	1	400	January 24, 2022
How many 2D crosscorrelations per second? CUDA Programming and Performance	0	526	December 19, 2013
Please comment on timings for my normalized cross correlation kernel I'd like to know if these a CUDA Programming and Performance	8	5643	July 8, 2009
Very poor performance with NPP CrossCorrValid GPU-Accelerated Libraries npp	8	3445	May 25, 2022
Are Nppi CrossCorrelation functions working? GPU-Accelerated Libraries	0	486	September 11, 2020
How to realise 1D Crosscorrelation between tow signals on GPU GPU-Accelerated Libraries npp	0	828	February 9, 2022
NPP NPP cross correlation CUDA Programming and Performance	3	5188	April 24, 2014
How much time it will take to do 2D cross correlation CUDA Programming and Performance	0	623	April 4, 2012
Cross Correlation function? CUDA Programming and Performance	3	2142	July 16, 2009

2D cross correlation

Related topics