opencv facedetection speedup in cvIntegral

nimals1986 · September 4, 2008, 12:15pm

hi…i have been searching and investigating about the possibility of speeding up opencv facedetection.

The main face detection part is done by the cvHaarDetectObjects function.going through that function most of it involved data structures and then at one point there was the use of cvIntegral
It seemed to be one important process that is done in the facedetection.
(is it the haar transform??..
)
Also the cvintegral function din seem clear to me,i tried googling about its working…during this i came across one good developer who has implemented the cvIntegral in a simple manner

LINK : cvintegral

==http://opencv.blogspot.com/2005/04/cvintegral-on-32-bit-floating-point_16.html==

making this code cuda enabled would help speed up the facedetection i feel…

but frankly i am kinda finding it tough to get into cuda programming…

Can you help me start with the process…we can also work together… External Image :-)

:magic:

nimals1986 · September 8, 2008, 9:08am

hi i timed the cvintegral and cvcanny function together for the facedetect demo in opencv that uses the lena image…both the functions tooks only about 5 to 6 ms of the total avg 200ms detection time…

then i did a count for the cvrunhaardetectobjects for the same example and it is called almost 184254 times for the lena image…

as it iterates over the entire image with the cascades and makes the pattern check for a face it gets so many calls…

this could be made to run in cuda isnt it…multiple processors can do the cascade comparison at different parts of the image simultaneously to have a very good speed up isnt it…

anyone has your suggestions…Is it possible converting the cvRunHaarClassifierCascade to a parallel process…

nimals1986 · September 8, 2008, 9:43am

i came across this cool site that very beautifully shows the working of the OpenCv facedetect demo…

[url=“http://morph.cs.st-andrews.ac.uk/fof/haarDemo/index.html”]http://morph.cs.st-andrews.ac.uk/fof/haarDemo/index.html[/url]

nimals1986 · September 11, 2008, 11:00am

hi…i did some experimenting with gprof for the facedetct of opencv and fond the most time consuming function…here is the list

% cumulative self self total
time seconds seconds calls ms/call ms/call name

78.38 0.29 0.29 3101044 0.00 0.00 icvEvalHidHaarClassifier(CvHidHaarClassifier*, double, unsigned int)

10.81 0.33 0.04 184588 0.00 0.00 cvRunHaarClassifierCascade

2.70 0.34 0.01 40689 0.00 0.00 icvXMLParseTag(CvFileStorage*, char*, CvStringHashNode**, CvAttrList**, int*)

its a huge list actually…i am attaching the profile here

can cuda be used to parallelize the algorithm?
gprofile.txt (46.8 KB)

dbancajas · June 11, 2009, 9:25am

Hi,

is anybody is still in need of this? I am actually doing a project of porting the viola/jones on cuda. I hope I get it right.

Miguel_Mesa · August 17, 2009, 4:48am

Hi dbancajas, I’m really interested on having it ported to cuda… are you still around?

Thanks in advance!

ElecStu · January 3, 2011, 11:51pm

Hi,
Is anyone manage to parallelize opencv face detection?

I trying to write opencl version of opencv face detection. Any suggestion will be very very helpful!

Thank you very much :)

Simon_Green · January 4, 2011, 5:49pm

Integral images are also known as summed-area tables (SATs), and are being used here to quickly sum all the values in a particular rectangle. You can compute SATs efficiently in CUDA using parallel prefix sum, there’s a demo of this in the CUDPP library:

http://gpgpu.org/static/developer/cudpp/rel/cudpp_1.1/html/index.html

I agree it would be cool to have a fast face detection demo in CUDA!

LeeL · January 4, 2011, 7:01pm

There is also a summed area table example in the Thrust examples.
http://code.google.com/p/thrust/

Download the examples .zip and check out:
summed_area_table.cu

Topic		Replies	Views
Opencv cuda convolution extremly slower than bare cuda code convolution on Jetson Nano using unified memory Jetson Nano opencv	12	3714	October 18, 2021
Call for ideas for CUDA Image Library CUDA Programming and Performance	18	18873	May 20, 2013
CUDA is so slow Jetson Nano opencv	5	1324	June 30, 2022
CUDA & OpenCV CUDA Programming and Performance	10	31578	December 2, 2010
CUDA OpenCV questions CUDA Programming and Performance	7	2467	November 30, 2010
Fast Morphological image processing using cuda-7.5 CUDA Programming and Performance	5	1112	November 2, 2016
Parallel Image Capture with Processing CUDA Programming and Performance	2	689	July 18, 2017
cuda profiling how to calculate speed up? CUDA Programming and Performance	5	2678	May 4, 2010
Eliminate upload/download for OpenCV cuda::GpuMat using shared memory? Jetson Nano opencv	14	20788	October 14, 2021
CUDA and OpenCV how to put them working together? CUDA Programming and Performance	2	28160	June 24, 2010

opencv facedetection speedup in cvIntegral

Related topics