Parallelizing blob detection

DBacon · January 22, 2008, 2:18am

I am using CUDA to do basic industrial inspection of thin film materials on gray scale images gathered with a line scan camera.

I have a CPU based blob implementation for finding connected regions, but it is dependant on going down the rows sequentially.

Does anybody have advice or references regarding parallelizing blob detection?

Thx

e.ping · January 22, 2008, 2:42am

I think that there are some parallel connected components analysis algorithms out there, but I haven’t investigated their applicability to CUDA. There also may be other, better approaches to finding blobs than connected components, but I don’t know about them.

If you find anything interesting about this, or want to collaborate on this, email me at geoff.langdale AT gmail.com and we’ll talk further. I’ve been meaning to try this for a while.

chris22 · April 19, 2008, 4:45am

Have any of you compute vision guys found a good way to do this with CUDA? This is a fairly canonical problem in computer vision, but most of the parallel algorithms that I have found are tightly to a specific architecture.

wumpus · April 21, 2008, 7:51am

A (wild) idea would be that you could probably find connected components recursively by doing a reduction, if possible this would be very parallelizable.

Ku-Sai_Sung · April 29, 2008, 5:16pm

It’s been a while since I started researching conected components labeling algorithms.
As far as I know, the best (sequential) implementation for CPU is the one shown in

[url=“http://www.springerlink.com/content/b67258v347158263/”]http://www.springerlink.com/content/b67258v347158263/[/url]

I’ve been implementing some parallel labeling approaches in CUDA, but surprisingly the sequential implementation is still better than my one.

Basically, I’ve divided the operation in two different phases: local labeling and label merge. My algorithm is able to perform the local labeling phase, for a 1024x1024 input image, in just 1.3 ms. The problem is with the merge phase, which is done completely in CPU. :(

manubot · July 9, 2008, 11:57am

Hi,

I have spent quite a lot of time thinking in a solution for doing connected components on GPU but I can’t come out with a correct (and faster than CPU) idea…

Has anybody advanced on this??

thanks!

Manu

ColinS · July 9, 2008, 7:24pm

If the algorithm is dependent on going down the rows sequentially, I would try this approach:

Break up the image into columns. This way, you can perform an algorithm which goes down the rows sequentially, except that now, each row is much shorter, and you can process different columns in parallel to each other. It may be neccessary to have some overlaps in the columns, depending on the exact algorithm.

Topic		Replies	Views
Optimizing Image Labeling Connected Component Labeling CUDA Programming and Performance	8	12626	December 9, 2009
Image Processing (Labeling and Blob Analysis) Calculating connected component labeling and blob area CUDA Programming and Performance	1	3076	November 19, 2008
Image Processing (Labeling and Blob Analysis) calculating connected component labeling and blob area CUDA Programming and Performance	1	1889	October 12, 2021
Connected component labelling on CUDA CUDA Programming and Performance	4	30276	July 28, 2011
Parallel Blob detection Reindexing of blob labels CUDA Programming and Performance	0	2478	July 12, 2011
Parallel Blob detection Reindexing of labels CUDA Programming and Performance	0	786	July 12, 2011
Region Labeling in Cuda Optimising for Cuda, memory etc CUDA Programming and Performance	4	6240	April 18, 2011
Parallelization schemes What schemes do you use when processing large datasets? CUDA Programming and Performance	6	902	December 23, 2010
Image labelling with CUDA CUDA Programming and Performance	4	3464	July 30, 2009
parallel find find multiple items from a array CUDA Programming and Performance	4	4385	February 23, 2009

Parallelizing blob detection

Related topics