CUDA for task parallelism?

jca · March 15, 2010, 3:24am

Hello, I’m a CUDA newbie trying to decide if switching from a cluster to CUDA would work for my application. I used MPI to parallelize a Fortran 90 code to run on a Beowulf cluster by taking advantage of task parallelism. Each task assigned to a core involves numerical integration of a function over the surface of an element in 3D space, incrementing the Gauss rule, then evaluating the integral again and checking the change against a tolerance. The evaluation of the functions to be integrated is itself an adaptive procedure and not expressible in closed-form, leading to loop-carried dependencies. Is this program not a good candidate for a Tesla supercomputer with Portland’s CUDA Fortran compiler?

By the way, this is for a boundary element code that requires complex-double precision.

Thanks for your help.

Uncle_Joe · March 17, 2010, 7:50pm

I’ve never seen an integral with an integrand with no known closed form. I guess an example is if you want to integrate the fibonacci function (even though fibonacci does have a closed form)

If that really is the case, then you’ll really have limited ||ism.

If the surface is fixed, you could compute the integrand once and reuse it, allowing massive ||ism.

If indeed you’re limited to evaluating 1 surface per thread, you would need thousands of surfaces to keep the GPU busy.

Without seeing exactly what you’re computing, I can’t really see a solution.

Topic		Replies	Views
Parrallelization of Numerical Integration Loop? How is loop parallelization accomplished in CUDA CUDA Programming and Performance	3	2873	September 2, 2009
When to use Serial CPU, CUDA, OpenMP and MPI? CUDA Programming and Performance	8	13685	May 29, 2021
noob questions their intended 'use case' for parallel programming with CUDA Teaching and Curriculum Support	6	1970	June 20, 2015
task parallelism on CUDA CUDA Programming and Performance	3	1748	June 24, 2012
Translating FORTRAN to C++ to CUDA advice CUDA Programming and Performance	19	23335	February 1, 2010
advice needed by a PhD student CUDA Programming and Performance	26	2972	December 4, 2011
Performance gap for a short test code between GPU and CPU CUDA Programming and Performance	8	1907	October 26, 2017
accelerator parallization issues Legacy PGI Compilers	18	26771	April 12, 2010
CUDA Fortran performance with explicit finite difference method CUDA Programming and Performance	4	677	February 28, 2020
multi task parallelization with cuda streams ? CUDA Programming and Performance	7	1501	September 14, 2017

CUDA for task parallelism?

Related topics