Does CUDA 2.0 support recursion? ... and ray tracing doubts.

korax · April 19, 2008, 10:24pm

Does CUDA 2.0 support recursion?

If not, is it due to a hardware or CUDA limitation?

I was hoping to implement a ray tracer and attempt to parallelize it across an SLI system. The algorithm becomes recursive when its time to perform reflection, refraction, etc. Lacking recursion would be a major problem, to my knowledge.

Does anyone have thoughts on this?

chris22 · April 19, 2008, 10:36pm

No, recursion is not supported and it is mainly a hardware limitation. Implementing a stack, that could get very deep, would have to exist in local memory, which is very slow, rather than registers. CUDA gives the programmer full flexibility to control the use of shared memory. Perhaps, you could implement a recursive stack on your own, utilizing the shared memory of the multiprocessor. Or you could call the same kernel with different data repeatedly on the host side.

gonnet · April 20, 2008, 12:33am

Just a dummy guess, provided the compiler does not have the notion of function so that everything is inlined, it seems hard to have recursion ? (there is indeed no stack as we are used to)

Perhaps this changed since cuda 1.x, please correct me if i’m wrong …

But in theory, can’t every recursive function be rewritten into a non recursive one ? (that’s really an algorithmic question External Media

CÃ©dric

seibert · April 20, 2008, 1:50am

This is certainly true, though some recursive algorithms will need a stack-like data structure in the non-recursive form. For example, a parser which recognizes strings described by a context-free grammar requires a stack.

You might be thinking about tail-recursive algorithms. These sorts of algorithms can be converted directly to a loop without any stack.

korax · April 20, 2008, 4:03am

This is the particular ray tracing algorithm at its core. It’s pretty easy to follow, but I’ve never tried to go from recursion to a stack.

color spawnRay(ray, depth) {

        

        // Code to determine if ray hit an object

        // ...

       // If ray didn't intersect anything, just return background color        

        if (!intersection) {

            return background_color;

    

        // There has been an intersection       

        } else {

            

            // MAX_DEPTH sets a limit to the recursion depth (10 is fine)

            if (depth < MAX_DEPTH) {

                

                //Spawn reflection ray if intersected object hit is reflective

                if (kr > 0) {

                    retcolor += spawnRay(reflect_ray, depth+1);

                }

                

                //Spawn transmission ray if intersected object is transparent                

                if (kt > 0) {

                    retcolor += spawnRay(trans_ray, depth+1);

                }

            }

        }

        return retcolor;

    }

Skribtsov · April 20, 2008, 6:24pm

from the programming theory any recursion can be replaced with a tree-structured data processing. Imagine that rays from a tree. So go by iterations, cast all known rays. Deterimine the new rays that are to be computed, by putting them into the nodes of the tree. Cast again. Repeat until you’re done. After your tree is ready, process all nodes to integrate information and form final screen pixel values.

hieronymus · April 21, 2008, 1:55pm

I implemented a Chess engine, which normally uses a lot of recursion, google for alpha-beta algorithm. I simulate recursion with implementing my own stack in shared memory and a statemachine. GpuChess

cronus98 · April 22, 2008, 4:05am

This is the particular ray tracing algorithm at its core. It’s pretty easy to follow, but I’ve never tried to go from recursion to a stack.

color spawnRay(ray, depth) {

 Â  Â  Â  Â 

 Â  Â  Â  Â // Code to determine if ray hit an object

 Â  Â  Â  Â // ...

Â  Â  Â  Â // If ray didn't intersect anything, just return background color Â  Â  Â  Â 

 Â  Â  Â  Â if (!intersection) {

 Â  Â  Â  Â  Â  Â return background_color;

 Â  Â 

 Â  Â  Â  Â // There has been an intersection Â  Â  Â  

 Â  Â  Â  Â } else {

 Â  Â  Â  Â  Â  Â 

 Â  Â  Â  Â  Â  Â // MAX_DEPTH sets a limit to the recursion depth (10 is fine)

 Â  Â  Â  Â  Â  Â if (depth < MAX_DEPTH) {

 Â  Â  Â  Â  Â  Â  Â  Â 

 Â  Â  Â  Â  Â  Â  Â  Â //Spawn reflection ray if intersected object hit is reflective

 Â  Â  Â  Â  Â  Â  Â  Â if (kr > 0) {

 Â  Â  Â  Â  Â  Â  Â  Â  Â  Â retcolor += spawnRay(reflect_ray, depth+1);

 Â  Â  Â  Â  Â  Â  Â  Â }

 Â  Â  Â  Â  Â  Â  Â  Â 

 Â  Â  Â  Â  Â  Â  Â  Â //Spawn transmission ray if intersected object is transparent Â  Â  Â  Â  Â  Â  Â  Â 

 Â  Â  Â  Â  Â  Â  Â  Â if (kt > 0) {

 Â  Â  Â  Â  Â  Â  Â  Â  Â  Â retcolor += spawnRay(trans_ray, depth+1);

 Â  Â  Â  Â  Â  Â  Â  Â }

 Â  Â  Â  Â  Â  Â }

 Â  Â  Â  Â }

 Â  Â  Â  Â return retcolor;

 Â  Â }

[snapback]365301[/snapback]

i just finished a cuda raytracer for a master’s project, and doing it iteratively wasn’t that bad. i just maintain two arrays: a ‘current’ array and ‘new’ array. i loop through the current array, throwing any reflective or refractive rays into the ‘new’ array. once the loop is done i copy the ‘new’ to ‘current’ and start over. i’ll be putting my code and report online in about a week.

cronus98 · July 10, 2008, 2:18am

i should have done this before, but my project is available at:

CUDA Ray Tracing Master’s Project

E.D_Riedijk · July 10, 2008, 4:26am

Very nice, I think a lot of people can base their work on this. I would like to have it when starting my raytrace-work ;)

Just a remark, if you are only using triangles (which it looked like), you can use a faster intersection algorithm than counting the number of times you cross a line of a polygon. It uses barycentric coordinates.

[url=“Ray-Triangle Intersection II”]http://www.cs.princeton.edu/courses/archiv...cast/sld019.htm[/url]
[url=“http://www.cs.virginia.edu/~gfx/Courses/2003/ImageSynthesis/papers/Acceleration/Fast%20MinimumStorage%20RayTriangle%20Intersection.pdf”]http://www.cs.virginia.edu/~gfx/Courses/20...ntersection.pdf[/url]
[url=“http://www.cs.lth.se/home/Tomas_Akenine_Moller/raytri/raytri.c”]http://www.cs.lth.se/home/Tomas_Akenine_Mo...raytri/raytri.c[/url]

st5486 · March 6, 2009, 1:24pm

I’m confused - can someone help to explain cronus98’s code and how it works? I’ve tried adapting his code to work within my raytracer but I’m having trouble - It works for reflection and everything apart from refraction. I think this is because I need to restructure the colouring of transparent objects and mine colouring code looks different from his but I am not too familiar with raytracing and how to fix it, can anyone help by looking at my code below?

[codebox]device RGBColour shade(ShadeRec& sr, const RGBColour& backgroundColour)

{

RGBColour L;

L.r = 0; L.g = 0; L.b = 0;

RGBColour temp, temp2, temp3;

temp.r = 0; temp.g = 0; temp.b = 0;

temp2.r = 0; temp2.g = 0; temp2.b = 0;

int i = 0;

float3 wo, wi, wt;	

RGBColour fr, ft; 

Ray newRays[16];

Ray currRays[16];

ShadeRec prevSr;



temp3 = backgroundColour;

int numNewRays = 0;

int numCurrRays = 0;

if(sr.material.y == 0)

	temp += lambertShade(sr);  

else if(sr.material.y == 1)

	temp += specularShade(sr);

else if(sr.material.y == 2)

{

	temp += specularShade(sr); 

	wo = -sr.ray.d;

	fr = reflectiveF(sr, wo, wi); 

	temp += max_to_one(fr * temp * (dot(sr.normal, wi)));

	currRays[numCurrRays].o = sr.hitPoint;

	currRays[numCurrRays].d = wi; 

	numCurrRays++;

}

else

{

	temp += specularShade(sr);

	wo = -sr.ray.d;

	fr = reflectiveF(sr, wo, wi); 

	ft = transparentF(sr, wo, wt);

	currRays[numCurrRays].o = sr.hitPoint;

	currRays[numCurrRays].d = wi; 

	currRays[numCurrRays].transparent = 0;

	numCurrRays++;

	if(!tir(sr))

	{

		currRays[numCurrRays].o = sr.hitPoint;

		currRays[numCurrRays].d = wt;

		currRays[numCurrRays].transparent = 1;

		numCurrRays++;

	}

}

for(i = 0; i < vp.maxDepth; i++)

{

	if(numCurrRays == 0)

		break;

	

	for (int currRay = 0; currRay < numCurrRays; currRay++)

	{

		prevSr = sr;

		sr = hitObjects(currRays[currRay]);    

		

		if (sr.hitAnObject) 

		{

			sr.ray = currRays[currRay];

			if(sr.material.y == 0)

				temp += lambertShade(sr);  

			else if(sr.material.y == 1)

				temp += specularShade(sr);

			else if(sr.material.y == 2)

			{

				temp += specularShade(sr);

				wo = -sr.ray.d;

				fr = reflectiveF(sr, wo, wi); 

				newRays[numNewRays].o = sr.hitPoint;

				newRays[numNewRays].d = wi;

				newRays[numNewRays].transparent = 0;

				numNewRays++;

				temp += max_to_one(fr * temp * (dot(sr.normal, wi)));

			}

			else

			{

				wo = -sr.ray.d;

				fr = reflectiveF(sr, wo, wi); 

				ft = transparentF(sr, wo, wt);

				temp += specularShade(sr); 

				if(tir(sr))

				{

					newRays[numNewRays].o = sr.hitPoint;

					newRays[numNewRays].d = wi;

					numNewRays++;

					newRays[numNewRays].transparent = 0;

				}

				else

				{

					newRays[numNewRays].o = sr.hitPoint;

					newRays[numNewRays].d = wi;

					newRays[numNewRays].transparent = 0;

					numNewRays++;

					temp2 += max_to_one(fr * temp * fabs(dot(sr.normal, wi)));

					

					newRays[numNewRays].o = sr.hitPoint;

					newRays[numNewRays].d = wt;

					newRays[numNewRays].transparent = 1;

					numNewRays++;

					temp2 += max_to_one(ft * temp * fabs(dot(sr.normal, wt)));

					temp = temp2;

				}

			}

		}

		else

		{

			if(prevSr.hitAnObject)

			{

				temp += (fr * backgroundColour * (dot(prevSr.normal, wi)));

				if(currRays[currRay].transparent == 1)

				{

					temp += max_to_one(ft * backgroundColour * fabs(dot(prevSr.normal, wt)));

				}

			}

		}

		//L += temp;

	}

	for (int j = 0; j < numNewRays; j++) 

	{

		currRays[j] = newRays[j];

	}

	numCurrRays = numNewRays;

	numNewRays = 0;

}	

return (temp);

}

[/codebox]

Topic		Replies	Views
Recursion in Cuda 3.1 CUDA Programming and Performance	1	9765	July 12, 2010
CUDA recursion failed CUDA Programming and Performance	11	3813	November 13, 2016
CUDA 3D Rendering Mystery CUDA Programming and Performance	25	16269	June 16, 2010
Map algorithms from CPU-GPU: recursive ans stack CUDA Programming and Performance	15	14451	August 12, 2008
ray tracer choosing tools CUDA Programming and Performance	24	34105	May 20, 2008
Accelerated Ray Tracing in One Weekend in CUDA Technical Blog	25	1988	February 23, 2024
Is there any performance difference implementing a ray-tracer in cuda vs. rendering pipelines? CUDA Programming and Performance	7	2885	March 2, 2019
does CUDA support recursion? CUDA Programming and Performance	2	5438	March 10, 2015
Does CUDA support recursion? CUDA Programming and Performance	0	621	March 9, 2015
Raycasting performance on GPU CUDA Programming and Performance	13	5909	September 28, 2008

Does CUDA 2.0 support recursion? ... and ray tracing doubts.

Related topics