graph traversal

can anybody help me in implementing depth first search? I am new to cuda and i don’t know how should i parallelize all the nodes? i am using stack and adjacency matrix.
thank you

Any luck with this one?