GPU is slower than CPU

rinsavs · November 16, 2016, 3:57pm

Hello, I am currently developing a GPU app. However, my GPU is slower than my CPU. What could be the problem? These are the specs of my comp and project environment:

-Windows 10 32bit
-Intel i5 2430M
-NVIDIA Geforce 540M
-CUDA Toolkit 6.5

I do have a lot of cudaMalloc and cudaMemcpy, but they’re not the problem (I’ve measured the time using event)

Any help will be highly appreciated.
Thanks a lot!

MutantJohn · November 16, 2016, 4:13pm

It’s actually not that hard for a GPU to be a lot slower than a CPU.

A lot of what makes a GPU faster than a CPU depends on things like the size of the data you’re working on and how computationally intense the code is. Small data with few calculations is a poor fit for a GPU, for example. CPUs aren’t as slow as we’d like to think so stuff like this does happen.

rinsavs · November 16, 2016, 4:26pm

Hello, MutantJohn. Thanks for your reply
I’m working on BVH CUDA ray tracing project. I use 200 for both number of blocks and threads. The calculations include checking intersections and shadows using BVH and calculating colors (reflections and refractions too). I think those are quite a lot of computations…

Any other opinions please? Since the difference is 20 secs for a teapot obj… (with and without CUDA)

MutantJohn · November 16, 2016, 4:41pm

First thing’s first, make sure you’re compiling with the proper optimization flags. Do no -G or -O0 or anything like that. Balls to the wall -O2 or -O3.

Next up is to profile your code itself. It’s near impossible to look at source code and go, “Oh hey, that’s a bottleneck!”

Okay, it is possible to do that! Some things are so obvious! But not all things are. So you’ll need to use nvprof or nvvp to find the slowest kernel invocations and then you can figure out why those kernels invocations are slow.

LongY · November 16, 2016, 10:44pm

This thread might help you understand why GPU is slower than CPU in some cases.
[url]https://devtalk.nvidia.com/default/topic/953975/sequential-code-is-faster-than-parallel-how-is-it-possible-/[/url]

swerner · November 18, 2016, 8:21am

Have you profiled your application yet? I strongly recommend you do that, it should point you in the right direction.

In GPU ray tracing, the bottleneck is typically not by computation, but by thread divergence and memory access. If you haven’t yet, you should also read this publication: Understanding the Efficiency of Ray Traversal on GPUs | Research

rinsavs · December 7, 2016, 8:28am

Hello, thanks for all the replies
The problem is solved. Thank you :)

Matthieu76 · August 10, 2017, 5:21pm

What was the problem?

Topic		Replies	Views
Cannot find a reason why CPU process much faster than GPU process in simple code CUDA Programming and Performance	3	502	November 19, 2018
Simple proven (timed) example code where GPU beats CPU, anyone? CUDA Programming and Performance	6	1149	November 1, 2013
GPU vs. CPU GPU is always much slower CUDA Programming and Performance	1	10288	June 5, 2009
GPU is slower than CPU process from nvprof CUDA Programming and Performance	0	395	December 13, 2018
CUDA slower than CPU? CUDA Programming and Performance	7	870	August 18, 2023
CPU faster than CUDA CUDA Programming and Performance	2	1789	September 6, 2020
cuda gpu slower than cpu CUDA Programming and Performance	2	1089	May 1, 2012
CUDA is slower than expected. Is something missing? CUDA Programming and Performance cuda , gpu , gpu-computing , parallel-computing	4	267	July 7, 2024
GPU parallel version of code slower? CUDA Programming and Performance	1	380	October 1, 2020
Performance gap for a short test code between GPU and CPU CUDA Programming and Performance	8	1881	October 26, 2017

GPU is slower than CPU

Related topics