Low or normal performance?

geohei · October 29, 2020, 5:17pm

Shouldn’t this be (*iters)++ ?
With that one, I get about 220.000 million kernel calls/second.
But still not what I expect … see below.

Ok, kernel calls/second might not be the usual parlance (sorry, I’m new in CUDA :), but how shall I phrase my “kernel calls/second” differently/correctly?

Have a look at this here (an example I found):
https://forums.developer.nvidia.com/t/bitslice-des-optimization/38896/48

How is it possible that this software does 23750 MH/s while my simple atomicAdd() code comes up with 1900 (whatever I should call it). Above thread talks about DES, which is far more complicated than one line of atomicAdd() or (*iters)++.

I must be missing something!

Topic		Replies	Views
Why would code run 1.7x faster when run with nvprof than without? CUDA Programming and Performance	35	3410	December 28, 2017
Attention Lucky GTX 480/GTX 470 Owners! Please run some benchmarks for us. :) CUDA Programming and Performance	88	22770	May 5, 2010
From low end GPUs to high end GPUs Moving from 9600GT to Tesla T10 provides no improvement, why ? CUDA Programming and Performance	24	17468	June 8, 2010
Cuda program taking more time. CUDA Programming and Performance	15	7165	November 21, 2010
CUDA and Murphy's Law Some things you may bump into... CUDA Programming and Performance	16	19321	August 21, 2007
GPU Perfomance How much GFlops??? CUDA Programming and Performance	27	37773	August 30, 2009
How to get more Gflops ? :) CUDA Programming and Performance	21	27764	September 12, 2008
problem running demos CUDA Programming and Performance	9	8275	January 1, 2009
preview of NVIDIA Visual Profiler CUDA Programming and Performance	76	89346	May 18, 2010
clGetEventProfilingInfo() zero end time cannot get kernel's end time while start is correct CUDA Programming and Performance	23	6252	November 14, 2010

Low or normal performance?

Related topics