how much thread blocks, threads per block are assigned at tensorflow-gpu?

lync2846 · July 15, 2019, 4:26am

I was researching how much of threads per block, blocks per grid is used in actual training. I did tried to read tensorflow cores and planning to read tensorRT white papers but was not sure if I am on a right way.
Is there a way I could find at least a hint?

Thanks!

nluehr · July 15, 2019, 4:09pm

The optimal launch configuration will depend on the specifics of the compute kernel. As such, there is no single value. You’ll need to identify a particular operation of interest to investigate. As far as the sources, some TF native kernels will use functions from tensorflow/core/util/gpu_launch_config.h when determining their launch configuration. For XLA, you might start in tensorflow/compiler/xla/service/gpu/partition_assignment.h.

Another approach is to run your network through the nvprof or nsight systems profilers. In the timeline view, the kernel properties will show the actual launch configuration used.

lync2846 · July 19, 2019, 7:47am

Thanks a lot. I should dig in some :)

whanafy · September 27, 2020, 7:53pm

Hello,

Is there a method to find these numbers (thread blocks and threads per block) for a model compiled with TensorRT?

nluehr · September 28, 2020, 7:02pm

If you profile the application with nsight compute you can get CUDA block and grid launch configurations and much more. https://developer.nvidia.com/blog/using-nsight-compute-to-inspect-your-kernels/

LeoSea · April 11, 2022, 10:12pm

is there a way to change the pre-set values?

Topic		Replies	Views
How to set the number of threads per block or number of block Frameworks tensorflow	2	500	April 22, 2019
finding the best number of threads per block CUDA Programming and Performance	3	7852	January 29, 2010
help to clairfy usage of number of grids and number of blocks in kernal CUDA Programming and Performance	0	611	February 14, 2014
Threads Per Block Issue CUDA Programming and Performance	2	888	September 7, 2010
How to chose the number of blocks and threads in kernel calling CUDA Programming and Performance	3	665	November 27, 2011
Confused about number of threads, block, grid... My first CUDA app CUDA Programming and Performance	2	2430	October 9, 2009
Number of thread blocks and threads in those, difference for performance? CUDA Programming and Performance	1	383	September 6, 2021
General Formula for Thread/Block Ratio CUDA Programming and Performance	1	593	June 2, 2011
default values of no. of blocks and no. of threads per block what if I dont know how many threads I CUDA Programming and Performance	1	4724	February 6, 2010
Scheduling of thread blocks on Stream Processors CUDA Programming and Performance	9	11022	June 7, 2010

how much thread blocks, threads per block are assigned at tensorflow-gpu?

Related topics