Mixed-Precision Training of Deep Neural Networks

jwitsoe · October 11, 2017, 4:59am

Originally published at: Mixed-Precision Training of Deep Neural Networks | NVIDIA Technical Blog

Deep Neural Networks (DNNs) have lead to breakthroughs in a number of areas, including image processing and understanding, language modeling, language translation, speech processing, game playing, and many others. DNN complexity has been increasing to achieve these results, which in turn has increased the computational resources required to train these networks. Mixed-precision training lowers the required…

anon44000663 · October 12, 2017, 9:39am

May be I missed it - but what is the real speedup achieved for these models?

anon84876766 · October 15, 2017, 4:14pm

What is the accuracy drop if FP8 or FP10 are used ?

anon203379 · July 13, 2018, 2:33pm

Have not tried the mixed precision in training but we did so with simulations so can share few remarks about the overall speedup here https://medium.com/@marcroj...

anon46986622 · September 26, 2018, 3:16pm

Hi, where I can find the pre-trained models for Faster R-CNN and Multibox SSD (FP16 and FP32)?

anon136802 · October 10, 2019, 2:38pm

What's the `VGG-D` model in Table.1?

Topic		Replies	Views
Mixed-Precision Training of Deep Neural Networks Technical Blog	0	259	August 21, 2022
Boosting NVIDIA MLPerf Training v1.1 Performance with Full Stack Optimization Technical Blog	2	1211	April 3, 2022
Mixed-Precision Programming with CUDA 8 Technical Blog	1	387	February 23, 2017
The Full Stack Optimization Powering NVIDIA MLPerf Training v2.0 Performance Technical Blog	0	403	June 29, 2022
Accelerating AI Training with NVIDIA TF32 Tensor Cores Technical Blog	1	546	January 29, 2021
Perception Model Training for Autonomous Vehicles with Tensor Parallelism Technical Blog	1	185	May 2, 2024
Getting Immediate Speedups with NVIDIA A100 TF32 Technical Blog	1	460	November 15, 2020
NVIDIA's 2017 Open-Source Deep Learning Frameworks Contributions Technical Blog	0	221	August 21, 2022
Setting New Records at Data Center Scale Using NVIDIA H100 GPUs and NVIDIA Quantum-2 InfiniBand Technical Blog	0	314	November 8, 2023
Profiling and Optimizing Deep Neural Networks with DLProf and PyProf Technical Blog	13	1402	August 11, 2021

Mixed-Precision Training of Deep Neural Networks

Related topics