Using Nvidia apex on models ensemble

alex.spivakovsky · June 23, 2021, 5:18pm

Hello,
I’m trying to use the apex automatic mixed precision on an ensemble of 2 models connected serially.
I’m testing the opt_level = 02 and indeed I observe the input and output of the model internally converted to half precision.
However, after the forward step the input/output tensors used to calculate the loss are again converted to single precision.
I expect the data tensors and weights of the model to be half precision all the way, such that the optimizer works on 16 bit tensors. Is this the correct behavior?

Thank you,
Alex.

Robert_Crovella · June 23, 2021, 5:37pm

certain aspects of model training will still be done using FP32

You may wish to review this blog:

quoting from there:

In brief, the methodology is:

Ensuring that weight updates are carried out in FP32.

Topic		Replies	Views
NVIDIA Apex: Tools for Easy Mixed-Precision Training in PyTorch Technical Blog	0	427	August 25, 2020
NVIDIA's apex only for inferences Deep Learning (Training & Inference)	0	660	March 19, 2020
Is training a requirement to use apex O1 mode? Deep Learning (Training & Inference) mixed-precision	0	506	March 16, 2020
Introducing Apex: PyTorch Extension with Tools to Realize the Power of Tensor Cores Technical Blog	0	271	August 21, 2022
Use Automatic Mixed Precision on Tensor Cores in Frameworks Today Technical Blog	0	230	August 21, 2022
Mixed Precision Models Jetson AGX Orin onnx	2	621	August 30, 2022
Mixed-Precision ResNet-50 Using Tensor Cores with TensorFlow Technical Blog	2	418	March 7, 2019
Mixed-Precision Training of Deep Neural Networks Technical Blog	5	393	October 10, 2019
Mixed Precision (Tensor) vs raw FP16 / raw FP32 Compute Metrics Jetson AGX Xavier tensorrt , hw , cuda , jetson-inference	4	740	October 18, 2021
End-to-End AI for NVIDIA-Based PCs: Optimizing AI by Transitioning from FP32 to FP16 Technical Blog	0	377	April 27, 2023

Using Nvidia apex on models ensemble

Related topics