is FP16 running only on the Volta?

khanrc · September 7, 2018, 9:13am

Hi,

From the TensorRT document, there is no condition about using fp16. There is only condition for using int8 about gpu capability. I tried fp16 on the TITAN XP and 1080 Ti, but I failed in both. By the searching, I saw the statement: “You need volta for fp16”. This is right and I cannot use fp16 in my environment?

$ ./sample_int8 mnist

FP32 run:400 batches of size 100 starting at 100
........................................
Top1: 0.9904, Top5: 1
Processing 40000 images averaged 0.00181013 ms/image and 0.181013 ms/batch.

FP16 run:400 batches of size 100 starting at 100
Engine could not be created at this precision

INT8 run:400 batches of size 100 starting at 100
........................................
Top1: 0.9908, Top5: 1
Processing 40000 images averaged 0.00140439 ms/image and 0.140439 ms/batch.

Thanks!

NVES · September 7, 2018, 3:54pm

Looking into this now. What version of TRT are you using?

khanrc · September 7, 2018, 4:04pm

I’ve tried TRT 4.0.1.6 on the TITAN XP and TRT 3.0.4 on the 1080 Ti, and I got the same results.

sanchezvr7 · September 11, 2018, 8:24pm

Hi, I am getting similar issues running the python example. Does TensortRT4 with FP16 work on Tesla P4?. I ran the python test used it https://devblogs.nvidia.com/tensorrt-integration-speeds-tensorflow-inference and got the below error:

For FP16: DefaultLogger Half2 support requested on hardware without native FP16 support, performance will be negatively affected.

I am using the docker image nvcr.io/nvidia/tensorflow:18.07-py3 (TensorRT 4)

NVES · September 11, 2018, 9:40pm

Hello,

The following GPUs currently support FP16: Quadro RTX 8000, Tesla V100, Tesla P100, and NVIDIA Jetson Xavier.

regards,
NVIDIA Enterprise Support

lpkhappy · December 22, 2018, 3:04am

Excuse me, dose the GTX1080Ti GPU support FP16 now ?

NVES · December 23, 2018, 8:18pm

@lpkhappy,

please reference: Support Matrix :: NVIDIA Deep Learning TensorRT Documentation

lpkhappy · December 26, 2018, 1:22am

Thanks very much :)

Topic		Replies	Views
Using FP16 precision mode on Tesla P4 TensorRT	1	3646	September 11, 2018
FP16 support on gtx 1060 and 1080 GPU-Accelerated Libraries math-api	14	26040	May 19, 2021
FP16 --half=true option doesn't work on GTX 1080 TI although it runs ./sample_int8 INT8 GPU-Accelerated Libraries	2	4939	August 23, 2017
TensorRT fp16 plugin GPU-Accelerated Libraries	4	2747	August 23, 2017
Use FP16 regardless if it is slower or not TensorRT	4	975	May 16, 2022
Question about the tensorrt precision transformation TensorRT	4	472	July 12, 2021
FP16 Issue on hardware Frameworks pytorch	0	385	July 20, 2020
Is there any layer that fp16 supports but int8 does not？ TensorRT	5	486	December 1, 2021
TRT Uses INT 32 VS INT 16 TensorRT	3	1020	October 12, 2021
TemsorRT Fp16 mode Jetson TX1	6	1270	October 18, 2021

is FP16 running only on the Volta?

Related topics