NVIDIA Developer Forums

Replicate 2.2ms inference time on BERT

AI & Data Science Deep Learning (Training & Inference)

ghiles August 22, 2019, 5:55pm 1

This is a duplicated topic originally posted in the wrong category. The new topic is here: Replicate 2.2ms inference time on BERT - TensorRT - NVIDIA Developer Forums

Please delete the topic if possible.