Accelerating Transformers with NVIDIA cuDNN 9

Originally published at: https://developer.nvidia.com/blog/accelerating-transformers-with-nvidia-cudnn-9/

The NVIDIA CUDA Deep Neural Network library (cuDNN) is a GPU-accelerated library for accelerating deep learning primitives with state-of-the-art performance. cuDNN is integrated with popular deep learning frameworks like PyTorch, TensorFlow, and XLA (Accelerated Linear Algebra). These frameworks abstract the complexities of direct GPU programming, enabling you to focus on designing and training their models…