Quadro RTX 6000 does not handle BF16? Please make an update?

ken_shoryu_ai · November 7, 2024, 11:16am

Hello,
I am using a quadro RTX 6000, and I have a problem with GENERATIVE AI.
I met at least 2 instances where my card could not handle things that other cards with less VRAM would do.
My card has 24GB vram, yet I am not able to handle bf16 and not able to run some scripts.

I am specifically using ComfyUI program.
Many models used are BF16 or need BF16, for example the “PULID FLUX” model, and the latest AI video gen model “MOCHI”.

I talked with the creator of ComfyUI and he suggested to modify the comfyUI code to allow for float16, so we commented a line of code and allowed the program to allow float 16:

#supported_inference_dtypes = [torch.bfloat16, torch.float32]
supported_inference_dtypes = [torch.float16, torch.bfloat16, torch.float32]

The result was that comfyUI was able to run some bf16 models but … in float16 mode? the result was a corrupted video output (from the generation process), it was a noise video.

__
Maybe the card failing to handle bf16 so it is turning to f32 → which makes the generation ultra slow and not efficient → ultimately the card would fail at the generation whereas lower vram card that can handle bf16 would not fail.
Which is a shame isnt it?
__

I need an upgrade or driver or workaround that can make my card work with BF16? Can you do that NVIDIA please?
I mean this is not bad card. And it is still used. It has 24GB vram after all. It deserves to get an update to handle this problem.
__

I also got other errors such as:

RuntimeError: expected scalar type Half but found BFloat16

Please help.

Btw, I don’t know if this is the right place or not.

Can NVIDIA help me find a quick solution please. I need this. I need to handle BF16 models.

Topic		Replies	Views
Quadro RTX 6000 does not handle BF16? Please make an update? CUDA Programming and Performance cudnn	16	876	November 15, 2024
Q4000 updated driver 256.02.25f01 break CUDA 4.0 CUDA Programming and Performance	19	4122	August 15, 2011
FP16 support on gtx 1060 and 1080 GPU-Accelerated Libraries math-api	14	25879	May 19, 2021
Info please: x16 PCI Express slot outdated? Upgrade from Quadro FX 1500 to GTX470? (CUDA) CUDA Programming and Performance	5	7603	January 5, 2011
Half Float and Fermi CUDA Programming and Performance	1	4212	October 23, 2009
cuDNN v6 INT8 convolution failing with CUDNN_STATUS_NOT_SUPPORTED cuDNN	12	5230	March 3, 2020
Quadro 5600 FX CUDA Programming and Performance	3	7894	March 12, 2007
Unexpectedly low performance of cuFFT with half floating point (FP16) GPU-Accelerated Libraries	1	1662	June 16, 2017
Cannot run CUDA w/ 259.12 driver and Quadro 5000 CUDA Programming and Performance	0	4254	August 18, 2010
Unable to run with half precision on Nvidia GTX 1080 TensorRT	3	1273	October 10, 2018

Quadro RTX 6000 does not handle BF16? Please make an update?

Related topics