CLBlast could not produce FP16 tuning results with NVIDIA GPUs like RTX4090 on Windows

Hi @tjcgy, welcome to the NVIDIA developer forums.

I don’t think I can add much more than what Robert already posted as reply to the related post over in the CUDA category.

The release notes specify that the NVVM compiler got upgraded and allows for usage of the cl_khr_fp16 extension.

But actual support of that feature on specific Hardware will vary.

I hope that helps clarify the situation.

Thanks!