I am implementing vector bit quantization and reached the point where I have to handle negative zero. I am seeking guidance on what should be the result of an equality comparision of negative zero and positive zero if implementing own BFloat16. From CUDA documentation, it is not clear what does __heq() return if the operands are CUDART_NEG_ZERO_BF16 and CUDARD_ZERO_BF16. Please let me know!