do you know if there is an extension for double precision floating point support on textures for CUDA and/or OpenCL?
I have some single precision code that uses textures and I would like to compare performance with its double precision counterpart
double2 x = as_double2(read_imageui(img, sample, i));
This works as CL_RGBA = 4 components, CL_UNSIGNED_INT32 = 32 bits, so uint4 = 4 * 32 = 128 bits
A double = 64 bits, double2 = 2 * 64 = 128 bits so the requirement of as_typen that the input and output have the same bit size is met.
(Note I have not tested it on a GPU yet)
Also:
Would be nice to see them add the ability for image2df to return doubles, maybe an image2dd for images with type CL_DOUBLE? It would complement the existing support for CL_HALF_FLOAT.
Perhaps a suggestion for 1.2? (Where do we suggest for 1.2 bty?)