Fast 3D arrays in OpenCL? In CUDA i would use cudaMalloc3D? How to do that in OpenCL?

In order to have fast memory access it must be aligned at 64-byte adresses?

CUDA offers cudaMalloc3D to allocate 3D arrays at these adresses.

How it is done with OpenCL or is it done somehow automatically?

Image3D maybe?