Good day,
as mentioned in the topic my Tesla C2075 global memory is displayed as 1.3 gigabytes which is far less than it is supposed to be according to the product information.
Product information can be found here NVIDIA DGX Station A100 | NVIDIA
My setup:
Tesla C2075 driver version 311.50, Release date 2013.04.17
CUDA 5.0
The code snippet that I used for displaying global memory
void PrintDeviceProperties(cudaDeviceProp devProp)
{
FILE *deviceProperties = fopen(“DeviceProperties.txt”, “a+”);
fprintf(deviceProperties, “Major revision number: %d\n”, devProp.major);
fprintf(deviceProperties, “Minor revision number: %d\n”, devProp.minor);
fprintf(deviceProperties, “Name: %s\n”, devProp.name);
fprintf(deviceProperties, “Total global memory: %u\n”, devProp.totalGlobalMem);
fprintf(deviceProperties, “Total shared memory per block: %u\n”, devProp.sharedMemPerBlock);
fprintf(deviceProperties, “Total registers per block: %d\n”, devProp.regsPerBlock);
fprintf(deviceProperties, “Warp size: %d\n”, devProp.warpSize);
fprintf(deviceProperties, “Maximum memory pitch: %u\n”, devProp.memPitch);
fprintf(deviceProperties, “Maximum threads per block: %d\n”, devProp.maxThreadsPerBlock);
for (int i = 0; i < 3; ++i)
fprintf(deviceProperties, “Maximum dimension %d of block: %d\n”, i, devProp.maxThreadsDim[i]);
for (int i = 0; i < 3; ++i)
fprintf(deviceProperties, “Maximum dimension %d of grid: %d\n”, i, devProp.maxGridSize[i]);
fprintf(deviceProperties, “Clock rate: %d\n”, devProp.clockRate);
fprintf(deviceProperties, “Total constant memory: %u\n”, devProp.totalConstMem);
fprintf(deviceProperties, “Texture alignment: %u\n”, devProp.textureAlignment);
fprintf(deviceProperties, “Concurrent copy and execution: %s\n”, (devProp.deviceOverlap ? “Yes” : “No”));
fprintf(deviceProperties, “Number of multiprocessors: %d\n”, devProp.multiProcessorCount);
fprintf(deviceProperties, “Kernel execution timeout: %s\n”,
devProp.kernelExecTimeoutEnabled ? “Yes” : “No”));
fclose(deviceProperties);
}
And the result is as follows:
Major revision number: 2
Minor revision number: 0
Name: Tesla C2075
Total global memory: 1341849600
Total shared memory per block: 49152
Total registers per block: 32768
Warp size: 32
Maximum memory pitch: 2147483647
Maximum threads per block: 1024
Maximum dimension 0 of block: 1024
Maximum dimension 1 of block: 1024
Maximum dimension 2 of block: 64
Maximum dimension 0 of grid: 65535
Maximum dimension 1 of grid: 65535
Maximum dimension 2 of grid: 65535
Clock rate: 1147000
Total constant memory: 65536
Texture alignment: 512
Concurrent copy and execution: Yes
Number of multiprocessors: 14
Kernel execution timeout: No
All sort of help is appreciated!
Best regards,
Jonne