./cupti_query Assuming default device id 0 CUDA Device Id : 0 CUDA Device Name: Tesla K80 Assuming default domain id 401 Event# 1 Id = 2082 Name = tex0_cache_sector_queries Shortdesc = Tex0 cache sector queries Longdesc = Number of texture cache 0 requests. This increments by 1 for each 32-byte access. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 2 Id = 2083 Name = tex1_cache_sector_queries Shortdesc = Tex1 cache sector queries Longdesc = Number of texture cache 1 requests. This increments by 1 for each 32-byte access. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 3 Id = 2084 Name = tex2_cache_sector_queries Shortdesc = Tex2 cache sector queries Longdesc = Number of texture cache 2 requests. This increments by 1 for each 32-byte access. Value will be 0 for devices that contain only 2 texture units. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 4 Id = 2085 Name = tex3_cache_sector_queries Shortdesc = Tex3 cache sector queries Longdesc = Number of texture cache 3 requests. This increments by 1 for each 32-byte access. Value will be 0 for devices that contain only 2 texture units. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 5 Id = 2086 Name = tex0_cache_sector_misses Shortdesc = Tex0 cache sector misses Longdesc = Number of texture cache 0 misses. This increments by 1 for each 32-byte access. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 6 Id = 2087 Name = tex1_cache_sector_misses Shortdesc = Tex1 cache sector misses Longdesc = Number of texture cache 1 misses. This increments by 1 for each 32-byte access. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 7 Id = 2088 Name = tex2_cache_sector_misses Shortdesc = Tex2 cache sector misses Longdesc = Number of texture cache 2 misses. This increments by 1 for each 32-byte access. Value will be 0 for devices that contain only 2 texture units. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 8 Id = 2089 Name = tex3_cache_sector_misses Shortdesc = Tex3 cache sector misses Longdesc = Number of texture cache 3 misses. This increments by 1 for each 32-byte access. Value will be 0 for devices that contain only 2 texture units. Event# 9 Id = 2165 Name = rocache_subp0_gld_warp_count_32b Shortdesc = rocache_subp0_gld_warp_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 0 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 10 Id = 2166 Name = rocache_subp1_gld_warp_count_32b Shortdesc = rocache_subp1_gld_warp_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 1 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 11 Id = 2167 Name = rocache_subp2_gld_warp_count_32b Shortdesc = rocache_subp2_gld_warp_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 2 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 12 Id = 2168 Name = rocache_subp3_gld_warp_count_32b Shortdesc = rocache_subp3_gld_warp_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 3 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 13 Id = 2169 Name = rocache_subp0_gld_warp_count_64b Shortdesc = rocache_subp0_gld_warp_count_64b Longdesc = Number of 64-bit global load requests via slice 0 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 14 Id = 2170 Name = rocache_subp1_gld_warp_count_64b Shortdesc = rocache_subp1_gld_warp_count_64b Longdesc = Number of 64-bit global load requests via slice 1 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 15 Id = 2171 Name = rocache_subp2_gld_warp_count_64b Shortdesc = rocache_subp2_gld_warp_count_64b Longdesc = Number of 64-bit global load requests via slice 2 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 16 Id = 2172 Name = rocache_subp3_gld_warp_count_64b Shortdesc = rocache_subp3_gld_warp_count_64b Longdesc = Number of 64-bit global load requests via slice 3 of read-only data cache.Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 17 Id = 2173 Name = rocache_subp0_gld_warp_count_128b Shortdesc = rocache_subp0_gld_warp_count_128b Longdesc = Number of 128-bit global load requests via slice 0 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 18 Id = 2174 Name = rocache_subp1_gld_warp_count_128b Shortdesc = rocache_subp1_gld_warp_count_128b Longdesc = Number of 128-bit global load requests via slice 1 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 19 Id = 2175 Name = rocache_subp2_gld_warp_count_128b Shortdesc = rocache_subp2_gld_warp_count_128b Longdesc = Number of 128-bit global load requests via slice 2 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 20 Id = 2176 Name = rocache_subp3_gld_warp_count_128b Shortdesc = rocache_subp3_gld_warp_count_128b Longdesc = Number of 128-bit global load requests via slice 3 of read-only data cache. Increments per warp. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 21 Id = 2153 Name = rocache_subp0_gld_thread_count_32b Shortdesc = rocache_subp0_gld_thread_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 0 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 22 Id = 2154 Name = rocache_subp1_gld_thread_count_32b Shortdesc = rocache_subp1_gld_thread_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 1 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 23 Id = 2155 Name = rocache_subp2_gld_thread_count_32b Shortdesc = rocache_subp2_gld_thread_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 2 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 24 Id = 2156 Name = rocache_subp3_gld_thread_count_32b Shortdesc = rocache_subp3_gld_thread_count_32b Longdesc = Number of 8-bit, 16-bit, and 32-bit global load requests via slice 3 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 25 Id = 2157 Name = rocache_subp0_gld_thread_count_64b Shortdesc = rocache_subp0_gld_thread_count_64b Longdesc = Number of 64-bit global load requests via slice 0 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 26 Id = 2158 Name = rocache_subp1_gld_thread_count_64b Shortdesc = rocache_subp1_gld_thread_count_64b Longdesc = Number of 64-bit global load requests via slice 1 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 27 Id = 2159 Name = rocache_subp2_gld_thread_count_64b Shortdesc = rocache_subp2_gld_thread_count_64b Longdesc = Number of 64-bit global load requests via slice 2 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 28 Id = 2160 Name = rocache_subp3_gld_thread_count_64b Shortdesc = rocache_subp3_gld_thread_count_64b Longdesc = Number of 64-bit global load requests via slice 3 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 29 Id = 2161 Name = rocache_subp0_gld_thread_count_128b Shortdesc = rocache_subp0_gld_thread_count_128b Longdesc = Number of 128-bit global load requests via slice 0 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 30 Id = 2162 Name = rocache_subp1_gld_thread_count_128b Shortdesc = rocache_subp1_gld_thread_count_128b Longdesc = Number of 128-bit global load requests via slice 1 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 31 Id = 2163 Name = rocache_subp2_gld_thread_count_128b Shortdesc = rocache_subp2_gld_thread_count_128b Longdesc = Number of 128-bit global load requests via slice 2 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 32 Id = 2164 Name = rocache_subp3_gld_thread_count_128b Shortdesc = rocache_subp3_gld_thread_count_128b Longdesc = Number of 128-bit global load requests via slice 3 of read-only data cache. For each instruction it increments by the number of threads in the warp that execute the instruction. Category = CUPTI_EVENT_CATEGORY_CACHE Event# 33 Id = 2193 Name = elapsed_cycles_sm Shortdesc = Elapsed clocks Longdesc = Elapsed clocks Category = CUPTI_EVENT_CATEGORY_INSTRUCTION