Extracting Cycle-Accurate Timing Diagrams in GPGPU-Sim

I’ve been working on extracting a cycle-accurate timing diagram from GPGPU-Sim while simulating a CUDA vector addition kernel on an NVIDIA Turing RTX 2060 model. My goal was to analyze warp execution timing, instruction issue cycles, and memory access latencies to create a pipeline timing diagram similar to what you’d see in architectural studies.

I wanted to ask how should be gpgpusim.config set for this?

GPGPU-Sim is not a NVIDIA product or maintained by NVIDIA.

Looking at their github page, I see this:

If you have questions, please sign up for the google groups page (see gpgpu-sim.org), but note that use of this simulator does not imply any level of support. Questions answered on a best effort basis.

You may get better help there.