Hi! I have been profiling my LLM training using Nsight Systems to better understand the communication patterns. When I hover over the ncclAllReduce event it shows some additional information such as Message Size. What exactly does the Message size refer to in this context?
Is it #GPUs x #elements/GPU x Bytes/element?
Thanks for your help :)
