Differences between cudnn's graph Api and the previous one

Now the graph mode is based on the graph structure to call the kernel, see the document description has pre_compile and runtime-compile: pre-compile refers to the binary compiled in advance, select the corresponding binary at run time, and runtime-compile refers to the need to compile at run time? What is the specific compilation? Is the previous version of cudnn using pre-compile when there was no graph API?