If you enter sinfo after configuring the workload, all nodes in the defq partition will be displayed as unk*. When connecting to the node, the munge daemon is displayed as failed and node001 munged[4473]: Failed to find keyfile “/cm/shared/apps/slurm/var/munge/keys/munge.key”: No such file or directory (Did you run mungekey?)
is displayed.
I think slurmd is not normally serviced because MUNGE is not properly serviced.
I think UNK* appears.
Do I have to work on the head node or work node?
help me!