I see “PCIE: Response decoding error” errors. This could possibly happen because of the NVMe device not asserting the CLKREQ signal all through.
Could you please add “nvidia,disable-clock-request” in the root port nodes of the PCIe controllers?
Also, if you could compile the kernel quickly, then, as an experiment, the following kernel change could be tried out as well.
i’m so sorry for the late reply. when i change the kernel code, the ssd can be identify sometimes. ok log nvme_ok.log (58.0 KB) . err log nvme_err.log (57.4 KB)
There is no update from you for a period, assuming this is not an issue any more.
Hence we are closing this topic. If need further support, please open a new one.
Thanks
Did you confirm that the change is indeed getting reflected? I’m afraid we can’t do much beyond this point other than connecting the PCIe analyzer and find out what is going wrong.
Also, do you observe this issue only with this endpoint or is this issue seen with other endpoints as well?