Kernel panic due to incompatibility between freeCudaHost and free_pages

Hi there! This topic is a continuation of this topic.

So, I tested the code for 16 256K buffers, and it’s true: NO ISSUES. But, when I tried to run with 16 256M buffers, the error returned. The same kernel panic, but, if I comment the cudaFreeHost, the program runs as expected. Than:

  1. 16x 256K + cudaFreeHost = OK
  2. 16x 256M - cudaFreeHost = OK
  3. 16x 256M + cudaFreeHost = FAIL

I will share my code here. There is a run script in the root directory.
To change the number and size of the buffers: change the L43 - main_cuda_buffer.cu. To comment the cudaFreeHost: change the L101 - main_cuda_buffer.cu.
driver.zip (6.2 KB)

Thanks a lot!

Hi,

Thanks for the update and source.
We will test it and provide more info to you later.

Thanks.

Thanks AastaLLL! Looking forward to hearing from you.

Hi there! Any news?

Thanks.

Hi,

We are still discussing this issue internally.
In our testing, the system reboots when running the app (the cause of kernel panic).

However, we are not able to collect related crash logs on the kernel or uart log.
Do you have the error log related to the panic so we can share it with our internal team?

Thanks.

Yes, I’m facing the same behavior. The log that I achieved was the attached in the previous ticket. However, it’s here the debug UART output: debug.txt (6.6 KB)
.

Hi,

Thanks for the info.
We have shared the logs with our internal team and will let you know once we have more info about this issue.