JP 5.0.2 GA Correct way to build kernel and modules? nvgpu.ko errors!

seeky15 · September 23, 2022, 9:29am

Hello together,

I have followed this guide:

https://docs.nvidia.com/jetson/archives/r35.1/DeveloperGuide/text/SD/Kernel/KernelCustomization.html

I am using the manual sources.
Using the nvbuild.sh works. A kernel is compiled.
Instead of creating a kernel supplements tar as described in step 7 I install the modules as follows:

rm -rf $ROOT_DIR/usr/lib/modules/$KERNEL_VERSION/kernel
make -C $SOURCE_DIR/kernel/kernel-* INSTALL_MOD_STRIP=1 LOCALVERSION="-tegra" ARCH=arm64 O=$KERNEL_DIR modules_install INSTALL_MOD_PATH=$ROOT_DIR/usr

This all works perfectly fine. We added custom kernel modules and everything runs properly.
Now since we have noticed that a lot unneccesary stuff is included in the kernel config I for example disabled the “Network Device Support → Wireless LAN” option in the kernel. Everything else is kept the same.

As soon as I run the generated kernel and modules on the device I get the following warnings from nvgpu.ko:

[   10.174511] nvidia: loading out-of-tree module taints kernel.
[   10.177739] nvidia: module verification failed: signature and/or required key missing - tainting kernel
[   10.178623] nvidia: disagrees about version of symbol nvhost_get_default_device
[   10.178784] nvidia: Unknown symbol nvhost_get_default_device (err -22)
[   10.178974] nvidia: disagrees about version of symbol fget
[   10.179093] nvidia: Unknown symbol fget (err -22)
[   10.179252] nvidia: disagrees about version of symbol fd_install
[   10.179397] nvidia: Unknown symbol fd_install (err -22)
[   10.179674] nvidia: disagrees about version of symbol wake_up_process
[   10.179815] nvidia: Unknown symbol wake_up_process (err -22)
[   10.180092] nvidia: disagrees about version of symbol iterate_fd
[   10.180257] nvidia: Unknown symbol iterate_fd (err -22)
[   10.180563] nvidia: disagrees about version of symbol __close_fd
[   10.180745] nvidia: Unknown symbol __close_fd (err -22)
[   10.181399] nvidia: disagrees about version of symbol nvhost_syncpt_unit_interface_get_aperture
[   10.181644] nvidia: Unknown symbol nvhost_syncpt_unit_interface_get_aperture (err -22)

Is there anything special about the build of nvgpu.ko that it does not get informed about my kernel config changes?

WayneWWW · September 26, 2022, 2:35am

Hi,

Is nvgpu.ko also got rebuilt and replaced on your side?

seeky15 · September 26, 2022, 4:03am

Hey @WayneWWW

yes it seems to be rebuilt. The date changes and I noticed that the guide point 5 mentions this:

Replace Linux_for_Tegra/rootfs/usr/lib/modules/$(uname -r)/kernel/drivers/gpu/nvgpu/nvgpu.ko with a copy of this file:

$kernel_out/drivers/gpu/nvgpu/nvgpu.ko

I had a step to copy it finally in my code initially but after the file was still 245MB I noticed that the module install step:

make -C $SOURCE_DIR/kernel/kernel-* INSTALL_MOD_STRIP=1 LOCALVERSION="-tegra" ARCH=arm64 O=$KERNEL_DIR modules_install INSTALL_MOD_PATH=$ROOT_DIR/usr

…copies it too, so I tested with copying and without copying, and also without the INSTALL_MOD_STRIP=1 option. No matter what I do, as soon as I remove the wifi modules the nvgpu.ko complains that there are missing symbols.

WayneWWW · September 26, 2022, 4:08am

Could you build the kernel + nvgpu without any of your patch first and see if nvgpu is working fine under such situation?

seeky15 · September 26, 2022, 4:09am

Hey @WayneWWW will do that. Give me a few minutes. Will report back.

WayneWWW · September 26, 2022, 4:09am

BTW, 5.0.2 is already not a DP version. Not sure why topic still has DP in it.

seeky15 · September 26, 2022, 4:10am

@kayccc Seems I can’t edit it, can you please do that? Got used to the DP

seeky15 · September 26, 2022, 5:40am

@WayneWWW

Unfortunately still the same result with the plain kernel from the public sources.
I entirely removed the output and source directory for the build to make sure that there are no leftovers.

Here is the script part that is responsible to build, I hope you can figure the variables:

echo "Copying kernel config"
cp $CONFIG_DIR/$KERNELCONFIG_NAME $SOURCE_DIR/kernel/kernel-*/arch/arm64/configs/tegra_defconfig

export CROSS_COMPILE=$TOOLCHAIN_DIR/$TOOLCHAIN_VERSION/bin/aarch64-buildroot-linux-gnu-
export CROSS_COMPILE_AARCH64_PATH=$TOOLCHAIN_DIR/$TOOLCHAIN_VERSION

echo "Building Kernel"
$SOURCE_DIR/nvbuild.sh -o $KERNEL_DIR

echo "Replacing kernel"
cp $KERNEL_DIR/arch/arm64/boot/Image $L4T_DIR/kernel/Image

echo "Replacing kernel modules"
echo "Cleaning modules"
rm -rf $ROOT_DIR/usr/lib/modules/$KERNEL_VERSION/kernel

echo "Installing modules"
make -C $SOURCE_DIR/kernel/kernel-* INSTALL_MOD_STRIP=1 LOCALVERSION="-tegra" ARCH=arm64 O=$KERNEL_DIR modules_install INSTALL_MOD_PATH=$ROOT_DIR/usr

And the config I used now, this is the minimal I had in use, now the missing symbols are even more.
kernel_config (165.5 KB)

WayneWWW · September 26, 2022, 6:02am

Does your “plain kernel” mean this is pure from source code tarball even without changing anything in defconfig?

seeky15 · September 26, 2022, 6:07am

@WayneWWW

To answer your previous question. Plain kernel means no patches, just the nvidia source code, but with my config. I already told you before that with the default config everything is alright but as soon as you modify the kernel config the issue appears. But it will be clearer now:

I have made an error during my initial analysis.
Since I did not know which kernel module the “nvidia” message belonged to I deleted the nvgpu.ko as it described itself as nvidia. But that was a mistake. The actual module in fact is called “nvidia.ko”

It is placed here:
Linux_for_Tegra/rootfs/usr/lib/modules/5.10.104-tegra/extra/opensrc-disp/nvidia.ko

I do not have that module anywhere in my build folder except in the rootfs. I assume it gets created by the apply_binaries.sh? It is already present in my minimal rootfs which is derived from ubuntu base + apply_binaries.sh

All the modules in that folder do not match my build kernel of course. It does not get built by nvbuild.sh.

WayneWWW · September 26, 2022, 6:14am

The source code of nvidia.ko and nvidia-modset.ko is in the public source code tarball.

Also, when I said “pure jetapck”, I mean your whole BSP is from jetpack. Build the kernel based on this environment.

Make sure this base case can pass first.

seeky15 · September 26, 2022, 6:23am

Again: Everything works as long as I do not edit the kernel config.

Seems we are back to this older question from the DP 5.0.1:

We already identified the issue. My rootfs includes the extra folder from the l4t but my kernel does not match it.
As long as I use the wrong nvidia.ko I’ll have that issue.

If the source for the nvidia.ko is included in the public sources, how can I build the modules in the extra folder? nvbuild.sh doesn’t.

WayneWWW · September 26, 2022, 6:33am

Have you identified which line you added in kernel defconfig is causing the problem?

Please follow the guidance here. This part is missing in document. We will add it back.
The toolchain version may need correction.

The NVIDIA-kernel-module-source-<Version>.tar.xz source code is supplied as a .xz file. Untar the file. Its location is <NV_WORKSPACE>/drive-linux_src/.
Export the required variables as per the Linux kernel compilation steps:

export ARCH=arm64
export LOCALVERSION="-tegra"
export CROSS_COMPILE=<NV_WORKSPACE>/toolchains/aarch64--glibc--stable-2020.08-1/bin/aarch64-linux-

cd <NV_WORKSPACE>/drive-linux_src/NVIDIA-kernel-module-source-<Version>
export IGNORE_PREEMPT_RT_PRESENCE = 1
Make the modules with following command:

make \
    modules \
    SYSSRC=<KERNELSRC> \
    SYSOUT=<KERNELSRC_OUTDIR> \
    CC=<NV_WORKSPACE>/toolchains/aarch64--glibc--stable-2020.08-1/bin/aarch64-linux-gcc \
    LD=<NV_WORKSPACE>/toolchains/aarch64--glibc--stable-2020.08-1/bin/aarch64-linux-ld.bfd \
    AR=<NV_WORKSPACE>/toolchains/aarch64--glibc--stable-2020.08-1/bin/aarch64-linux-ar \
    CXX=<NV_WORKSPACE>/toolchains/aarch64--glibc--stable-2020.08-1/bin/aarch64-linux-g++ \
    OBJCOPY=<NV_WORKSPACE>/toolchains/aarch64--glibc--stable-2020.08-1/bin/aarch64-linux-objcopy \
    TARGET_ARCH=aarch64 \
    ARCH=arm64

Modules are built under <NV_WORKSPACE>/drive-linux_src/NVIDIA-kernel-module-source-<Version>/kernel-open.

seeky15 · September 26, 2022, 6:50am

I assume you mean this file inside the public sources?

It contains folders with the names of the three modules.

The NVIDIA-kernel-module-source-<Version>.tar.xz does not exist in the source tar.

WayneWWW · September 26, 2022, 6:52am

Hi,

The folder name does not matter. Please read the readme file inside this tarball and modify the make parameters as my comments.

seeky15 · September 26, 2022, 6:55am

I just wanted to make sure that I use the correct tar file as source for building and not some other since you mentioned a totally different file.

WayneWWW · September 26, 2022, 6:58am

Yes, that tarball is correct.

seeky15 · September 26, 2022, 10:43am

@WayneWWW thanks for the help.

With the self compiled extra modules all warnings are gone!

system · October 19, 2022, 2:11am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Failed to restart display after loading self-compiled Image and dtb Jetson AGX Orin nvbugs , device-tree	85	2973	July 13, 2022
Display drivers install to the wrong path with official document instructions Jetson Orin NX boot , kernel , ubuntu , nvbugs , jetson , flash	21	96	March 12, 2025
Building kernel 5.10 from nvbuild.sh script Jetson Xavier NX kernel	11	2786	April 29, 2022
Problem building nvdisplay display driver for Orin AGX with Jetpack 5.1.3 Jetson AGX Orin kernel , compile , dp-display	27	68	April 9, 2025
Apply RT patches to the kernel: Jetson Orin NX preempt_rt	30	637	October 22, 2024
Issues Building Custom Kernel 36.4 new Jetson Orin Nano Dev Kit Jetson Orin Nano kernel	51	1530	December 23, 2024
Jetpack 5.1 - Real-time kernel and modules Jetson Xavier NX nvbugs , preempt_rt	8	2077	October 10, 2023
Network driver error when recompiling the kernel on JetPack 6.2 Jetson Orin Nano kernel , ethernet	12	117	April 2, 2025
Compile & install customized Kernel directly on Orin Developer Kit Jetson AGX Orin kernel	7	3416	August 2, 2023
Building the Jetson Linux Kernel（36.3） Jetson AGX Orin kernel	15	336	July 23, 2024

JP 5.0.2 GA Correct way to build kernel and modules? nvgpu.ko errors!

Related topics