Startup service nv_target_docker never exits

DRIVE OS Version: Provide DRIVE OS version: 7.0.3

Issue Description:

This is pretty obscure but if we install the clang-17 package on our drive units the nv_target_docker startup service never exits and puts a considerable load on the unit.

It doesn’t appear to make it out of the glob.glob(path, recursive=True) call so maybe it’s symbolic link related.

Dear @rich.frederiksen ,
Could you please share the repro steps?

I believe you can reproduce it if you take a base drive 7.0.3 install and:

  • apt install clang-17
  • reboot
  • run top as su (to see all processes) and you should see the nv_target_docker process running at 100% (usually it runs and exits in less than 1 sec during boot)

I notice issues when try to install on Thor with DOS 7.0.3. Did you just run apt install clang-17 just after flashing DOS 7.0.3 or performed any other steps?

nvidia@tegra-ubuntu:~$ sudo apt install clang-17
Reading package lists... Done
Building dependency tree... Done
Reading state information... Done
Some packages could not be installed. This may mean that you have
requested an impossible situation or if you are using the unstable
distribution that some required packages have not yet been created
or been moved out of Incoming.
The following information may help to resolve the situation:

The following packages have unmet dependencies:
 libobjc4 : Depends: gcc-14-base (= 14-20240412-0ubuntu1) but 14.2.0-4ubuntu2~24.04 is to be installed
E: Unable to correct problems, you have held broken packages.

Sorry, I left out a step that might be what’s causing your upstream issue. I did run the unminimize script on the system.

I tried sudo apt update && sudo apt install -y unminimize && sudo unminimize and still notice same issue. We have not tested installing clang on DRIVE. Did you check to build from source?

No I just did a straight install. Although I do think at some point I had trouble running the unminimize script (bad authentication) so I added an --allow-unauthenticated argument to the dpkg commands like below:

    # Reinstallation takes place in two steps because a single dpkg --verified
    # command generates very long parameter list for "xargs dpkg -S" and may go
    # over ARG_MAX. Since many packages have man pages the second download
    # handles a much smaller amount of packages.
    dpkg -S /usr/share/man/ |sed 's|, |\n|g;s|: [^:]*$||' | DEBIAN_FRONTEND=noninteractive xargs apt-get install --reinstall -y --allow-unauthenticated
    echo "Reinstalling packages with system documentation in /usr/share/doc/ .."
    # This step processes the packages which still have missing documentation
    dpkg --verify --verify-format rpm | awk '$2 ~ /\/usr\/share\/doc/ {print $2}' | sed 's|/[^/]*$||' | sort | uniq \
         | xargs dpkg -S | sed 's|, |\n|g;s|: [^:]*$||' | uniq | DEBIAN_FRONTEND=noninteractive xargs apt-get install --reinstall -y --allow-unauthenticated
    echo "Restoring system translations..."
    # This step processes the packages which still have missing translations
    dpkg --verify --verify-format rpm | awk '$2 ~ /\/usr\/share\/locale/ {print $2}' | sed 's|/[^/]*$||' | sort | uniq \
         | xargs dpkg -S | sed 's|, |\n|g;s|: [^:]*$||' | uniq | DEBIAN_FRONTEND=noninteractive xargs apt-get install --reinstall -y --allow-unauthenticated

I’m including the list of packages I’ve got installed when the problem happens installed-packages-broken.txt and when it’s fixed installed-packages-ok.txt. To go from OK → broken I do sudo apt install clang-17 and to go from broken → OK I do sudo apt purge clang-17 && sudo apt autoremove to remove everything installed with clang-17.

installed-packages-broken.txt (129.5 KB)
installed-packages-ok.txt (128.0 KB)

Could you please provide any update for this topic? Did you try installation from source?