not able to update Tesla P100 driver 384 to 418

rajaya · September 2, 2019, 4:05am

Sorry for the delay… Thanks for the steps but the users have said they can’t allow maintenance window this weekend for me to carry out these steps. I will have to wait till they give me the server to carry out these tasks. I really want to thank you for being extremely helpful with my issue. If not for your guidance I would never get the GUI back online. Once I get the chance to carry out the steps I will post the output but thanks a million for your timely help. Appreciate it very much.

rajaya · September 17, 2019, 9:30pm

I got a question… End users are saying Cuda is not installed. I thought I followed the steps
yum clean all
yum install cuda-drivers
reboot

but when I run ‘nvcc -V’ I get
bash: nvcc: command not found…

±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+

Should I run the command from the nvidia site to install cuda?

https://developer.nvidia.com/cuda-downloads?target_os=Linux&target_arch=x86_64&target_distro=RHEL&target_version=7&target_type=rpmlocal

Please let me know. Thanks!

generix · September 17, 2019, 11:07pm

Do the first three steps but not the last (sudo yum -y install nvidia-driver-latest-dkms cuda), instead run
sudo yum install cuda-toolkit-10-1
otherwise you’ll kill your already installed driver.

rajaya · September 18, 2019, 1:03am

Thanks for your prompt reply. I will schedule time to do it and post the result. Appreciate it very much.

rajaya · September 18, 2019, 10:12pm

I have a little different question and want your opinion. The users want to plot images (3D acceleration graphics) but are not able to plot. The hardware rendering use to work before with MayaVi but not anymore with MayaVi 2 because the support for Matrox card has been deprecated by the Mesa 3D graphics library. The server vendor, Dell, connected me to an NVIDIA rep who said that P100 will do graphics but it’s for VDI. The end users think that hardware is the problem and changing or putting an appropriate graphics card should take care of the rendering issue. I want to know if putting an additional graphics card will fix the hardware rendering issue and if so then which one should I go with. The NVIDIA asked me to let them know which card is compatible so they can send me the pricing. Thanks!

generix · September 19, 2019, 8:11am

Putting in another graphics card won’t change the fact that you still need to set up virtualgl for your xrdp users to get hw accel. It just doesn’t work without it.

rajaya · September 21, 2019, 2:24am

So I ran the first 3 commands as you said from below:

$ wget http://developer.download.nvidia.com/compute/cuda/10.1/Prod/local_installers/cuda-repo-rhel7-10-1-local-10.1.243-418.87.00-1.0-1.x86_64.rpm
$ sudo rpm -i cuda-repo-rhel7-10-1-local-10.1.243-418.87.00-1.0-1.x86_64.rpm
$ sudo yum clean all
$ sudo yum -y install nvidia-driver-latest-dkms cuda (but not this one)

and then sudo yum install cuda-toolkit-10-1

±----------------------------------------------------------------------------+
| Processes: GPU Memory |
| GPU PID Type Process name Usage |
|=============================================================================|
| No running processes found |
±----------------------------------------------------------------------------+

Also, $ nvcc --version
bash: nvcc: command not found…
And
$ which nvcc
/usr/bin/which: no nvcc in (/usr/bin/anaconda2/bin:/usr/bin/anaconda2/condabin:/usr/lib64/qt-3.3/bin:/usr/local/bin:/usr/bin:/usr/local/sbin:/usr/sbin:/home/kiran/.local/bin:/home/kiran/bin)

Am I missing something?

rajaya · September 21, 2019, 2:47am

Also, $ sudo rpm -i cuda-repo-rhel7-10-1-local-10.1.243-418.87.00-1.0-1.x86_64.rpm gives
package cuda-repo-rhel7-10-1-local-10.1.243-418.87.00-1.0-1.x86_64 is already installed

rajaya · September 21, 2019, 3:37am

OK before I do the post installation steps suggested here: https://docs.nvidia.com/cuda/cuda-installation-guide-linux/index.html#post-installation-actions, I want to confirm with you if I should do this on PATH export as well - :/usr/local/cuda-10.1/NsightCompute-2019.1${PATH:+:${PATH}}? I don’t see NsightCompute-2019 in /usr/local/cuda-10.1

$ls /usr/local/cuda-10.1/
bin extras lib64 libnvvp nsightee_plugins nvvm samples src tools
doc include libnsight LICENSE nvml README share targets version.txt

Please let me know. Thanks!

generix · September 21, 2019, 7:17pm

NsightCompute is a separate download, so there’s no need to add that path.

rajaya · September 24, 2019, 2:40am

Thanks… That worked. Appreciate it.

rajaya · October 1, 2019, 8:18pm

I want to know if this is possible… Can the Tesla P100 be used for parallel computing and graphics simultaneously by setting another X server and installing VirtualGL?

generix · October 1, 2019, 8:41pm

Yes, with some limitations:
[url]USING CUDA AND X | NVIDIA

rajaya · October 7, 2019, 5:27pm

Thanks. The NVIDIA reps suggested to go ahead with another graphics card.

rajaya · October 9, 2019, 2:59pm

NVIDIA and Dell representatives recommend NVIDIA Quadro P4000 GPU card for graphics. I’m hoping this should be good to install Virtual GL and another X server.

rajaya · October 10, 2019, 2:27pm

Question… Once I get the card and install it, do you want me to follow the steps you mentioned in post # 40 or should the hardware rendering work with the new Quadro graphics card? Since there will be 2 graphical displays I want the Quadro configured to be used for XRDP and locally and disable Matrox if possible. Please let me know. Thanks!

generix · October 11, 2019, 9:01am

I would not recommend it for the following reason:
virtualgl is using the Xserver at :0 for rendering. This is the GDM screen. As soon as you log in locally, it will spawn a second Xserver :1 for the user session, resulting in a vt switch so the Xserver at :0 will be inaccessible for virtualgl. So as soon as you log in locally, the rendering from xrdp will stop working.

rajaya · October 11, 2019, 4:05pm

Thanks. The server is accessed over XRDP by users exclusively and I access it locally during a maintenance window only after I reboot the server to make sure it is online.
I had posted previously the issue that Mayavi application after upgrade to version 2 does not support hardware rendering because of the native Matrox card being deprecated. So, we decided to test by installing Virtual GL on a laptop with Intel graphics card to see how to configure hardware rendering over XRDP and be prepared whenever we get the NVIDIA Quadro card.
After we installed Virtual GL on the laptop and tried to run mayavi2 issuing ‘vglrun mayavi2’ command we got segmentation fault core dump error (over XRDP). When we issue mayavi2 command we get the same update OpenGL driver error. We even tried uninstalling Virtual GL and get the same error when trying to launch Mayavi2. glxinfo shows Mayavi still uses llvm instead of the native Intel card for rendering. Locally Mayavi works fine but it doesn’t work over XRDP on the Intel graphics card laptop. What/where am I missing the point?

rajaya · October 13, 2019, 1:23pm

Is it not possible to achieve hardware rendering over XRDP? Do I have to use another way to get this application to work? Please let me know your suggestion/opinion.

generix · October 14, 2019, 8:34pm

For mayavi2 to work with virtualgl and the nvidia driver, min. virtualgl 2.6 is required: [url]https://github.com/VirtualGL/virtualgl/releases[/url]
Though I doubt that was the problem on your test install.

An Xserver has to be running on the notebook, the normal login suffices.
The user connecting over xrdp has to have access to it:
When connected over xrdp, what’s the output of
DISPLAY=:0 glxinfo
Does glxgears run?
vglrun glxgears

Topic		Replies	Views
Can I do remote direct rendering with Tesla P4 on CentOS 7? Linux	11	3550	March 14, 2018
X Server does not start with 9x Tesla P100 + 1x Quadro - Ubuntu 16.04 & 18.04 driver 375, 390, 396 Linux	4	1722	October 14, 2021
Tesla P4 PCI pass through from RHOSP 13 to RHEL 7.6/Windows VMs issues Linux	16	1689	October 12, 2021
Installation of CUDA / on RHEL 6 with TurboVNC and VirtualGL CUDA Setup and Installation	3	5963	May 12, 2015
Several issues with 396.24 under CentOS 7 Linux	29	5029	August 28, 2018
OpenGL, NVIDIA and Ubuntu 14.04 issues Linux	28	17477	September 22, 2017
CentOS 7 headless with nVidia drivers installed, OpenGL not using nVidia drivers, only llvmpipe Linux opengl , linux	44	5351	May 10, 2022
Tesla C870 and Linux RHEL 4.5 CUDA Programming and Performance	13	28895	February 28, 2008
2 Tesla C1060s with a legacy GeForce FX 5200 card Need help editing the xorg.conf file for multiple CUDA Programming and Performance	28	35576	January 29, 2009
Ubuntu 19.04 Driver Installed but not Used Linux	102	16282	October 12, 2021

not able to update Tesla P100 driver 384 to 418

Related topics