Hi,
Thanks for any help in advance.
I want to be able to install and use the Apex package (https://github.com/NVIDIA/apex) in order to use this (https://github.com/kaushaltrivedi/fast-bert) BERT implementation.
However, I’ve been running into many problems with this on various AMI’s and I’d really appreciate help with getting it installed.
My systems team have now set me up with an NVIDIA 19.05 AMI (running Ubuntu 18.04.3 LTS), but I have run into a problem where I don’t seem to be able to access cuda-10.1’s functionality.
Running nvidia-smi, I can see that the driver is there. When I check /usr/local, however, the cuda folder is not there.
I tried installing the cuda driver as per the official documentation. Several problems arose.
- First, though the cuda folder appeared in /usr/local, I wasn't able to access it and nvcc --version gave me nothing (other than a message telling me that the cuda toolkit could be downloaded via sudo apt-get install).
- Second, this totally messed up the NVIDIA driver and nvidia-smi no longer worked. I then had to completely relaunch the instance to return it to its original state.
I also seemed to run into trouble getting awscli installed so I can access data in an S3 bucket. Any ideas why that might be the case?
Again, thanks for any help in advance.
Darren