Hi.
Trying to run multi-threading application (same process, multi-threads using same CUDA -resources) using CUDA 10.1.
Nothing special set unsing “nvidia-smi”
In that, started “nvidia-cuda-mps-control -f” to see possible errors.
Reason to use MPS-server is to avoid resource-conflicts causing “flickering” in images, when simultaneous threads are using CUDA.
But I don’t have luck on starting MPS-server as control daemon reports this behavior of “virtual memory” as shown below,.
Any hints about this ?
-niilo
[2019-06-10 16:54:21.601 Control 22509] Accepting connection…
[2019-06-10 16:54:21.601 Control 22509] User did not send valid credentials
[2019-06-10 16:54:21.601 Control 22509] Accepting connection…
[2019-06-10 16:54:21.601 Control 22509] NEW CLIENT 22635 from user 0: Server is not ready, push client to pending list
[2019-06-10 16:54:21.601 Control 22509] Starting new server 22721 for user 0
[2019-06-10 16:54:21.623 Other 22721] Start
[2019-06-10 16:54:21.825 Control 22509] Accepting connection…
[2019-06-10 16:54:21.825 Control 22509] NEW SERVER 22721: Ready
[2019-06-10 16:54:21.825 Other 22721] MPS Server: Received new client request
[2019-06-10 16:54:21.825 Other 22721] MPS Server: worker created
[2019-06-10 16:54:21.958 Control 22509] Accepting connection…
[2019-06-10 16:54:21.958 Control 22509] User did not send valid credentials
[2019-06-10 16:54:21.958 Control 22509] Accepting connection…
[2019-06-10 16:54:21.958 Control 22509] NEW CLIENT 22635 from user 0: Server already exists
[2019-06-10 16:54:21.958 Other 22721] MPS Server: Received new client request
[2019-06-10 16:54:21.958 Other 22721] MPS Server: worker created
[2019-06-10 16:54:21.959 Client 22635] MPS client failed to reserve virtual memory range at address (nil)
Systems :
uname -r
4.15.0-51-lowlatency
lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description: Ubuntu 16.04.6 LTS
Release: 16.04
Codename: xenial
model name : Intel(R) Xeon(R) E-2176G CPU @ 3.70GHz