Nsight VSCode - ignore breakpoints, Python exception in debug console

Just installed Nsight for VSCode and cannot get debugger to break (Ubuntu 20.04 - most recent versions of everything…VSCode, all CUDA tools and libraries etc.). Following configuration instructions “to the letter” on the matrixMul Sample. Build works fine and application runs and passes. But my breakpoints are ignored. When I launch the application just runs and completes “normally”. Following output from debug console:

NVIDIA (R) CUDA Debugger
10.1 release
Portions Copyright (C) 2007-2019 NVIDIA Corporation
GNU gdb (GDB) 7.12
Copyright (C) 2016 Free Software Foundation, Inc.
License GPLv3+: GNU GPL version 3 or later
This is free software: you are free to change and redistribute it.
There is NO WARRANTY, to the extent permitted by law. Type “show copying”
and “show warranty” for details.
This GDB was configured as “x86_64-pc-linux-gnu”.
Type “show configuration” for configuration details.
For bug reporting instructions, please see:
Find the GDB manual and other documentation resources online at:
For help, type “help”.
Type “apropos word” to search for commands related to “word”.
Python Exception <type ‘exceptions.NameError’> Installation error: gdb.execute_unwinders function is missing:
[Thread debugging using libthread_db enabled]
Using host libthread_db library “/lib/x86_64-linux-gnu/libthread_db.so.1”.
7
Python Exception <type ‘exceptions.NameError’> Installation error: gdb.execute_unwinders function is missing:
[New Thread 0x7fffdffff000 (LWP 189489)]
[New Thread 0x7fffdf7fe000 (LWP 189490)]
Python Exception <type ‘exceptions.NameError’> Installation error: gdb.execute_unwinders function is missing:
[New Thread 0x7fffdef7c000 (LWP 189491)]
9
Python Exception <type ‘exceptions.NameError’> Installation error: gdb.execute_unwinders function is missing:
[Matrix Multiply Using CUDA] - Starting…
GPU Device 0: “Ampere” with compute capability 8.6
MatrixA(320,320), MatrixB(640,320)
Computing result using CUDA Kernel…
done
Performance= 2308.11 GFlop/s, Time= 0.057 msec, Size= 131072000 Ops, WorkgroupSize= 1024 threads/block
Checking computed result for correctness: Result = PASS
NOTE: The CUDA Samples are not meant for performance measurements. Results may vary when GPU Boost is enabled.
[Thread 0x7fffdef7c000 (LWP 189491) exited]
[Thread 0x7fffdf7fe000 (LWP 189490) exited]
[Thread 0x7fffdffff000 (LWP 189489) exited]
[Inferior 1 (process 189478) exited normally]

Not sure if the Python errors are “normal” or if they are contributing to the problem.

launch.json:

{
// Use IntelliSense to learn about possible attributes.
// Hover to view descriptions of existing attributes.
// For more information, visit: Debugging in Visual Studio Code
“version”: “0.2.0”,
“configurations”: [
{
“name”: “CUDA C++: Launch”,
“type”: “cuda-gdb”,
“request”: “launch”,
“program”: “${workspaceFolder}/matrixMul”
}
]
}

tasks.json:
{
“version”: “2.0.0”,
“tasks”: [
{
“label”: “Cuda Build”,
“type”: “shell”,
“command”: “make dbg=1”,
“problemMatcher”: ["$nvcc"],
“group”: {
“kind”: “build”,
“isDefault”: true
}
}
]
}

Help!