Hi, i have TF 2.4.1 python3.8 module, compiled from sources. The trouble is that, when i do import tensorflow for the first time , after i open #python3 cli, and give it a simple things to calc, it takes >10m to receive the results. But after it, it does calculations pretty fast. I’ll show you on a screenshots:
Do you compile the package with Nano GPU architecture (sm_53)?
If not, some GPU files need to recompile with the correct architecture when initialization.