TensorFlow not working after update

dezotti.alexandre · July 29, 2020, 4:29pm

Not sure where to post this issue. I have recently upgraded my system and the following packages changed versions:

CUDA 10.2 → 11.0
cuDNN 7.6 → 8.0
TensorFlow-CUDA 2.2 → 2.3
Before that everything seemed to work as it should (I have been using TensorFlow with no apparent issue). Now TensorFlow doesn’t seem to work anymore. For example the code:

import tensorflow as tf
m = tf.keras.Sequential()

Will produce the following error:

Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
  File "/usr/lib/python3.8/site-packages/tensorflow/python/training/tracking/base.py", line 457, in _method_wrapper
    result = method(self, *args, **kwargs)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/keras/engine/sequential.py", line 116, in __init__
    super(functional.Functional, self).__init__(  # pylint: disable=bad-super-call
  File "/usr/lib/python3.8/site-packages/tensorflow/python/training/tracking/base.py", line 457, in _method_wrapper
    result = method(self, *args, **kwargs)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/keras/engine/training.py", line 308, in __init__
    self._init_batch_counters()
  File "/usr/lib/python3.8/site-packages/tensorflow/python/training/tracking/base.py", line 457, in _method_wrapper
    result = method(self, *args, **kwargs)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/keras/engine/training.py", line 317, in _init_batch_counters
    self._train_counter = variables.Variable(0, dtype='int64', aggregation=agg)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/variables.py", line 262, in __call__
    return cls._variable_v2_call(*args, **kwargs)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/variables.py", line 244, in _variable_v2_call
    return previous_getter(
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/variables.py", line 237, in <lambda>
    previous_getter = lambda **kws: default_variable_creator_v2(None, **kws)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/variable_scope.py", line 2633, in default_variable_creator_v2
    return resource_variable_ops.ResourceVariable(
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/variables.py", line 264, in __call__
    return super(VariableMetaclass, cls).__call__(*args, **kwargs)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/resource_variable_ops.py", line 1507, in __init__
    self._init_from_args(
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/resource_variable_ops.py", line 1661, in _init_from_args
    handle = eager_safe_variable_handle(
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/resource_variable_ops.py", line 242, in eager_safe_variable_handle
    return _variable_handle_from_shape_and_dtype(
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/resource_variable_ops.py", line 174, in _variable_handle_from_shape_and_dtype
    gen_logging_ops._assert(  # pylint: disable=protected-access
  File "/usr/lib/python3.8/site-packages/tensorflow/python/ops/gen_logging_ops.py", line 49, in _assert
    _ops.raise_from_not_ok_status(e, name)
  File "/usr/lib/python3.8/site-packages/tensorflow/python/framework/ops.py", line 6843, in raise_from_not_ok_status
    six.raise_from(core._status_to_exception(e.code, message), None)
  File "<string>", line 3, in raise_from
tensorflow.python.framework.errors_impl.InvalidArgumentError: assertion failed: [0] [Op:Assert] name: EagerVariableNameReuse

The above code works fine when the GPU is deactivated. Not sure what to do now. Is this a bug with CUDA or cuDNN or the Nvidia driver? What should I do to solve the issue? Should I submit a bug report to Nvidia?

scarcia7 · August 20, 2020, 9:42pm

I I am stuck with the same issue from days. People on github say that it’s generated when executing different processes that use python/tensorflow, but checking the nvidia-smi I get no other processes running. Did you find a solution for the problem?

dezotti.alexandre · August 20, 2020, 9:58pm

None beside downgrading the packages. This might be a bug with either tensorflow, cuda or cudnn. There’s an open issue at: tf.keras.Sequential() fails · Issue #41855 · tensorflow/tensorflow · GitHub. The issue I have has nothing to do with other processes using the GPU (see discussion in the link on how to check that).

Topic		Replies	Views
Python crashes after cudnn update cuDNN	13	4301	May 17, 2022
Downgrading CUDA 10.1 to 10 in windows because TF2.0.0 doesnt work with 10.1 CUDA Setup and Installation	9	22614	August 18, 2022
Rollback from tensorflow 1.13.1’ to 1.11.0’ CUDA Setup and Installation	2	522	June 27, 2019
Update to tensorflow? Frameworks (archived) tensorflow	2	466	June 27, 2019
Use gpu for tensorflow, crashes CUDA Setup and Installation tensorflow	14	6819	March 7, 2024
cuDNN/CUDA/TensorFlow setup prroblem CUDA Setup and Installation	2	1168	March 17, 2020
CUDA, cudNN, GPU and TENSORFLOW VERSİONS CUDA Setup and Installation	4	4641	February 10, 2023
Fail to load tensorflow Jetson Nano tensorflow	2	436	October 18, 2021
Read tensorflow version failed Frameworks (archived) tensorflow	1	740	July 3, 2020
Tensorflow is not upgrading Frameworks (archived) tensorflow	4	957	July 26, 2019

TensorFlow not working after update

Related topics