TLT3 NHWC error on 940MX, error while training on TLT3

• Hardware (940MX)
• Network Type (Detectnet_v2/Yolo_4/any)
Configuration of the TLT Instance
docker:
nvidia/tlt-streamanalytics:
docker_registry: nvcr.io
docker_tag: v3.0-py3
tasks:

  1. augment
  2. bpnet
  3. classification
  4. detectnet_v2
  5. dssd
  6. emotionnet
  7. faster_rcnn
  8. fpenet
  9. gazenet
  10. gesturenet
  11. heartratenet
  12. lprnet
  13. mask_rcnn
  14. multitask_classification
  15. retinanet
  16. ssd
  17. unet
  18. yolo_v3
  19. yolo_v4
  20. tlt-converter
    nvidia/tlt-pytorch:
    docker_registry: nvcr.io
    docker_tag: v3.0-py3
    tasks:
  21. speech_to_text
  22. speech_to_text_citrinet
  23. text_classification
  24. question_answering
  25. token_classification
  26. intent_slot_classification
  27. punctuation_and_capitalization
    format_version: 1.0
    tlt_version: 3.0
    published_date: 04/16/2021
    docker_tag: nvcr.io/nvidia/tlt-streamanalytics:v3.0-py3

Error Log:
Epoch 1/80
Traceback (most recent call last):
File “/opt/tlt/.cache/dazel/_dazel_tlt/2b81a5aac84a1d3b7a324f2a7a6f400b/execroot/ai_infra/bazel-out/k8-fastbuild/bin/magnet/packages/iva/build_wheel.runfiles/ai_infra/iva/yolo_v4/scripts/train.py”, line 221, in
File “/opt/tlt/.cache/dazel/_dazel_tlt/2b81a5aac84a1d3b7a324f2a7a6f400b/execroot/ai_infra/bazel-out/k8-fastbuild/bin/magnet/packages/iva/build_wheel.runfiles/ai_infra/iva/common/utils.py”, line 494, in return_func
File “/opt/tlt/.cache/dazel/_dazel_tlt/2b81a5aac84a1d3b7a324f2a7a6f400b/execroot/ai_infra/bazel-out/k8-fastbuild/bin/magnet/packages/iva/build_wheel.runfiles/ai_infra/iva/common/utils.py”, line 482, in return_func
File “/opt/tlt/.cache/dazel/_dazel_tlt/2b81a5aac84a1d3b7a324f2a7a6f400b/execroot/ai_infra/bazel-out/k8-fastbuild/bin/magnet/packages/iva/build_wheel.runfiles/ai_infra/iva/yolo_v4/scripts/train.py”, line 217, in main
File “/opt/tlt/.cache/dazel/_dazel_tlt/2b81a5aac84a1d3b7a324f2a7a6f400b/execroot/ai_infra/bazel-out/k8-fastbuild/bin/magnet/packages/iva/build_wheel.runfiles/ai_infra/iva/yolo_v4/scripts/train.py”, line 173, in run_experiment
File “/usr/local/lib/python3.6/dist-packages/keras/legacy/interfaces.py”, line 91, in wrapper
return func(*args, **kwargs)
File “/usr/local/lib/python3.6/dist-packages/keras/engine/training.py”, line 1418, in fit_generator
initial_epoch=initial_epoch)
File “/usr/local/lib/python3.6/dist-packages/keras/engine/training_generator.py”, line 217, in fit_generator
class_weight=class_weight)
File “/usr/local/lib/python3.6/dist-packages/keras/engine/training.py”, line 1217, in train_on_batch
outputs = self.train_function(ins)
File “/usr/local/lib/python3.6/dist-packages/keras/backend/tensorflow_backend.py”, line 2715, in call
return self._call(inputs)
File “/usr/local/lib/python3.6/dist-packages/keras/backend/tensorflow_backend.py”, line 2671, in _call
session)
File “/usr/local/lib/python3.6/dist-packages/keras/backend/tensorflow_backend.py”, line 2623, in _make_callable
callable_fn = session._make_callable_from_options(callable_opts)
File “/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/client/session.py”, line 1505, in _make_callable_from_options
return BaseSession._Callable(self, callable_options)
File “/usr/local/lib/python3.6/dist-packages/tensorflow_core/python/client/session.py”, line 1460, in init
session._session, options_ptr)
tensorflow.python.framework.errors_impl.InvalidArgumentError: Default MaxPoolingOp only supports NHWC on device type CPU
[[{{node yolo_spp_pool_1_1/MaxPool}}]]
2021-07-08 08:06:03,035 [INFO] tlt.components.docker_handler.docker_handler: Stopping container.

940MX has compute capability of 5.0.
The same issue as TLT Detectnet TrafficCamNet training not working - #9 by Morganh
Training on Custom Dataset using TLT
Error with tlt train in official Jupyter notebook TLT 3.0 - #5 by Morganh
Training Peoplent on custom data - #20 by abhigoku10

This topic was automatically closed 60 days after the last reply. New replies are no longer allowed.