Hardware Platform: GPU (NVIDIA RTX A6000 48GB)
DeepStream Version: 9.0
JetPack Version: N/A (not Jetson)
TensorRT Version: N/A
NVIDIA GPU Driver Version: 580.126.09
Issue Type: Questions
How to reproduce the issue:
I am using AutoMagicCalib 2.0.0 (nvcr.io/nvidia/auto-magic-calib:2.0.0) to calibrate two store surveillance cameras. My store layout is a long narrow space (A->B->C), with Camera 0 mounted at position A and Camera 1 at position B, both pointing toward C. The cameras have limited spatial overlap (only the B->C zone).
Calibration consistently fails with the following error in multi_view_calib_*.log:
INFO - tracklet_matches: [((0, 55), (1, 162)), ((0, 10), (1, 63))]
ERROR - Number of matching tracklet candidates is less than 3
Only 2 tracklet pairs are matched across cameras. The RANSAC step (min_samples: 3) requires at least 3 matches — a geometric minimum for R,t estimation — so calibration cannot proceed even after I lowered min_matched_tracklets to 2 in mv_amc_config.yaml.
I also enabled enable_manual_adjustment: true and provided 4+ manual alignment point sets in Step 4, but calibration still fails because the base tracklet matching is insufficient.
Single-view outputs are produced (per-camera camInfo_hyper_00.yaml and camInfo_hyper_00_opencv.yaml), but multi-view relative pose is not computed.
Background context:
My ultimate goal is to adapt the official deepstream-tracker-3d-multi-view 4-camera sample to my own 2-camera setup. I am trying to produce the equivalent input files for a 2-camera configuration. Looking at the sample dataset structure, MV3DT requires:
-
camInfo/— per-camera YAML files containingprojectionMatrix_3x4_w2p -
transforms.yml— coordinate transformation definitions -
videos/— input video files
I chose AutoMagicCalib as the calibration tool, but I am not sure if it is the right tool for this purpose, or if there is a better recommended workflow.
My questions:
-
Is there a recommended workflow for cameras with very limited overlap (both cameras facing the same direction in a narrow space), where automatic tracklet matching cannot find ≥3 candidates?
-
Can the manually provided alignment points (Step 4) substitute for tracklet matching to bootstrap the R,t estimation, even when tracklet matching fails entirely?
-
My downstream goal is MV3DT. The single-view
camInfo_hyper_00.yamloutputs contain per-camera intrinsics and extrinsics. Are these outputs compatible with MV3DT’s requiredprojectionMatrix_3x4_w2pformat? If so, what additional steps are needed to generate the multi-view relative pose? -
How is
transforms.ymlgenerated for MV3DT? Is it an output of AutoMagicCalib, or must it be written manually from the calibration results? -
Is AutoMagicCalib the intended/recommended tool for generating MV3DT-compatible
camInfofiles, or is there another calibration approach (e.g., OpenCV checkerboard, manual matrix construction) that is more reliable for this use case?