As stated in this link
True: 3d bounding box data (NPY) are output.
Note, ground truth visuals are not supported."
My question is, why? Is there any specific reason for that? I see other users having problems with regenerating the 3D BBoxes too. Like this one: Insight into world to camera transform for 3D bounding box