Training data preparation for 3D object detection networks

silentjcr · December 20, 2023, 8:58am

I’ve been trying out 2 different networks on TAO toolkit: PointPillars and CenterPose.
PointPillars takes point clouds data and KITTI-formatted annotations as the inputs while CenterPose takes a 2D image and a .json file containing necessary information for training and the intrinsic matrix of the camera is also required.

I’m thinking about the possibility of training both networks for the purpose of 3D virtual fences, in which people or some other certain objects such as cars need to be annotated.

Currently I’ve downloaded some open dataset for 3D object detection aside from KITTI dataset, if I wanna add them into the training set, conversion is inevitable. What are the things that need to be taken care of when doing this?
It seems that currently there’s only Objectron dataset containing only 8 classes. Is there any annotation tool for creating my own dataset that’s to be used for training CenterPose?

Morganh · December 21, 2023, 4:40pm

You can take a look in Data Annotation Format - NVIDIA Docs.

The similar question as Video 3D Bounding Box Annotation tool for Objectron · Issue #61 · google-research-datasets/Objectron · GitHub. You can ask again. And also search something via website. Maybe you can take a look at GitHub - walzimmer/3d-bat: 3D Bounding Box Annotation Tool (3D-BAT) Point cloud and Image Labeling.

silentjcr · December 22, 2023, 1:27am

Thanks as always.

As for the first question, I do know how the training data needs to be placed when training CenterPose. It’s just that I have no idea how to produce the .json file containing tons of information of the corresponding object.

On the other hand, I’ve also been working on using other available point cloud data and have converted some of them into .bin files, but I guess I still need to work on producing corresponding .txt files containing KITTI-formatted info.

For instance:
Mask 0 0 0.0 156 279 451 590 0 0 0 0 0 0 0

The line above is an example of an annotation of a mask containing ONLY 2D bbox info, while the rest of the values aside from the class name are set to 0.0.

I’d like to know a few things regarding training PointPillarNets:

Is the 2D bbox info, i.e, the 4 non-zero values shown above, unnecessary?
I know that the last 7 zeroes indicates 3D object info, but are they all neccesary when training PointPillarNets or I just need some of them?

I’ll look into 3D-bat. Thanks.

Morganh · December 26, 2023, 4:32pm

In TAO Centerpose, tao_tutorials/notebooks/tao_launcher_starter_kit/centerpose/centerpose.ipynb at main · NVIDIA/tao_tutorials · GitHub, it is using Objectron dataset.
For annotation, maybe you can refer to Annotation tool and Synthetic dataset Generation · Issue #6 · google-research-datasets/Objectron · GitHub.

They are needed. You can refer to PointPillars - NVIDIA Docs.

silentjcr · December 27, 2023, 6:04am

It seems that the Objectron annotation tool isn’t open-sourced so currently we can’t really train CenterPose using custom dataset as we have no way to produce it.

I later managed to convert other pointcloud dataset into KITTI format and it looks like the following form:

obj_type 0 0 0 0 0 0 0 h w l x y z yaw

I managed to draw bboxes on the corresponding pointcloud data using open3d and the bboxes are correct.

silentjcr · January 2, 2024, 2:00am

I downloaded some open synthetic data, PreSIL, which contains both point cloud and annotated data, but intensity value for the point clouds is ALL 0 according to their paper and I also verified this using open3d. I wonder if that can make a difference as training by adding the PreSIL data and the existing KITTI dataset into the training set didn’t make the result better as expected…

Morganh · January 2, 2024, 8:21am

There is no update from you for a period, assuming this is not an issue anymore.
Hence we are closing this topic. If need further support, please open a new one.
Thanks

You may also take a look at nuscenes dataset as well.
nuScenes Dataset | Papers With Code,
nuscenes_tutorial,
https://www.nuscenes.org/nuscenes#data-annotation.

More，you can use Omniverse Isaac Sim to create the synthetic dataset to train the model.

Please see the following documentation: 10.10. Object Detection Synthetic Data Generation — Omniverse IsaacSim latest documentation

The user might need to collet their USD for their customer dataset and create the synthetic dataset in Omniverse. The tool will output the 3D labels, which can be used to train the CenterPose model.

system · January 16, 2024, 8:22am

This topic was automatically closed 14 days after the last reply. New replies are no longer allowed.

Topic		Replies	Views
Pointpillar custom dataset annotation format TAO Toolkit	3	62	November 13, 2025
Questions regarding the generated data Synthetic Data Generation (SDG)	2	607	December 11, 2023
Train TAO Toolkit PointPillars object detection model without calibration files TAO Toolkit	7	821	August 8, 2022
TAO custom training dataset annotation General Topics and Other SDKs tensorrt , cuda , tao , deepstream61	0	479	August 2, 2023
[KeyError: 'AR_data'] Training CenterPose with custom synthetic dataset from Omniverse Replicator Isaac Sim	5	201	September 6, 2024
Tool for KITTI annotations TAO Toolkit	5	429	June 14, 2023
Isaac_ros_centerpose detection issues TAO Toolkit	4	129	May 23, 2025
How to create an AI model with annotation and image data? TAO Toolkit tao	8	374	June 3, 2024
TAO DetectNet_v2 TAO Toolkit	7	554	November 8, 2023
Training data preparation for PoseClassificationNet TAO Toolkit	13	646	October 10, 2023

Training data preparation for 3D object detection networks

Related topics