Hope you all are fine.
I have trained the SSD model using my own dataset for object detection.
I want to ask how to add voice capability like when it detects the object speaker should say the label of that object during inference.
Currently the command for inference in bash shell I am using for the Inference is their any argument to add also:
detectnet --model=models/dir/ssd-mobilenet.onnx --labels=models/dir/labels.txt --input-blob=input_0 --output-cvg=scores --output-bbox=boxes /dev/video0