following speech-to-text-training.ipynb
I want to do inference (i.e. !tao speech_to_text infer …)
with a list of wav files given in a file.
How does $path in
file_paths=[$path]
has to look like??
file_paths=["/data/an4_converted/wavs/an406-fcaw-b.wav","/data/an4_converted/wavs/cen8-fjlp-b.wav"]
works, but only if NO blank is after the Komma (strange).
I tried many things like
filelist=open(“an4_test.inlist”).read().replace("\n",",")
or
filelist=open(“an4_test.inlist”).read().split()
together with
file_paths=[filelist]
or
file_paths=filelist
What does it look like when you run above? Is it the same as ["/data/an4_converted/wavs/an406-fcaw-b.wav","/data/an4_converted/wavs/cen8-fjlp-b.wav"] ?
I still have another question concerning
a) training / fine-tuning
Could you give hints what is an appropriate value for max_epochs ??
b) evaluation:
What is a reasonable result for this an4 train/test? I got (so far)
{‘test_loss’: 86.24714660644531, ‘test_wer’: 0.8680465817451477}
is this ok?
Is it possible to output detailed results (per utterance, Insertions,Deletions,Substitutions) when doing
!tao speech_to_text evaluate …
many thanks!