The issue is in /usr/local/lib/python3.8/dist-packages/nemo2riva/patches/mtencdec.py where change_tokenizer_names is missing **kwargs, i.e. it shout be
The mentioned fix allows me to safely deploy en_de_24x6.nemo, but I have problems deploying a custom nmt model built with nemo:23.03.
Riva does not start up, it lists the following error:
UNAVAILABLE: Invalid argument: model 'nmt-classifier', tensor 'log_probs': the model expects 3 dimensions (shape [-1,-1,64000]) but the model configuration specifies 3 dimensions (an initial batch dimension because max_batch_size > 0 followed by the explicit tensor shape, making complete shape [-1,-1,32000])
I’ve tracked the issue down to line 96 in /usr/local/lib/python3.8/dist-packages/servicemaker/triton/nmt.py, which has a hardcoded value [-1,32000] for the classifier output.
However, I can not find why my custom built model has a value of 64000 (if I modify the classifier .pbtxt by hand the model loads and works as it should). Is this the size of the target language tokenizer (I use 64000 tokens).