Looking at the forum, it seems you need to implement yourself a custom op or probably easier, split the model in different parts (I would advise using netron to know where to split)
Converting onnx to trt: [8] No importer registered for op: OneHot - #3 by francesco.ciannella and
execute just a part of the model in tensorrt (easy to plug the whole thing together on Triton). To help you in splitting the model, I would recommend polygraphy tool.
Probably even easier, don’t know if possible for you, but you may want to use albert v2 from HF hub, it seems to work out of the box (without requiring any custom plugin).