Triton infererence server example 'simple_grpc_infer_client.py'

h9945394143 · March 3, 2022, 5:03am

im running through docker container tritonserver.21.01 py3 sdk

could some one tell me the parameters to be passed to run simple_grpc_infer_client.py

also could you let know the best sample usecase python code for inferencing Video

kayccc · March 15, 2022, 1:44am

Please provide complete information as applicable to your setup.

• Hardware Platform (Jetson / GPU)
• DeepStream Version
• JetPack Version (valid for Jetson only)
• TensorRT Version
• NVIDIA GPU Driver Version (valid for GPU only)
• Issue Type( questions, new requirements, bugs)
• How to reproduce the issue ? (This is for bugs. Including which sample app is using, the configuration files content, the command line used and other details for reproducing)
• Requirement details( This is for new requirement. Including the module name-for which plugin or for which sample application, the function description)

Fiona.Chen · March 18, 2022, 2:10am

Where did you get tritonserver:21.01-py3-sdk container. We only have nvcr.io/nvidia/tritonserver:21.02-py3-sdk

h9945394143 · March 21, 2022, 7:55am

sorry its 21.02 only . i was able to find sample examples , Could you help me out with the path of triton server logs .
when-ever i change/unload/relead the models , where can i see the complete logs

fanzh · March 22, 2022, 3:50am

As you know, triton is client server architecture, client sends command to server, server does inferrence.

1 triton sdk does not include inference server, it dose not have triton server logs, please refer to triton docker introdcution Triton Inference Server | NVIDIA NGC

2 client need to send messge to server if need infomation, you can call API triton_client.get_inference_statistics to get module infomation, please refer to demo simple_grpc_infer_client.py, and
here is all API introdcution: GitHub - triton-inference-server/client: Triton Python, C++ and Java client libraries, and GRPC-generated client examples for go, java and scala.

h9945394143 · March 22, 2022, 5:27am

i have a deepstream 6.0 triton docker . and i load the models and start the triton server through

tritonserver --model-repository=model-folder-path

and the model get loaded , and can see the logs whenever they is an change in the model as below ,

I0322 05:41:04.478350 73 server.cc:586] 
+----------------------------------+---------+--------+
| Model                            | Version | Status |
+----------------------------------+---------+--------+
| Fire_model                       | 1       | READY  |
| Fire_onnx_model                  | 1       | READY  |
| Helmet_model                     | 1       | READY  |
| IndianVehicle_model              | 1       | READY  |
| PPEKit_ONNX                      | 1       | READY  |
| PPEKit_model                     | 1       | READY  |
| Primary_Detector                 | 1       | READY  |
| Secondary_CarColor               | 1       | READY  |
| Secondary_CarMake                | 1       | READY  |
| Secondary_VehicleTypes           | 1       | READY  |
| Segmentation_Industrial          | 1       | READY  |
| Segmentation_Semantic            | 1       | READY  |
| TripleRiding_model               | 1       | READY  |
| densenet_onnx                    | 1       | READY  |
| inception_graphdef               | 1       | READY  |
| mobilenet_v1                     | 1       | READY  |
| ssd_inception_v2_coco_2018_01_28 | 1       | READY  |
| ssd_mobilenet_v1_coco_2018_01_28 | 1       | READY  |
+----------------------------------+---------+--------+

I0322 05:41:04.478471 73 tritonserver.cc:1718] 
+----------------------------------+--------------------------------------------------------------------------------------------------------------------------------------+
| Option                           | Value                                                                                                                                |
+----------------------------------+--------------------------------------------------------------------------------------------------------------------------------------+
| server_id                        | triton                                                                                                                               |
| server_version                   | 2.13.0                                                                                                                               |
| server_extensions                | classification sequence model_repository model_repository(unload_dependents) schedule_policy model_configuration system_shared_memor |
|                                  | y cuda_shared_memory binary_tensor_data statistics                                                                                   |
| model_repository_path[0]         | /opt/nvidia/deepstream/deepstream-6.0/samples/triton_model_repo/                                                                     |
| model_control_mode               | MODE_POLL                                                                                                                            |
| strict_model_config              | 1                                                                                                                                    |
| pinned_memory_pool_byte_size     | 268435456                                                                                                                            |
| cuda_memory_pool_byte_size{0}    | 67108864                                                                                                                             |
| min_supported_compute_capability | 6.0                                                                                                                                  |
| strict_readiness                 | 1                                                                                                                                    |
| exit_timeout                     | 30                                                                                                                                   |
+----------------------------------+--------------------------------------------------------------------------------------------------------------------------------------+

I0322 05:41:04.820839 73 grpc_server.cc:4111] Started GRPCInferenceService at 0.0.0.0:8001
I0322 05:41:04.832203 73 http_server.cc:2803] Started HTTPService at 0.0.0.0:8000
I0322 05:41:05.005487 73 http_server.cc:162] Started Metrics Service at 0.0.0.0:8002

if i close the terminal , it would be running at the backend , now how it see the logs without restarting tritonserver --model-repository=model-folder-path again

fanzh · March 22, 2022, 6:28am

1 can you see the printing above if docker attach the contianer again?
2 if can’t, what is your full docker start command?

h9945394143 · March 22, 2022, 6:32am

when i try to re-attach , i dont have control over tritonserver logs , but the models would be up and running at the backend .

i want to know if any new changes are done , wheather i can see logs . Such as 'model loaded successfully; or ’ failed to load model ’ or 'particular errors ’

fanzh · March 22, 2022, 8:14am

1 about " have control over tritonserver logs", do you mean the logs dose not update ?
2 you can use API triton_client.is_model_ready to get model status, please refer to simple_grpc_model_control.py。

h9945394143 · March 22, 2022, 8:57am

how do i view the logs …?? Is there a logfile to view it ?

Also can i get the complete command of ‘triton_client.is_model_ready’

fanzh · March 23, 2022, 4:11am

1 no, Triton logs to the console. You can save the logs in a file by redirecting standard output and standard error.

2 please refer to the links below:
protocol

github.com

triton-inference-server/server/blob/main/docs/protocol/extension_model_repository.md

<!--
# Copyright 2020-2022, NVIDIA CORPORATION & AFFILIATES. All rights reserved.
#
# Redistribution and use in source and binary forms, with or without
# modification, are permitted provided that the following conditions
# are met:
#  * Redistributions of source code must retain the above copyright
#    notice, this list of conditions and the following disclaimer.
#  * Redistributions in binary form must reproduce the above copyright
#    notice, this list of conditions and the following disclaimer in the
#    documentation and/or other materials provided with the distribution.
#  * Neither the name of NVIDIA CORPORATION nor the names of its
#    contributors may be used to endorse or promote products derived
#    from this software without specific prior written permission.
#
# THIS SOFTWARE IS PROVIDED BY THE COPYRIGHT HOLDERS ``AS IS'' AND ANY
# EXPRESS OR IMPLIED WARRANTIES, INCLUDING, BUT NOT LIMITED TO, THE
# IMPLIED WARRANTIES OF MERCHANTABILITY AND FITNESS FOR A PARTICULAR
# PURPOSE ARE DISCLAIMED.  IN NO EVENT SHALL THE COPYRIGHT OWNER OR
# CONTRIBUTORS BE LIABLE FOR ANY DIRECT, INDIRECT, INCIDENTAL, SPECIAL,

This file has been truncated. show original

API introduction

github.com

triton-inference-server/client/blob/main/src/python/library/tritonclient/grpc/init.py#L341


      
          bool
              True if server is ready, False if server is not ready.
          
          
Raises
          ------
          InferenceServerException
              If unable to get readiness.
          
          
"""
          if headers is not None:
              metadata = headers.items()
          else:
              metadata = ()
          try:
              request = service_pb2.ServerReadyRequest()
              if self._verbose:
                  print("is_server_ready, metadata {}\n{}".format(
                      metadata, request))
              response = self._client_stub.ServerReady(request=request,
                                                       metadata=metadata)
              if self._verbose:

is_model_ready usage

github.com

triton-inference-server/client/blob/main/src/python/examples/simple_grpc_model_control.py#L66


      
          
          
model_name = 'simple'
          
          
# There are eight models in the repository directory
          if len(triton_client.get_model_repository_index().models) != 8:
              print('FAILED : Repository Index')
              sys.exit(1)
          
          
triton_client.load_model(model_name)
          if not triton_client.is_model_ready(model_name):
              print('FAILED : Load Model')
              sys.exit(1)
          
          
triton_client.unload_model(model_name)
          if triton_client.is_model_ready(model_name):
              print('FAILED : Unload Model')
              sys.exit(1)
          
          
# Trying to load wrong model name should emit exception
          try:
              triton_client.load_model("wrong_model_name")

Topic		Replies	Views
Triton server logs DeepStream SDK	7	5557	May 16, 2022
Triton server inference model placement TAO Toolkit	7	1087	February 23, 2022
Deepstream triton server config_infer.txt file DeepStream SDK	5	558	May 16, 2022
Deepstream with triton DeepStream SDK	12	795	October 9, 2023
DeepStream Triton gRPC example does not run with Deepstream Triton Docker images DeepStream SDK	12	1448	January 17, 2023
Triton and deepstream Triton Inference Server (archived)	0	517	March 11, 2021
Regarding when we execute triton server on jetson orin getting an error unable to load model DeepStream SDK cuda	19	1320	July 30, 2024
Serving Peoplenet model using Triton gRPC Inference Server and make calls to it from outside the container DeepStream SDK tensorrt , gstreamer , python , inference-server-triton , tao , deepstream	14	1237	February 2, 2023
Error while running Tritron server DeepStream SDK	2	373	January 12, 2024
Inferencing on DINO in triton inference server TensorRT inference-server-triton	1	190	August 29, 2024

Triton infererence server example 'simple_grpc_infer_client.py'

Related topics