Optimising AI reasoning pipeline – Layer 2 + 3 merge for CPU efficiency

Environment

TensorRT Version: 8.8
GPU Type: NVIDIA A100 40GB
Nvidia Driver Version: 535.54
CUDA Version: 12.1
CUDNN Version: 8.9
Operating System + Version: Ubuntu 22.04 LTS
Python Version: 3.11
TensorFlow Version: 2.14 (if applicable)
PyTorch Version: 2.1 (if applicable)
Baremetal or Container: Container (nvidia/cuda:12.1-cudnn8-runtime-ubuntu22.04)


Relevant Files


Steps To Reproduce

  1. Input scenario: multi-layer AI reasoning pipeline with redundant Layer 2 (Core) + Layer 3 (Redundant Checks)
  2. Apply short pseudo-code (example inline):
def layer2_optimized(input):
    return merge_logic(process_base(input))

Hi @mohammad.manuar, Please check the shared github repo link. It’s broken. The url has your-repo/ . Share the correct URL and PDF link AI_CPU_LayerMerge_Optimisation_v1.pdf to let us help you.

Thank You.