ONNX & TensorRT

Exporting and optimizing deep learning models for production

Overview

ONNX (Open Neural Network Exchange)

An open standard format for representing ML models, allowing interoperability between PyTorch, TensorFlow, etc.

TensorRT

NVIDIA’s SDK for high-performance deep learning inference, performing graph optimizations and layer fusion.

TODO: Add examples of exporting PyTorch to ONNX and compiling to TensorRT.