Overview
ONNX (Open Neural Network Exchange)
An open standard format for representing ML models, allowing interoperability between PyTorch, TensorFlow, etc.
TensorRT
NVIDIA’s SDK for high-performance deep learning inference, performing graph optimizations and layer fusion.
TODO: Add examples of exporting PyTorch to ONNX and compiling to TensorRT.