PyTorch ExecuTorch Quantization Overview

Properties
authors	PyTorch Quantization for TensorRT
year	2024
url	https://pytorch.org/executorch/main/quantization-overview.html

Pasted image 20240925193351.png { width="400" }

Quantization is usually tied to execution backends that have quantized operators implemented. Thus each backend is opinionated about how the model should be quantized, expressed in a backend specific Quantizer class.