PyTorch ExecuTorch Quantization Overview
| Properties | |
|---|---|
| authors | PyTorch Quantization for TensorRT |
| year | 2024 |
| url | https://pytorch.org/executorch/main/quantization-overview.html |
{ width="400" }
Quantization is usually tied to execution backends that have quantized operators implemented. Thus each backend is opinionated about how the model should be quantized, expressed in a backend specific
Quantizerclass.