PyTorch ExecuTorch Quantization Overview
Properties | |
---|---|
authors | PyTorch Quantization for TensorRT |
year | 2024 |
url | https://pytorch.org/executorch/main/quantization-overview.html |
{ width="400" }
Quantization is usually tied to execution backends that have quantized operators implemented. Thus each backend is opinionated about how the model should be quantized, expressed in a backend specific
Quantizer
class.