PyTorch PyTorch 2 Export Post Training Quantization
| Properties | |
|---|---|
| authors | Jerry Zhang |
| year | 2024 |
| url | https://pytorch.org/tutorials/prototype/pt2e_quant_ptq.html |
Uses prepare_pt2e and convert_pt2e.
float_model(Python) Example Input
\ /
\ /
—-------------------------------------------------------
| export |
—-------------------------------------------------------
|
FX Graph in ATen Backend Specific Quantizer
| /
—--------------------------------------------------------
| prepare_pt2e |
—--------------------------------------------------------
|
Calibrate/Train
|
—--------------------------------------------------------
| convert_pt2e |
—--------------------------------------------------------
|
Quantized Model
|
—--------------------------------------------------------
| Lowering |
—--------------------------------------------------------
|
Executorch, Inductor or <Other Backends>