Meituan Technology Team
Sep 22, 2022 · Artificial Intelligence
Quantization Deployment Scheme for YOLOv6: Methods, Optimizations, and Performance Evaluation
The paper proposes a full quantization pipeline for YOLOv6 that combines a re‑parameterization optimizer, partial PTQ, channel‑wise distillation, graph‑scale merging, and GPU‑offloaded preprocessing, enabling an INT8 model to retain ~42 % mAP while delivering over 200 % throughput increase and 40 % QPS gain versus FP16.
Channel DistillationModel deploymentPTQ
0 likes · 16 min read
