Efficient-Net-B0 AIMET quantization

abinila_siva · June 15, 2021, 4:20pm

EfficientNet-B0 from timm repository, post quantization Sim is leading to very less accuracy.

If I try to update timm version (4.5.1) vs what AIMET requires (0.3.1) the AIMET quantization fails at onnx importer.

@akhobare Can you please suggest how to proceed?

akhobare · July 6, 2021, 11:24pm

EfficientNet-B0 is a somewhat difficult to quantize model from our experience. Could you try applying the AdaRound technique in AIMET. Check the accuracy after that.
And then try the QAT (Quantization-aware training) feature in AIMET after that.

Thanks
Abhi

edan840216 · July 28, 2021, 3:52am

Hi @akhobare,

I have do some experiments on EfficientNet-B0, and find that the quantization of the SE layer is the main reason that harm the entire model accuracy. We then switch to EfficientNet-Lite0, which remove the SE layer and replace SiLU with ReLU, and found that it can be retrained to close to original accuracy. Is your observation echos to the above statement? Thank you for your attention!

Best regards,
Edan

Topic		Replies	Views
Welcome to the AI Model Efficiency Toolkit Forum!	0	929	April 27, 2020
Bert, Transformers	3	988	April 20, 2021
Passing onnx to AIMET	2	1166	July 2, 2021
Is setting model.eval() required while using QuantizationSimModel in PyTorch?	1	3026	August 18, 2021
How to check whether the layers of model is Quantized?	1	880	May 28, 2021

Efficient-Net-B0 AIMET quantization

Related topics