SNPE onnx-to-dlc quantizer reads and uses provided quantization parameters?

Qianhao · July 15, 2020, 3:18am

Hi, I know this is somehow related to another Qualcomm tool (SNPE) but I would love to know if there is a way to somehow load the quantization parameters from AIMET and use it in SNPE?
Or is there a way to modify the quantization parameters in DLC so that I can manually make use of the .encodings file exported from AIMET’s quantization simulation? (I heard from a technical meeting with the dev guys from Qualcomm that there might be a script for this?)

To make things more concrete, I have a quantized model stored in these 2 files: .onnx and .encodings, both exported from AIMET’s quantization simulation. I want to load the model directly to SNPE without SNPE doing the quantization solely based on the ONNX and another input data again.

https://developer.qualcomm.com/docs/snpe/tools.html#tools_snpe-dlc-quantize
The command does support overwriting the quantization parameter in tensorflow. I am using Pytorch and its exported ONNX models.

Greatly appreciate it if you can provide some insight!

akhobare · November 6, 2020, 11:30pm

@Qianhao Sorry for the late response.

This question is already answered here
https://github.com/quic/aimet/issues/168

Let me know if you have further questions.

Topic		Replies	Views
Passing onnx to AIMET	2	1179	July 2, 2021
How to load the exported model?	2	1025	August 6, 2020
Exception while exporting Quantized model	4	1865	August 6, 2020
Quantization data types supported by AIMET	2	689	October 21, 2022
Efficient-Net-B0 AIMET quantization	2	1159	July 28, 2021

SNPE onnx-to-dlc quantizer reads and uses provided quantization parameters?

Related topics