DETR(Detection Transformer) AIMET Quantization

mcw_qc_aimet · July 26, 2021, 2:52am

Quantization of DETR model from facebook repository throws the following error in compute_encodings()

The self.self_attn() refers to torch.nn.MultiheadAttention()

Any suggestions on how to overcome this issue?

akhobare · July 26, 2021, 3:27am

Hi @mcw_qc_aimet… Transformer models are not supported at this time. We hope to work on adding support for transformer models later this year.

Topic		Replies	Views
Bert, Transformers	3	987	April 20, 2021
Welcome to the AI Model Efficiency Toolkit Forum!	0	929	April 27, 2020
QAT for ResNet101 PyTorch & TensorFlow	0	719	January 28, 2022
Exception while exporting Quantized model	4	1840	August 6, 2020
Efficient-Net-B0 AIMET quantization	2	1153	July 28, 2021