An Attention Module for Convolutional Neural Networks

Baozhou Zhu; Peter Hofstee; Jinho Lee; Zaid Al-Ars

doi:10.1007/978-3-030-86362-3_14

An Attention Module for Convolutional Neural Networks

Baozhou Zhu^*, Peter Hofstee, Jinho Lee, Zaid Al-Ars

^*Corresponding author for this work

Computer Engineering

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

6 Citations (Scopus)

95 Downloads (Pure)

Abstract

Attention mechanism has been regarded as an advanced technique to capture long-range feature interactions and to boost the representation capability for convolutional neural networks. However, we found two ignored problems in current attentional activations-based models: the approximation problem and the insufficient capacity problem of the attention maps. To solve the two problems together, we initially propose an attention module for convolutional neural networks by developing an AW-convolution, where the shape of attention maps matches that of the weights rather than the activations. Our proposed attention module is a complementary method to previous attention-based schemes, such as those that apply the attention mechanism to explore the relationship between channel-wise and spatial features. Experiments on several datasets for image classification and object detection tasks show the effectiveness of our proposed attention module. In particular, our proposed attention module achieves 1.00 % Top-1 accuracy improvement on ImageNet classification over a ResNet101 baseline and 0.63 COCO-style Average Precision improvement on the COCO object detection on top of a Faster R-CNN baseline with the backbone of ResNet101-FPN. When integrating with the previous attentional activations-based models, our proposed attention module can further increase their Top-1 accuracy on ImageNet classification by up to 0.57 % and COCO-style Average Precision on the COCO object detection by up to 0.45. Code and pre-trained models will be publicly available.

Original language	English
Title of host publication	Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings
Editors	Igor Farkaš, Paolo Masulli, Sebastian Otte, Stefan Wermter
Publisher	Springer
Pages	167-178
Number of pages	12
Volume	12891
ISBN (Print)	9783030863616
DOIs	https://doi.org/10.1007/978-3-030-86362-3_14
Publication status	Published - 2021
Event	30th International Conference on Artificial Neural Networks, ICANN 2021 - Virtual, Online at Bratislava, Slovakia Duration: 14 Sept 2021 → 17 Sept 2021

Publication series

Name	Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)
Volume	12891 LNCS
ISSN (Print)	0302-9743
ISSN (Electronic)	1611-3349

Conference

Conference	30th International Conference on Artificial Neural Networks, ICANN 2021
Country/Territory	Slovakia
City	Virtual, Online at Bratislava
Period	14/09/21 → 17/09/21

Keywords

Attention mechanism
Convolution
Representation

Access to Document

10.1007/978-3-030-86362-3_14

An Attention Module for Convolutional Neural NetworksAccepted author manuscript, 277 KB

Cite this

Zhu, B., Hofstee, P., Lee, J., & Al-Ars, Z. (2021). An Attention Module for Convolutional Neural Networks. In I. Farkaš, P. Masulli, S. Otte, & S. Wermter (Eds.), Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings (Vol. 12891, pp. 167-178). (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12891 LNCS). Springer. https://doi.org/10.1007/978-3-030-86362-3_14

Zhu, Baozhou ; Hofstee, Peter ; Lee, Jinho et al. / An Attention Module for Convolutional Neural Networks. Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings. editor / Igor Farkaš ; Paolo Masulli ; Sebastian Otte ; Stefan Wermter. Vol. 12891 Springer, 2021. pp. 167-178 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)).

@inproceedings{85860c53ee904dbc92c675f3e10a1bb6,

title = "An Attention Module for Convolutional Neural Networks",

abstract = "Attention mechanism has been regarded as an advanced technique to capture long-range feature interactions and to boost the representation capability for convolutional neural networks. However, we found two ignored problems in current attentional activations-based models: the approximation problem and the insufficient capacity problem of the attention maps. To solve the two problems together, we initially propose an attention module for convolutional neural networks by developing an AW-convolution, where the shape of attention maps matches that of the weights rather than the activations. Our proposed attention module is a complementary method to previous attention-based schemes, such as those that apply the attention mechanism to explore the relationship between channel-wise and spatial features. Experiments on several datasets for image classification and object detection tasks show the effectiveness of our proposed attention module. In particular, our proposed attention module achieves 1.00 % Top-1 accuracy improvement on ImageNet classification over a ResNet101 baseline and 0.63 COCO-style Average Precision improvement on the COCO object detection on top of a Faster R-CNN baseline with the backbone of ResNet101-FPN. When integrating with the previous attentional activations-based models, our proposed attention module can further increase their Top-1 accuracy on ImageNet classification by up to 0.57 % and COCO-style Average Precision on the COCO object detection by up to 0.45. Code and pre-trained models will be publicly available.",

keywords = "Attention mechanism, Convolution, Representation",

author = "Baozhou Zhu and Peter Hofstee and Jinho Lee and Zaid Al-Ars",

year = "2021",

doi = "10.1007/978-3-030-86362-3_14",

language = "English",

isbn = "9783030863616",

volume = "12891",

series = "Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)",

publisher = "Springer",

pages = "167--178",

editor = "Igor Farka{\v s} and Paolo Masulli and Sebastian Otte and Stefan Wermter",

booktitle = "Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings",

note = "30th International Conference on Artificial Neural Networks, ICANN 2021 ; Conference date: 14-09-2021 Through 17-09-2021",

}

Zhu, B, Hofstee, P, Lee, J & Al-Ars, Z 2021, An Attention Module for Convolutional Neural Networks. in I Farkaš, P Masulli, S Otte & S Wermter (eds), Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings. vol. 12891, Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), vol. 12891 LNCS, Springer, pp. 167-178, 30th International Conference on Artificial Neural Networks, ICANN 2021, Virtual, Online at Bratislava, Slovakia, 14/09/21. https://doi.org/10.1007/978-3-030-86362-3_14

An Attention Module for Convolutional Neural Networks. / Zhu, Baozhou; Hofstee, Peter; Lee, Jinho et al.
Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings. ed. / Igor Farkaš; Paolo Masulli; Sebastian Otte; Stefan Wermter. Vol. 12891 Springer, 2021. p. 167-178 (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics); Vol. 12891 LNCS).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - An Attention Module for Convolutional Neural Networks

AU - Zhu, Baozhou

AU - Hofstee, Peter

AU - Lee, Jinho

AU - Al-Ars, Zaid

PY - 2021

Y1 - 2021

N2 - Attention mechanism has been regarded as an advanced technique to capture long-range feature interactions and to boost the representation capability for convolutional neural networks. However, we found two ignored problems in current attentional activations-based models: the approximation problem and the insufficient capacity problem of the attention maps. To solve the two problems together, we initially propose an attention module for convolutional neural networks by developing an AW-convolution, where the shape of attention maps matches that of the weights rather than the activations. Our proposed attention module is a complementary method to previous attention-based schemes, such as those that apply the attention mechanism to explore the relationship between channel-wise and spatial features. Experiments on several datasets for image classification and object detection tasks show the effectiveness of our proposed attention module. In particular, our proposed attention module achieves 1.00 % Top-1 accuracy improvement on ImageNet classification over a ResNet101 baseline and 0.63 COCO-style Average Precision improvement on the COCO object detection on top of a Faster R-CNN baseline with the backbone of ResNet101-FPN. When integrating with the previous attentional activations-based models, our proposed attention module can further increase their Top-1 accuracy on ImageNet classification by up to 0.57 % and COCO-style Average Precision on the COCO object detection by up to 0.45. Code and pre-trained models will be publicly available.

AB - Attention mechanism has been regarded as an advanced technique to capture long-range feature interactions and to boost the representation capability for convolutional neural networks. However, we found two ignored problems in current attentional activations-based models: the approximation problem and the insufficient capacity problem of the attention maps. To solve the two problems together, we initially propose an attention module for convolutional neural networks by developing an AW-convolution, where the shape of attention maps matches that of the weights rather than the activations. Our proposed attention module is a complementary method to previous attention-based schemes, such as those that apply the attention mechanism to explore the relationship between channel-wise and spatial features. Experiments on several datasets for image classification and object detection tasks show the effectiveness of our proposed attention module. In particular, our proposed attention module achieves 1.00 % Top-1 accuracy improvement on ImageNet classification over a ResNet101 baseline and 0.63 COCO-style Average Precision improvement on the COCO object detection on top of a Faster R-CNN baseline with the backbone of ResNet101-FPN. When integrating with the previous attentional activations-based models, our proposed attention module can further increase their Top-1 accuracy on ImageNet classification by up to 0.57 % and COCO-style Average Precision on the COCO object detection by up to 0.45. Code and pre-trained models will be publicly available.

KW - Attention mechanism

KW - Convolution

KW - Representation

UR - http://www.scopus.com/inward/record.url?scp=85115445536&partnerID=8YFLogxK

U2 - 10.1007/978-3-030-86362-3_14

DO - 10.1007/978-3-030-86362-3_14

M3 - Conference contribution

AN - SCOPUS:85115445536

SN - 9783030863616

VL - 12891

T3 - Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)

SP - 167

EP - 178

BT - Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings

A2 - Farkaš, Igor

A2 - Masulli, Paolo

A2 - Otte, Sebastian

A2 - Wermter, Stefan

PB - Springer

T2 - 30th International Conference on Artificial Neural Networks, ICANN 2021

Y2 - 14 September 2021 through 17 September 2021

ER -

Zhu B, Hofstee P, Lee J, Al-Ars Z. An Attention Module for Convolutional Neural Networks. In Farkaš I, Masulli P, Otte S, Wermter S, editors, Artificial Neural Networks and Machine Learning – ICANN 2021 - 30th International Conference on Artificial Neural Networks, Proceedings. Vol. 12891. Springer. 2021. p. 167-178. (Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics)). doi: 10.1007/978-3-030-86362-3_14

An Attention Module for Convolutional Neural Networks

Abstract

Publication series

Conference

Keywords

Access to Document

Other files and links

Fingerprint

Cite this