Investigating transformers in the decomposition of polygonal shapes as point collections

Andrea  Alfieri; Yancong Lin; Jan C. van Gemert

doi:10.1109/ICCVW54120.2021.00235

Investigating transformers in the decomposition of polygonal shapes as point collections

Andrea Alfieri, Yancong Lin, Jan C. van Gemert

Pattern Recognition and Bioinformatics

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

Abstract

Transformers can generate predictions in two approaches: 1. auto-regressively by conditioning each sequence element on the previous ones, or 2. directly produce an output sequences in parallel. While research has mostly explored upon this difference on sequential tasks in NLP, we study the difference between auto-regressive and parallel prediction on visual set prediction tasks, and in particular on polygonal shapes in images because polygons are representative of numerous types of objects, such as buildings or obstacles for aerial vehicles. This is challenging for deep learning architectures as a polygon can consist of a varying carnality of points. We provide evidence on the importance of natural orders for Transformers, and show the benefit of decomposing complex polygons into collections of points in an auto-regressive manner.

Original language	English
Title of host publication	Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021
Subtitle of host publication	Proceedings
Editors	L. O'Conner
Place of Publication	Piscataway
Publisher	IEEE
Pages	2076-2085
Number of pages	10
ISBN (Electronic)	978-1-6654-0191-3
ISBN (Print)	978-1-6654-0192-0
DOIs	https://doi.org/10.1109/ICCVW54120.2021.00235
Publication status	Published - 2021
Event	2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) - Virtual at Montreal, Canada Duration: 11 Oct 2021 → 17 Oct 2021

Publication series

Name	Proceedings of the IEEE International Conference on Computer Vision
Volume	2021-October
ISSN (Print)	1550-5499

Conference

Conference	2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)
Country/Territory	Canada
City	Virtual at Montreal
Period	11/10/21 → 17/10/21

Access to Document

10.1109/ICCVW54120.2021.00235

Cite this

Alfieri, A., Lin, Y., & van Gemert, J. C. (2021). Investigating transformers in the decomposition of polygonal shapes as point collections. In L. O'Conner (Ed.), Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021: Proceedings (pp. 2076-2085). Article 9607442 (Proceedings of the IEEE International Conference on Computer Vision; Vol. 2021-October). IEEE. https://doi.org/10.1109/ICCVW54120.2021.00235

Alfieri, Andrea ; Lin, Yancong ; van Gemert, Jan C. / Investigating transformers in the decomposition of polygonal shapes as point collections. Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021: Proceedings. editor / L. O'Conner. Piscataway : IEEE, 2021. pp. 2076-2085 (Proceedings of the IEEE International Conference on Computer Vision).

@inproceedings{1b27030490604f71a07f6e90cdd18087,

title = "Investigating transformers in the decomposition of polygonal shapes as point collections",

abstract = "Transformers can generate predictions in two approaches: 1. auto-regressively by conditioning each sequence element on the previous ones, or 2. directly produce an output sequences in parallel. While research has mostly explored upon this difference on sequential tasks in NLP, we study the difference between auto-regressive and parallel prediction on visual set prediction tasks, and in particular on polygonal shapes in images because polygons are representative of numerous types of objects, such as buildings or obstacles for aerial vehicles. This is challenging for deep learning architectures as a polygon can consist of a varying carnality of points. We provide evidence on the importance of natural orders for Transformers, and show the benefit of decomposing complex polygons into collections of points in an auto-regressive manner.",

author = "Andrea Alfieri and Yancong Lin and {van Gemert}, {Jan C.}",

year = "2021",

doi = "10.1109/ICCVW54120.2021.00235",

language = "English",

isbn = "978-1-6654-0192-0",

series = "Proceedings of the IEEE International Conference on Computer Vision",

publisher = "IEEE",

pages = "2076--2085",

editor = "L. O'Conner",

booktitle = "Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021",

address = "United States",

note = "2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW) ; Conference date: 11-10-2021 Through 17-10-2021",

}

Alfieri, A, Lin, Y & van Gemert, JC 2021, Investigating transformers in the decomposition of polygonal shapes as point collections. in L O'Conner (ed.), Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021: Proceedings., 9607442, Proceedings of the IEEE International Conference on Computer Vision, vol. 2021-October, IEEE, Piscataway, pp. 2076-2085, 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW), Virtual at Montreal, Canada, 11/10/21. https://doi.org/10.1109/ICCVW54120.2021.00235

Investigating transformers in the decomposition of polygonal shapes as point collections. / Alfieri, Andrea ; Lin, Yancong ; van Gemert, Jan C.
Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021: Proceedings. ed. / L. O'Conner. Piscataway: IEEE, 2021. p. 2076-2085 9607442 (Proceedings of the IEEE International Conference on Computer Vision; Vol. 2021-October).

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Investigating transformers in the decomposition of polygonal shapes as point collections

AU - Alfieri, Andrea

AU - Lin, Yancong

AU - van Gemert, Jan C.

PY - 2021

Y1 - 2021

N2 - Transformers can generate predictions in two approaches: 1. auto-regressively by conditioning each sequence element on the previous ones, or 2. directly produce an output sequences in parallel. While research has mostly explored upon this difference on sequential tasks in NLP, we study the difference between auto-regressive and parallel prediction on visual set prediction tasks, and in particular on polygonal shapes in images because polygons are representative of numerous types of objects, such as buildings or obstacles for aerial vehicles. This is challenging for deep learning architectures as a polygon can consist of a varying carnality of points. We provide evidence on the importance of natural orders for Transformers, and show the benefit of decomposing complex polygons into collections of points in an auto-regressive manner.

AB - Transformers can generate predictions in two approaches: 1. auto-regressively by conditioning each sequence element on the previous ones, or 2. directly produce an output sequences in parallel. While research has mostly explored upon this difference on sequential tasks in NLP, we study the difference between auto-regressive and parallel prediction on visual set prediction tasks, and in particular on polygonal shapes in images because polygons are representative of numerous types of objects, such as buildings or obstacles for aerial vehicles. This is challenging for deep learning architectures as a polygon can consist of a varying carnality of points. We provide evidence on the importance of natural orders for Transformers, and show the benefit of decomposing complex polygons into collections of points in an auto-regressive manner.

UR - http://www.scopus.com/inward/record.url?scp=85123046335&partnerID=8YFLogxK

U2 - 10.1109/ICCVW54120.2021.00235

DO - 10.1109/ICCVW54120.2021.00235

M3 - Conference contribution

SN - 978-1-6654-0192-0

T3 - Proceedings of the IEEE International Conference on Computer Vision

SP - 2076

EP - 2085

BT - Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021

A2 - O'Conner, L.

PB - IEEE

CY - Piscataway

T2 - 2021 IEEE/CVF International Conference on Computer Vision Workshops (ICCVW)

Y2 - 11 October 2021 through 17 October 2021

ER -

Alfieri A, Lin Y , van Gemert JC. Investigating transformers in the decomposition of polygonal shapes as point collections. In O'Conner L, editor, Proceedings - 2021 IEEE/CVF International Conference on Computer Vision Workshops, ICCVW 2021: Proceedings. Piscataway: IEEE. 2021. p. 2076-2085. 9607442. (Proceedings of the IEEE International Conference on Computer Vision). doi: 10.1109/ICCVW54120.2021.00235

Investigating transformers in the decomposition of polygonal shapes as point collections

Abstract

Publication series

Conference

Access to Document

Other files and links

Fingerprint

Cite this