Abstract
Street-level imagery contains a variety of visual information about the facades of Points of Interest (POIs). In addition to general mor- phological features, signs on the facades of, primarily, business-related POIs could be a valuable source of information about the type and iden- tity of a POI. Recent advancements in computer vision could leverage visual information from street-level imagery, and contribute to the classification of POIs. However, there is currently a gap in existing literature regarding the use of visual labels contained in street-level imagery, where their value as indicators of POI categories is assessed. This paper presents Scene-Text Semantics (ST-Sem), a novel method that leverages visual la- bels (e.g., texts, logos) from street-level imagery as complementary in- formation for the categorization of business-related POIs. Contrary to existing methods that fuse visual and textual information at a feature- level, we propose a late fusion approach that combines visual and textual cues after resolving issues of incorrect digitization and semantic ambiguity of the retrieved textual components. Experiments on two existing and a newly-created datasets show that ST-Sem can outperform visual-only approaches by 80% and related multimodal approaches by 4%.
Original language | English |
---|---|
Title of host publication | Web Engineering - 19th International Conference, ICWE 2019, Proceedings |
Editors | Maxim Bakaev, In-Young Ko, Flavius Frasincar |
Place of Publication | Cham |
Publisher | Springer |
Pages | 32-46 |
Number of pages | 15 |
Volume | 11496 |
ISBN (Electronic) | 978-3-030-19274-7 |
ISBN (Print) | 978-3-030-19273-0 |
DOIs | |
Publication status | Published - 26 Apr 2019 |
Event | 19th International Conference on Web Engineering, ICWE 2019 - Daejeon Convention Center (DCC), Daejeon, Korea, Republic of Duration: 11 Jun 2019 → 14 Jun 2019 Conference number: 19 |
Publication series
Name | Lecture Notes in Computer Science |
---|---|
Volume | 11496 |
ISSN (Print) | 0302-9743 |
ISSN (Electronic) | 1611-3349 |
Conference
Conference | 19th International Conference on Web Engineering, ICWE 2019 |
---|---|
Abbreviated title | ICWE 2019 |
Country/Territory | Korea, Republic of |
City | Daejeon |
Period | 11/06/19 → 14/06/19 |
Keywords
- Convolutional neural networks
- Points of Interest
- Semantic similarity
- Street-level imagery
- Word embeddings