RGB-Depth cross-modal person re-identification

Frank M. Hafner; Amran Bhuiyan; Julian F.P. Kooij; Eric Granger

doi:10.1109/AVSS.2019.8909838

RGB-Depth cross-modal person re-identification

Frank M. Hafner, Amran Bhuiyan, Julian F.P. Kooij, Eric Granger

Intelligent Vehicles

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

16 Citations (Scopus)

170 Downloads (Pure)

Abstract

Person re-identification is a key challenge for surveillance across multiple sensors. Prompted by the advent of powerful deep learning models for visual recognition, and inexpensive RGBD cameras and sensor-rich mobile robotic platforms, e.g. self-driving vehicles, we investigate the relatively unexplored problem of cross-modal re-identification of persons between RGB (color) and depth images. The considerable divergence in data distributions across different sensor modalities introduces additional challenges to the typical difficulties like distinct viewpoints, occlusions, and pose and illumination variation. While some work has investigated re-identification across RGB and infrared, we take inspiration from successes in transfer learning from RGB to depth in object detection tasks. Our main contribution is a novel cross-modal distillation network for robust person re-identification, which learns a shared feature representation space of person's appearance in both RGB and depth images. The proposed network was compared to conventional and deep learning approaches proposed for other cross-domain re-identification tasks. Results obtained on the public BIWI and RobotPKU datasets indicate that the proposed method can significantly outperform the state-of-the-art approaches by up to 10.5% mAp, demonstrating the benefit of the proposed distillation paradigm.

Original language	English
Title of host publication	Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2019)
Place of Publication	Piscataway, NJ, USA
Publisher	IEEE
Number of pages	8
ISBN (Electronic)	978-1-7281-0990-9
DOIs	https://doi.org/10.1109/AVSS.2019.8909838
Publication status	Published - 2019
Event	16th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2019 - Taipei, Taiwan Duration: 18 Sept 2019 → 21 Sept 2019

Conference

Conference	16th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2019
Country/Territory	Taiwan
City	Taipei
Period	18/09/19 → 21/09/19

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Access to Document

10.1109/AVSS.2019.8909838

RGB-Depth_Cross-Modal_Person_Re-identificationFinal published version, 1.48 MB

Cite this

@inproceedings{261bdf5c4e9c4d37bc55ecb93181a4be,

title = "RGB-Depth cross-modal person re-identification",

abstract = "Person re-identification is a key challenge for surveillance across multiple sensors. Prompted by the advent of powerful deep learning models for visual recognition, and inexpensive RGBD cameras and sensor-rich mobile robotic platforms, e.g. self-driving vehicles, we investigate the relatively unexplored problem of cross-modal re-identification of persons between RGB (color) and depth images. The considerable divergence in data distributions across different sensor modalities introduces additional challenges to the typical difficulties like distinct viewpoints, occlusions, and pose and illumination variation. While some work has investigated re-identification across RGB and infrared, we take inspiration from successes in transfer learning from RGB to depth in object detection tasks. Our main contribution is a novel cross-modal distillation network for robust person re-identification, which learns a shared feature representation space of person's appearance in both RGB and depth images. The proposed network was compared to conventional and deep learning approaches proposed for other cross-domain re-identification tasks. Results obtained on the public BIWI and RobotPKU datasets indicate that the proposed method can significantly outperform the state-of-the-art approaches by up to 10.5% mAp, demonstrating the benefit of the proposed distillation paradigm.",

author = "Hafner, {Frank M.} and Amran Bhuiyan and Kooij, {Julian F.P.} and Eric Granger",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.; 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2019 ; Conference date: 18-09-2019 Through 21-09-2019",

year = "2019",

doi = "10.1109/AVSS.2019.8909838",

language = "English",

booktitle = "Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2019)",

publisher = "IEEE",

address = "United States",

}

Hafner, FM, Bhuiyan, A, Kooij, JFP & Granger, E 2019, RGB-Depth cross-modal person re-identification. in Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2019). IEEE, Piscataway, NJ, USA, 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2019, Taipei, Taiwan, 18/09/19. https://doi.org/10.1109/AVSS.2019.8909838

RGB-Depth cross-modal person re-identification. / Hafner, Frank M.; Bhuiyan, Amran; Kooij, Julian F.P. et al.
Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2019). Piscataway, NJ, USA: IEEE, 2019.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - RGB-Depth cross-modal person re-identification

AU - Hafner, Frank M.

AU - Bhuiyan, Amran

AU - Kooij, Julian F.P.

AU - Granger, Eric

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2019

Y1 - 2019

N2 - Person re-identification is a key challenge for surveillance across multiple sensors. Prompted by the advent of powerful deep learning models for visual recognition, and inexpensive RGBD cameras and sensor-rich mobile robotic platforms, e.g. self-driving vehicles, we investigate the relatively unexplored problem of cross-modal re-identification of persons between RGB (color) and depth images. The considerable divergence in data distributions across different sensor modalities introduces additional challenges to the typical difficulties like distinct viewpoints, occlusions, and pose and illumination variation. While some work has investigated re-identification across RGB and infrared, we take inspiration from successes in transfer learning from RGB to depth in object detection tasks. Our main contribution is a novel cross-modal distillation network for robust person re-identification, which learns a shared feature representation space of person's appearance in both RGB and depth images. The proposed network was compared to conventional and deep learning approaches proposed for other cross-domain re-identification tasks. Results obtained on the public BIWI and RobotPKU datasets indicate that the proposed method can significantly outperform the state-of-the-art approaches by up to 10.5% mAp, demonstrating the benefit of the proposed distillation paradigm.

AB - Person re-identification is a key challenge for surveillance across multiple sensors. Prompted by the advent of powerful deep learning models for visual recognition, and inexpensive RGBD cameras and sensor-rich mobile robotic platforms, e.g. self-driving vehicles, we investigate the relatively unexplored problem of cross-modal re-identification of persons between RGB (color) and depth images. The considerable divergence in data distributions across different sensor modalities introduces additional challenges to the typical difficulties like distinct viewpoints, occlusions, and pose and illumination variation. While some work has investigated re-identification across RGB and infrared, we take inspiration from successes in transfer learning from RGB to depth in object detection tasks. Our main contribution is a novel cross-modal distillation network for robust person re-identification, which learns a shared feature representation space of person's appearance in both RGB and depth images. The proposed network was compared to conventional and deep learning approaches proposed for other cross-domain re-identification tasks. Results obtained on the public BIWI and RobotPKU datasets indicate that the proposed method can significantly outperform the state-of-the-art approaches by up to 10.5% mAp, demonstrating the benefit of the proposed distillation paradigm.

UR - http://www.scopus.com/inward/record.url?scp=85076368191&partnerID=8YFLogxK

U2 - 10.1109/AVSS.2019.8909838

DO - 10.1109/AVSS.2019.8909838

M3 - Conference contribution

BT - Proceedings of the 16th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS 2019)

PB - IEEE

CY - Piscataway, NJ, USA

T2 - 16th IEEE International Conference on Advanced Video and Signal Based Surveillance, AVSS 2019

Y2 - 18 September 2019 through 21 September 2019

ER -

RGB-Depth cross-modal person re-identification

Abstract

Conference

Bibliographical note

Access to Document

Other files and links

Fingerprint

Cite this