Multiple object tracking using a transform space

M. Li; J. Li; A. Tamayo; L. Nan

doi:10.5194/isprs-Annals-V-4-2022-137-2022

Multiple object tracking using a transform space

M. Li^*, J. Li, A. Tamayo, L. Nan

^*Corresponding author for this work

Urban Data Science

Research output: Contribution to journal › Conference article › Scientific › peer-review

39 Downloads (Pure)

Abstract

This paper presents a method for multiple object tracking (MOT) in video streams. The method incorporates the prediction of physical locations of people into a tracking-by-detection paradigm. We predict the trajectories of people on an estimated ground plane and apply a learning-based network to extract the appearance features across frames. The method transforms the detected object locations from image space to an estimated ground space to refine the tracking trajectories. This transform space allows the objects detected from multi-view images to be associated under one coordinate system. Besides, the occluded pedestrians in image space can be well separated in a rectified ground plane where the motion models of the pedestrians are estimated. The effectiveness of this method is evaluated on different datasets by extensive comparisons with state-of-The-Art techniques. Experimental results show that the proposed method improves MOT tasks in terms of the number of identity switches (IDSW) and the fragmentations (Frag).

Original language	English
Pages (from-to)	137-143
Number of pages	7
Journal	ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences
Volume	5
Issue number	4
DOIs	https://doi.org/10.5194/isprs-Annals-V-4-2022-137-2022
Publication status	Published - 2022
Event	2022 24th ISPRS Congress on Imaging Today, Foreseeing Tomorrow, Commission IV - Nice, France Duration: 6 Jun 2022 → 11 Jun 2022

Keywords

Data Association
Deep Features
Multiple Object Tracking
Tracking-by-Detection
Transform Space.

Access to Document

10.5194/isprs-Annals-V-4-2022-137-2022

isprs-annals-V-4-2022-137-2022Final published version, 1.32 MBLicence: CC BY

Cite this

@article{307ea4e9254f4ab386dfd0890d2fa8cb,

title = "Multiple object tracking using a transform space",

abstract = "This paper presents a method for multiple object tracking (MOT) in video streams. The method incorporates the prediction of physical locations of people into a tracking-by-detection paradigm. We predict the trajectories of people on an estimated ground plane and apply a learning-based network to extract the appearance features across frames. The method transforms the detected object locations from image space to an estimated ground space to refine the tracking trajectories. This transform space allows the objects detected from multi-view images to be associated under one coordinate system. Besides, the occluded pedestrians in image space can be well separated in a rectified ground plane where the motion models of the pedestrians are estimated. The effectiveness of this method is evaluated on different datasets by extensive comparisons with state-of-The-Art techniques. Experimental results show that the proposed method improves MOT tasks in terms of the number of identity switches (IDSW) and the fragmentations (Frag). ",

keywords = "Data Association, Deep Features, Multiple Object Tracking, Tracking-by-Detection, Transform Space.",

author = "M. Li and J. Li and A. Tamayo and L. Nan",

year = "2022",

doi = "10.5194/isprs-Annals-V-4-2022-137-2022",

language = "English",

volume = "5",

pages = "137--143",

journal = "ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences",

issn = "2194-9042",

publisher = "Copernicus",

number = "4",

note = "2022 24th ISPRS Congress on Imaging Today, Foreseeing Tomorrow, Commission IV ; Conference date: 06-06-2022 Through 11-06-2022",

}

TY - JOUR

T1 - Multiple object tracking using a transform space

AU - Li, M.

AU - Li, J.

AU - Tamayo, A.

AU - Nan, L.

PY - 2022

Y1 - 2022

N2 - This paper presents a method for multiple object tracking (MOT) in video streams. The method incorporates the prediction of physical locations of people into a tracking-by-detection paradigm. We predict the trajectories of people on an estimated ground plane and apply a learning-based network to extract the appearance features across frames. The method transforms the detected object locations from image space to an estimated ground space to refine the tracking trajectories. This transform space allows the objects detected from multi-view images to be associated under one coordinate system. Besides, the occluded pedestrians in image space can be well separated in a rectified ground plane where the motion models of the pedestrians are estimated. The effectiveness of this method is evaluated on different datasets by extensive comparisons with state-of-The-Art techniques. Experimental results show that the proposed method improves MOT tasks in terms of the number of identity switches (IDSW) and the fragmentations (Frag).

AB - This paper presents a method for multiple object tracking (MOT) in video streams. The method incorporates the prediction of physical locations of people into a tracking-by-detection paradigm. We predict the trajectories of people on an estimated ground plane and apply a learning-based network to extract the appearance features across frames. The method transforms the detected object locations from image space to an estimated ground space to refine the tracking trajectories. This transform space allows the objects detected from multi-view images to be associated under one coordinate system. Besides, the occluded pedestrians in image space can be well separated in a rectified ground plane where the motion models of the pedestrians are estimated. The effectiveness of this method is evaluated on different datasets by extensive comparisons with state-of-The-Art techniques. Experimental results show that the proposed method improves MOT tasks in terms of the number of identity switches (IDSW) and the fragmentations (Frag).

KW - Data Association

KW - Deep Features

KW - Multiple Object Tracking

KW - Tracking-by-Detection

KW - Transform Space.

UR - http://www.scopus.com/inward/record.url?scp=85132011923&partnerID=8YFLogxK

U2 - 10.5194/isprs-Annals-V-4-2022-137-2022

DO - 10.5194/isprs-Annals-V-4-2022-137-2022

M3 - Conference article

AN - SCOPUS:85132011923

SN - 2194-9042

VL - 5

SP - 137

EP - 143

JO - ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

JF - ISPRS Annals of the Photogrammetry, Remote Sensing and Spatial Information Sciences

IS - 4

T2 - 2022 24th ISPRS Congress on Imaging Today, Foreseeing Tomorrow, Commission IV

Y2 - 6 June 2022 through 11 June 2022

ER -

Multiple object tracking using a transform space

Abstract

Keywords

Access to Document

Other files and links

Fingerprint

Cite this