Crafted vs. Learned Representations in Predictive Models - A Case Study on Cyclist Path Prediction

Research output: Contribution to journalArticleScientificpeer-review

Abstract

This paper compares two models for context-based path prediction of objects with switching dynamics: a Dynamic Bayesian Network (DBN) and a Recurrent Neural Network (RNN). These models are instances of two larger model categories, distinguished by whether expert knowledge is explicitly crafted into the state representation (and thus is interpretable) or whether the representation is learned from data, respectively. They have shown state-of-the-art performance in previous work. In order to provide a fair comparison, we ensure that both models are treated similarly with respect to the use of context cues and parameter estimation. Specifically, we describe (1) how to integrate the context cues (used previously by the DBN) into the RNN, and (2) how to optimize the DBN with backpropagation similar to the RNN, while keeping an interpretable state representation. Experiments are performed on a scenario where a cyclist might turn left at an intersection in front of the ego-vehicle. Results show that the RNN successfully leverages the context cues, and that optimizing the DBN improves its performance with respect to existing work. While the RNN outperforms the optimized DBN in predictive log-likelihood by a significant margin, both models attain similar average Euclidean distance errors (23-39 cm for DBN and 31-34 cm for RNN, predicting 1 s ahead).

Original languageEnglish
JournalIEEE Transactions on Intelligent Vehicles
DOIs
Publication statusAccepted/In press - 1 Jan 2021

Keywords

  • Active safety
  • Context modeling
  • Dynamics
  • Intelligent vehicles
  • motion prediction
  • Predictive models
  • Recurrent neural networks
  • Roads
  • Vehicle dynamics
  • vulnerable road users (VRUs)

Fingerprint Dive into the research topics of 'Crafted vs. Learned Representations in Predictive Models - A Case Study on Cyclist Path Prediction'. Together they form a unique fingerprint.

Cite this