Deep Reinforcement Learning With Dynamic Graphs for Adaptive Informative Path Planning

Apoorva Vashisth, Julius Ruckin, Federico Magistri, Cyrill Stachniss, Marija Popovic*

*Corresponding author for this work

Research output: Contribution to journalArticleScientificpeer-review

Abstract

Autonomousrobots are often employed for data collection due to their efficiency and low labour costs. A key task in robotic data acquisition is planning paths through an initially unknown environment to collect observations given platform-specific resource constraints, such as limited battery life. Adaptive online path planning in 3D environments is challenging due to the large set of valid actions and the presence of unknown occlusions. To address these issues, we propose a novel deep reinforcement learning approach for adaptively replanning robot paths to map targets of interest in unknown 3D environments. A key aspect of our approach is a dynamically constructed graph that restricts planning actions local to the robot, allowing us to react to newly discovered static obstacles and targets of interest. For replanning, we propose a new reward function that balances between exploring the unknown environment and exploiting online-discovered targets of interest. Our experiments show that our method enables more efficient target discovery compared to state-of-the-art learning and non-learning baselines. We also showcase our approach for orchard monitoring using an unmanned aerial vehicle in a photorealistic simulator.

Original languageEnglish
Pages (from-to)7747-7754
Number of pages8
JournalIEEE Robotics and Automation Letters
Volume9
Issue number9
DOIs
Publication statusPublished - 2024

Bibliographical note

Green Open Access added to TU Delft Institutional Repository ‘You share, we take care!’ – Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

  • Motion and path planning
  • reinforcement learning
  • robotics and automation in agriculture and forestry

Fingerprint

Dive into the research topics of 'Deep Reinforcement Learning With Dynamic Graphs for Adaptive Informative Path Planning'. Together they form a unique fingerprint.

Cite this