Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning

M. Lodel; B.F. Ferreira de Brito; A. Serra Gomez; L. Ferranti; R. Babuska; J. Alonso-Mora

doi:10.1109/ICRA46639.2022.9812190

Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning

M. Lodel, B.F. Ferreira de Brito, A. Serra Gomez, L. Ferranti, R. Babuska, J. Alonso-Mora

Learning & Autonomous Control

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

9 Citations (Scopus)

38 Downloads (Pure)

Abstract

Search missions require motion planning and navigation methods for information gathering that continuously replan based on new observations of the robot's surroundings. Current methods for information gathering, such as Monte Carlo Tree Search, are capable of reasoning over long horizons, but they are computationally expensive. An alternative for fast online execution is to train, offline, an information gathering policy, which indirectly reasons about the information value of new observations. However, these policies lack safety guarantees and do not account for the robot dynamics. To overcome these limitations we train an information-aware policy via deep reinforcement learning, that guides a receding-horizon trajectory optimization planner. In particular, the policy continuously recommends a reference viewpoint to the local planner, such that the resulting dynamically feasible and collision-free trajectories lead to observations that maximize the information gain and reduce the uncertainty about the environment. In simulation tests in previously unseen environments, our method consistently outperforms greedy next-best-view policies and achieves competitive performance compared to Monte Carlo Tree Search, in terms of information gains and coverage time, with a reduction in execution time by three orders of magnitude.

Original language	English
Title of host publication	Proceedings of the International Conference on Robotics and Automation (ICRA 2022)
Editors	George J. Pappas, Vijay Kumar
Publisher	IEEE
Pages	4466-4472
ISBN (Electronic)	978-1-7281-9681-7
ISBN (Print)	978-1-7281-9680-0
DOIs	https://doi.org/10.1109/ICRA46639.2022.9812190
Publication status	Published - 2022
Event	2022 International Conference on Robotics and Automation (ICRA) - Philadelphia, United States Duration: 23 May 2022 → 27 May 2022 Conference number: 39

Conference

Conference	2022 International Conference on Robotics and Automation (ICRA)
Abbreviated title	ICRA 2022
Country/Territory	United States
City	Philadelphia
Period	23/05/22 → 27/05/22

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care
Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Monte Carlo methods
Uncertainty
Trajectory planning
Navigation
Reinforcement learning
Safety
Planning

Access to Document

10.1109/ICRA46639.2022.9812190

Where_to_Look_Next_Learning_Viewpoint_Recommendations_for_Informative_Trajectory_PlanningFinal published version, 462 KB

Cite this

Lodel, M., Ferreira de Brito, B. F., Serra Gomez, A., Ferranti, L., Babuska, R., & Alonso-Mora, J. (2022). Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning. In G. J. Pappas, & V. Kumar (Eds.), Proceedings of the International Conference on Robotics and Automation (ICRA 2022) (pp. 4466-4472). IEEE. https://doi.org/10.1109/ICRA46639.2022.9812190

@inproceedings{df38e0369dee4b21b6d8a5f823af19a5,

title = "Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning",

abstract = "Search missions require motion planning and navigation methods for information gathering that continuously replan based on new observations of the robot's surroundings. Current methods for information gathering, such as Monte Carlo Tree Search, are capable of reasoning over long horizons, but they are computationally expensive. An alternative for fast online execution is to train, offline, an information gathering policy, which indirectly reasons about the information value of new observations. However, these policies lack safety guarantees and do not account for the robot dynamics. To overcome these limitations we train an information-aware policy via deep reinforcement learning, that guides a receding-horizon trajectory optimization planner. In particular, the policy continuously recommends a reference viewpoint to the local planner, such that the resulting dynamically feasible and collision-free trajectories lead to observations that maximize the information gain and reduce the uncertainty about the environment. In simulation tests in previously unseen environments, our method consistently outperforms greedy next-best-view policies and achieves competitive performance compared to Monte Carlo Tree Search, in terms of information gains and coverage time, with a reduction in execution time by three orders of magnitude.",

keywords = "Monte Carlo methods, Uncertainty, Trajectory planning, Navigation, Reinforcement learning, Safety, Planning",

author = "M. Lodel and {Ferreira de Brito}, B.F. and {Serra Gomez}, A. and L. Ferranti and R. Babuska and J. Alonso-Mora",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.; 2022 International Conference on Robotics and Automation (ICRA), ICRA 2022 ; Conference date: 23-05-2022 Through 27-05-2022",

year = "2022",

doi = "10.1109/ICRA46639.2022.9812190",

language = "English",

isbn = "978-1-7281-9680-0",

pages = "4466--4472",

editor = "Pappas, {George J.} and Vijay Kumar",

booktitle = "Proceedings of the International Conference on Robotics and Automation (ICRA 2022)",

publisher = "IEEE",

address = "United States",

}

Lodel, M , Ferreira de Brito, BF , Serra Gomez, A , Ferranti, L , Babuska, R & Alonso-Mora, J 2022, Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning. in GJ Pappas & V Kumar (eds), Proceedings of the International Conference on Robotics and Automation (ICRA 2022). IEEE, pp. 4466-4472, 2022 International Conference on Robotics and Automation (ICRA), Philadelphia, United States, 23/05/22. https://doi.org/10.1109/ICRA46639.2022.9812190

Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning. / Lodel, M.; Ferreira de Brito, B.F.; Serra Gomez, A. et al.
Proceedings of the International Conference on Robotics and Automation (ICRA 2022). ed. / George J. Pappas; Vijay Kumar. IEEE, 2022. p. 4466-4472.

Research output: Chapter in Book/Conference proceedings/Edited volume › Conference contribution › Scientific › peer-review

TY - GEN

T1 - Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning

AU - Lodel, M.

AU - Ferreira de Brito, B.F.

AU - Serra Gomez, A.

AU - Ferranti, L.

AU - Babuska, R.

AU - Alonso-Mora, J.

N1 - Conference code: 39

PY - 2022

Y1 - 2022

N2 - Search missions require motion planning and navigation methods for information gathering that continuously replan based on new observations of the robot's surroundings. Current methods for information gathering, such as Monte Carlo Tree Search, are capable of reasoning over long horizons, but they are computationally expensive. An alternative for fast online execution is to train, offline, an information gathering policy, which indirectly reasons about the information value of new observations. However, these policies lack safety guarantees and do not account for the robot dynamics. To overcome these limitations we train an information-aware policy via deep reinforcement learning, that guides a receding-horizon trajectory optimization planner. In particular, the policy continuously recommends a reference viewpoint to the local planner, such that the resulting dynamically feasible and collision-free trajectories lead to observations that maximize the information gain and reduce the uncertainty about the environment. In simulation tests in previously unseen environments, our method consistently outperforms greedy next-best-view policies and achieves competitive performance compared to Monte Carlo Tree Search, in terms of information gains and coverage time, with a reduction in execution time by three orders of magnitude.

AB - Search missions require motion planning and navigation methods for information gathering that continuously replan based on new observations of the robot's surroundings. Current methods for information gathering, such as Monte Carlo Tree Search, are capable of reasoning over long horizons, but they are computationally expensive. An alternative for fast online execution is to train, offline, an information gathering policy, which indirectly reasons about the information value of new observations. However, these policies lack safety guarantees and do not account for the robot dynamics. To overcome these limitations we train an information-aware policy via deep reinforcement learning, that guides a receding-horizon trajectory optimization planner. In particular, the policy continuously recommends a reference viewpoint to the local planner, such that the resulting dynamically feasible and collision-free trajectories lead to observations that maximize the information gain and reduce the uncertainty about the environment. In simulation tests in previously unseen environments, our method consistently outperforms greedy next-best-view policies and achieves competitive performance compared to Monte Carlo Tree Search, in terms of information gains and coverage time, with a reduction in execution time by three orders of magnitude.

KW - Monte Carlo methods

KW - Uncertainty

KW - Trajectory planning

KW - Navigation

KW - Reinforcement learning

KW - Safety

KW - Planning

UR - http://www.scopus.com/inward/record.url?scp=85136338584&partnerID=8YFLogxK

U2 - 10.1109/ICRA46639.2022.9812190

DO - 10.1109/ICRA46639.2022.9812190

M3 - Conference contribution

SN - 978-1-7281-9680-0

SP - 4466

EP - 4472

BT - Proceedings of the International Conference on Robotics and Automation (ICRA 2022)

A2 - Pappas, George J.

A2 - Kumar, Vijay

PB - IEEE

T2 - 2022 International Conference on Robotics and Automation (ICRA)

Y2 - 23 May 2022 through 27 May 2022

ER -

Where to Look Next: Learning Viewpoint Recommendations for Informative Trajectory Planning

Abstract

Conference

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this