Abstract
Long-term inspection and maintenance (I&M) planning, a multi-stage stochastic optimization problem, can be efficiently formulated as a partially observable Markov decision process (POMDP). However, within this context, single-agent approaches do not scale well for large multi-component systems since the joint state, action and observation spaces grow exponentially with the number of components. To alleviate this curse of dimensionality, cooperative decentralized approaches, known as decentralized POMDPs, are often adopted and solved using multi-agent deep reinforcement learning (MADRL) algorithms. This paper examines the centralization vs. decentralization performance of MADRL formulations in I&M planning of multi-component systems. Towards this, we set up a comprehensive computational experimental program focused on k-out-of-n system configurations, a common and broadly applicable archetype of deteriorating engineering systems, to highlight the manifestations of MADRL strengths and pathologies when optimizing global returns under varying decentralization relaxations.
Original language | English |
---|---|
Title of host publication | Artificial Intelligence and Machine Learning |
Subtitle of host publication | 35th Benelux Conference, BNAIC/Benelearn 2023, Delft, The Netherlands, November 8–10, 2023, Revised Selected Papers |
Editors | Frans A. Oliehoek, Manon Kok, Sicco Verwer |
Place of Publication | Cham |
Publisher | Springer |
Pages | 236-254 |
Number of pages | 19 |
ISBN (Electronic) | 978-3-031-74650-5 |
ISBN (Print) | 978-3-031-74649-9 |
DOIs | |
Publication status | Published - 2025 |
Event | 35th Benelux Conference on Artificial Intelligence and Machine Learning, BNAIC/Benelearn 2023 - TU Delft, Delft, Netherlands Duration: 8 Nov 2023 → 10 Nov 2023 https://bnaic2023.tudelft.nl/ |
Publication series
Name | Communications in Computer and Information Science |
---|---|
Publisher | Springer |
Volume | 2187 CCIS |
ISSN (Print) | 1865-0929 |
ISSN (Electronic) | 1865-0937 |
Conference
Conference | 35th Benelux Conference on Artificial Intelligence and Machine Learning, BNAIC/Benelearn 2023 |
---|---|
Country/Territory | Netherlands |
City | Delft |
Period | 8/11/23 → 10/11/23 |
Internet address |
Bibliographical note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-careOtherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
Keywords
- Actor-critic methods
- Decentralized partially observable Markov decision processes
- Inspection and maintenance planning
- Multi-agent deep reinforcement learning
- Stochastic deterioration