Abstract
Effective transportation network management systems should consider safety and sustainability objectives. Existing research on large-scale transportation network management often employs the assumption that bridges can be considered individually under these objectives. However, this simplification misses accurate system-level representations, induced by multiple components, network topology, and global maintenance actions. To address these limitations, this paper presents a deep reinforcement learning (DRL) framework that draws inspiration from biological learning behaviors to determine optimal life-cycle management policies. It incorporates synergetic branches and hierarchical rewards, factorizing the action space and, thereby, diminishing system complexity from exponential to linear with respect to the number of bridges. Extensive experiments based on a realistic case study demonstrate that the proposed method outperforms expert maintenance strategies and state-of-the-art decision-making methods. Overall, the proposed DRL framework can assist engineers by offering adaptive solutions to maintenance planning. It also provides solutions that address large action spaces within complex systems.
Original language | English |
---|---|
Article number | 105302 |
Number of pages | 19 |
Journal | Automation in Construction |
Volume | 160 |
DOIs | |
Publication status | Published - 2024 |
Bibliographical note
Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-careOtherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.
Funding
This study has been supported by the Research Grants Council of Hong Kong (PolyU 15225722), the Innovation and Technology Commission of Hong Kong SAR Government to the Hong Kong Branch of National Engineering Research Center on Rail Transit Electrification and Automation (K-BBY1), the Research Institute for Sustainable Urban Development , the Hong Kong Polytechnic University (PolyU 1-BBWM), and Centrally Funded Postdoctoral Fellowship Scheme (PolyU 1-YXB5), the TU Delft AI Labs program. The support is gratefully acknowledged. The opinions and conclusions presented in this study are those of the authors and do not necessarily reflect the views of the sponsoring organizations.Keywords
- deep reinforcement learning
- infrastructure management
- maintenance optimization
- hierarchical reward
- life-cycle analysis
- large discrete action spaces