Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

C. P. Andriotis; K. G. Papakonstantinou

doi:10.1016/j.ress.2021.107551

Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

C. P. Andriotis^*, K. G. Papakonstantinou

^*Corresponding author for this work

Structural Design & Mechanics

Research output: Contribution to journal › Article › Scientific › peer-review

49 Citations (Scopus)

33 Downloads (Pure)

Abstract

Determination of inspection and maintenance policies for minimizing long-term risks and costs in deteriorating engineering environments constitutes a complex optimization problem. Major computational challenges include the (i) curse of dimensionality, due to exponential scaling of state/action set cardinalities with the number of components; (ii) curse of history, related to exponentially growing decision-trees with the number of decision-steps; (iii) presence of state uncertainties, induced by inherent environment stochasticity and variability of inspection/monitoring measurements; (iv) presence of constraints, pertaining to stochastic long-term limitations, due to resource scarcity and other infeasible/undesirable system responses. In this work, these challenges are addressed within a joint framework of constrained Partially Observable Markov Decision Processes (POMDP) and multi-agent Deep Reinforcement Learning (DRL). POMDPs optimally tackle (ii)-(iii), combining stochastic dynamic programming with Bayesian inference principles. Multi-agent DRL addresses (i), through deep function parametrizations and decentralized control assumptions. Challenge (iv) is herein handled through proper state augmentation and Lagrangian relaxation, with emphasis on life-cycle risk-based constraints and budget limitations. The underlying algorithmic steps are provided, and the proposed framework is found to outperform well-established policy baselines and facilitate adept prescription of inspection and intervention actions, in cases where decisions must be made in the most resource- and risk-aware manner.

Original language	English
Article number	107551
Number of pages	16
Journal	Reliability Engineering and System Safety
Volume	212
DOIs	https://doi.org/10.1016/j.ress.2021.107551
Publication status	Published - 2021

Bibliographical note

Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care

Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

Keywords

Constrained stochastic optimization
Decentralized multi-agent control
Deep reinforcement learning
Inspection and maintenance planning
Partially observable Markov decision processes
System risk and reliability

Access to Document

10.1016/j.ress.2021.107551

1-s2.0-S095183202100106X-mainFinal published version, 2.83 MB

Cite this

@article{0713e10e31f1499d89dc940e50faef3a,

title = "Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints",

abstract = "Determination of inspection and maintenance policies for minimizing long-term risks and costs in deteriorating engineering environments constitutes a complex optimization problem. Major computational challenges include the (i) curse of dimensionality, due to exponential scaling of state/action set cardinalities with the number of components; (ii) curse of history, related to exponentially growing decision-trees with the number of decision-steps; (iii) presence of state uncertainties, induced by inherent environment stochasticity and variability of inspection/monitoring measurements; (iv) presence of constraints, pertaining to stochastic long-term limitations, due to resource scarcity and other infeasible/undesirable system responses. In this work, these challenges are addressed within a joint framework of constrained Partially Observable Markov Decision Processes (POMDP) and multi-agent Deep Reinforcement Learning (DRL). POMDPs optimally tackle (ii)-(iii), combining stochastic dynamic programming with Bayesian inference principles. Multi-agent DRL addresses (i), through deep function parametrizations and decentralized control assumptions. Challenge (iv) is herein handled through proper state augmentation and Lagrangian relaxation, with emphasis on life-cycle risk-based constraints and budget limitations. The underlying algorithmic steps are provided, and the proposed framework is found to outperform well-established policy baselines and facilitate adept prescription of inspection and intervention actions, in cases where decisions must be made in the most resource- and risk-aware manner.",

keywords = "Constrained stochastic optimization, Decentralized multi-agent control, Deep reinforcement learning, Inspection and maintenance planning, Partially observable Markov decision processes, System risk and reliability",

author = "Andriotis, {C. P.} and Papakonstantinou, {K. G.}",

note = "Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.",

year = "2021",

doi = "10.1016/j.ress.2021.107551",

language = "English",

volume = "212",

journal = "Reliability Engineering and System Safety",

issn = "0951-8320",

publisher = "Elsevier",

}

TY - JOUR

T1 - Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

AU - Andriotis, C. P.

AU - Papakonstantinou, K. G.

N1 - Green Open Access added to TU Delft Institutional Repository 'You share, we take care!' - Taverne project https://www.openaccess.nl/en/you-share-we-take-care Otherwise as indicated in the copyright section: the publisher is the copyright holder of this work and the author uses the Dutch legislation to make this work public.

PY - 2021

Y1 - 2021

N2 - Determination of inspection and maintenance policies for minimizing long-term risks and costs in deteriorating engineering environments constitutes a complex optimization problem. Major computational challenges include the (i) curse of dimensionality, due to exponential scaling of state/action set cardinalities with the number of components; (ii) curse of history, related to exponentially growing decision-trees with the number of decision-steps; (iii) presence of state uncertainties, induced by inherent environment stochasticity and variability of inspection/monitoring measurements; (iv) presence of constraints, pertaining to stochastic long-term limitations, due to resource scarcity and other infeasible/undesirable system responses. In this work, these challenges are addressed within a joint framework of constrained Partially Observable Markov Decision Processes (POMDP) and multi-agent Deep Reinforcement Learning (DRL). POMDPs optimally tackle (ii)-(iii), combining stochastic dynamic programming with Bayesian inference principles. Multi-agent DRL addresses (i), through deep function parametrizations and decentralized control assumptions. Challenge (iv) is herein handled through proper state augmentation and Lagrangian relaxation, with emphasis on life-cycle risk-based constraints and budget limitations. The underlying algorithmic steps are provided, and the proposed framework is found to outperform well-established policy baselines and facilitate adept prescription of inspection and intervention actions, in cases where decisions must be made in the most resource- and risk-aware manner.

AB - Determination of inspection and maintenance policies for minimizing long-term risks and costs in deteriorating engineering environments constitutes a complex optimization problem. Major computational challenges include the (i) curse of dimensionality, due to exponential scaling of state/action set cardinalities with the number of components; (ii) curse of history, related to exponentially growing decision-trees with the number of decision-steps; (iii) presence of state uncertainties, induced by inherent environment stochasticity and variability of inspection/monitoring measurements; (iv) presence of constraints, pertaining to stochastic long-term limitations, due to resource scarcity and other infeasible/undesirable system responses. In this work, these challenges are addressed within a joint framework of constrained Partially Observable Markov Decision Processes (POMDP) and multi-agent Deep Reinforcement Learning (DRL). POMDPs optimally tackle (ii)-(iii), combining stochastic dynamic programming with Bayesian inference principles. Multi-agent DRL addresses (i), through deep function parametrizations and decentralized control assumptions. Challenge (iv) is herein handled through proper state augmentation and Lagrangian relaxation, with emphasis on life-cycle risk-based constraints and budget limitations. The underlying algorithmic steps are provided, and the proposed framework is found to outperform well-established policy baselines and facilitate adept prescription of inspection and intervention actions, in cases where decisions must be made in the most resource- and risk-aware manner.

KW - Constrained stochastic optimization

KW - Decentralized multi-agent control

KW - Deep reinforcement learning

KW - Inspection and maintenance planning

KW - Partially observable Markov decision processes

KW - System risk and reliability

UR - http://www.scopus.com/inward/record.url?scp=85104958870&partnerID=8YFLogxK

U2 - 10.1016/j.ress.2021.107551

DO - 10.1016/j.ress.2021.107551

M3 - Article

AN - SCOPUS:85104958870

SN - 0951-8320

VL - 212

JO - Reliability Engineering and System Safety

JF - Reliability Engineering and System Safety

M1 - 107551

ER -

Deep reinforcement learning driven inspection and maintenance planning under incomplete information and constraints

Abstract

Bibliographical note

Keywords

Access to Document

Other files and links

Fingerprint

Cite this