The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty

Mohammad Saifullah; Charalampos Andriotis; Konstantinos G. Papakonstantinou

The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty

Mohammad Saifullah, Charalampos Andriotis, Konstantinos G. Papakonstantinou

Architectural Technology

Research output: Contribution to conference › Paper › peer-review

14 Downloads (Pure)

Abstract

To preserve structural safety of deteriorating engineering systems through optimal maintenance, it is imperative to efficiently integrate structural health information with decision-making optimization frameworks. Although there may be abundance of available data, these are often uncertain and incomplete. In addition, joint inspection and maintenance (I&M) optimization is inherently complex due to high-dimensional state and action spaces, stochastic objectives, long planning horizons, and various constraints, among others. As shown recently, these computational challenges can be effectively addressed through optimization principles of Partially Observable Markov Decision Processes (POMDPs) and constrained Deep Reinforcement Learning (DRL). The POMDP framework provides a way of updating the decision-maker's perception about the system state by naturally incorporating the Value of Information (VoI) in the optimality equations. As such, optimal observation-gathering actions are those which guide maintenance decisions towards reduced life-cycle costs and risks. The role of VoI in DRL-driven I&M has also been shown to be central to the formation of policy gradients, which are necessary to obtain the optimal I&M plan with deep learning actor-critic architectures. Leveraging this property, a recently devised DRL architecture is further examined in this work, consisting of fully decoupled 'maintainer' and 'inspector' actors, which allow for greater efficacy and interpretability in multi-agent DRL settings. Several numerical analyses are carried out to assess the performance of the relevant architectures on stochastic systems with a varying number of components, multiple maintenance-inspection actions per component, and system-level failure risks.

Original language	English
Number of pages	8
Publication status	Published - 2023
Event	14th International Conference on Applications of Statistics and Probability in Civil Engineering 2023 - Trinity College Dublin, Dublin, Ireland Duration: 9 Jul 2023 → 13 Jul 2023 https://icasp14.com/

Conference

Conference	14th International Conference on Applications of Statistics and Probability in Civil Engineering 2023
Abbreviated title	ICASP14
Country/Territory	Ireland
City	Dublin
Period	9/07/23 → 13/07/23
Internet address	https://icasp14.com/

Access to Document

The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertaintyFinal published version, 1.19 MBLicence: CC BY-NC-SA

http://hdl.handle.net/2262/103618

Cite this

@conference{11db84c989de4f0e9cfd29bae7277040,

title = "The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty",

abstract = "To preserve structural safety of deteriorating engineering systems through optimal maintenance, it is imperative to efficiently integrate structural health information with decision-making optimization frameworks. Although there may be abundance of available data, these are often uncertain and incomplete. In addition, joint inspection and maintenance (I&M) optimization is inherently complex due to high-dimensional state and action spaces, stochastic objectives, long planning horizons, and various constraints, among others. As shown recently, these computational challenges can be effectively addressed through optimization principles of Partially Observable Markov Decision Processes (POMDPs) and constrained Deep Reinforcement Learning (DRL). The POMDP framework provides a way of updating the decision-maker's perception about the system state by naturally incorporating the Value of Information (VoI) in the optimality equations. As such, optimal observation-gathering actions are those which guide maintenance decisions towards reduced life-cycle costs and risks. The role of VoI in DRL-driven I&M has also been shown to be central to the formation of policy gradients, which are necessary to obtain the optimal I&M plan with deep learning actor-critic architectures. Leveraging this property, a recently devised DRL architecture is further examined in this work, consisting of fully decoupled 'maintainer' and 'inspector' actors, which allow for greater efficacy and interpretability in multi-agent DRL settings. Several numerical analyses are carried out to assess the performance of the relevant architectures on stochastic systems with a varying number of components, multiple maintenance-inspection actions per component, and system-level failure risks.",

author = "Mohammad Saifullah and Charalampos Andriotis and Papakonstantinou, {Konstantinos G.}",

year = "2023",

language = "English",

note = "14th International Conference on Applications of Statistics and Probability in Civil Engineering 2023, ICASP14 ; Conference date: 09-07-2023 Through 13-07-2023",

url = "https://icasp14.com/",

}

Saifullah, M, Andriotis, C & Papakonstantinou, KG 2023, 'The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty', Paper presented at 14th International Conference on Applications of Statistics and Probability in Civil Engineering 2023, Dublin, Ireland, 9/07/23 - 13/07/23. <http://hdl.handle.net/2262/103618>

The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty. / Saifullah, Mohammad; Andriotis, Charalampos; Papakonstantinou, Konstantinos G.
2023. Paper presented at 14th International Conference on Applications of Statistics and Probability in Civil Engineering 2023, Dublin, Ireland.

Research output: Contribution to conference › Paper › peer-review

TY - CONF

T1 - The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty

AU - Saifullah, Mohammad

AU - Andriotis, Charalampos

AU - Papakonstantinou, Konstantinos G.

PY - 2023

Y1 - 2023

N2 - To preserve structural safety of deteriorating engineering systems through optimal maintenance, it is imperative to efficiently integrate structural health information with decision-making optimization frameworks. Although there may be abundance of available data, these are often uncertain and incomplete. In addition, joint inspection and maintenance (I&M) optimization is inherently complex due to high-dimensional state and action spaces, stochastic objectives, long planning horizons, and various constraints, among others. As shown recently, these computational challenges can be effectively addressed through optimization principles of Partially Observable Markov Decision Processes (POMDPs) and constrained Deep Reinforcement Learning (DRL). The POMDP framework provides a way of updating the decision-maker's perception about the system state by naturally incorporating the Value of Information (VoI) in the optimality equations. As such, optimal observation-gathering actions are those which guide maintenance decisions towards reduced life-cycle costs and risks. The role of VoI in DRL-driven I&M has also been shown to be central to the formation of policy gradients, which are necessary to obtain the optimal I&M plan with deep learning actor-critic architectures. Leveraging this property, a recently devised DRL architecture is further examined in this work, consisting of fully decoupled 'maintainer' and 'inspector' actors, which allow for greater efficacy and interpretability in multi-agent DRL settings. Several numerical analyses are carried out to assess the performance of the relevant architectures on stochastic systems with a varying number of components, multiple maintenance-inspection actions per component, and system-level failure risks.

AB - To preserve structural safety of deteriorating engineering systems through optimal maintenance, it is imperative to efficiently integrate structural health information with decision-making optimization frameworks. Although there may be abundance of available data, these are often uncertain and incomplete. In addition, joint inspection and maintenance (I&M) optimization is inherently complex due to high-dimensional state and action spaces, stochastic objectives, long planning horizons, and various constraints, among others. As shown recently, these computational challenges can be effectively addressed through optimization principles of Partially Observable Markov Decision Processes (POMDPs) and constrained Deep Reinforcement Learning (DRL). The POMDP framework provides a way of updating the decision-maker's perception about the system state by naturally incorporating the Value of Information (VoI) in the optimality equations. As such, optimal observation-gathering actions are those which guide maintenance decisions towards reduced life-cycle costs and risks. The role of VoI in DRL-driven I&M has also been shown to be central to the formation of policy gradients, which are necessary to obtain the optimal I&M plan with deep learning actor-critic architectures. Leveraging this property, a recently devised DRL architecture is further examined in this work, consisting of fully decoupled 'maintainer' and 'inspector' actors, which allow for greater efficacy and interpretability in multi-agent DRL settings. Several numerical analyses are carried out to assess the performance of the relevant architectures on stochastic systems with a varying number of components, multiple maintenance-inspection actions per component, and system-level failure risks.

M3 - Paper

T2 - 14th International Conference on Applications of Statistics and Probability in Civil Engineering 2023

Y2 - 9 July 2023 through 13 July 2023

ER -

The role of value of information in multi-agent deep reinforcement learning for optimal decision-making under uncertainty

Abstract

Conference

Access to Document

Fingerprint

Cite this