TY - JOUR
T1 - Application of clustering algorithms for dimensionality reduction in infrastructure resilience prediction models
AU - Balakrishnan, S.
AU - Cassottana, Beatrice
AU - Verma, Arun
PY - 2024
Y1 - 2024
N2 - Recent studies increasingly adopt simulation-based machine learning (ML) models to analyse critical infrastructure system resilience. For realistic applications, these ML models consider the component-level characteristics that influence the network response during emergencies. However, such an approach could result in a large number of features and cause ML models to suffer from the ’curse of dimensionality’. A clustering-based method is presented that simultaneously minimises the problem of high-dimensionality and improves the prediction accuracy of ML models developed for resilience analysis in large-scale interdependent infrastructure networks. The methodology has three parts: (a) generation of simulation dataset, (b) network component clustering, and (c) dimensionality reduction and development of prediction models. First, an interdependent infrastructure simulation model simulates the network-wide consequences of various disruptive events. The component-level features are extracted from the simulated data. Next, clustering algorithms are used to derive the cluster-level features by grouping component-level features based on their topological and functional characteristics. Finally, ML algorithms are used to develop models that predict the network-wide impacts of disruptive events using the cluster-level features. The applicability of the method is demonstrated using an interdependent power-water-transport testbed. The proposed method can be used to develop decision-support tools for post-disaster recovery of infrastructure networks.
AB - Recent studies increasingly adopt simulation-based machine learning (ML) models to analyse critical infrastructure system resilience. For realistic applications, these ML models consider the component-level characteristics that influence the network response during emergencies. However, such an approach could result in a large number of features and cause ML models to suffer from the ’curse of dimensionality’. A clustering-based method is presented that simultaneously minimises the problem of high-dimensionality and improves the prediction accuracy of ML models developed for resilience analysis in large-scale interdependent infrastructure networks. The methodology has three parts: (a) generation of simulation dataset, (b) network component clustering, and (c) dimensionality reduction and development of prediction models. First, an interdependent infrastructure simulation model simulates the network-wide consequences of various disruptive events. The component-level features are extracted from the simulated data. Next, clustering algorithms are used to derive the cluster-level features by grouping component-level features based on their topological and functional characteristics. Finally, ML algorithms are used to develop models that predict the network-wide impacts of disruptive events using the cluster-level features. The applicability of the method is demonstrated using an interdependent power-water-transport testbed. The proposed method can be used to develop decision-support tools for post-disaster recovery of infrastructure networks.
U2 - 10.1080/15732479.2024.2366958
DO - 10.1080/15732479.2024.2366958
M3 - Article
SN - 1744-8980
SP - 1
EP - 13
JO - Structure and Infrastructure Engineering
JF - Structure and Infrastructure Engineering
ER -