TY - JOUR
T1 - Generating quality datasets for real-time security assessment
T2 - Balancing historically relevant and rare feasible operating conditions
AU - Bugaje, Al Amin B.
AU - Cremer, Jochen L.
AU - Strbac, Goran
PY - 2023
Y1 - 2023
N2 - This paper presents a novel, unified approach for generating high-quality datasets for training machine-learned models for real-time security assessment in power systems. Synthetic data generation methods that extrapolate beyond historical data can be inefficient in generating feasible and rare operating conditions (OCs). The proposed approach balances the trade-off between historically relevant OCs and rare but feasible OCs. Unlike conventional methods that rely on historical records or generic sampling, our approach results in datasets that generalise well beyond similar distributions. The proposed approach is validated through experiments on the IEEE 118-bus system, where a decision tree model trained on data generated using our approach achieved 97% accuracy in predicting the security label of rare OCs, outperforming baseline approaches by 41% and 20%. This work is crucial for deploying reliable machine-learned models for real-time security assessment in power systems undergoing decarbonisation and integrating renewable energy sources.
AB - This paper presents a novel, unified approach for generating high-quality datasets for training machine-learned models for real-time security assessment in power systems. Synthetic data generation methods that extrapolate beyond historical data can be inefficient in generating feasible and rare operating conditions (OCs). The proposed approach balances the trade-off between historically relevant OCs and rare but feasible OCs. Unlike conventional methods that rely on historical records or generic sampling, our approach results in datasets that generalise well beyond similar distributions. The proposed approach is validated through experiments on the IEEE 118-bus system, where a decision tree model trained on data generated using our approach achieved 97% accuracy in predicting the security label of rare OCs, outperforming baseline approaches by 41% and 20%. This work is crucial for deploying reliable machine-learned models for real-time security assessment in power systems undergoing decarbonisation and integrating renewable energy sources.
KW - Data generation
KW - Dynamic security assessment
KW - Machine learning
KW - Power system operation
UR - http://www.scopus.com/inward/record.url?scp=85167990335&partnerID=8YFLogxK
U2 - 10.1016/j.ijepes.2023.109427
DO - 10.1016/j.ijepes.2023.109427
M3 - Article
AN - SCOPUS:85167990335
VL - 154
JO - International Journal of Electrical Power & Energy Systems
JF - International Journal of Electrical Power & Energy Systems
SN - 0142-0615
M1 - 109427
ER -