TY - JOUR
T1 - Ozone exceedance forecasting with enhanced extreme instance augmentation
T2 - A case study in Germany
AU - Deng, Tuo
AU - Manders, Astrid
AU - Segers, Arjo
AU - Heemink, Arnold Willem
AU - Lin, Hai Xiang
PY - 2024
Y1 - 2024
N2 - Accurately forecasting ozone levels that exceed specific thresholds is pivotal for mitigating adverse effects on both the environment and public health. However, predicting such ozone exceedances remains challenging due to the infrequent occurrence of high-concentration ozone data. This research, leveraging data from 57 German monitoring stations from 1999 to 2018, introduces an Enhanced Extreme Instance Augmentation Random Forest (EEIA-RF) approach that significantly improves the prediction of days when the maximum daily 8-hour average ozone concentrations exceed 120μg/m3. A pre-trained machine learning model is used to generate additional high-concentration data, which, combined with selectively reduced low-concentration data, forms a new dataset for training a refined model. This method achieved an improvement of at least 8% in the accuracy of predicting days with ozone exceedances across Germany. Our experiment underscores the approach's value in enhancing atmospheric modeling and supporting public health advisories and environmental policy-making related to ozone pollution.
AB - Accurately forecasting ozone levels that exceed specific thresholds is pivotal for mitigating adverse effects on both the environment and public health. However, predicting such ozone exceedances remains challenging due to the infrequent occurrence of high-concentration ozone data. This research, leveraging data from 57 German monitoring stations from 1999 to 2018, introduces an Enhanced Extreme Instance Augmentation Random Forest (EEIA-RF) approach that significantly improves the prediction of days when the maximum daily 8-hour average ozone concentrations exceed 120μg/m3. A pre-trained machine learning model is used to generate additional high-concentration data, which, combined with selectively reduced low-concentration data, forms a new dataset for training a refined model. This method achieved an improvement of at least 8% in the accuracy of predicting days with ozone exceedances across Germany. Our experiment underscores the approach's value in enhancing atmospheric modeling and supporting public health advisories and environmental policy-making related to ozone pollution.
KW - Air quality
KW - Extreme instance augmentation
KW - Ozone prediction
KW - Random forest
UR - http://www.scopus.com/inward/record.url?scp=85200254837&partnerID=8YFLogxK
U2 - 10.1016/j.envsoft.2024.106162
DO - 10.1016/j.envsoft.2024.106162
M3 - Article
AN - SCOPUS:85200254837
SN - 1364-8152
VL - 181
JO - Environmental Modelling and Software
JF - Environmental Modelling and Software
M1 - 106162
ER -