Spatial hazard assessment of the PM10 using machine learning models in Barcelona, Spain.
Air quality
Bagged classification and regression trees
Hazard assessment
Mixture discriminate analysis
Particulate matter
Random forest
Simulated annealing
Journal
The Science of the total environment
ISSN: 1879-1026
Titre abrégé: Sci Total Environ
Pays: Netherlands
ID NLM: 0330500
Informations de publication
Date de publication:
20 Jan 2020
20 Jan 2020
Historique:
received:
10
08
2019
revised:
13
09
2019
accepted:
14
09
2019
pubmed:
11
11
2019
medline:
11
11
2019
entrez:
10
11
2019
Statut:
ppublish
Résumé
Air pollution, and especially atmospheric particulate matter (PM), has a profound impact on human mortality and morbidity, environment, and ecological system. Accordingly, it is very relevant predicting air quality. Although the application of the machine learning (ML) models for predicting air quality parameters, such as PM concentrations, has been evaluated in previous studies, those on the spatial hazard modeling of them are very limited. Due to the high potential of the ML models, the spatial modeling of PM can help managers to identify the pollution hotspots. Accordingly, this study aims at developing new ML models, such as Random Forest (RF), Bagged Classification and Regression Trees (Bagged CART), and Mixture Discriminate Analysis (MDA) for the hazard prediction of PM10 (particles with a diameter less than 10 µm) in the Barcelona Province, Spain. According to the annual PM10 concentration in 75 stations, the healthy and unhealthy locations are determined, and a ratio 70/30 (53/22 stations) is applied for calibrating and validating the ML models to predict the most hazardous areas for PM10. In order to identify the influential variables of PM modeling, the simulated annealing (SA) feature selection method is used. Seven features, among the thirteen features, are selected as critical features. According to the results, all the three-machine learning (ML) models achieve an excellent performance (Accuracy > 87% and precision > 86%). However, the Bagged CART and RF models have the same performance and higher than the MDA model. Spatial hazard maps predicted by the three models indicate that the high hazardous areas are located in the middle of the Barcelona Province more than in the Barcelona's Metropolitan Area.
Identifiants
pubmed: 31704408
pii: S0048-9697(19)34465-1
doi: 10.1016/j.scitotenv.2019.134474
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
134474Informations de copyright
Copyright © 2019 Elsevier B.V. All rights reserved.