Spatial hazard assessment of the PM10 using machine learning models in Barcelona, Spain.

Air quality Bagged classification and regression trees Hazard assessment Mixture discriminate analysis Particulate matter Random forest Simulated annealing

Journal

The Science of the total environment
ISSN: 1879-1026
Titre abrégé: Sci Total Environ
Pays: Netherlands
ID NLM: 0330500

Informations de publication

Date de publication:
20 Jan 2020
Historique:
received: 10 08 2019
revised: 13 09 2019
accepted: 14 09 2019
pubmed: 11 11 2019
medline: 11 11 2019
entrez: 10 11 2019
Statut: ppublish

Résumé

Air pollution, and especially atmospheric particulate matter (PM), has a profound impact on human mortality and morbidity, environment, and ecological system. Accordingly, it is very relevant predicting air quality. Although the application of the machine learning (ML) models for predicting air quality parameters, such as PM concentrations, has been evaluated in previous studies, those on the spatial hazard modeling of them are very limited. Due to the high potential of the ML models, the spatial modeling of PM can help managers to identify the pollution hotspots. Accordingly, this study aims at developing new ML models, such as Random Forest (RF), Bagged Classification and Regression Trees (Bagged CART), and Mixture Discriminate Analysis (MDA) for the hazard prediction of PM10 (particles with a diameter less than 10 µm) in the Barcelona Province, Spain. According to the annual PM10 concentration in 75 stations, the healthy and unhealthy locations are determined, and a ratio 70/30 (53/22 stations) is applied for calibrating and validating the ML models to predict the most hazardous areas for PM10. In order to identify the influential variables of PM modeling, the simulated annealing (SA) feature selection method is used. Seven features, among the thirteen features, are selected as critical features. According to the results, all the three-machine learning (ML) models achieve an excellent performance (Accuracy > 87% and precision > 86%). However, the Bagged CART and RF models have the same performance and higher than the MDA model. Spatial hazard maps predicted by the three models indicate that the high hazardous areas are located in the middle of the Barcelona Province more than in the Barcelona's Metropolitan Area.

Identifiants

pubmed: 31704408
pii: S0048-9697(19)34465-1
doi: 10.1016/j.scitotenv.2019.134474
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

134474

Informations de copyright

Copyright © 2019 Elsevier B.V. All rights reserved.

Auteurs

Bahram Choubin (B)

Soil Conservation and Watershed Management Research Department, West Azarbaijan Agricultural and Natural Resources Research and Education Center, AREEO, Urmia, Iran.

Mahsa Abdolshahnejad (M)

Department of Reclamation of Arid and Mountainous Regions, Faculty of Natural Resources, University of Tehran, Karaj, Iran.

Ehsan Moradi (E)

Department of Reclamation of Arid and Mountainous Regions, Faculty of Natural Resources, University of Tehran, Karaj, Iran.

Xavier Querol (X)

Institute of Environmental Assessment and Water Research (IDAEA), Spanish Council for Scientific Research (CSIC), Barcelona, Spain.

Amir Mosavi (A)

School of the Built Environment, Oxford Brookes University, Oxford, UK; Institute of Automation, Kando Kalman Faculty of Electrical Engineering, Obuda University, 1034 Budapest, Hungary.

Shahaboddin Shamshirband (S)

Institute of Research and Development, Duy Tan University, Da Nang 550000, Viet Nam. Electronic address: shamshirbandshahaboddin@duytan.edu.vn.

Pedram Ghamisi (P)

Exploration Devision, Helmholtz Institute Freiberg for Resource Technology, Helmholtz-Zentrum Dresden-Rossendorf, Dresden, Germany.

Classifications MeSH