Using machine learning approach to reproduce the measured feature and understand the model-to-measurement discrepancy of atmospheric formaldehyde.
FTIR
GEOS-Chem
HCHO
Machine learning
Remote sensing
Journal
The Science of the total environment
ISSN: 1879-1026
Titre abrégé: Sci Total Environ
Pays: Netherlands
ID NLM: 0330500
Informations de publication
Date de publication:
10 Dec 2022
10 Dec 2022
Historique:
received:
21
06
2022
revised:
10
08
2022
accepted:
21
08
2022
pubmed:
27
8
2022
medline:
21
10
2022
entrez:
26
8
2022
Statut:
ppublish
Résumé
The solar absorption spectrometry in the infrared spectral region, using high-resolution Fourier transform infrared (FTIR) spectrometer, has been established as a powerful tool in atmospheric science. These observations cannot be performed continuously, for example, clouds prevent observations. On the other hand, chemical transport models give continuously data. Their results depend on the knowledge of emission inventories, the chemistry involved, and the meteorological fields, yielding to potential biases between measurements and simulations. In our study we concentrated on Formaldehyde (HCHO) and used machine learning approach to fill the gap between the observations, performed on an irregular time scale and having their measurement lacks, and model data, giving continuous data, but having potential variable biases. The proposed machine learning approach is based on the Light Gradient Boosting Machine (LightGBM) algorithm and created by using GEOS-Chem simulations, meteorological fields, emission inventory, and is referred to as the GEOS-Chem-LightGBM model. The results of established GEOS-Chem-LightGBM model have generated consistent HCHO predictions with the ground-based FTIR and satellite (OMI and TROPOMI) observations. In order to understand the GEOS-Chem model to measurement discrepancy, we have investigated the contribution of each input variable to GEOS-Chem-LightGBM model HCHO predictions through the SHapely Additive exPlanations (SHAP) approach. We found that the GEOS-Chem model underestimates the sensitivities of HCHO total column to most photochemical variables, contributing to lower amplitudes of diurnal cycle and seasonal cycle by the GEOS-Chem model. By correcting the model-to-measurement discrepancy, the sensitivities of HCHO total column to all variables by the GEOS-Chem-LightGBM became to be in good agreement with the FTIR observations. As a result, GEOS-Chem-LightGBM model has significantly improved the performance of HCHO predictions compared to the GEOS-Chem alone. The proposed GEOS-Chem-LightGBM model can be extendible to other atmospheric constituents obtained by various measurement techniques and platforms, and is expected to have wide applications.
Identifiants
pubmed: 36028030
pii: S0048-9697(22)05370-0
doi: 10.1016/j.scitotenv.2022.158271
pii:
doi:
Substances chimiques
Air Pollutants
0
Formaldehyde
1HG84L3525
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
158271Informations de copyright
Copyright © 2022 Elsevier B.V. All rights reserved.
Déclaration de conflit d'intérêts
Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.