Predicting carbon and water vapor fluxes using machine learning and novel feature ranking algorithms.

Eddy covariance Feature ranking Machine learning Remote sensing Support vector machine

Journal

The Science of the total environment
ISSN: 1879-1026
Titre abrégé: Sci Total Environ
Pays: Netherlands
ID NLM: 0330500

Informations de publication

Date de publication:
25 Jun 2021
Historique:
received: 27 07 2020
revised: 15 12 2020
accepted: 08 01 2021
pubmed: 23 2 2021
medline: 23 2 2021
entrez: 22 2 2021
Statut: ppublish

Résumé

Gap-filling eddy covariance flux data using quantitative approaches has increased over the past decade. Numerous methods have been proposed previously, including look-up table approaches, parametric methods, process-based models, and machine learning. Particularly, the REddyProc package from the Max Planck Institute for Biogeochemistry and ONEFlux package from AmeriFlux have been widely used in many studies. However, there is no consensus regarding the optimal model and feature selection method that could be used for predicting different flux targets (Net Ecosystem Exchange, NEE; or Evapotranspiration -ET), due to the limited systematic comparative research based on the identical site-data. Here, we compared NEE and ET gap-filling/prediction performance of the least-square-based linear model, artificial neural network, random forest (RF), and support vector machine (SVM) using data obtained from four major row-crop and forage agroecosystems located in the subtropical or the climate-transition zones in the US. Additionally, we tested the impacts of different training-testing data partitioning settings, including a 10-fold time-series sequential (10FTS), a 10-fold cross validation (CV) routine with single data point (10FCV), daily (10FCVD), weekly (10FCVW) and monthly (10FCVM) gap length, and a 7/14-day flanking window (FW) approach; and implemented a novel Sliced Inverse Regression-based Recursive Feature Elimination algorithm (SIRRFE). We benchmarked the model performance against REddyProc and ONEFlux-produced results. Our results indicated that accurate NEE and ET prediction models could be systematically constructed using SVM/RF and only a few top informative features. The gap-filling performance of ONEFlux is generally satisfactory (R

Identifiants

pubmed: 33618314
pii: S0048-9697(21)00196-0
doi: 10.1016/j.scitotenv.2021.145130
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

145130

Informations de copyright

Copyright © 2021 Elsevier B.V. All rights reserved.

Déclaration de conflit d'intérêts

Declaration of competing interest The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.

Auteurs

Xia Cui (X)

Key Laboratory of Western China's Environmental Systems (Ministry of Education), College of Earth and Environmental Sciences, Lanzhou University, Lanzhou 730000, China. Electronic address: xiacui@lzu.edu.cn.

Thomas Goff (T)

Center for Computational Science, Middle Tennessee State University, Murfreesboro, TN 37132, USA.

Song Cui (S)

School of Agriculture, Middle Tennessee State University, Murfreesboro, TN 37132, USA.

Dorothy Menefee (D)

Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843, USA.

Qiang Wu (Q)

Department of Mathematical Sciences, Middle Tennessee State University, Murfreesboro, TN 37132, USA.

Nithya Rajan (N)

Department of Soil and Crop Sciences, Texas A&M University, College Station, TX, 77843, USA.

Shyam Nair (S)

Department of Agricultural Sciences and Engineering Technology, Sam Houston State University, Huntsville, TX 77341, USA.

Nate Phillips (N)

School of Agriculture, Middle Tennessee State University, Murfreesboro, TN 37132, USA.

Forbes Walker (F)

Department of Biosystems Engineering and Soil Science, University of Tennessee, Knoxville, TN 37996, USA.

Classifications MeSH