Towards the Improvement of Soil Salinity Mapping in a Data-Scarce Context Using Sentinel-2 Images in Machine-Learning Models.
Random-Forest
Sentinel-2
Support Vector Machine
data scarcity
machine-learning
soil salinity
Journal
Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366
Informations de publication
Date de publication:
22 Nov 2023
22 Nov 2023
Historique:
received:
28
09
2023
revised:
10
11
2023
accepted:
20
11
2023
medline:
9
12
2023
pubmed:
9
12
2023
entrez:
9
12
2023
Statut:
epublish
Résumé
Several recent studies have evidenced the relevance of machine-learning for soil salinity mapping using Sentinel-2 reflectance as input data and field soil salinity measurement (i.e., Electrical Conductivity-EC) as the target. As soil EC monitoring is costly and time consuming, most learning databases used for training/validation rely on a limited number of soil samples, which can affect the model consistency. Based on the low soil salinity variation at the Sentinel-2 pixel resolution, this study proposes to increase the learning database's number of observations by assigning the EC value obtained on the sampled pixel to the eight neighboring pixels. The method allowed extending the original learning database made up of 97 field EC measurements (OD) to an enhanced learning database made up of 691 observations (ED). Two classification machine-learning models (i.e., Random Forest-RF and Support Vector Machine-SVM) were trained with both OD and ED to assess the efficiency of the proposed method by comparing the models' outcomes with EC observations not used in the models´ training. The use of ED led to a significant increase in both models' consistency with the overall accuracy of the RF (SVM) model increasing from 0.25 (0.26) when using the OD to 0.77 (0.55) when using ED. This corresponds to an improvement of approximately 208% and 111%, respectively. Besides the improved accuracy reached with the ED database, the results showed that the RF model provided better soil salinity estimations than the SVM model and that feature selection (i.e., Variance Inflation Factor-VIF and/or Genetic Algorithm-GA) increase both models´ reliability, with GA being the most efficient. This study highlights the potential of machine-learning and Sentinel-2 image combination for soil salinity monitoring in a data-scarce context, and shows the importance of both model and features selection for an optimum machine-learning set-up.
Identifiants
pubmed: 38067701
pii: s23239328
doi: 10.3390/s23239328
pmc: PMC10708742
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : Agropolis Fondation
ID : 2001-032
Organisme : Centre National d'Etudes Spatiales (CNES) in the framework of the QUIMONOS project
ID : NA
Références
Sensors (Basel). 2017 Dec 22;18(1):
pubmed: 29271909
Sensors (Basel). 2023 Aug 07;23(15):
pubmed: 37571787
Sci Total Environ. 2020 Mar 10;707:136092
pubmed: 31972911
Sensors (Basel). 2019 Jan 30;19(3):
pubmed: 30704120
IEEE Trans Neural Netw. 1999;10(5):988-99
pubmed: 18252602
Plants (Basel). 2022 Mar 08;11(6):
pubmed: 35336599
Sensors (Basel). 2017 Jun 21;17(6):
pubmed: 28635625
Sensors (Basel). 2023 Sep 01;23(17):
pubmed: 37688040
Saudi J Biol Sci. 2015 Mar;22(2):123-31
pubmed: 25737642
Sensors (Basel). 2017 Aug 26;17(9):
pubmed: 28846601