Missing Value Imputation of Wireless Sensor Data for Environmental Monitoring.
environmental monitoring
imputation
missing data
time series
wireless sensor networks
Journal
Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366
Informations de publication
Date de publication:
10 Apr 2024
10 Apr 2024
Historique:
received:
05
03
2024
revised:
04
04
2024
accepted:
08
04
2024
medline:
27
4
2024
pubmed:
27
4
2024
entrez:
27
4
2024
Statut:
epublish
Résumé
Over the past few years, the scale of sensor networks has greatly expanded. This generates extended spatiotemporal datasets, which form a crucial information resource in numerous fields, ranging from sports and healthcare to environmental science and surveillance. Unfortunately, these datasets often contain missing values due to systematic or inadvertent sensor misoperation. This incompleteness hampers the subsequent data analysis, yet addressing these missing observations forms a challenging problem. This is especially the case when both the temporal correlation of timestamps within a single sensor and the spatial correlation between sensors are important. Here, we apply and evaluate 12 imputation methods to complete the missing values in a dataset originating from large-scale environmental monitoring. As part of a large citizen science project, IoT-based microclimate sensors were deployed for six months in 4400 gardens across the region of Flanders, generating 15-min recordings of temperature and soil moisture. Methods based on spatial recovery as well as time-based imputation were evaluated, including Spline Interpolation, MissForest, MICE, MCMC, M-RNN, BRITS, and others. The performance of these imputation methods was evaluated for different proportions of missing data (ranging from 10% to 50%), as well as a realistic missing value scenario. Techniques leveraging the spatial features of the data tend to outperform the time-based methods, with matrix completion techniques providing the best performance. Our results therefore provide a tool to maximize the benefit from costly, large-scale environmental monitoring efforts.
Identifiants
pubmed: 38676032
pii: s24082416
doi: 10.3390/s24082416
pii:
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Subventions
Organisme : The project "CurieuzeNeuzen in de Tuin" was funded by a Citizen Science grant from the Department of Economy, Science and Innovation (Flanders) and through a partnership between UAntwerpen, De Standaard, Rabobank Belgium, Orange Belgium
Organisme : Flanders Environment Agency (VMM), Departement Omgeving (Flanders), and the Flemish institute for technological research (VITO), with additional financial contributions from Aquafin and Bioplanet.