Missing Value Imputation of Wireless Sensor Data for Environmental Monitoring.

environmental monitoring imputation missing data time series wireless sensor networks

Journal

Sensors (Basel, Switzerland)
ISSN: 1424-8220
Titre abrégé: Sensors (Basel)
Pays: Switzerland
ID NLM: 101204366

Informations de publication

Date de publication:
10 Apr 2024
Historique:
received: 05 03 2024
revised: 04 04 2024
accepted: 08 04 2024
medline: 27 4 2024
pubmed: 27 4 2024
entrez: 27 4 2024
Statut: epublish

Résumé

Over the past few years, the scale of sensor networks has greatly expanded. This generates extended spatiotemporal datasets, which form a crucial information resource in numerous fields, ranging from sports and healthcare to environmental science and surveillance. Unfortunately, these datasets often contain missing values due to systematic or inadvertent sensor misoperation. This incompleteness hampers the subsequent data analysis, yet addressing these missing observations forms a challenging problem. This is especially the case when both the temporal correlation of timestamps within a single sensor and the spatial correlation between sensors are important. Here, we apply and evaluate 12 imputation methods to complete the missing values in a dataset originating from large-scale environmental monitoring. As part of a large citizen science project, IoT-based microclimate sensors were deployed for six months in 4400 gardens across the region of Flanders, generating 15-min recordings of temperature and soil moisture. Methods based on spatial recovery as well as time-based imputation were evaluated, including Spline Interpolation, MissForest, MICE, MCMC, M-RNN, BRITS, and others. The performance of these imputation methods was evaluated for different proportions of missing data (ranging from 10% to 50%), as well as a realistic missing value scenario. Techniques leveraging the spatial features of the data tend to outperform the time-based methods, with matrix completion techniques providing the best performance. Our results therefore provide a tool to maximize the benefit from costly, large-scale environmental monitoring efforts.

Identifiants

pubmed: 38676032
pii: s24082416
doi: 10.3390/s24082416
pii:
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Subventions

Organisme : The project "CurieuzeNeuzen in de Tuin" was funded by a Citizen Science grant from the Department of Economy, Science and Innovation (Flanders) and through a partnership between UAntwerpen, De Standaard, Rabobank Belgium, Orange Belgium
Organisme : Flanders Environment Agency (VMM), Departement Omgeving (Flanders), and the Flemish institute for technological research (VITO), with additional financial contributions from Aquafin and Bioplanet.

Auteurs

Thomas Decorte (T)

Department of Mathematics, University of Antwerp-imec, Middelheimlaan 1, 2000 Antwerp, Belgium.

Steven Mortier (S)

IDLab, Department of Computer Science, University of Antwerp-imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium.

Jonas J Lembrechts (JJ)

Plants and Ecosystems, Department of Biology, University of Antwerp, Universiteitsplein 1, 2610 Antwerp, Belgium.

Filip J R Meysman (FJR)

Geobiology, Department of Biology, University of Antwerp, Universiteitsplein 1, 2610 Antwerp, Belgium.

Steven Latré (S)

IDLab, Department of Computer Science, University of Antwerp-imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium.

Erik Mannens (E)

IDLab, Department of Computer Science, University of Antwerp-imec, Sint-Pietersvliet 7, 2000 Antwerp, Belgium.

Tim Verdonck (T)

Department of Mathematics, University of Antwerp-imec, Middelheimlaan 1, 2000 Antwerp, Belgium.

Classifications MeSH