Missing data in amortized simulation-based neural posterior estimation.


Journal

PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922

Informations de publication

Date de publication:
17 Jun 2024
Historique:
received: 12 07 2023
accepted: 21 05 2024
medline: 17 6 2024
pubmed: 17 6 2024
entrez: 17 6 2024
Statut: aheadofprint

Résumé

Amortized simulation-based neural posterior estimation provides a novel machine learning based approach for solving parameter estimation problems. It has been shown to be computationally efficient and able to handle complex models and data sets. Yet, the available approach cannot handle the in experimental studies ubiquitous case of missing data, and might provide incorrect posterior estimates. In this work, we discuss various ways of encoding missing data and integrate them into the training and inference process. We implement the approaches in the BayesFlow methodology, an amortized estimation framework based on invertible neural networks, and evaluate their performance on multiple test problems. We find that an approach in which the data vector is augmented with binary indicators of presence or absence of values performs the most robustly. Indeed, it improved the performance also for the simpler problem of data sets with variable length. Accordingly, we demonstrate that amortized simulation-based inference approaches are applicable even with missing data, and we provide a guideline for their handling, which is relevant for a broad spectrum of applications.

Identifiants

pubmed: 38885265
doi: 10.1371/journal.pcbi.1012184
pii: PCOMPBIOL-D-23-01114
doi:

Types de publication

Journal Article

Langues

eng

Sous-ensembles de citation

IM

Pagination

e1012184

Informations de copyright

Copyright: © 2024 Wang et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Auteurs

Zijian Wang (Z)

University of Bonn, Life and Medical Sciences Institute, Bonn, Germany.

Jan Hasenauer (J)

University of Bonn, Life and Medical Sciences Institute, Bonn, Germany.
Helmholtz Center Munich, Computational Health Center, Neuherberg, Germany.

Yannik Schälte (Y)

University of Bonn, Life and Medical Sciences Institute, Bonn, Germany.
Helmholtz Center Munich, Computational Health Center, Neuherberg, Germany.
Technical University Munich, Center for Mathematics, Garching, Germany.

Classifications MeSH