Informative and adaptive distances and summary statistics in sequential approximate Bayesian computation.
Journal
PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081
Informations de publication
Date de publication:
2023
2023
Historique:
received:
19
07
2022
accepted:
02
05
2023
medline:
24
5
2023
pubmed:
22
5
2023
entrez:
22
5
2023
Statut:
epublish
Résumé
Calibrating model parameters on heterogeneous data can be challenging and inefficient. This holds especially for likelihood-free methods such as approximate Bayesian computation (ABC), which rely on the comparison of relevant features in simulated and observed data and are popular for otherwise intractable problems. To address this problem, methods have been developed to scale-normalize data, and to derive informative low-dimensional summary statistics using inverse regression models of parameters on data. However, while approaches only correcting for scale can be inefficient on partly uninformative data, the use of summary statistics can lead to information loss and relies on the accuracy of employed methods. In this work, we first show that the combination of adaptive scale normalization with regression-based summary statistics is advantageous on heterogeneous parameter scales. Second, we present an approach employing regression models not to transform data, but to inform sensitivity weights quantifying data informativeness. Third, we discuss problems for regression models under non-identifiability, and present a solution using target augmentation. We demonstrate improved accuracy and efficiency of the presented approach on various problems, in particular robustness and wide applicability of the sensitivity weights. Our findings demonstrate the potential of the adaptive approach. The developed algorithms have been made available in the open-source Python toolbox pyABC.
Identifiants
pubmed: 37216372
doi: 10.1371/journal.pone.0285836
pii: PONE-D-22-20368
pmc: PMC10202307
doi:
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
e0285836Informations de copyright
Copyright: © 2023 Schälte, Hasenauer. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
Science. 2002 Mar 1;295(5560):1662-4
pubmed: 11872829
Biometrics. 2011 Mar;67(1):225-33
pubmed: 20345496
Mol Biol Evol. 1999 Dec;16(12):1791-8
pubmed: 10605120
IEEE Trans Neural Netw Learn Syst. 2022 Apr;33(4):1452-1466
pubmed: 33338021
Genetics. 1997 Feb;145(2):505-18
pubmed: 9071603
Genetics. 2002 Dec;162(4):2025-35
pubmed: 12524368
Stat Appl Genet Mol Biol. 2010;9:Article34
pubmed: 20887273
Stat Appl Genet Mol Biol. 2013 Mar 26;12(1):87-107
pubmed: 23502346
Brief Bioinform. 2022 Jan 17;23(1):
pubmed: 34619769
Proc Natl Acad Sci U S A. 2007 Feb 6;104(6):1760-5
pubmed: 17264216
Bioinformatics. 2010 Jan 1;26(1):104-10
pubmed: 19880371
Bioinformatics. 2018 Oct 15;34(20):3591-3593
pubmed: 29762723
Stat Appl Genet Mol Biol. 2013 Oct 1;12(5):603-18
pubmed: 24025688
PLoS One. 2013 Sep 30;8(9):e74335
pubmed: 24098642
PLoS One. 2020 Aug 6;15(8):e0236954
pubmed: 32760106
Bioinformatics. 2009 Aug 1;25(15):1923-9
pubmed: 19505944
Trends Ecol Evol. 2010 Jul;25(7):410-8
pubmed: 20488578