Distilling identifiable and interpretable dynamic models from biological data.
Journal
PLoS computational biology
ISSN: 1553-7358
Titre abrégé: PLoS Comput Biol
Pays: United States
ID NLM: 101238922
Informations de publication
Date de publication:
Oct 2023
Oct 2023
Historique:
received:
13
03
2023
accepted:
03
10
2023
revised:
30
10
2023
medline:
1
11
2023
pubmed:
18
10
2023
entrez:
18
10
2023
Statut:
epublish
Résumé
Mechanistic dynamical models allow us to study the behavior of complex biological systems. They can provide an objective and quantitative understanding that would be difficult to achieve through other means. However, the systematic development of these models is a non-trivial exercise and an open problem in computational biology. Currently, many research efforts are focused on model discovery, i.e. automating the development of interpretable models from data. One of the main frameworks is sparse regression, where the sparse identification of nonlinear dynamics (SINDy) algorithm and its variants have enjoyed great success. SINDy-PI is an extension which allows the discovery of rational nonlinear terms, thus enabling the identification of kinetic functions common in biochemical networks, such as Michaelis-Menten. SINDy-PI also pays special attention to the recovery of parsimonious models (Occam's razor). Here we focus on biological models composed of sets of deterministic nonlinear ordinary differential equations. We present a methodology that, combined with SINDy-PI, allows the automatic discovery of structurally identifiable and observable models which are also mechanistically interpretable. The lack of structural identifiability and observability makes it impossible to uniquely infer parameter and state variables, which can compromise the usefulness of a model by distorting its mechanistic significance and hampering its ability to produce biological insights. We illustrate the performance of our method with six case studies. We find that, despite enforcing sparsity, SINDy-PI sometimes yields models that are unidentifiable. In these cases we show how our method transforms their equations in order to obtain a structurally identifiable and observable model which is also interpretable.
Identifiants
pubmed: 37851682
doi: 10.1371/journal.pcbi.1011014
pii: PCOMPBIOL-D-23-00384
pmc: PMC10615316
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
e1011014Informations de copyright
Copyright: © 2023 Massonis et al. This is an open access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited.
Déclaration de conflit d'intérêts
The authors have declared that no competing interests exist.
Références
Proc Natl Acad Sci U S A. 2016 Apr 12;113(15):3932-7
pubmed: 27035946
Proc Math Phys Eng Sci. 2018 Sep;474(2217):20180305
pubmed: 30333709
Nature. 2006 Mar 23;440(7083):545-50
pubmed: 16554821
PLoS Comput Biol. 2020 Nov 3;16(11):e1008248
pubmed: 33141821
J Chem Phys. 2019 Jan 14;150(2):025101
pubmed: 30646700
Math Biosci. 2000 Dec;168(2):137-59
pubmed: 11121562
Curr Opin Biotechnol. 2008 Aug;19(4):360-8
pubmed: 18672061
J R Soc Interface. 2019 Jul 26;16(156):20190043
pubmed: 31266417
Phys Rev E Stat Nonlin Soft Matter Phys. 2015 Jul;92(1):012920
pubmed: 26274260
Cell Cycle. 2007 Sep 1;6(17):2106-12
pubmed: 17873520
Proc Natl Acad Sci U S A. 2007 Jun 12;104(24):9943-8
pubmed: 17553966
Sci Adv. 2020 Apr 15;6(16):eaay2631
pubmed: 32426452
BMC Syst Biol. 2011 Oct 28;5:177
pubmed: 22034917
J R Soc Interface. 2021 Sep;18(182):20210413
pubmed: 34493091
J R Soc Interface. 2013 Dec 04;11(91):20130505
pubmed: 24307566
Animal. 2018 Apr;12(4):701-712
pubmed: 29096725
Sci Signal. 2013 May 28;6(277):ra41
pubmed: 23716718
Nat Commun. 2021 May 28;12(1):3219
pubmed: 34050155
Risk Anal. 2020 Feb;40(2):352-369
pubmed: 31441953
Bioinformatics. 2023 Feb 3;39(2):
pubmed: 36721336
Science. 2020 Feb 28;367(6481):1026-1030
pubmed: 32001523
Nonlinear Dyn. 2022;110(3):2589-2609
pubmed: 36060282
Math Biosci. 2022 Dec;354:108926
pubmed: 36377100
Philos Trans A Math Phys Eng Sci. 2022 Aug 8;380(2229):20210213
pubmed: 35719077
Proc Math Phys Eng Sci. 2020 Oct;476(2242):20200279
pubmed: 33214760
Proc Math Phys Eng Sci. 2022 Apr;478(2260):20210904
pubmed: 35450025
Biochim Biophys Acta Gene Regul Mech. 2020 Jun;1863(6):194430
pubmed: 31678629
PLoS Comput Biol. 2022 Jan 31;18(1):e1009830
pubmed: 35100263
Bioinformatics. 2009 Aug 1;25(15):1923-9
pubmed: 19505944
PLoS Comput Biol. 2017 Nov 29;13(11):e1005878
pubmed: 29186132
Front Physiol. 2016 Dec 05;7:590
pubmed: 27994553
Biochem J. 2000 Jan 15;345 Pt 2:321-34
pubmed: 10702114
Sci Adv. 2020 Jan 31;6(5):eaav6971
pubmed: 32064326
Bioinformatics. 2023 Jan 1;39(1):
pubmed: 36398887
PLoS Comput Biol. 2022 Nov 16;18(11):e1010599
pubmed: 36383612
J Theor Biol. 2017 Oct 27;431:63-78
pubmed: 28733187
Comput Methods Programs Biomed. 2011 Nov;104(2):120-34
pubmed: 20851494
Philos Trans A Math Phys Eng Sci. 2022 Aug 8;380(2229):20210201
pubmed: 35719075
Proc Math Phys Eng Sci. 2017 Aug;473(2204):20170009
pubmed: 28878554
Proc Natl Acad Sci U S A. 2019 Apr 9;116(15):7226-7231
pubmed: 30902894