Impact and mitigation of sampling bias to determine viral spread: Evaluating discrete phylogeography through CTMC modeling and structured coalescent model approximations.
BASTA
Bayesian phylogeographic inference
CTMC
MASCOT
Markov chain monte carlo
pathogen spread
phylogeography
sampling bias
Journal
Virus evolution
ISSN: 2057-1577
Titre abrégé: Virus Evol
Pays: England
ID NLM: 101664675
Informations de publication
Date de publication:
2023
2023
Historique:
received:
15
08
2022
revised:
06
01
2023
accepted:
02
02
2023
entrez:
2
3
2023
pubmed:
3
3
2023
medline:
3
3
2023
Statut:
epublish
Résumé
Bayesian phylogeographic inference is a powerful tool in molecular epidemiological studies, which enables reconstruction of the origin and subsequent geographic spread of pathogens. Such inference is, however, potentially affected by geographic sampling bias. Here, we investigated the impact of sampling bias on the spatiotemporal reconstruction of viral epidemics using Bayesian discrete phylogeographic models and explored different operational strategies to mitigate this impact. We considered the continuous-time Markov chain (CTMC) model and two structured coalescent approximations (Bayesian structured coalescent approximation [BASTA] and marginal approximation of the structured coalescent [MASCOT]). For each approach, we compared the estimated and simulated spatiotemporal histories in biased and unbiased conditions based on the simulated epidemics of rabies virus (RABV) in dogs in Morocco. While the reconstructed spatiotemporal histories were impacted by sampling bias for the three approaches, BASTA and MASCOT reconstructions were also biased when employing unbiased samples. Increasing the number of analyzed genomes led to more robust estimates at low sampling bias for the CTMC model. Alternative sampling strategies that maximize the spatiotemporal coverage greatly improved the inference at intermediate sampling bias for the CTMC model, and to a lesser extent, for BASTA and MASCOT. In contrast, allowing for time-varying population sizes in MASCOT resulted in robust inference. We further applied these approaches to two empirical datasets: a RABV dataset from the Philippines and a SARS-CoV-2 dataset describing its early spread across the world. In conclusion, sampling biases are ubiquitous in phylogeographic analyses but may be accommodated by increasing the sample size, balancing spatial and temporal composition in the samples, and informing structured coalescent models with reliable case count data.
Identifiants
pubmed: 36860641
doi: 10.1093/ve/vead010
pii: vead010
pmc: PMC9969415
doi:
Types de publication
Journal Article
Langues
eng
Pagination
vead010Informations de copyright
© The Author(s) 2023. Published by Oxford University Press.
Références
Nat Commun. 2022 Mar 3;13(1):1152
pubmed: 35241661
PLoS Comput Biol. 2019 Apr 8;15(4):e1006650
pubmed: 30958812
Mol Biol Evol. 2021 May 4;38(5):1777-1791
pubmed: 33316067
Bioinformatics. 2018 Nov 15;34(22):3843-3848
pubmed: 29790921
Virus Evol. 2022 Mar 21;8(1):veac025
pubmed: 35371561
Infect Genet Evol. 2018 Oct;64:225-230
pubmed: 29991455
Nature. 2021 Jul;595(7866):205-213
pubmed: 34194045
Nat Commun. 2020 Oct 9;11(1):5110
pubmed: 33037213
Bioinformatics. 2010 Jun 15;26(12):1569-71
pubmed: 20421198
Mol Biol Evol. 2021 Apr 13;38(4):1537-1543
pubmed: 33295605
J Anim Ecol. 2013 Jan;82(1):6-14
pubmed: 23004351
PLoS Comput Biol. 2021 Feb 12;17(2):e1008618
pubmed: 33577550
Nat Commun. 2020 Nov 6;11(1):5620
pubmed: 33159066
Front Vet Sci. 2017 Jun 02;4:78
pubmed: 28626749
Mol Biol Evol. 2021 Apr 13;38(4):1608-1613
pubmed: 33316043
Virus Evol. 2018 Jun 08;4(1):vey016
pubmed: 29942656
Proc Natl Acad Sci U S A. 2021 Dec 28;118(52):
pubmed: 34930835
PLoS Genet. 2015 Aug 12;11(8):e1005421
pubmed: 26267488
J Mol Evol. 1985;22(2):160-74
pubmed: 3934395
PLOS Glob Public Health. 2022 Sep 28;2(9):e0000577
pubmed: 36962555
Syst Biol. 2018 Sep 1;67(5):901-904
pubmed: 29718447
PLoS Pathog. 2010 Oct 28;6(10):e1001166
pubmed: 21060816
Nat Ecol Evol. 2018 Sep;2(9):1468-1478
pubmed: 30038246
Nat Commun. 2021 Jan 19;12(1):434
pubmed: 33469026
PLoS Comput Biol. 2021 Jan 6;17(1):e1008561
pubmed: 33406072
Syst Biol. 2012 Jan;61(1):170-3
pubmed: 21963610
J Virol. 2020 Nov 9;94(23):
pubmed: 32938762
PLoS Negl Trop Dis. 2018 Sep 6;12(9):e0006367
pubmed: 30188896
PLoS Pathog. 2021 May 20;17(5):e1009571
pubmed: 34015049
Elife. 2018 Jan 16;7:
pubmed: 29336306
Nature. 2012 Feb 26;484(7392):96-100
pubmed: 22367540
Sci Transl Med. 2021 May 26;13(595):
pubmed: 33941621
Virus Evol. 2019 Aug 14;5(2):vez030
pubmed: 31428459
PLoS Biol. 2006 May;4(5):e88
pubmed: 16683862
Nat Commun. 2021 Sep 29;12(1):5705
pubmed: 34588460
BMC Genomics. 2013 Jul 04;14:444
pubmed: 23822119
Cell. 2019 Aug 22;178(5):1057-1071.e11
pubmed: 31442400
Mol Biol Evol. 2013 Mar;30(3):713-24
pubmed: 23180580
PLoS Negl Trop Dis. 2013 Aug 22;7(8):e2372
pubmed: 23991233
Nature. 2017 Jun 15;546(7658):406-410
pubmed: 28538727
Curr Protoc. 2021 Apr;1(4):e98
pubmed: 33836121
Mol Ecol. 2004 Apr;13(4):827-36
pubmed: 15012758
Nature. 2017 Apr 20;544(7650):309-315
pubmed: 28405027
PLoS Pathog. 2019 Dec 6;15(12):e1007976
pubmed: 31809523
J Mol Evol. 1994 Sep;39(3):306-14
pubmed: 7932792
PLoS Comput Biol. 2019 Aug 6;15(8):e1007189
pubmed: 31386651
Syst Biol. 2016 Nov;65(6):1041-1056
pubmed: 27368344
Mol Biol Evol. 2017 Nov 1;34(11):2970-2981
pubmed: 28666382
Science. 2020 Sep 4;369(6508):1255-1260
pubmed: 32703910
Proc Natl Acad Sci U S A. 2020 Apr 7;117(14):7897-7904
pubmed: 32229557
PLoS Pathog. 2014 Feb 20;10(2):e1003932
pubmed: 24586153
PLoS Negl Trop Dis. 2013 Apr 04;7(4):e2144
pubmed: 23593515
Epidemics. 2015 Mar;10:88-92
pubmed: 25843391
Nature. 2021 Mar;591(7848):30-33
pubmed: 33649511
Infect Genet Evol. 2016 Mar;38:22-28
pubmed: 26656835
Evol Bioinform Online. 2007 Feb 12;2:227-35
pubmed: 19455215
Nat Commun. 2021 Nov 23;12(1):6802
pubmed: 34815406
Acta Trop. 2017 Jan;165:133-140
pubmed: 26721555
PLoS Biol. 2009 Mar 10;7(3):e53
pubmed: 19278295
Sci Adv. 2018 Oct 17;4(10):eaat5869
pubmed: 30345355
Mol Biol Evol. 2017 Oct 1;34(10):2563-2571
pubmed: 28651357
Mol Biol Evol. 2022 Feb 3;39(2):
pubmed: 34951645
Infect Genet Evol. 2014 Apr;23:86-94
pubmed: 24512808
J Clin Invest. 2020 Apr 1;130(4):1699-1712
pubmed: 31910162
PLoS Comput Biol. 2009 Sep;5(9):e1000520
pubmed: 19779555
Bioinformatics. 2014 Aug 15;30(16):2272-9
pubmed: 24753484