Optimizing ancestral trait reconstruction of large HIV Subtype C datasets through multiple-trait subsampling.

HIV Subtype C ancestral trait reconstruction multiple-trait subsampling phylogenetic comparative methods subsampling approaches

Journal

Virus evolution
ISSN: 2057-1577
Titre abrégé: Virus Evol
Pays: England
ID NLM: 101664675

Informations de publication

Date de publication:
2023
Historique:
received: 26 06 2023
revised: 29 10 2023
accepted: 20 11 2023
medline: 4 12 2023
pubmed: 4 12 2023
entrez: 4 12 2023
Statut: epublish

Résumé

Large datasets along with sampling bias represent a challenge for phylodynamic reconstructions, particularly when the study data are obtained from various heterogeneous sources and/or through convenience sampling. In this study, we evaluate the presence of unbalanced sampled distribution by collection date, location, and risk group of human immunodeficiency virus Type 1 Subtype C using a comprehensive subsampling strategy and assess their impact on the reconstruction of the viral spatial and risk group dynamics using phylogenetic comparative methods. Our study shows that a most suitable dataset for ancestral trait reconstruction can be obtained through subsampling by all available traits, particularly using multigene datasets. We also demonstrate that sampling bias is inflated when considerable information for a given trait is unavailable or of poor quality, as we observed for the trait risk group. In conclusion, we suggest that, even if traits are not well recorded, including them deliberately optimizes the representativeness of the original dataset rather than completely excluding them. Therefore, we advise the inclusion of as many traits as possible with the aid of subsampling approaches in order to optimize the dataset for phylodynamic analysis while reducing the computational burden. This will benefit research communities investigating the evolutionary and spatio-temporal patterns of infectious diseases.

Identifiants

pubmed: 38046219
doi: 10.1093/ve/vead069
pii: vead069
pmc: PMC10691791
doi:

Types de publication

Journal Article

Langues

eng

Pagination

vead069

Informations de copyright

Published by Oxford University Press 2023. This work is written by (a) US Government employee(s) and is in the public domain in the US.

Déclaration de conflit d'intérêts

The authors declare no competing interests.

Références

Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
PLoS Comput Biol. 2021 Jan 6;17(1):e1008561
pubmed: 33406072
iScience. 2022 Mar 18;25(3):103880
pubmed: 35156006
BMC Bioinformatics. 2018 May 2;19(1):164
pubmed: 29716518
Front Public Health. 2022 Apr 07;10:844541
pubmed: 35480591
Science. 2014 Oct 3;346(6205):56-61
pubmed: 25278604
Viruses. 2010 Jan;2(1):33-54
pubmed: 21994599
Bioinformatics. 2020 Feb 1;36(3):945-947
pubmed: 31418766
Virus Evol. 2023 Feb 06;9(1):vead010
pubmed: 36860641
Influenza Other Respir Viruses. 2018 Sep;12(5):551-557
pubmed: 29722140
J AIDS Clin Res. 2012 Aug 4;3(7):
pubmed: 23516666
PLoS One. 2012;7(10):e47289
pubmed: 23056619
Biostatistics. 2015 Jul;16(3):565-79
pubmed: 25597489
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jan-Feb;9(1):150-60
pubmed: 21383415
PLoS Genet. 2015 Aug 12;11(8):e1005421
pubmed: 26267488
Virus Evol. 2016 Jun 22;2(1):vew016
pubmed: 28694998
mBio. 2021 Aug 31;12(4):e0114021
pubmed: 34465019
PLoS Genet. 2020 Nov 18;16(11):e1009175
pubmed: 33206635
Curr Opin HIV AIDS. 2019 Sep;14(5):337-353
pubmed: 31368909
Nature. 2022 Mar;603(7902):679-686
pubmed: 35042229
Virulence. 2017 Jul 4;8(5):497-503
pubmed: 27574950
AIDS Res Hum Retroviruses. 1995 Nov;11(11):1413-6
pubmed: 8573400
Bioinformatics. 2014 May 1;30(9):1312-3
pubmed: 24451623
EBioMedicine. 2020 Mar;53:102682
pubmed: 32114391
Viruses. 2019 Nov 06;11(11):
pubmed: 31698764
Syst Biol. 2002 Aug;51(4):588-98
pubmed: 12228001
Int J Infect Dis. 2021 Feb;103:305-307
pubmed: 33333251
Prev Med. 2012 Nov;55(5):362-70
pubmed: 21884721
IEEE/ACM Trans Comput Biol Bioinform. 2012 Jul-Aug;9(4):1014-22
pubmed: 22184263
Syst Biol. 2022 Aug 10;71(5):1255-1270
pubmed: 34963003
Bioinformatics. 2021 Nov 5;37(21):3983-3985
pubmed: 34096994
Syst Biol. 2009 Dec;58(6):586-94
pubmed: 20525611
BMC Bioinformatics. 2015 Jul 01;16:204
pubmed: 26130249
Am J Trop Med Hyg. 2020 Feb;102(2):408-414
pubmed: 31820709
Glob Health Epidemiol Genom. 2018 Feb 5;3:e3
pubmed: 29868228
Nucleic Acids Res. 2022 Jan 7;50(D1):D161-D164
pubmed: 34850943
Bioinformatics. 2020 Dec 22;36(20):5007-5013
pubmed: 32619004
Viruses. 2020 Feb 05;12(2):
pubmed: 32033422
Mol Biol Evol. 2005 May;22(5):1337-44
pubmed: 15746014
Virus Evol. 2022 Mar 17;8(1):veac020
pubmed: 35462736
Mol Biol Evol. 2015 Jan;32(1):268-74
pubmed: 25371430
J Appl Stat. 2021 Jul 11;48(13-15):2239-2258
pubmed: 35707089
Cell. 2021 Sep 16;184(19):4939-4952.e15
pubmed: 34508652
Nature. 2015 Jul 9;523(7559):217-20
pubmed: 26053121
Nature. 2016 Oct 12;538(7624):161-164
pubmed: 27734877
Virology. 2016 Aug;495:173-84
pubmed: 27228177
Virus Evol. 2016 Apr 09;2(1):vew007
pubmed: 27774300
Microbiol Spectr. 2023 Oct 6;:e0153923
pubmed: 37800925
Virus Evol. 2022 Jun 16;8(1):veac048
pubmed: 35769891
Glob Chall. 2017 Jan 10;1(1):33-46
pubmed: 31565258
Nat Genet. 2021 Jun;53(6):809-816
pubmed: 33972780
Sci Rep. 2020 Mar 20;10(1):5098
pubmed: 32198405
BMJ. 2005 Jul 23;331(7510):220-3
pubmed: 16037463
Bioinformatics. 2000 Apr;16(4):400-1
pubmed: 10869039
Front Genet. 2021 Dec 06;12:720750
pubmed: 34938310
Nat Commun. 2021 Oct 1;12(1):5769
pubmed: 34599175

Auteurs

Nídia S Trovão (NS)

Division of International Epidemiology and Population Studies, Fogarty International Center, National Institutes of Health, 31 Center Dr, Bethesda, MA 20892, USA.

Joel O Wertheim (JO)

Department of Medicine, University of California, La Jolla, San Diego, CA 92093, USA.

Guy Baele (G)

Department of Microbiology, Immunology and Transplantation, Rega Institute, KU Leuven, Leuven BE-3000, Belgium.

Adriano de Bernardi Schneider (A)

Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA.
Ningbo No.2 Hospital, Ningbo 315010, China.
Ningbo Institute of Life and Health Industry, University of Chinese Academy of Sciences, Ningbo 315000, China.

Classifications MeSH