Phylogenetic Permulations: A Statistically Rigorous Approach to Measure Confidence in Associations in a Phylogenetic Context.

comparative genomics convergent evolution evolutionary rate convergence genotype–phenotype association phylogenetic generalized least squares statistical phylogenetics

Journal

Molecular biology and evolution
ISSN: 1537-1719
Titre abrégé: Mol Biol Evol
Pays: United States
ID NLM: 8501455

Informations de publication

Date de publication:
25 06 2021
Historique:
pubmed: 20 3 2021
medline: 22 9 2021
entrez: 19 3 2021
Statut: ppublish

Résumé

Many evolutionary comparative methods seek to identify associations between phenotypic traits or between traits and genotypes, often with the goal of inferring potential functional relationships between them. Comparative genomics methods aimed at this goal measure the association between evolutionary changes at the genetic level with traits evolving convergently across phylogenetic lineages. However, these methods have complex statistical behaviors that are influenced by nontrivial and oftentimes unknown confounding factors. Consequently, using standard statistical analyses in interpreting the outputs of these methods leads to potentially inaccurate conclusions. Here, we introduce phylogenetic permulations, a novel statistical strategy that combines phylogenetic simulations and permutations to calculate accurate, unbiased P values from phylogenetic methods. Permulations construct the null expectation for P values from a given phylogenetic method by empirically generating null phenotypes. Subsequently, empirical P values that capture the true statistical confidence given the correlation structure in the data are directly calculated based on the empirical null expectation. We examine the performance of permulation methods by analyzing both binary and continuous phenotypes, including marine, subterranean, and long-lived large-bodied mammal phenotypes. Our results reveal that permulations improve the statistical power of phylogenetic analyses and correctly calibrate statements of confidence in rejecting complex null distributions while maintaining or improving the enrichment of known functions related to the phenotype. We also find that permulations refine pathway enrichment analyses by correcting for nonindependence in gene ranks. Our results demonstrate that permulations are a powerful tool for improving statistical confidence in the conclusions of phylogenetic analysis when the parametric null is unknown.

Identifiants

pubmed: 33739420
pii: 6178799
doi: 10.1093/molbev/msab068
pmc: PMC8233500
doi:

Types de publication

Evaluation Study Journal Article Research Support, N.I.H., Extramural

Langues

eng

Sous-ensembles de citation

IM

Pagination

3004-3021

Subventions

Organisme : NEI NIH HHS
ID : R01 EY030546
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG009299
Pays : United States
Organisme : NIBIB NIH HHS
ID : T32 EB009403
Pays : United States

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.

Références

Mol Biol Evol. 2016 Apr;33(4):1110-25
pubmed: 26715628
BMC Bioinformatics. 2009 Feb 03;10:48
pubmed: 19192299
Nucleic Acids Res. 2015 Apr 20;43(7):e47
pubmed: 25605792
Bioinformatics. 2008 Jan 1;24(1):129-31
pubmed: 18006550
Cell Rep. 2012 Oct 25;2(4):817-23
pubmed: 23022484
Am Nat. 2004 Jan;163(1):84-96
pubmed: 14767838
Mol Biol Evol. 2015 Mar;32(3):820-32
pubmed: 25540451
Science. 2011 Oct 28;334(6055):521-4
pubmed: 21940861
Genome Res. 2012 Apr;22(4):714-20
pubmed: 22287101
Proc Natl Acad Sci U S A. 2003 Aug 5;100(16):9440-5
pubmed: 12883005
Biol Lett. 2018 Oct 3;14(10):
pubmed: 30282748
Mol Biol Evol. 2016 Aug;33(8):2135-50
pubmed: 27222536
PLoS Comput Biol. 2007 Mar 23;3(3):e39
pubmed: 17381235
Nature. 2007 Mar 29;446(7135):507-12
pubmed: 17392779
Science. 2018 Aug 10;361(6402):591-594
pubmed: 30093596
Mol Biol Evol. 2002 Dec;19(12):2142-9
pubmed: 12446806
Bioinformatics. 2004 Dec 12;20(18):3710-5
pubmed: 15297299
Proc Natl Acad Sci U S A. 2008 Jan 22;105(3):934-9
pubmed: 18199838
J Evol Biol. 2010 Mar;23(3):494-508
pubmed: 20070460
Nat Genet. 2015 Mar;47(3):272-5
pubmed: 25621460
Front Genet. 2017 Feb 15;8:16
pubmed: 28261263
Mol Biol Evol. 2007 Aug;24(8):1586-91
pubmed: 17483113
Philos Trans R Soc Lond B Biol Sci. 2011 May 12;366(1569):1410-24
pubmed: 21444315
Genetics. 2013 Feb;193(2):529-38
pubmed: 23183665
Mol Biol Evol. 2016 Sep;33(9):2182-92
pubmed: 27329977
Nature. 1999 Jan 28;397(6717):344-7
pubmed: 9950425
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50
pubmed: 16199517
Mol Biol Evol. 2019 May 1;36(5):1086-1100
pubmed: 30851112
Am J Hum Genet. 2007 Apr;80(4):727-39
pubmed: 17357078
Philos Trans R Soc Lond B Biol Sci. 1989 Dec 21;326(1233):119-57
pubmed: 2575770
Front Cell Dev Biol. 2014 Apr 22;2:14
pubmed: 25364721
Am Nat. 2006 Jun;167(6):808-25
pubmed: 16685633
Elife. 2017 Oct 16;6:
pubmed: 29035697
Mol Biol Evol. 2019 Aug 1;36(8):1817-1830
pubmed: 31077321
Evolution. 1991 Aug;45(5):1065-1080
pubmed: 28564168
Blood. 2010 Aug 5;116(5):731-9
pubmed: 20445021
Bioinformatics. 2019 Nov 1;35(22):4815-4817
pubmed: 31192356
Elife. 2020 Feb 11;9:
pubmed: 32043462
Genetics. 2006 Jun;173(2):891-900
pubmed: 16547091
Trends Ecol Evol. 2000 Jul;15(7):296-299
pubmed: 10856957
Nucleic Acids Res. 2012 Sep 1;40(17):e133
pubmed: 22638577

Auteurs

Elysia Saputra (E)

Joint Carnegie Mellon University - University of Pittsburgh PhD Program in Computational Biology, Pittsburgh, PA, USA.
Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.

Amanda Kowalczyk (A)

Joint Carnegie Mellon University - University of Pittsburgh PhD Program in Computational Biology, Pittsburgh, PA, USA.
Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.

Luisa Cusick (L)

Department of Biological Sciences, University of Pittsburgh, Pittsburgh, PA, USA.

Nathan Clark (N)

Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.
Department of Human Genetics, University of Utah, Salt Lake City, UT, USA.
Pittsburgh Center for Evolutionary Biology and Medicine, University of Pittsburgh, Pittsburgh, PA, USA.

Maria Chikina (M)

Department of Computational and Systems Biology, University of Pittsburgh, Pittsburgh, PA, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing

[Redispensing of expensive oral anticancer medicines: a practical application].

Lisanne N van Merendonk, Kübra Akgöl, Bastiaan Nuijen
1.00
Humans Antineoplastic Agents Administration, Oral Drug Costs Counterfeit Drugs

Smoking Cessation and Incident Cardiovascular Disease.

Jun Hwan Cho, Seung Yong Shin, Hoseob Kim et al.
1.00
Humans Male Smoking Cessation Cardiovascular Diseases Female
Humans United States Aged Cross-Sectional Studies Medicare Part C

Classifications MeSH