Phylogenetic Permulations: A Statistically Rigorous Approach to Measure Confidence in Associations in a Phylogenetic Context.
comparative genomics
convergent evolution
evolutionary rate convergence
genotype–phenotype association
phylogenetic generalized least squares
statistical phylogenetics
Journal
Molecular biology and evolution
ISSN: 1537-1719
Titre abrégé: Mol Biol Evol
Pays: United States
ID NLM: 8501455
Informations de publication
Date de publication:
25 06 2021
25 06 2021
Historique:
pubmed:
20
3
2021
medline:
22
9
2021
entrez:
19
3
2021
Statut:
ppublish
Résumé
Many evolutionary comparative methods seek to identify associations between phenotypic traits or between traits and genotypes, often with the goal of inferring potential functional relationships between them. Comparative genomics methods aimed at this goal measure the association between evolutionary changes at the genetic level with traits evolving convergently across phylogenetic lineages. However, these methods have complex statistical behaviors that are influenced by nontrivial and oftentimes unknown confounding factors. Consequently, using standard statistical analyses in interpreting the outputs of these methods leads to potentially inaccurate conclusions. Here, we introduce phylogenetic permulations, a novel statistical strategy that combines phylogenetic simulations and permutations to calculate accurate, unbiased P values from phylogenetic methods. Permulations construct the null expectation for P values from a given phylogenetic method by empirically generating null phenotypes. Subsequently, empirical P values that capture the true statistical confidence given the correlation structure in the data are directly calculated based on the empirical null expectation. We examine the performance of permulation methods by analyzing both binary and continuous phenotypes, including marine, subterranean, and long-lived large-bodied mammal phenotypes. Our results reveal that permulations improve the statistical power of phylogenetic analyses and correctly calibrate statements of confidence in rejecting complex null distributions while maintaining or improving the enrichment of known functions related to the phenotype. We also find that permulations refine pathway enrichment analyses by correcting for nonindependence in gene ranks. Our results demonstrate that permulations are a powerful tool for improving statistical confidence in the conclusions of phylogenetic analysis when the parametric null is unknown.
Identifiants
pubmed: 33739420
pii: 6178799
doi: 10.1093/molbev/msab068
pmc: PMC8233500
doi:
Types de publication
Evaluation Study
Journal Article
Research Support, N.I.H., Extramural
Langues
eng
Sous-ensembles de citation
IM
Pagination
3004-3021Subventions
Organisme : NEI NIH HHS
ID : R01 EY030546
Pays : United States
Organisme : NHGRI NIH HHS
ID : R01 HG009299
Pays : United States
Organisme : NIBIB NIH HHS
ID : T32 EB009403
Pays : United States
Informations de copyright
© The Author(s) 2021. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Références
Mol Biol Evol. 2016 Apr;33(4):1110-25
pubmed: 26715628
BMC Bioinformatics. 2009 Feb 03;10:48
pubmed: 19192299
Nucleic Acids Res. 2015 Apr 20;43(7):e47
pubmed: 25605792
Bioinformatics. 2008 Jan 1;24(1):129-31
pubmed: 18006550
Cell Rep. 2012 Oct 25;2(4):817-23
pubmed: 23022484
Am Nat. 2004 Jan;163(1):84-96
pubmed: 14767838
Mol Biol Evol. 2015 Mar;32(3):820-32
pubmed: 25540451
Science. 2011 Oct 28;334(6055):521-4
pubmed: 21940861
Genome Res. 2012 Apr;22(4):714-20
pubmed: 22287101
Proc Natl Acad Sci U S A. 2003 Aug 5;100(16):9440-5
pubmed: 12883005
Biol Lett. 2018 Oct 3;14(10):
pubmed: 30282748
Mol Biol Evol. 2016 Aug;33(8):2135-50
pubmed: 27222536
PLoS Comput Biol. 2007 Mar 23;3(3):e39
pubmed: 17381235
Nature. 2007 Mar 29;446(7135):507-12
pubmed: 17392779
Science. 2018 Aug 10;361(6402):591-594
pubmed: 30093596
Mol Biol Evol. 2002 Dec;19(12):2142-9
pubmed: 12446806
Bioinformatics. 2004 Dec 12;20(18):3710-5
pubmed: 15297299
Proc Natl Acad Sci U S A. 2008 Jan 22;105(3):934-9
pubmed: 18199838
J Evol Biol. 2010 Mar;23(3):494-508
pubmed: 20070460
Nat Genet. 2015 Mar;47(3):272-5
pubmed: 25621460
Front Genet. 2017 Feb 15;8:16
pubmed: 28261263
Mol Biol Evol. 2007 Aug;24(8):1586-91
pubmed: 17483113
Philos Trans R Soc Lond B Biol Sci. 2011 May 12;366(1569):1410-24
pubmed: 21444315
Genetics. 2013 Feb;193(2):529-38
pubmed: 23183665
Mol Biol Evol. 2016 Sep;33(9):2182-92
pubmed: 27329977
Nature. 1999 Jan 28;397(6717):344-7
pubmed: 9950425
Proc Natl Acad Sci U S A. 2005 Oct 25;102(43):15545-50
pubmed: 16199517
Mol Biol Evol. 2019 May 1;36(5):1086-1100
pubmed: 30851112
Am J Hum Genet. 2007 Apr;80(4):727-39
pubmed: 17357078
Philos Trans R Soc Lond B Biol Sci. 1989 Dec 21;326(1233):119-57
pubmed: 2575770
Front Cell Dev Biol. 2014 Apr 22;2:14
pubmed: 25364721
Am Nat. 2006 Jun;167(6):808-25
pubmed: 16685633
Elife. 2017 Oct 16;6:
pubmed: 29035697
Mol Biol Evol. 2019 Aug 1;36(8):1817-1830
pubmed: 31077321
Evolution. 1991 Aug;45(5):1065-1080
pubmed: 28564168
Blood. 2010 Aug 5;116(5):731-9
pubmed: 20445021
Bioinformatics. 2019 Nov 1;35(22):4815-4817
pubmed: 31192356
Elife. 2020 Feb 11;9:
pubmed: 32043462
Genetics. 2006 Jun;173(2):891-900
pubmed: 16547091
Trends Ecol Evol. 2000 Jul;15(7):296-299
pubmed: 10856957
Nucleic Acids Res. 2012 Sep 1;40(17):e133
pubmed: 22638577