Evolutionary shift detection with ensemble variable selection.
ELPASO
Ensemble method
Evolutionary shift detection
LASSO
Ornstein-Uhlenbeck model
Phylogenetic comparative methods
Trait evolution
Journal
BMC ecology and evolution
ISSN: 2730-7182
Titre abrégé: BMC Ecol Evol
Pays: England
ID NLM: 101775613
Informations de publication
Date de publication:
20 Jan 2024
20 Jan 2024
Historique:
received:
19
01
2023
accepted:
10
01
2024
medline:
21
1
2024
pubmed:
21
1
2024
entrez:
20
1
2024
Statut:
epublish
Résumé
Abrupt environmental changes can lead to evolutionary shifts in trait evolution. Identifying these shifts is an important step in understanding the evolutionary history of phenotypes. The detection performances of different methods are influenced by many factors, including different numbers of shifts, shift sizes, where a shift occurs on a tree, and the types of phylogenetic structure. Furthermore, the model assumptions are oversimplified, so are likely to be violated in real data, which could cause the methods to fail. We perform simulations to assess the effect of these factors on the performance of shift detection methods. To make the comparisons more complete, we also propose an ensemble variable selection method (R package ELPASO) and compare it with existing methods (R packages [Formula: see text]1ou and PhylogeneticEM). The performances of methods are highly dependent on the selection criterion. [Formula: see text]1ou+pBIC is usually the most conservative method and it performs well when signal sizes are large. [Formula: see text]1ou+BIC is the least conservative method and it performs well when signal sizes are small. The ensemble method provides more balanced choices between those two methods. Moreover, the performances of all methods are heavily impacted by measurement error, tree reconstruction error and shifts in variance.
Identifiants
pubmed: 38245667
doi: 10.1186/s12862-024-02201-w
pii: 10.1186/s12862-024-02201-w
doi:
Types de publication
Journal Article
Langues
eng
Sous-ensembles de citation
IM
Pagination
11Subventions
Organisme : Natural Sciences and Engineering Research Council of Canada
ID : RGPIN/4945-2014
Organisme : Natural Sciences and Engineering Research Council of Canada
ID : RGPIN-2018-05447
Informations de copyright
© 2024. The Author(s).
Références
Felsenstein J. Phylogenies and the Comparative Method. Am Nat. 1985;125(1):1–15. https://doi.org/10.1086/284325 .
doi: 10.1086/284325
Davis CC, Latvis M, Nickrent DL, Wurdack KJ, Baum DA. Floral gigantism in Rafflesiaceae. Science. 2007;315(5820):1812. https://doi.org/10.1126/science.1135260 .
doi: 10.1126/science.1135260
pubmed: 17218493
Beaulieu JM, Jhwueng DC, Boettiger C, O’Meara BC. Modeling stabilizing selection: expanding the Ornstein-Uhlenbeck model of adaptive evolution. Evol Int J Org Evol. 2012;66(8):2369–83.
doi: 10.1111/j.1558-5646.2012.01619.x
Gill MS, Ho LST, Baele G, Lemey P, Suchard MA. A relaxed directional random walk model for phylogenetic trait evolution. Syst Biol. 2016. https://doi.org/10.1093/sysbio/syw093 .
doi: 10.1093/sysbio/syw093
pubmed: 27368344
pmcid: 6075548
Hassler G, Tolkoff MR, Allen WL, Ho LST, Lemey P, Suchard MA. Inferring phenotypic trait evolution on large trees with many incomplete measurements. J Am Stat Assoc. 2020:1–15. https://doi.org/10.1080/01621459.2020.1799812 .
Hansen TF. Stabilizing Selection and the Comparative Analysis of Adaptation. Evolution. 1997;51(5):1341. https://doi.org/10.2307/2411186 .
doi: 10.2307/2411186
pubmed: 28568616
Butler MA, King AA. Phylogenetic comparative analysis: a modeling approach for adaptive evolution. Am Nat. 2004;164(6):683–95. https://doi.org/10.1086/426002 .
doi: 10.1086/426002
pubmed: 29641928
Bastide P, Ho LST, Baele G, Lemey P, Suchard MA. Efficient Bayesian inference of general gaussian models on large phylogenetic trees. Ann Appl Stat. 2021;15(2). https://doi.org/10.1214/20-aoas1419 .
Losos JB. Lizards in an evolutionary tree: ecology and adaptive radiation of anoles. University of California Press; 2011.
Mahler DL, Ingram T, Revell LJ, Losos JB. Exceptional Convergence on the Macroevolutionary Landscape in Island Lizard Radiations. Science. 2013;341(6143):292–5. https://doi.org/10.1126/science.1232392 .
doi: 10.1126/science.1232392
pubmed: 23869019
Jaffe AL, Slater GJ, Alfaro ME. The evolution of island gigantism and body size variation in tortoises and turtles. Biol Lett. 2011;7(4):558–61. https://doi.org/10.1098/rsbl.2010.1084 .
doi: 10.1098/rsbl.2010.1084
pubmed: 21270022
pmcid: 3130210
Uyeda JC, Harmon LJ. A Novel Bayesian Method for Inferring and Interpreting the Dynamics of Adaptive Landscapes from Phylogenetic Comparative Data. Syst Biol. 2014;63(6):902–18. https://doi.org/10.1093/sysbio/syu057 .
doi: 10.1093/sysbio/syu057
pubmed: 25077513
Ho LST, Ané C. Intrinsic inference difficulties for trait evolution with Ornstein-Uhlenbeck models. Methods Ecol Evol. 2014;5(11):1133–46. https://doi.org/10.1111/2041-210x.12285 .
doi: 10.1111/2041-210x.12285
Zhang NR, Siegmund DO. A modified Bayes information criterion with applications to the analysis of comparative genomic hybridization data. Biometrics. 2006;63(1):22–32. https://doi.org/10.1111/j.1541-0420.2006.00662.x .
doi: 10.1111/j.1541-0420.2006.00662.x
Khabbazian M, Kriebel R, Rohe K, Ané C. Fast and accurate detection of evolutionary shifts in Ornstein-Uhlenbeck models. Methods Ecol Evol. 2016;7(7):811–24. https://doi.org/10.1111/2041-210x.12534 .
doi: 10.1111/2041-210x.12534
Bastide P, Mariadassou M, Robin S. Detection of adaptive shifts on phylogenies by using shifted stochastic processes on a tree. J R Stat Soc Ser B Stat Methodol. 2017;79(4):1067–93. https://doi.org/10.1111/rssb.12206 .
doi: 10.1111/rssb.12206
Lee CP, Leu Y. A novel hybrid feature selection method for microarray data analysis. Appl Soft Comput. 2011;11(1):208–13. https://doi.org/10.1016/j.asoc.2009.11.010 .
doi: 10.1016/j.asoc.2009.11.010
Mera-Gaona M, López DM, Vargas-Canas R. An ensemble feature selection approach to identify relevant features from EEG signals. Appl Sci. 2021;11(15):6983. https://doi.org/10.3390/app11156983 .
doi: 10.3390/app11156983
Piao Y, Piao M, Park K, Ryu KH. An ensemble correlation-based gene selection algorithm for cancer classification with Gene Expression Data. Bioinformatics. 2012;28(24):3306–15. https://doi.org/10.1093/bioinformatics/bts602 .
doi: 10.1093/bioinformatics/bts602
pubmed: 23060613
Pes B. Ensemble feature selection for high-dimensional data: a stability analysis across multiple domains. Neural Comput & Applic. 2019;32(10):5951–73. https://doi.org/10.1007/s00521-019-04082-3 .
doi: 10.1007/s00521-019-04082-3
Ho LST, Ané C. Asymptotic theory with hierarchical autocorrelation: Ornstein-Uhlenbeck tree models. Ann Stat. 2013;41(2):957–81.
doi: 10.1214/13-AOS1105
Ingram T, Mahler DL. Surface: detecting convergent evolution from comparative data by fitting Ornstein-Uhlenbeck models with stepwise Akaike Information Criterion. Methods Ecol Evol. 2013;4(5):416–25. https://doi.org/10.1111/2041-210x.12034 .
doi: 10.1111/2041-210x.12034
Bastide P, Ané C, Robin S, Mariadassou M. Inference of adaptive shifts for multivariate correlated traits. Syst Biol. 2018;67(4):662–80. https://doi.org/10.1093/sysbio/syy005 .
doi: 10.1093/sysbio/syy005
pubmed: 29385556
Dempster AP, Laird NM, Rubin DB. Maximum Likelihood from Incomplete Data Via the EM Algorithm. J R Stat Soc Ser B Methodol. 1977;39(1):1–22. https://doi.org/10.1111/j.2517-6161.1977.tb01600.x .
doi: 10.1111/j.2517-6161.1977.tb01600.x
Baraud Y, Giraud C, Huet S. Gaussian model selection with an unknown variance. Ann Stat. 2009;37(2). https://doi.org/10.1214/07-aos573 .
Bolón-Canedo V, Alonso-Betanzos A. Ensembles for Feature Selection. Intell Syst Ref Libr Recent Adv Ensembles Feature Sel. 2018:53–81. https://doi.org/10.1007/978-3-319-90080-3_4 .
Hubert L, Arabie P. Comparing partitions. J Classif. 1985;2(1):193–218. https://doi.org/10.1007/bf01908075 .
doi: 10.1007/bf01908075