Dissecting Incongruence between Concatenation- and Quartet-Based Approaches in Phylogenomic Data.


Journal

Systematic biology
ISSN: 1076-836X
Titre abrégé: Syst Biol
Pays: England
ID NLM: 9302532

Informations de publication

Date de publication:
11 08 2021
Historique:
received: 08 01 2020
revised: 10 02 2021
accepted: 17 02 2021
pubmed: 23 2 2021
medline: 15 12 2021
entrez: 22 2 2021
Statut: ppublish

Résumé

Topological conflict or incongruence is widespread in phylogenomic data. Concatenation- and coalescent-based approaches often result in incongruent topologies, but the causes of this conflict can be difficult to characterize. We examined incongruence stemming from conflict the between likelihood-based signal (quantified by the difference in gene-wise log-likelihood score or $\Delta $GLS) and quartet-based topological signal (quantified by the difference in gene-wise quartet score or $\Delta $GQS) for every gene in three phylogenomic studies in animals, fungi, and plants, which were chosen because their concatenation-based IQ-TREE (T1) and quartet-based ASTRAL (T2) phylogenies are known to produce eight conflicting internal branches (bipartitions). By comparing the types of phylogenetic signal for all genes in these three data matrices, we found that 30-36% of genes in each data matrix are inconsistent, that is, each of these genes has a higher log-likelihood score for T1 versus T2 (i.e., $\Delta $GLS $>$0) whereas its T1 topology has lower quartet score than its T2 topology (i.e., $\Delta $GQS $<$0) or vice versa. Comparison of inconsistent and consistent genes using a variety of metrics (e.g., evolutionary rate, gene tree topology, distribution of branch lengths, hidden paralogy, and gene tree discordance) showed that inconsistent genes are more likely to recover neither T1 nor T2 and have higher levels of gene tree discordance than consistent genes. Simulation analyses demonstrate that the removal of inconsistent genes from data sets with low levels of incomplete lineage sorting (ILS) and low and medium levels of gene tree estimation error (GTEE) reduced incongruence and increased accuracy. In contrast, removal of inconsistent genes from data sets with medium and high ILS levels and high GTEE levels eliminated or extensively reduced incongruence, but the resulting congruent species phylogenies were not always topologically identical to the true species trees.[Conflict; gene tree; phylogenetic signal; phylogenetics; phylogenomics; Tree of Life.].

Identifiants

pubmed: 33616672
pii: 6146422
doi: 10.1093/sysbio/syab011
doi:

Banques de données

Dryad
['10.5061/dryad.9p8cz8wc5']

Types de publication

Journal Article Research Support, Non-U.S. Gov't Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

997-1014

Subventions

Organisme : Advanced Computing Center for Research and Education (ACCRE) at Vanderbilt University
Organisme : National Natural Science Foundation of China
ID : 32071665
Organisme : Howard Hughes Medical Institute
Pays : United States

Informations de copyright

© The Author(s) 2021. Published by Oxford University Press, on behalf of the Society of Systematic Biologists. All rights reserved. For permissions, please email: journals.permissions@oup.com.

Auteurs

Xing-Xing Shen (XX)

State Key Laboratory of Rice Biology and Ministry of Agriculture Key Lab of Molecular Biology of Crop Pathogens and Insects, Department of Plant Protection, Zhejiang University, 866 Yuhangtang Road, Hangzhou 310058, China.
Institute of Insect Sciences, Department of Plant Protection, Zhejiang University, 866 Yuhangtang Road, Hangzhou 310058, China.

Jacob L Steenwyk (JL)

Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.

Antonis Rokas (A)

Department of Biological Sciences, Vanderbilt University, VU Station B #35-1634, Nashville, TN 37235, USA.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell
Animals TOR Serine-Threonine Kinases Colorectal Neoplasms Colitis Mice

Classifications MeSH