Interrogating Genomic-Scale Data to Resolve Recalcitrant Nodes in the Spider Tree of Life.
Araneae
noncoding regions
phylogeny
target-capture
transcriptomics
Journal
Molecular biology and evolution
ISSN: 1537-1719
Titre abrégé: Mol Biol Evol
Pays: United States
ID NLM: 8501455
Informations de publication
Date de publication:
09 03 2021
09 03 2021
Historique:
pubmed:
29
9
2020
medline:
25
6
2021
entrez:
28
9
2020
Statut:
ppublish
Résumé
Genome-scale data sets are converging on robust, stable phylogenetic hypotheses for many lineages; however, some nodes have shown disagreement across classes of data. We use spiders (Araneae) as a system to identify the causes of incongruence in phylogenetic signal between three classes of data: exons (as in phylotranscriptomics), noncoding regions (included in ultraconserved elements [UCE] analyses), and a combination of both (as in UCE analyses). Gene orthologs, coded as amino acids and nucleotides (with and without third codon positions), were generated by querying published transcriptomes for UCEs, recovering 1,931 UCE loci (codingUCEs). We expected that congeners represented in the codingUCE and UCEs data would form clades in the presence of phylogenetic signal. Noncoding regions derived from UCE sequences were recovered to test the stability of relationships. Phylogenetic relationships resulting from all analyses were largely congruent. All nucleotide data sets from transcriptomes, UCEs, or a combination of both recovered similar topologies in contrast with results from transcriptomes analyzed as amino acids. Most relationships inferred from low-occupancy data sets, containing several hundreds of loci, were congruent across Araneae, as opposed to high occupancy data matrices with fewer loci, which showed more variation. Furthermore, we found that low-occupancy data sets analyzed as nucleotides (as is typical of UCE data sets) can result in more congruent relationships than high occupancy data sets analyzed as amino acids (as in phylotranscriptomics). Thus, omitting data, through amino acid translation or via retention of only high occupancy loci, may have a deleterious effect in phylogenetic reconstruction.
Identifiants
pubmed: 32986823
pii: 5912541
doi: 10.1093/molbev/msaa251
pmc: PMC7947752
doi:
Types de publication
Comparative Study
Journal Article
Research Support, Non-U.S. Gov't
Research Support, U.S. Gov't, Non-P.H.S.
Langues
eng
Sous-ensembles de citation
IM
Pagination
891-903Informations de copyright
© The Author(s) 2020. Published by Oxford University Press on behalf of the Society for Molecular Biology and Evolution.
Références
Appl Plant Sci. 2016 Jul 12;4(7):
pubmed: 27437175
Insect Sci. 2014 Dec;21(6):687-98
pubmed: 24167122
Curr Biol. 2014 Aug 4;24(15):1765-71
pubmed: 25042592
Nat Genet. 2017 Jun;49(6):895-903
pubmed: 28459453
Syst Biol. 2016 Jan;65(1):128-45
pubmed: 26330450
Mol Biol Evol. 2013 Sep;30(9):2145-56
pubmed: 23813979
Syst Biol. 2021 Jan 1;70(1):14-20
pubmed: 32497195
Syst Biol. 2014 Nov;63(6):919-32
pubmed: 25077515
Syst Biol. 2009 Feb;58(1):130-45
pubmed: 20525573
Genome Biol Evol. 2013;5(11):2082-92
pubmed: 24148944
Bioinformatics. 2012 Dec 1;28(23):3150-2
pubmed: 23060610
Nature. 2003 Oct 23;425(6960):798-804
pubmed: 14574403
Mol Biol Evol. 2013 Apr;30(4):772-80
pubmed: 23329690
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
Mol Biol Evol. 2013 Jan;30(1):197-214
pubmed: 22930702
Mol Ecol Resour. 2017 Jul;17(4):812-823
pubmed: 27768256
Curr Biol. 2014 Aug 4;24(15):1772-7
pubmed: 25042584
Nature. 2003 Oct 23;425(6960):782
pubmed: 14574398
Syst Biol. 2019 Nov 1;68(6):937-955
pubmed: 31135914
Proc Biol Sci. 2012 Apr 7;279(1732):1341-50
pubmed: 22048955
PeerJ. 2019 May 3;7:e6864
pubmed: 31110925
Bioinformatics. 2015 Oct 1;31(19):3210-2
pubmed: 26059717
Mol Biol Evol. 2010 Dec;27(12):2790-803
pubmed: 20616145
Nature. 2015 Oct 22;526(7574):569-73
pubmed: 26444237
Mol Ecol Resour. 2020 Jan;20(1):185-203
pubmed: 31599100
Mol Biol Evol. 2013 Sep;30(9):2134-44
pubmed: 23813978
Science. 2014 Dec 12;346(6215):1320-31
pubmed: 25504713
BMC Biol. 2017 Jul 31;15(1):62
pubmed: 28756775
Genome Biol Evol. 2018 Jun 1;10(6):1375-1388
pubmed: 29788113
Nat Methods. 2012 Mar 04;9(4):357-9
pubmed: 22388286
Mol Biol Evol. 2014 Nov;31(11):2963-84
pubmed: 25107551
Mol Phylogenet Evol. 2018 Oct;127:907-918
pubmed: 29966686
Syst Biol. 2020 Jan 1;69(1):17-37
pubmed: 31062852
Mol Biol Evol. 2018 Feb 1;35(2):518-522
pubmed: 29077904
Mol Biol Evol. 2015 Jan;32(1):268-74
pubmed: 25371430
Genome Res. 2002 Apr;12(4):656-64
pubmed: 11932250
Nat Commun. 2019 May 24;10(1):2295
pubmed: 31127117
Proc Natl Acad Sci U S A. 2002 May 14;99(10):6848-53
pubmed: 12011443
PLoS Comput Biol. 2014 Jan 30;10(1):e1003447
pubmed: 24499933
Bioinformatics. 2009 Aug 1;25(15):1972-3
pubmed: 19505945
Annu Rev Entomol. 2014;59:487-512
pubmed: 24160416
Mol Phylogenet Evol. 2020 Oct;151:106899
pubmed: 32590046
Evolution. 1985 Jul;39(4):783-791
pubmed: 28561359
Nature. 2011 Feb 10;470(7333):198-203
pubmed: 21307932
Nat Methods. 2017 Jun;14(6):587-589
pubmed: 28481363
Proc Natl Acad Sci U S A. 2014 Nov 11;111(45):E4859-68
pubmed: 25355905
Syst Biol. 2019 Nov 1;68(6):896-917
pubmed: 30917194
Nat Commun. 2014 May 06;5:3765
pubmed: 24801114
Nat Protoc. 2013 Aug;8(8):1494-512
pubmed: 23845962
Mol Biol Evol. 2020 Sep 1;37(9):2727-2733
pubmed: 32365179
Curr Biol. 2018 May 7;28(9):1489-1497.e5
pubmed: 29706520
Syst Biol. 2010 May;59(3):307-21
pubmed: 20525638
Gigascience. 2015 Oct 19;4:48
pubmed: 26500767
J Hered. 2013 Sep-Oct;104(5):595-600
pubmed: 23940263
Syst Biol. 2018 Sep 1;67(5):916-924
pubmed: 29893968
PeerJ. 2016 Feb 23;4:e1719
pubmed: 26925338
Mol Phylogenet Evol. 2019 Jan;130:121-131
pubmed: 30326287