A Comprehensive Phylogenomic Platform for Exploring the Angiosperm Tree of Life.
Journal
Systematic biology
ISSN: 1076-836X
Titre abrégé: Syst Biol
Pays: England
ID NLM: 9302532
Informations de publication
Date de publication:
10 02 2022
10 02 2022
Historique:
received:
23
02
2021
revised:
06
05
2021
accepted:
08
05
2021
pubmed:
14
5
2021
medline:
23
3
2022
entrez:
13
5
2021
Statut:
ppublish
Résumé
The tree of life is the fundamental biological roadmap for navigating the evolution and properties of life on Earth, and yet remains largely unknown. Even angiosperms (flowering plants) are fraught with data gaps, despite their critical role in sustaining terrestrial life. Today, high-throughput sequencing promises to significantly deepen our understanding of evolutionary relationships. Here, we describe a comprehensive phylogenomic platform for exploring the angiosperm tree of life, comprising a set of open tools and data based on the 353 nuclear genes targeted by the universal Angiosperms353 sequence capture probes. The primary goals of this article are to (i) document our methods, (ii) describe our first data release, and (iii) present a novel open data portal, the Kew Tree of Life Explorer (https://treeoflife.kew.org). We aim to generate novel target sequence capture data for all genera of flowering plants, exploiting natural history collections such as herbarium specimens, and augment it with mined public data. Our first data release, described here, is the most extensive nuclear phylogenomic data set for angiosperms to date, comprising 3099 samples validated by DNA barcode and phylogenetic tests, representing all 64 orders, 404 families (96$\%$) and 2333 genera (17$\%$). A "first pass" angiosperm tree of life was inferred from the data, which totaled 824,878 sequences, 489,086,049 base pairs, and 532,260 alignment columns, for interactive presentation in the Kew Tree of Life Explorer. This species tree was generated using methods that were rigorous, yet tractable at our scale of operation. Despite limitations pertaining to taxon and gene sampling, gene recovery, models of sequence evolution and paralogy, the tree strongly supports existing taxonomy, while challenging numerous hypothesized relationships among orders and placing many genera for the first time. The validated data set, species tree and all intermediates are openly accessible via the Kew Tree of Life Explorer and will be updated as further data become available. This major milestone toward a complete tree of life for all flowering plant species opens doors to a highly integrated future for angiosperm phylogenomics through the systematic sequencing of standardized nuclear markers. Our approach has the potential to serve as a much-needed bridge between the growing movement to sequence the genomes of all life on Earth and the vast phylogenomic potential of the world's natural history collections. [Angiosperms; Angiosperms353; genomics; herbariomics; museomics; nuclear phylogenomics; open access; target sequence capture; tree of life.].
Identifiants
pubmed: 33983440
pii: 6275244
doi: 10.1093/sysbio/syab035
pmc: PMC8830076
doi:
Banques de données
Dryad
['10.5061/dryad.ns1rn8ps7']
Types de publication
Journal Article
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
301-319Informations de copyright
© The Author(s) 2021. Published by Oxford University Press, on behalf of the Society of Systematic Biologists.
Références
Gigascience. 2019 Oct 1;8(10):
pubmed: 31644802
New Phytol. 2018 Oct;220(2):636-650
pubmed: 30016546
Front Plant Sci. 2020 Mar 20;11:258
pubmed: 32265950
Bioinformatics. 2014 Aug 1;30(15):2114-20
pubmed: 24695404
Cold Spring Harb Protoc. 2010 Jun;2010(6):pdb.prot5448
pubmed: 20516186
Front Plant Sci. 2019 Sep 20;10:1161
pubmed: 31616452
Syst Biol. 2019 Jul 1;68(4):594-606
pubmed: 30535394
IEEE Trans Vis Comput Graph. 2011 Dec;17(12):2301-9
pubmed: 22034350
Mol Phylogenet Evol. 2021 Apr;157:107067
pubmed: 33412273
Nat Plants. 2019 May;5(5):461-470
pubmed: 31061536
Appl Plant Sci. 2021 May 18;9(7):
pubmed: 34336401
Plant Commun. 2020 Feb 04;1(2):100027
pubmed: 33367231
Nature. 2019 Oct;574(7780):679-685
pubmed: 31645766
Gigascience. 2020 Feb 1;9(2):
pubmed: 32043527
Appl Plant Sci. 2021 Jan 24;9(1):e11406
pubmed: 33552748
Gigascience. 2018 Mar 1;7(3):1-9
pubmed: 29618049
New Phytol. 2018 Sep;219(4):1170-1187
pubmed: 29577323
Genome Biol. 2015 Jun 16;16:124
pubmed: 26076734
Appl Plant Sci. 2016 Jul 12;4(7):
pubmed: 27437175
Nat Commun. 2021 Jun 9;12(1):3498
pubmed: 34108452
New Phytol. 2019 May;222(3):1638-1651
pubmed: 30735246
Appl Plant Sci. 2014 Aug 29;2(9):
pubmed: 25225629
Nat Commun. 2019 Feb 25;10(1):934
pubmed: 30804347
Genome Biol. 2020 Sep 10;21(1):241
pubmed: 32912315
Am J Bot. 2018 Mar;105(3):291-301
pubmed: 29603143
Front Plant Sci. 2020 Jan 09;10:1655
pubmed: 31998342
Am J Bot. 2021 Jul;108(7):1166-1180
pubmed: 34250591
Appl Plant Sci. 2021 Jul 07;9(7):
pubmed: 34336398
PLoS One. 2014 Jul 07;9(7):e98986
pubmed: 24999823
Am J Bot. 2018 Mar;105(3):614-622
pubmed: 29603138
Appl Plant Sci. 2019 Jun 13;7(6):e11254
pubmed: 31236313
Syst Biol. 2012 Oct;61(5):727-44
pubmed: 22605266
Trends Plant Sci. 2019 Oct;24(10):887-891
pubmed: 31477409
Appl Plant Sci. 2020 May 09;8(5):e11345
pubmed: 32477841
PeerJ. 2016 Jan 28;4:e1660
pubmed: 26835189
Am J Bot. 2021 Jul;108(7):1059-1065
pubmed: 34293179
Bioinformatics. 2017 Sep 15;33(18):2946-2947
pubmed: 28525531
Mol Biol Evol. 2016 Jul;33(7):1654-68
pubmed: 27189547
Front Plant Sci. 2019 Jan 09;9:1941
pubmed: 30687347
Mol Biol Evol. 2020 May 1;37(5):1530-1534
pubmed: 32011700
Syst Biol. 2012 Oct;61(5):717-26
pubmed: 22232343
Bioinformatics. 2010 Jul 1;26(13):1669-70
pubmed: 20472542
Mol Biol Evol. 2018 Feb 1;35(2):518-522
pubmed: 29077904
Am J Bot. 2011 Apr;98(4):704-30
pubmed: 21613169
Appl Plant Sci. 2021 Jun 14;9(7):
pubmed: 34336399
Front Plant Sci. 2019 Jul 12;10:864
pubmed: 31396244
PeerJ. 2017 Jul 25;5:e3569
pubmed: 28761782
BMC Genomics. 2018 May 8;19(Suppl 5):272
pubmed: 29745847
Appl Plant Sci. 2018 Mar 31;6(3):e1032
pubmed: 29732262
Appl Plant Sci. 2014 Feb 06;2(2):
pubmed: 25202605
Proc Natl Acad Sci U S A. 2015 Oct 13;112(41):12764-9
pubmed: 26385966
Front Plant Sci. 2019 Sep 18;10:1102
pubmed: 31620145
Proc Natl Acad Sci U S A. 2018 Apr 24;115(17):4325-4333
pubmed: 29686065
Proc Natl Acad Sci U S A. 2014 Nov 11;111(45):E4859-68
pubmed: 25355905
BMC Bioinformatics. 2018 May 8;19(Suppl 6):153
pubmed: 29745866
Mol Phylogenet Evol. 2020 Mar;144:106668
pubmed: 31682924
BMC Bioinformatics. 2009 Dec 15;10:421
pubmed: 20003500
Mol Phylogenet Evol. 2021 Apr;157:107068
pubmed: 33422648
Nature. 2009 Sep 10;461(7261):168-70
pubmed: 19741685
Plants (Basel). 2020 Apr 01;9(4):
pubmed: 32244605
Ann Bot. 2019 Feb 15;123(3):491-503
pubmed: 30376040
Appl Plant Sci. 2020 Apr 14;8(4):e11337
pubmed: 32351798
Am J Bot. 2018 Mar;105(3):302-314
pubmed: 29746720