Revising transcriptome assemblies with phylogenetic information.


Journal

PloS one
ISSN: 1932-6203
Titre abrégé: PLoS One
Pays: United States
ID NLM: 101285081

Informations de publication

Date de publication:
2021
Historique:
received: 06 07 2020
accepted: 04 12 2020
entrez: 12 1 2021
pubmed: 13 1 2021
medline: 24 4 2021
Statut: epublish

Résumé

A common transcriptome assembly error is to mistake different transcripts of the same gene as transcripts from multiple closely related genes. This error is difficult to identify during assembly, but in a phylogenetic analysis such errors can be diagnosed from gene phylogenies where they appear as clades of tips from the same species with improbably short branch lengths. treeinform is a method that uses phylogenetic information across species to refine transcriptome assemblies within species. It identifies transcripts of the same gene that were incorrectly assigned to multiple genes and reassign them as transcripts of the same gene. The treeinform method is implemented in Agalma, available at https://bitbucket.org/caseywdunn/agalma, and the general approach is relevant in a variety of other contexts.

Identifiants

pubmed: 33434218
doi: 10.1371/journal.pone.0244202
pii: PONE-D-20-20890
pmc: PMC7802918
doi:

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, U.S. Gov't, Non-P.H.S.

Langues

eng

Sous-ensembles de citation

IM

Pagination

e0244202

Subventions

Organisme : NIGMS NIH HHS
ID : P20 GM109035
Pays : United States

Déclaration de conflit d'intérêts

The authors have declared that no competing interests exist.

Références

PLoS Comput Biol. 2007 Mar 2;3(3):e33
pubmed: 17335345
BMC Genomics. 2013 May 14;14:328
pubmed: 23672450
Genetics. 2002 May;161(1):259-67
pubmed: 12019239
BMC Bioinformatics. 2006 Mar 06;7:110
pubmed: 16519805
Nat Rev Genet. 2016 Jul;17(7):422-33
pubmed: 27265362
Bioinformatics. 2012 Apr 15;28(8):1086-92
pubmed: 22368243
Syst Zool. 1970 Jun;19(2):99-113
pubmed: 5449325
J Theor Biol. 2008 Aug 21;253(4):769-78
pubmed: 18538793
Mol Biol Evol. 2013 Aug;30(8):1987-97
pubmed: 23709260
Genome Res. 2013 Feb;23(2):323-30
pubmed: 23132911
Genome Biol. 2014 Dec 21;15(12):553
pubmed: 25608678
BMC Evol Biol. 2018 Dec 13;18(1):189
pubmed: 30545284
Bioinformatics. 2012 Dec 1;28(23):3150-2
pubmed: 23060610
Front Genet. 2017 Feb 14;8:14
pubmed: 28261262
Trends Ecol Evol. 2016 Feb;31(2):116-126
pubmed: 26775796
BMC Bioinformatics. 2013 Nov 19;14:330
pubmed: 24252138
Nat Biotechnol. 2011 May 15;29(7):644-52
pubmed: 21572440
Mol Phylogenet Evol. 2018 Oct;127:823-833
pubmed: 29940256
PLoS Genet. 2007 Nov;3(11):e197
pubmed: 17997610
Genome Biol. 2014 Jul 26;15(7):410
pubmed: 25063469
Sci Rep. 2019 Jun 5;9(1):8304
pubmed: 31165774
Nat Rev Genet. 2009 Jan;10(1):57-63
pubmed: 19015660
Genome Res. 2003 Sep;13(9):2178-89
pubmed: 12952885
IEEE Trans Pattern Anal Mach Intell. 1984 Jun;6(6):721-41
pubmed: 22499653
PLoS Comput Biol. 2011 Oct;7(10):e1002195
pubmed: 22039361
Bioinformatics. 2014 Jun 15;30(12):1660-6
pubmed: 24532719
J Mol Biol. 1990 Oct 5;215(3):403-10
pubmed: 2231712

Auteurs

August Guang (A)

Center for Computational Biology of Human Disease, Brown University, Providence, RI, United States of America.
Center for Computation and Visualization, Brown University, Providence, RI, United States of America.

Mark Howison (M)

Research Improving People's Lives, Providence, RI, United States of America.

Felipe Zapata (F)

Department of Ecology & Evolutionary Biology, University of California-Los Angeles, Los Angeles, CA, United States of America.

Charles Lawrence (C)

Department of Applied Mathematics, Brown University, Providence, RI, United States of America.

Casey W Dunn (CW)

Department of Ecology & Evolutionary Biology, Yale University, New Haven, CT, United States of America.

Articles similaires

Genome, Chloroplast Phylogeny Genetic Markers Base Composition High-Throughput Nucleotide Sequencing
Robotic Surgical Procedures Animals Humans Telemedicine Models, Animal

Odour generalisation and detection dog training.

Lyn Caldicott, Thomas W Pike, Helen E Zulch et al.
1.00
Animals Odorants Dogs Generalization, Psychological Smell

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages

Classifications MeSH