Tally-2.0: upgraded validator of tandem repeat detection in protein sequences.
Journal
Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944
Informations de publication
Date de publication:
01 05 2020
01 05 2020
Historique:
received:
28
11
2019
revised:
02
02
2020
accepted:
18
02
2020
pubmed:
26
2
2020
medline:
30
10
2020
entrez:
26
2
2020
Statut:
ppublish
Résumé
Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated non-globular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs. Tally-2.0 is a scoring tool based on a machine learning (ML) approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional ML features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an area under the receiver operating characteristic curve of 95%. Tally-2.0 is available, as a web tool and as a standalone application published under Apache License 2.0, on the URL https://bioinfo.crbm.cnrs.fr/index.php? route=tools&tool=27. It is supported on Linux. Source code is available upon request. Supplementary data are available at Bioinformatics online.
Identifiants
pubmed: 32096820
pii: 5756200
doi: 10.1093/bioinformatics/btaa121
pmc: PMC7214015
doi:
Substances chimiques
Proteins
0
Types de publication
Journal Article
Research Support, N.I.H., Extramural
Research Support, Non-U.S. Gov't
Langues
eng
Sous-ensembles de citation
IM
Pagination
3260-3262Informations de copyright
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.
Références
Bioinformatics. 2004 Aug 4;20 Suppl 1:i311-7
pubmed: 15262814
J Mol Biol. 1999 Oct 15;293(1):151-60
pubmed: 10512723
Bioinformatics. 2016 Jul 1;32(13):1952-8
pubmed: 27153701
Bioinformatics. 2009 Oct 15;25(20):2632-8
pubmed: 19671691
Nat Genet. 1995 Oct;11(2):115-6
pubmed: 7550332
Front Bioeng Biotechnol. 2015 Sep 24;3:143
pubmed: 26442257
Curr Opin Struct Biol. 2001 Dec;11(6):725-32
pubmed: 11751054
Curr Med Chem. 2007;14(4):441-53
pubmed: 17305545
Bioinformatics. 2007 Oct 1;23(19):2507-17
pubmed: 17720704
Protein Eng. 1988 Jul;2(2):93-100
pubmed: 3244698
Biochem Soc Trans. 2015 Oct;43(5):807-11
pubmed: 26517886
Bioinformatics. 2008 Mar 15;24(6):807-14
pubmed: 18245125
J Struct Biol. 2012 Sep;179(3):279-88
pubmed: 21884799