Tally-2.0: upgraded validator of tandem repeat detection in protein sequences.


Journal

Bioinformatics (Oxford, England)
ISSN: 1367-4811
Titre abrégé: Bioinformatics
Pays: England
ID NLM: 9808944

Informations de publication

Date de publication:
01 05 2020
Historique:
received: 28 11 2019
revised: 02 02 2020
accepted: 18 02 2020
pubmed: 26 2 2020
medline: 30 10 2020
entrez: 26 2 2020
Statut: ppublish

Résumé

Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated non-globular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs. Tally-2.0 is a scoring tool based on a machine learning (ML) approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional ML features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an area under the receiver operating characteristic curve of 95%. Tally-2.0 is available, as a web tool and as a standalone application published under Apache License 2.0, on the URL https://bioinfo.crbm.cnrs.fr/index.php? route=tools&tool=27. It is supported on Linux. Source code is available upon request. Supplementary data are available at Bioinformatics online.

Identifiants

pubmed: 32096820
pii: 5756200
doi: 10.1093/bioinformatics/btaa121
pmc: PMC7214015
doi:

Substances chimiques

Proteins 0

Types de publication

Journal Article Research Support, N.I.H., Extramural Research Support, Non-U.S. Gov't

Langues

eng

Sous-ensembles de citation

IM

Pagination

3260-3262

Informations de copyright

© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Références

Bioinformatics. 2004 Aug 4;20 Suppl 1:i311-7
pubmed: 15262814
J Mol Biol. 1999 Oct 15;293(1):151-60
pubmed: 10512723
Bioinformatics. 2016 Jul 1;32(13):1952-8
pubmed: 27153701
Bioinformatics. 2009 Oct 15;25(20):2632-8
pubmed: 19671691
Nat Genet. 1995 Oct;11(2):115-6
pubmed: 7550332
Front Bioeng Biotechnol. 2015 Sep 24;3:143
pubmed: 26442257
Curr Opin Struct Biol. 2001 Dec;11(6):725-32
pubmed: 11751054
Curr Med Chem. 2007;14(4):441-53
pubmed: 17305545
Bioinformatics. 2007 Oct 1;23(19):2507-17
pubmed: 17720704
Protein Eng. 1988 Jul;2(2):93-100
pubmed: 3244698
Biochem Soc Trans. 2015 Oct;43(5):807-11
pubmed: 26517886
Bioinformatics. 2008 Mar 15;24(6):807-14
pubmed: 18245125
J Struct Biol. 2012 Sep;179(3):279-88
pubmed: 21884799

Auteurs

Vladimir Perovic (V)

Laboratory for Bioinformatics and Computational Chemistry, Institute of Nuclear Sciences VINCA, University of Belgrade, Belgrade 11001, Serbia.

Jeremy Y Leclercq (JY)

Centre de Recherche en Biologie cellulaire de Montpellier, UMR 5237 CNRS, Université de Montpellier, Montpellier 34293, France.

Neven Sumonja (N)

Laboratory for Bioinformatics and Computational Chemistry, Institute of Nuclear Sciences VINCA, University of Belgrade, Belgrade 11001, Serbia.

Francois D Richard (FD)

Centre de Recherche en Biologie cellulaire de Montpellier, UMR 5237 CNRS, Université de Montpellier, Montpellier 34293, France.
Laboratory for Translational Breast Cancer Research, Department of Oncology, KU Leuven, Leuven 3000, Belgium.

Nevena Veljkovic (N)

Laboratory for Bioinformatics and Computational Chemistry, Institute of Nuclear Sciences VINCA, University of Belgrade, Belgrade 11001, Serbia.

Andrey V Kajava (AV)

Centre de Recherche en Biologie cellulaire de Montpellier, UMR 5237 CNRS, Université de Montpellier, Montpellier 34293, France.

Articles similaires

Selecting optimal software code descriptors-The case of Java.

Yegor Bugayenko, Zamira Kholmatova, Artem Kruglov et al.
1.00
Software Algorithms Programming Languages
Databases, Protein Protein Domains Protein Folding Proteins Deep Learning
Animals Hemiptera Insect Proteins Phylogeny Insecticides

Exploring blood-brain barrier passage using atomic weighted vector and machine learning.

Yoan Martínez-López, Paulina Phoobane, Yanaima Jauriga et al.
1.00
Blood-Brain Barrier Machine Learning Humans Support Vector Machine Software

Classifications MeSH