Literature DB >> 32096820

Tally-2.0: upgraded validator of tandem repeat detection in protein sequences.

Vladimir Perovic1, Jeremy Y Leclercq2, Neven Sumonja1, Francois D Richard2,3, Nevena Veljkovic1, Andrey V Kajava2.   

Abstract

MOTIVATION: Proteins containing tandem repeats (TRs) are abundant, frequently fold in elongated non-globular structures and perform vital functions. A number of computational tools have been developed to detect TRs in protein sequences. A blurred boundary between imperfect TR motifs and non-repetitive sequences gave rise to necessity to validate the detected TRs.
RESULTS: Tally-2.0 is a scoring tool based on a machine learning (ML) approach, which allows to validate the results of TR detection. It was upgraded by using improved training datasets and additional ML features. Tally-2.0 performs at a level of 93% sensitivity, 83% specificity and an area under the receiver operating characteristic curve of 95%.
AVAILABILITY AND IMPLEMENTATION: Tally-2.0 is available, as a web tool and as a standalone application published under Apache License 2.0, on the URL https://bioinfo.crbm.cnrs.fr/index.php? route=tools&tool=27. It is supported on Linux. Source code is available upon request. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2020. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32096820      PMCID: PMC7214015          DOI: 10.1093/bioinformatics/btaa121

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  13 in total

1.  A census of protein repeats.

Authors:  E M Marcotte; M Pellegrini; T O Yeates; D Eisenberg
Journal:  J Mol Biol       Date:  1999-10-15       Impact factor: 5.469

Review 2.  The leucine-rich repeat as a protein recognition motif.

Authors:  B Kobe; A V Kajava
Journal:  Curr Opin Struct Biol       Date:  2001-12       Impact factor: 6.809

3.  Tracking repeats using significance and transitivity.

Authors:  Radek Szklarczyk; Jaap Heringa
Journal:  Bioinformatics       Date:  2004-08-04       Impact factor: 6.937

Review 4.  Application of the EIIP/ISM bioinformatics concept in development of new drugs.

Authors:  V Veljkovic; N Veljkovic; J A Esté; A Hüther; U Dietrich
Journal:  Curr Med Chem       Date:  2007       Impact factor: 4.530

Review 5.  A review of feature selection techniques in bioinformatics.

Authors:  Yvan Saeys; Iñaki Inza; Pedro Larrañaga
Journal:  Bioinformatics       Date:  2007-08-24       Impact factor: 6.937

6.  T-REKS: identification of Tandem REpeats in sequences with a K-meanS based algorithm.

Authors:  Julien Jorda; Andrey V Kajava
Journal:  Bioinformatics       Date:  2009-08-11       Impact factor: 6.937

7.  Tally: a scoring tool for boundary determination between repetitive and non-repetitive protein sequences.

Authors:  François D Richard; Ronnie Alves; Andrey V Kajava
Journal:  Bioinformatics       Date:  2016-03-07       Impact factor: 6.937

8.  Cluster analysis of amino acid indices for prediction of protein structure and function.

Authors:  K Nakai; A Kidera; M Kanehisa
Journal:  Protein Eng       Date:  1988-07

9.  HEAT repeats in the Huntington's disease protein.

Authors:  M A Andrade; P Bork
Journal:  Nat Genet       Date:  1995-10       Impact factor: 38.330

Review 10.  Tandem Repeats in Proteins: Prediction Algorithms and Biological Role.

Authors:  Marco Pellegrini
Journal:  Front Bioeng Biotechnol       Date:  2015-09-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.