Literature DB >> 19671691

T-REKS: identification of Tandem REpeats in sequences with a K-meanS based algorithm.

Julien Jorda1, Andrey V Kajava.   

Abstract

MOTIVATION: Over the last years a number of evidences have been accumulated about high incidence of tandem repeats in proteins carrying fundamental biological functions and being related to a number of human diseases. At the same time, frequently, protein repeats are strongly degenerated during evolution and, therefore, cannot be easily identified. To solve this problem, several computer programs which were based on different algorithms have been developed. Nevertheless, our tests showed that there is still room for improvement of methods for accurate and rapid detection of tandem repeats in proteins.
RESULTS: We developed a new program called T-REKS for ab initio identification of the tandem repeats. It is based on clustering of lengths between identical short strings by using a K-means algorithm. Benchmark of the existing programs and T-REKS on several sequence datasets is presented. Our program being linked to the Protein Repeat DataBase opens the way for large-scale analysis of protein tandem repeats. T-REKS can also be applied to the nucleotide sequences. AVAILABILITY: The algorithm has been implemented in JAVA, the program is available upon request at http://bioinfo.montp.cnrs.fr/?r=t-reks. Protein Repeat DataBase generated by using T-REKS is accessible at http://bioinfo.montp.cnrs.fr/?r=repeatDB.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19671691     DOI: 10.1093/bioinformatics/btp482

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  63 in total

1.  The Argonaute-binding platform of NRPE1 evolves through modulation of intrinsically disordered repeats.

Authors:  Joshua T Trujillo; Mark A Beilstein; Rebecca A Mosher
Journal:  New Phytol       Date:  2016-07-19       Impact factor: 10.151

2.  Genetic diversity of the Plasmodium vivax merozoite surface protein-5 locus from diverse geographic origins.

Authors:  Chaturong Putaporntip; Rachanee Udomsangpetch; Urassaya Pattanawong; Liwang Cui; Somchai Jongwutiwes
Journal:  Gene       Date:  2010-02-21       Impact factor: 3.688

3.  Comparison of Listeria monocytogenes Exoproteomes from biofilm and planktonic state: Lmo2504, a protein associated with biofilms.

Authors:  António Lourenço; Aitor de Las Heras; Mariela Scortti; Jose Vazquez-Boland; Joseph F Frank; Luisa Brito
Journal:  Appl Environ Microbiol       Date:  2013-07-26       Impact factor: 4.792

4.  A new way to visualize DNA's base succession: the Caenorhabditis elegans chromosome landscapes.

Authors:  Afef Elloumi Oueslati; Imen Messaoudi; Zied Lachiri; Noureddine Ellouze
Journal:  Med Biol Eng Comput       Date:  2015-05-24       Impact factor: 2.602

5.  Insights into the Evolution of Hydroxyproline-Rich Glycoproteins from 1000 Plant Transcriptomes.

Authors:  Kim L Johnson; Andrew M Cassin; Andrew Lonsdale; Gane Ka-Shu Wong; Douglas E Soltis; Nicholas W Miles; Michael Melkonian; Barbara Melkonian; Michael K Deyholos; James Leebens-Mack; Carl J Rothfels; Dennis W Stevenson; Sean W Graham; Xumin Wang; Shuangxiu Wu; J Chris Pires; Patrick P Edger; Eric J Carpenter; Antony Bacic; Monika S Doblin; Carolyn J Schultz
Journal:  Plant Physiol       Date:  2017-04-26       Impact factor: 8.340

6.  Characterization and DNA-binding specificities of Ralstonia TAL-like effectors.

Authors:  Lixin Li; Ahmed Atef; Agnieszka Piatek; Zahir Ali; Marek Piatek; Mustapha Aouida; Altanbadralt Sharakuu; Ali Mahjoub; Guangchao Wang; Suhail Khan; Nina V Fedoroff; Jian-Kang Zhu; Magdy M Mahfouz
Journal:  Mol Plant       Date:  2013-01-08       Impact factor: 13.164

Review 7.  Modeling repetitive, non-globular proteins.

Authors:  Koli Basu; Robert L Campbell; Shuaiqi Guo; Tianjun Sun; Peter L Davies
Journal:  Protein Sci       Date:  2016-03-16       Impact factor: 6.725

8.  Search of tandem repeats with insertion and deletions in the A. thaliana genome.

Authors:  E V Korotkov; Yu M Suvorova; K G Skryabin
Journal:  Dokl Biochem Biophys       Date:  2018-01-03       Impact factor: 0.788

9.  Tally-2.0: upgraded validator of tandem repeat detection in protein sequences.

Authors:  Vladimir Perovic; Jeremy Y Leclercq; Neven Sumonja; Francois D Richard; Nevena Veljkovic; Andrey V Kajava
Journal:  Bioinformatics       Date:  2020-05-01       Impact factor: 6.937

10.  Effect of charged residues in the N-domain of Sup35 protein on prion [PSI+] stability and propagation.

Authors:  Stanislav A Bondarev; Vadim V Shchepachev; Andrey V Kajava; Galina A Zhouravleva
Journal:  J Biol Chem       Date:  2013-08-21       Impact factor: 5.157

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.