Literature DB >> 12364578

Sequence clustering strategies improve remote homology recognitions while reducing search times.

Weizhong Li1, Lukasz Jaroszewski, Adam Godzik.   

Abstract

Sequence databases are rapidly growing, thereby increasing the coverage of protein sequence space, but this coverage is uneven because most sequencing efforts have concentrated on a small number of organisms. The resulting granularity of sequence space creates many problems for profile-based sequence comparison programs. In this paper, we suggest several strategies that address these problems, and at the same time speed up the searches for homologous proteins and improve the ability of profile methods to recognize distant homologies. One of our strategies combines database clustering, which removes highly redundant sequence, and a two-step PSI-BLAST (PDB-BLAST), which separates sequence spaces of profile composition and space of homology searching. The combination of these strategies improves distant homology recognitions by more than 100%, while using only 10% of the CPU time of the standard PSI-BLAST search. Another method, intermediate profile searches, allows for the exploration of additional search directions that are normally dominated by large protein sub-families within very diverse families. All methods are evaluated with a large fold-recognition benchmark.

Mesh:

Substances:

Year:  2002        PMID: 12364578     DOI: 10.1093/protein/15.8.643

Source DB:  PubMed          Journal:  Protein Eng        ISSN: 0269-2139


  25 in total

1.  Detection of homologous proteins by an intermediate sequence search.

Authors:  Bino John; Andrej Sali
Journal:  Protein Sci       Date:  2004-01       Impact factor: 6.725

2.  TBC: a clustering algorithm based on prokaryotic taxonomy.

Authors:  Jae-Hak Lee; Hana Yi; Yoon-Seong Jeon; Sungho Won; Jongsik Chun
Journal:  J Microbiol       Date:  2012-04-27       Impact factor: 3.422

3.  Uniclust databases of clustered and deeply annotated protein sequences and alignments.

Authors:  Milot Mirdita; Lars von den Driesch; Clovis Galiez; Maria J Martin; Johannes Söding; Martin Steinegger
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

4.  Intrinsic disorder in transcription factors.

Authors:  Jiangang Liu; Narayanan B Perumal; Christopher J Oldfield; Eric W Su; Vladimir N Uversky; A Keith Dunker
Journal:  Biochemistry       Date:  2006-06-06       Impact factor: 3.162

5.  Gamma carbonic anhydrases in plant mitochondria.

Authors:  Gustavo Parisi; Mariano Perales; María Silvina Fornasari; Alejandro Colaneri; Nahuel González-Schain; Diego Gómez-Casati; Sabrina Zimmermann; Axel Brennicke; Alejandro Araya; James G Ferry; Julián Echave; Eduardo Zabaleta
Journal:  Plant Mol Biol       Date:  2004-05       Impact factor: 4.076

6.  Transcriptome analysis of stem wood of Nothapodytes nimmoniana (Graham) Mabb. identifies genes associated with biosynthesis of camptothecin, an anti-carcinogenic molecule.

Authors:  B L Manjunatha; H R Singh; G Ravikanth; Karaba N Nataraja; Ravi Shankar; Sanjay Kumar; R Uma Shaanker
Journal:  J Biosci       Date:  2016-03       Impact factor: 1.826

Review 7.  PSI-2: structural genomics to cover protein domain family space.

Authors:  Benoît H Dessailly; Rajesh Nair; Lukasz Jaroszewski; J Eduardo Fajardo; Andrei Kouranov; David Lee; Andras Fiser; Adam Godzik; Burkhard Rost; Christine Orengo
Journal:  Structure       Date:  2009-06-10       Impact factor: 5.006

8.  De novo sequencing and characterization of Picrorhiza kurrooa transcriptome at two temperatures showed major transcriptome adjustments.

Authors:  Parul Gahlan; Heikham Russiachand Singh; Ravi Shankar; Niharika Sharma; Anita Kumari; Vandna Chawla; Paramvir Singh Ahuja; Sanjay Kumar
Journal:  BMC Genomics       Date:  2012-03-31       Impact factor: 3.969

9.  Ultrafast clustering algorithms for metagenomic sequence analysis.

Authors:  Weizhong Li; Limin Fu; Beifang Niu; Sitao Wu; John Wooley
Journal:  Brief Bioinform       Date:  2012-07-06       Impact factor: 11.622

10.  First comparative transcriptomic analysis of wild adult male and female Lutzomyia longipalpis, vector of visceral leishmaniasis.

Authors:  Christina B McCarthy; María Soledad Santini; Paulo F P Pimenta; Luis A Diambra
Journal:  PLoS One       Date:  2013-03-12       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.