Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Clustering of highly homologous sequences to reduce the size of large protein databases.

Literature DB >> 11294794

Clustering of highly homologous sequences to reduce the size of large protein databases.

Abstract

We present a fast and flexible program for clustering large protein databases at different sequence identity levels. It takes less than 2 h for the all-against-all sequence comparison and clustering of the non-redundant protein database of over 560,000 sequences on a high-end PC. The output database, including only the representative sequences, can be used for more efficient and sensitive database searches.

Mesh：

Substances：
Proteins

Year: 2001 PMID： 11294794 DOI： 10.1093/bioinformatics/17.3.282

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

312 in total

1. MitoProteome: mitochondrial protein sequence database and annotation system.

Authors: Dawn Cotter; Purnima Guda; Eoin Fahy; Shankar Subramaniam
Journal: Nucleic Acids Res Date: 2004-01-01 Impact factor: 16.971

2. The distribution and query systems of the RCSB Protein Data Bank.

Authors: Philip E Bourne; Kenneth J Addess; Wolfgang F Bluhm; Li Chen; Nita Deshpande; Zukang Feng; Ward Fleri; Rachel Green; Jeffrey C Merino-Ott; Wayne Townsend-Merino; Helge Weissig; John Westbrook; Helen M Berman
Journal: Nucleic Acids Res Date: 2004-01-01 Impact factor: 16.971

3. Loss of the anaphase-promoting complex in quiescent cells causes unscheduled hepatocyte proliferation.

Authors: Karin G Wirth; Romeo Ricci; Juan F Giménez-Abián; Shahryar Taghybeeglu; Nobuaki R Kudo; Wolfram Jochum; Mireille Vasseur-Cognet; Kim Nasmyth
Journal: Genes Dev Date: 2004-01-01 Impact factor: 11.361

4. UniqueProt: Creating representative protein sequence sets.

Authors: Sven Mika; Burkhard Rost
Journal: Nucleic Acids Res Date: 2003-07-01 Impact factor: 16.971

5. Sensitivity and selectivity in protein structure comparison.

Authors: Michael L Sierk; William R Pearson
Journal: Protein Sci Date: 2004-03 Impact factor: 6.725

6. Colonic microbiome is altered in alcoholism.

Authors: Ece A Mutlu; Patrick M Gillevet; Huzefa Rangwala; Masoumeh Sikaroodi; Ammar Naqvi; Phillip A Engen; Mary Kwasny; Cynthia K Lau; Ali Keshavarzian
Journal: Am J Physiol Gastrointest Liver Physiol Date: 2012-01-12 Impact factor: 4.052

7. TBC: a clustering algorithm based on prokaryotic taxonomy.

Authors: Jae-Hak Lee; Hana Yi; Yoon-Seong Jeon; Sungho Won; Jongsik Chun
Journal: J Microbiol Date: 2012-04-27 Impact factor: 3.422

8. A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons.

Authors: Veronika Steinbauerová; Pavel Neumann; Petr Novák; Jiří Macas
Journal: Genetica Date: 2012-04-29 Impact factor: 1.082

9. Spatial variability in airborne bacterial communities across land-use types and their relationship to the bacterial communities of potential source environments.

Authors: Robert M Bowers; Shawna McLetchie; Rob Knight; Noah Fierer
Journal: ISME J Date: 2010-11-04 Impact factor: 10.302

10. Internal duplications in α-helical membrane protein topologies are common but the nonduplicated forms are rare.

Authors: Aron Hennerdal; Jenny Falk; Erik Lindahl; Arne Elofsson
Journal: Protein Sci Date: 2010-12 Impact factor: 6.725