Literature DB >> 11294794

Clustering of highly homologous sequences to reduce the size of large protein databases.

W Li1, L Jaroszewski, A Godzik.   

Abstract

We present a fast and flexible program for clustering large protein databases at different sequence identity levels. It takes less than 2 h for the all-against-all sequence comparison and clustering of the non-redundant protein database of over 560,000 sequences on a high-end PC. The output database, including only the representative sequences, can be used for more efficient and sensitive database searches.

Mesh:

Substances:

Year:  2001        PMID: 11294794     DOI: 10.1093/bioinformatics/17.3.282

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  312 in total

1.  MitoProteome: mitochondrial protein sequence database and annotation system.

Authors:  Dawn Cotter; Purnima Guda; Eoin Fahy; Shankar Subramaniam
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

2.  The distribution and query systems of the RCSB Protein Data Bank.

Authors:  Philip E Bourne; Kenneth J Addess; Wolfgang F Bluhm; Li Chen; Nita Deshpande; Zukang Feng; Ward Fleri; Rachel Green; Jeffrey C Merino-Ott; Wayne Townsend-Merino; Helge Weissig; John Westbrook; Helen M Berman
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

3.  Loss of the anaphase-promoting complex in quiescent cells causes unscheduled hepatocyte proliferation.

Authors:  Karin G Wirth; Romeo Ricci; Juan F Giménez-Abián; Shahryar Taghybeeglu; Nobuaki R Kudo; Wolfram Jochum; Mireille Vasseur-Cognet; Kim Nasmyth
Journal:  Genes Dev       Date:  2004-01-01       Impact factor: 11.361

4.  UniqueProt: Creating representative protein sequence sets.

Authors:  Sven Mika; Burkhard Rost
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

5.  Sensitivity and selectivity in protein structure comparison.

Authors:  Michael L Sierk; William R Pearson
Journal:  Protein Sci       Date:  2004-03       Impact factor: 6.725

6.  Colonic microbiome is altered in alcoholism.

Authors:  Ece A Mutlu; Patrick M Gillevet; Huzefa Rangwala; Masoumeh Sikaroodi; Ammar Naqvi; Phillip A Engen; Mary Kwasny; Cynthia K Lau; Ali Keshavarzian
Journal:  Am J Physiol Gastrointest Liver Physiol       Date:  2012-01-12       Impact factor: 4.052

7.  TBC: a clustering algorithm based on prokaryotic taxonomy.

Authors:  Jae-Hak Lee; Hana Yi; Yoon-Seong Jeon; Sungho Won; Jongsik Chun
Journal:  J Microbiol       Date:  2012-04-27       Impact factor: 3.422

8.  A widespread occurrence of extra open reading frames in plant Ty3/gypsy retrotransposons.

Authors:  Veronika Steinbauerová; Pavel Neumann; Petr Novák; Jiří Macas
Journal:  Genetica       Date:  2012-04-29       Impact factor: 1.082

9.  Spatial variability in airborne bacterial communities across land-use types and their relationship to the bacterial communities of potential source environments.

Authors:  Robert M Bowers; Shawna McLetchie; Rob Knight; Noah Fierer
Journal:  ISME J       Date:  2010-11-04       Impact factor: 10.302

10.  Internal duplications in α-helical membrane protein topologies are common but the nonduplicated forms are rare.

Authors:  Aron Hennerdal; Jenny Falk; Erik Lindahl; Arne Elofsson
Journal:  Protein Sci       Date:  2010-12       Impact factor: 6.725

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.