Literature DB >> 18059312

Towards completion of the Earth's proteome.

Carolina Perez-Iratxeta1, Gareth Palidwor, Miguel A Andrade-Navarro.   

Abstract

New protein sequences are deposited in databases at an accelerating pace; however, many of these are homologous to known proteins and could be considered redundant. If all historical releases of the protein database are analysed using the original sequence-clustering procedure described here, the fraction of newly sequenced proteins that are redundant is increasing. We interpret this as an indication that the sequencing of the Earth's proteome--the complete set of proteins on Earth--is approaching completion. We estimate the approximate size of the Earth's proteome to be 5 million sequences, most of which will be identified during the next 5 years. As the Earth's proteome nears completion, cluster analysis of the protein database will become essential to identify under-explored taxa to which future sequencing efforts should be directed and to focus research on protein families without experimental characterization.

Mesh:

Substances:

Year:  2007        PMID: 18059312      PMCID: PMC2267224          DOI: 10.1038/sj.embor.7401117

Source DB:  PubMed          Journal:  EMBO Rep        ISSN: 1469-221X            Impact factor:   8.807


  32 in total

1.  Cloning the soil metagenome: a strategy for accessing the genetic and functional diversity of uncultured microorganisms.

Authors:  M R Rondon; P R August; A D Bettermann; S F Brady; T H Grossman; M R Liles; K A Loiacono; B A Lynch; I A MacNeil; C Minor; C L Tiong; M Gilman; M S Osburne; J Clardy; J Handelsman; R M Goodman
Journal:  Appl Environ Microbiol       Date:  2000-06       Impact factor: 4.792

2.  The early days of DNA sequences.

Authors:  F Sanger
Journal:  Nat Med       Date:  2001-03       Impact factor: 53.440

Review 3.  Industrial biocatalysis today and tomorrow.

Authors:  A Schmid; J S Dordick; B Hauer; A Kiener; M Wubbolts; B Witholt
Journal:  Nature       Date:  2001-01-11       Impact factor: 49.962

4.  Completeness in structural genomics.

Authors:  D Vitkup; E Melamud; J Moult; C Sander
Journal:  Nat Struct Biol       Date:  2001-06

5.  Prokaryotic diversity--magnitude, dynamics, and controlling factors.

Authors:  Vigdis Torsvik; Lise Øvreås; Tron Frede Thingstad
Journal:  Science       Date:  2002-05-10       Impact factor: 47.728

Review 6.  Chemical strategies for functional proteomics.

Authors:  Gregory C Adam; Erik J Sorensen; Benjamin F Cravatt
Journal:  Mol Cell Proteomics       Date:  2002-10       Impact factor: 5.911

7.  A combined transmembrane topology and signal peptide prediction method.

Authors:  Lukas Käll; Anders Krogh; Erik L L Sonnhammer
Journal:  J Mol Biol       Date:  2004-05-14       Impact factor: 5.469

8.  Community structure and metabolism through reconstruction of microbial genomes from the environment.

Authors:  Gene W Tyson; Jarrod Chapman; Philip Hugenholtz; Eric E Allen; Rachna J Ram; Paul M Richardson; Victor V Solovyev; Edward M Rubin; Daniel S Rokhsar; Jillian F Banfield
Journal:  Nature       Date:  2004-02-01       Impact factor: 49.962

9.  The SWISS-PROT protein sequence data bank.

Authors:  A Bairoch; B Boeckmann
Journal:  Nucleic Acids Res       Date:  1991-04-25       Impact factor: 16.971

10.  BiasViz: visualization of amino acid biased regions in protein alignments.

Authors:  Matthew R Huska; Henrik Buschmann; Miguel A Andrade-Navarro
Journal:  Bioinformatics       Date:  2007-10-06       Impact factor: 6.937

View more
  11 in total

1.  Highly accurate and high-resolution function prediction of RNA binding proteins by fold recognition and binding affinity prediction.

Authors:  Huiying Zhao; Yuedong Yang; Yaoqi Zhou
Journal:  RNA Biol       Date:  2011-11-01       Impact factor: 4.652

Review 2.  Minireview: applied structural bioinformatics in proteomics.

Authors:  Yee Siew Choong; Gee Jun Tye; Theam Soon Lim
Journal:  Protein J       Date:  2013-10       Impact factor: 2.371

3.  Prediction and validation of the unexplored RNA-binding protein atlas of the human proteome.

Authors:  Huiying Zhao; Yuedong Yang; Sarath Chandra Janga; C Cheng Kao; Yaoqi Zhou
Journal:  Proteins       Date:  2013-11-22

4.  Bioinformatics and genomic analysis of transposable elements in eukaryotic genomes.

Authors:  Mateusz Janicki; Rebecca Rooke; Guojun Yang
Journal:  Chromosome Res       Date:  2011-08       Impact factor: 4.620

5.  Validation of coevolving residue algorithms via pipeline sensitivity analysis: ELSC and OMES and ZNMI, oh my!

Authors:  Christopher A Brown; Kevin S Brown
Journal:  PLoS One       Date:  2010-06-01       Impact factor: 3.240

6.  Quantitative global studies of reactomes and metabolomes using a vectorial representation of reactions and chemical compounds.

Authors:  Juan C Triviño; Florencio Pazos
Journal:  BMC Syst Biol       Date:  2010-04-20

7.  Linking genes to diseases: it's all in the data.

Authors:  Nicki Tiffin; Miguel A Andrade-Navarro; Carolina Perez-Iratxeta
Journal:  Genome Med       Date:  2009-08-07       Impact factor: 11.117

8.  Génie: literature-based gene prioritization at multi genomic scale.

Authors:  Jean-Fred Fontaine; Florian Priller; Adriano Barbosa-Silva; Miguel A Andrade-Navarro
Journal:  Nucleic Acids Res       Date:  2011-05-23       Impact factor: 16.971

9.  Preimplantation development regulatory pathway construction through a text-mining approach.

Authors:  Elisa Donnard; Adriano Barbosa-Silva; Rafael L M Guedes; Gabriel R Fernandes; Henrique Velloso; Matthew J Kohn; Miguel A Andrade-Navarro; J Miguel Ortega
Journal:  BMC Genomics       Date:  2011-12-22       Impact factor: 3.969

10.  Functional and genomic analyses of alpha-solenoid proteins.

Authors:  David Fournier; Gareth A Palidwor; Sergey Shcherbinin; Angelika Szengel; Martin H Schaefer; Carol Perez-Iratxeta; Miguel A Andrade-Navarro
Journal:  PLoS One       Date:  2013-11-21       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.