Literature DB >> 15174140

Has the yo-yo stopped? An assessment of human protein-coding gene number.

Christopher Southan1.   

Abstract

Since the identification of approximately 25,000 proteins from the draft human genome assembly in 2001, estimates of the total have oscillated between 30,000 and 70,000. The recently announced genome closure has not generated a consensus gene count despite this being a key parameter for many areas of biology including drug target discovery and characterization of the human proteome. Contrary to earlier predictions of constitutive under-detection for eukaryotic genes, the latest model organism updates have produced minor increases in the worm but fly and yeast gene numbers have decreased. The postdraft, precompletion interval has produced large increases in human transcript coverage, continuous improvements in genome assembly and refinements in automated genomic annotation. Notably these enhancements have resulted in an Ensembl human protein-coding gene number of 22,184, a decrease of 1862 since the first release. Longitudinal database surveys indicate that redundancy-reduced human mRNA and protein collections are flattening out at approximately 28,000, although Ensembl maps approximately 20,000 known sequences. Observations suggest high-throughput cloning projects are predominantly extending known genes or sampling new splice forms and novel protein discovery has slowed to a trickle. The hypothesis that substantial numbers of short proteins remain experimentally and computationally undetected in mammalian genomes is neither supported by sequence data nor by the extensive homology between mouse and human proteins. Aggregating the independent annotations for complete transcripts from seven completed human chromosomes extrapolates to approximately 25,000 genes. The inclusion of partial putative genes would increase this to above 30,000 but recent data suggest these represent predominantly nonprotein-coding transcripts. Mass spectrometry-based proteomics has already verified more than 10% of human genes but has not identified significant numbers of unpredicted proteins. The available data are thus converging to a basal protein-coding gene number well below 30,000, which could even be as low as 25,000.

Entities:  

Mesh:

Substances:

Year:  2004        PMID: 15174140     DOI: 10.1002/pmic.200300700

Source DB:  PubMed          Journal:  Proteomics        ISSN: 1615-9853            Impact factor:   3.984


  19 in total

1.  A newly discovered human alpha-globin gene.

Authors:  Sung-Ho Goh; Y Terry Lee; Natarajan V Bhanu; Margaret C Cam; Richard Desper; Brian M Martin; Ramy Moharram; Robert B Gherman; Jeffery L Miller
Journal:  Blood       Date:  2005-04-26       Impact factor: 22.113

Review 2.  Genome and proteome annotation: organization, interpretation and integration.

Authors:  Gabrielle A Reeves; David Talavera; Janet M Thornton
Journal:  J R Soc Interface       Date:  2009-02-06       Impact factor: 4.118

3.  Host-microbial interactions and regulation of intestinal epithelial barrier function: From physiology to pathology.

Authors:  Linda Chia-Hui Yu; Jin-Town Wang; Shu-Chen Wei; Yen-Hsuan Ni
Journal:  World J Gastrointest Pathophysiol       Date:  2012-02-15

4.  Anti-neural antibody reactivity in patients with a history of Lyme borreliosis and persistent symptoms.

Authors:  Abhishek Chandra; Gary P Wormser; Mark S Klempner; Richard P Trevino; Mary K Crow; Norman Latov; Armin Alaedini
Journal:  Brain Behav Immun       Date:  2010-03-18       Impact factor: 7.217

Review 5.  Approaches for defining the Hsp90-dependent proteome.

Authors:  Steven D Hartson; Robert L Matts
Journal:  Biochim Biophys Acta       Date:  2011-08-27

6.  Ulysses - an application for the projection of molecular interactions across species.

Authors:  Danielle Kemmer; Yong Huang; Sohrab P Shah; Jonathan Lim; Jochen Brumm; Macaire M S Yuen; John Ling; Tao Xu; Wyeth W Wasserman; B F Francis Ouellette
Journal:  Genome Biol       Date:  2005-12-02       Impact factor: 13.583

Review 7.  LIM-homeodomain genes in mammalian development and human disease.

Authors:  Chad S Hunter; Simon J Rhodes
Journal:  Mol Biol Rep       Date:  2005-06       Impact factor: 2.316

8.  Penalized likelihood for sparse contingency tables with an application to full-length cDNA libraries.

Authors:  Corinne Dahinden; Giovanni Parmigiani; Mark C Emerick; Peter Bühlmann
Journal:  BMC Bioinformatics       Date:  2007-12-11       Impact factor: 3.169

9.  Schizophrenia susceptibility alleles are enriched for alleles that affect gene expression in adult human brain.

Authors:  A L Richards; L Jones; V Moskvina; G Kirov; P V Gejman; D F Levinson; A R Sanders; S Purcell; P M Visscher; N Craddock; M J Owen; P Holmans; M C O'Donovan
Journal:  Mol Psychiatry       Date:  2011-02-22       Impact factor: 15.992

Review 10.  Precision animal breeding.

Authors:  A P F Flint; J A Woolliams
Journal:  Philos Trans R Soc Lond B Biol Sci       Date:  2008-02-12       Impact factor: 6.237

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.