Literature DB >> 10835646

Gene index analysis of the human genome estimates approximately 120,000 genes.

F Liang1, I Holt, G Pertea, S Karamycheva, S L Salzberg, J Quackenbush.   

Abstract

Although sequencing of the human genome will soon be completed, gene identification and annotation remains a challenge. Early estimates suggested that there might be 60,000-100,000 (ref. 1) human genes, but recent analyses of the available data from EST sequencing projects have estimated as few as 45,000 (ref. 2) or as many as 140, 000 (ref. 3) distinct genes. The Chromosome 22 Sequencing Consortium estimated a minimum of 45,000 genes based on their annotation of the complete chromosome, although their data suggests there may be additional genes. The nearly 2,000,000 human ESTs in dbEST provide an important resource for gene identification and genome annotation, but these single-pass sequences must be carefully analysed to remove contaminating sequences, including those from genomic DNA, spurious transcription, and vector and bacterial sequences. We have developed a highly refined and rigorously tested protocol for cleaning, clustering and assembling EST sequences to produce high-fidelity consensus sequences for the represented genes (F.L. et al., manuscript submitted) and used this to create the TIGR Gene Indices-databases of expressed genes for human, mouse, rat and other species (http://www.tigr.org/tdb/tgi.html). Using highly refined and tested algorithms for EST analysis, we have arrived at two independent estimates indicating the human genome contains approximately 120,000 genes.

Entities:  

Mesh:

Year:  2000        PMID: 10835646     DOI: 10.1038/76126

Source DB:  PubMed          Journal:  Nat Genet        ISSN: 1061-4036            Impact factor:   38.330


  69 in total

1.  Determination of the number of conserved chromosomal segments between species.

Authors:  S Kumar; S R Gadagkar; A Filipski; X Gu
Journal:  Genetics       Date:  2001-03       Impact factor: 4.562

2.  Gene2EST: a BLAST2 server for searching expressed sequence tag (EST) databases with eukaryotic gene-sized queries.

Authors:  C Gemünd; C Ramu; B Altenberg-Greulich; T J Gibson
Journal:  Nucleic Acids Res       Date:  2001-03-15       Impact factor: 16.971

3.  Identification of human chromosome 22 transcribed sequences with ORF expressed sequence tags.

Authors:  S J de Souza; A A Camargo; M R Briones; F F Costa; M A Nagai; S Verjovski-Almeida; M A Zago; L E Andrade; H Carrer; H F El-Dorry; E M Espreafico; A Habr-Gama; D Giannella-Neto; G H Goldman; A Gruber; C Hackel; E T Kimura; R M Maciel; S K Marie; E A Martins; M P Nobrega; M L Paco-Larson; M I Pardini; G G Pereira; J B Pesquero; V Rodrigues; S R Rogatto; I D da Silva; M C Sogayar; M de Fátima Sonati; E H Tajara; S R Valentini; M Acencio; F L Alberto; M E Amaral; I Aneas; M H Bengtson; D M Carraro; A F Carvalho; L H Carvalho; J M Cerutti; M L Corrêa; M C Costa; C Curcio; T Gushiken; P L Ho; E Kimura; L C Leite; G Maia; P Majumder; M Marins; A Matsukuma; A S Melo; C A Mestriner; E C Miracca; D C Miranda; A N Nascimento; F G Nóbrega; E P Ojopi; J R Pandolfi; L G Pessoa; P Rahal; C A Rainho; N da Rós; R G de Sá; M M Sales; N P da Silva; T C Silva; W da Silva; D F Simão; J F Sousa; D Stecconi; F Tsukumo; V Valente; H Zalcbeg; R R Brentani; F L Reis; E Dias-Neto; A J Simpson
Journal:  Proc Natl Acad Sci U S A       Date:  2000-11-07       Impact factor: 11.205

4.  Phylogenetic analysis of T-Box genes demonstrates the importance of amphioxus for understanding evolution of the vertebrate genome.

Authors:  I Ruvinsky; L M Silver; J J Gibson-Brown
Journal:  Genetics       Date:  2000-11       Impact factor: 4.562

5.  Genes, isochores and bands in human chromosomes 21 and 22.

Authors:  S Saccone; A Pavlicek; C Federico; J Paces; G Bernard
Journal:  Chromosome Res       Date:  2001       Impact factor: 5.239

Review 6.  Molecular pathology of solid tumours: some practical suggestions for translating research into clinical practice.

Authors:  I P Tomlinson; M Ilyas
Journal:  Mol Pathol       Date:  2001-08

7.  Genome-wide detection of alternative splicing in expressed sequences of human genes.

Authors:  B Modrek; A Resch; C Grasso; C Lee
Journal:  Nucleic Acids Res       Date:  2001-07-01       Impact factor: 16.971

8.  The Gene Resource Locator: gene locus maps for transcriptome analysis.

Authors:  Toshihiko Honkura; Jun Ogasawara; Tomoyuki Yamada; Shinichi Morishita
Journal:  Nucleic Acids Res       Date:  2002-01-01       Impact factor: 16.971

9.  A question of size: the eukaryotic proteome and the problems in defining it.

Authors:  Paul M Harrison; Anuj Kumar; Ning Lang; Michael Snyder; Mark Gerstein
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

10.  A comparative molecular analysis of developing mouse forelimbs and hindlimbs using serial analysis of gene expression (SAGE).

Authors:  E H Margulies; S L Kardia; J W Innis
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.