Literature DB >> 11337484

Assembly, annotation, and integration of UNIGENE clusters into the human genome draft.

D Zhuo1, W D Zhao, F A Wright, H Y Yang, J P Wang, R Sears, T Baer, D H Kwon, D Gordon, S Gibbs, D Dai, Q Yang, J Spitzner, R Krahe, D Stredney, A Stutz, B Yuan.   

Abstract

The recent release of the first draft of the human genome provides an unprecedented opportunity to integrate human genes and their functions in a complete positional context. However, at least three significant technical hurdles remain: first, to assemble a complete and nonredundant human transcript index; second, to accurately place the individual transcript indices on the human genome; and third, to functionally annotate all human genes. Here, we report the extension of the UNIGENE database through the assembly of its sequence clusters into nonredundant sequence contigs. Each resulting consensus was aligned to the human genome draft. A unique location for each transcript within the human genome was determined by the integration of the restriction fingerprint, assembled genomic contig, and radiation hybrid (RH) maps. A total of 59,500 UNIGENE clusters were mapped on the basis of at least three independent criteria as compared with the 30,000 human genes/ESTs currently mapped in Genemap'99. Finally, the extension of the human transcript consensus in this study enabled a greater number of putative functional assignments than the 11,000 annotated entries in UNIGENE. This study reports a draft physical map with annotations for a majority of the human transcripts, called the Human Index of Nonredundant Transcripts (HINT). Such information can be immediately applied to the discovery of new genes and the identification of candidate genes for positional cloning.

Entities:  

Mesh:

Year:  2001        PMID: 11337484      PMCID: PMC311045          DOI: 10.1101/gr.gr-1645r

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  31 in total

1.  The Pfam protein families database.

Authors:  A Bateman; E Birney; R Durbin; S R Eddy; K L Howe; E L Sonnhammer
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

2.  Representation of functional information in the SWISS-PROT data bank.

Authors:  V L Junker; R Apweiler; A Bairoch
Journal:  Bioinformatics       Date:  1999-12       Impact factor: 6.937

3.  Gene index analysis of the human genome estimates approximately 120,000 genes.

Authors:  F Liang; I Holt; G Pertea; S Karamycheva; S L Salzberg; J Quackenbush
Journal:  Nat Genet       Date:  2000-06       Impact factor: 38.330

4.  Analysis of expressed sequence tags indicates 35,000 human genes.

Authors:  B Ewing; P Green
Journal:  Nat Genet       Date:  2000-06       Impact factor: 38.330

5.  Reliable identification of large numbers of candidate SNPs from public EST data.

Authors:  K H Buetow; M N Edmonson; A B Cassidy
Journal:  Nat Genet       Date:  1999-03       Impact factor: 38.330

6.  Repeat polymorphisms within gene regions: phenotypic and evolutionary implications.

Authors:  J D Wren; E Forgacs; J W Fondon; A Pertsemlidis; S Y Cheng; T Gallardo; R S Williams; R V Shohet; J D Minna; H R Garner
Journal:  Am J Hum Genet       Date:  2000-07-07       Impact factor: 11.025

7.  The SWISS-PROT protein sequence database and its supplement TrEMBL in 2000.

Authors:  A Bairoch; R Apweiler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

8.  The TIGR gene indices: reconstruction and representation of expressed gene sequences.

Authors:  J Quackenbush; F Liang; I Holt; G Pertea; J Upton
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

9.  Frequent alternative splicing of human genes.

Authors:  A A Mironov; J W Fickett; M S Gelfand
Journal:  Genome Res       Date:  1999-12       Impact factor: 9.043

10.  A comprehensive approach to clustering of expressed human gene sequence: the sequence tag alignment and consensus knowledge base.

Authors:  R T Miller; A G Christoffels; C Gopalakrishnan; J Burke; A A Ptitsyn; T R Broveak; W A Hide
Journal:  Genome Res       Date:  1999-11       Impact factor: 9.043

View more
  26 in total

1.  Oligo(dT) primer generates a high frequency of truncated cDNAs through internal poly(A) priming during reverse transcription.

Authors:  Douglas Kyung Nam; Sanggyu Lee; Guolin Zhou; Xiaohong Cao; Clarence Wang; Terry Clark; Jianjun Chen; Janet D Rowley; San Ming Wang
Journal:  Proc Natl Acad Sci U S A       Date:  2002-04-23       Impact factor: 11.205

Review 2.  Open-system approaches to gene expression in the CNS.

Authors:  J G Sutcliffe
Journal:  J Neurosci       Date:  2001-11-01       Impact factor: 6.167

3.  PipeOnline 2.0: automated EST processing and functional data sorting.

Authors:  Patricia Ayoubi; Xiaojing Jin; Saul Leite; Xianghui Liu; Jeson Martajaja; Abdurashid Abduraham; Qiaolan Wan; Wei Yan; Eduardo Misawa; Rolf A Prade
Journal:  Nucleic Acids Res       Date:  2002-11-01       Impact factor: 16.971

Review 4.  Application of DNA microarrays in pharmacogenomics and toxicogenomics.

Authors:  Khew-Voon Chin; A N Tony Kong
Journal:  Pharm Res       Date:  2002-12       Impact factor: 4.200

5.  ESTAnnotator: A tool for high throughput EST annotation.

Authors:  Agnes Hotz-Wagenblatt; Thomas Hankeln; Peter Ernst; Karl-Heinz Glatting; Erwin R Schmidt; Sándor Suhai
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

6.  Splice variation in mouse full-length cDNAs identified by mapping to the mouse genome.

Authors:  Mihaela Zavolan; Erik van Nimwegen; Terry Gaasterland
Journal:  Genome Res       Date:  2002-09       Impact factor: 9.043

7.  The Alternative Splicing Gallery (ASG): bridging the gap between genome and transcriptome.

Authors:  Jeremy Leipzig; Pavel Pevzner; Steffen Heber
Journal:  Nucleic Acids Res       Date:  2004-08-03       Impact factor: 16.971

8.  The multiassembly problem: reconstructing multiple transcript isoforms from EST fragment mixtures.

Authors:  Yi Xing; Alissa Resch; Christopher Lee
Journal:  Genome Res       Date:  2004-02-12       Impact factor: 9.043

9.  Comparing cDNA and oligonucleotide array data: concordance of gene expression across platforms for the NCI-60 cancer cells.

Authors:  Jae K Lee; Kimberly J Bussey; Fuad G Gwadry; William Reinhold; Gregory Riddick; Sandra L Pelletier; Satoshi Nishizuka; Gergely Szakacs; Jean-Phillipe Annereau; Uma Shankavaram; Samir Lababidi; Lawrence H Smith; Michael M Gottesman; John N Weinstein
Journal:  Genome Biol       Date:  2003-11-25       Impact factor: 13.583

10.  A conserved gene structure and expression regulation of miR-433 and miR-127 in mammals.

Authors:  Guisheng Song; Li Wang
Journal:  PLoS One       Date:  2009-11-25       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.