Literature DB >> 9092672

A comparison of expressed sequence tags (ESTs) to human genomic sequences.

T G Wolfsberg1, D Landsman.   

Abstract

The Expressed Sequence Tag (EST) division of GenBank, dbEST, is a large repository of the data being generated by human genome sequencing centers. ESTs are short, single pass cDNA sequences generated from randomly selected library clones. The approximately 415 000 human ESTs represent a valuable, low priced, and easily accessible biological reagent. As many ESTs are derived from yet uncharacterized genes, dbEST is a prime starting point for the identification of novel mRNAs. Conversely, other genes are represented by hundreds of ESTs, a redundancy which may provide data about rare mRNA isoforms. Here we present an analysis of >1000 ESTs generated by the WashU-Merck EST project. These ESTs were collected by querying dbEST with the genomic sequences of 15 human genes. When we aligned the matching ESTs to the genomic sequences, we found that in one gene, 73% of the ESTs which derive from spliced or partially spliced transcripts either contain intron sequences or are spliced at previously unreported sites; other genes have lower percentages of such ESTs, and some have none. This finding suggests that ESTs could provide researchers with novel information about alternative splicing in certain genes. In a related analysis of pairs of ESTs which are reported to derive from a single gene, we found that as many as 26% of the pairs do not BOTH align with the sequence of the same gene. We suspect that some of these unusual ESTs result from artifacts in EST generation, and caution researchers that they may find such clones while analyzing sequences in dbEST.

Entities:  

Mesh:

Substances:

Year:  1997        PMID: 9092672      PMCID: PMC146621          DOI: 10.1093/nar/25.8.1626

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  18 in total

Review 1.  A gene map of the human genome.

Authors:  G D Schuler; M S Boguski; E A Stewart; L D Stein; G Gyapay; K Rice; R E White; P Rodriguez-Tomé; A Aggarwal; E Bajorek; S Bentolila; B B Birren; A Butler; A B Castle; N Chiannilkulchai; A Chu; C Clee; S Cowles; P J Day; T Dibling; N Drouot; I Dunham; S Duprat; C East; C Edwards; J B Fan; N Fang; C Fizames; C Garrett; L Green; D Hadley; M Harris; P Harrison; S Brady; A Hicks; E Holloway; L Hui; S Hussain; C Louis-Dit-Sully; J Ma; A MacGilvery; C Mader; A Maratukulam; T C Matise; K B McKusick; J Morissette; A Mungall; D Muselet; H C Nusbaum; D C Page; A Peck; S Perkins; M Piercy; F Qin; J Quackenbush; S Ranby; T Reif; S Rozen; C Sanders; X She; J Silva; D K Slonim; C Soderlund; W L Sun; P Tabar; T Thangarajah; N Vega-Czarny; D Vollrath; S Voyticky; T Wilmer; X Wu; M D Adams; C Auffray; N A Walter; R Brandon; A Dehejia; P N Goodfellow; R Houlgatte; J R Hudson; S E Ide; K R Iorio; W Y Lee; N Seki; T Nagase; K Ishikawa; N Nomura; C Phillips; M H Polymeropoulos; M Sandusky; K Schmitt; R Berry; K Swanson; R Torres; J C Venter; J M Sikela; J S Beckmann; J Weissenbach; R M Myers; D R Cox; M R James; D Bentley; P Deloukas; E S Lander; T J Hudson
Journal:  Science       Date:  1996-10-25       Impact factor: 47.728

2.  Virtually sequenced: the next genomic generation.

Authors:  W Bains
Journal:  Nat Biotechnol       Date:  1996-06       Impact factor: 54.908

3.  Entrez: molecular biology database and retrieval system.

Authors:  G D Schuler; J A Epstein; H Ohkawa; J A Kans
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

4.  Generation and analysis of 280,000 human expressed sequence tags.

Authors:  L D Hillier; G Lennon; M Becker; M F Bonaldo; B Chiapelli; S Chissoe; N Dietrich; T DuBuque; A Favello; W Gish; M Hawkins; M Hultman; T Kucaba; M Lacy; M Le; N Le; E Mardis; B Moore; M Morris; J Parsons; C Prange; L Rifkin; T Rohlfing; K Schellenberg; M Bento Soares; F Tan; J Thierry-Meg; E Trevaskis; K Underwood; P Wohldman; R Waterston; R Wilson; M Marra
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

5.  Toward the development of a gene index to the human genome: an assessment of the nature of high-throughput EST sequence data.

Authors:  J S Aaronson; B Eckman; R A Blevins; J A Borkowski; J Myerson; S Imran; K O Elliston
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

Review 6.  Lessons learned, promises kept: a biologist's eye view of the Genome Project.

Authors:  S M Tilghman
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

7.  Normalization and subtraction: two approaches to facilitate gene discovery.

Authors:  M F Bonaldo; G Lennon; M B Soares
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

8.  Conserved initiator proteins in eukaryotes.

Authors:  K A Gavin; M Hidaka; B Stillman
Journal:  Science       Date:  1995-12-08       Impact factor: 47.728

Review 9.  A cell biological perspective on genome research.

Authors:  J R McIntosh; R R West
Journal:  J Cell Biol       Date:  1995-12       Impact factor: 10.539

10.  Budding yeast SKP1 encodes an evolutionarily conserved kinetochore protein required for cell cycle progression.

Authors:  C Connelly; P Hieter
Journal:  Cell       Date:  1996-07-26       Impact factor: 41.582

View more
  32 in total

1.  The intronerator: exploring introns and alternative splicing in Caenorhabditis elegans.

Authors:  W J Kent; A M Zahler
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Microarray data quality analysis: lessons from the AFGC project. Arabidopsis Functional Genomics Consortium.

Authors:  David Finkelstein; Rob Ewing; Jeremy Gollub; Fredrik Sterky; J Michael Cherry; Shauna Somerville
Journal:  Plant Mol Biol       Date:  2002-01       Impact factor: 4.076

3.  TissueInfo: high-throughput identification of tissue expression profiles and specificity.

Authors:  L Skrabanek; F Campagne
Journal:  Nucleic Acids Res       Date:  2001-11-01       Impact factor: 16.971

4.  The contribution of exon-skipping events on chromosome 22 to protein coding diversity.

Authors:  W A Hide; V N Babenko; P A van Heusden; C Seoighe; J F Kelso
Journal:  Genome Res       Date:  2001-11       Impact factor: 9.043

5.  Computational inference of homologous gene structures in the human genome.

Authors:  R F Yeh; L P Lim; C B Burge
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

6.  Gene structure prediction and alternative splicing analysis using genomically aligned ESTs.

Authors:  Z Kan; E C Rouchka; W R Gish; D J States
Journal:  Genome Res       Date:  2001-05       Impact factor: 9.043

7.  Selecting for functional alternative splices in ESTs.

Authors:  Zhengyan Kan; David States; Warren Gish
Journal:  Genome Res       Date:  2002-12       Impact factor: 9.043

8.  Comparison of whole genome assemblies of the human genome.

Authors:  Eric C Rouchka; Warren Gish; David J States
Journal:  Nucleic Acids Res       Date:  2002-11-15       Impact factor: 16.971

9.  Reevaluating human gene annotation: a second-generation analysis of chromosome 22.

Authors:  John E Collins; Melanie E Goward; Charlotte G Cole; Luc J Smink; Elizabeth J Huckle; Sarah Knowles; Jacqueline M Bye; David M Beare; Ian Dunham
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

10.  Increased noncanonical splicing of autoantigen transcripts provides the structural basis for expression of untolerized epitopes.

Authors:  Bernard Ng; Fan Yang; David P Huston; Yan Yan; Yu Yang; Zeyu Xiong; Leif E Peterson; Hong Wang; Xiao-Feng Yang
Journal:  J Allergy Clin Immunol       Date:  2004-12       Impact factor: 10.793

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.