Literature DB >> 10022984

An effective approach for analyzing "prefinished" genomic sequence data.

P M Kuehl1, J M Weisemann, J W Touchman, E D Green, M S Boguski.   

Abstract

Ongoing efforts to sequence the human genome are already generating large amounts of data, with substantial increases anticipated over the next few years. In most cases, a shotgun sequencing strategy is being used, which rapidly yields most of the primary sequence in incompletely assembled sequence contigs ("prefinished" sequence) and more slowly produces the final, completely assembled sequence ("finished" sequence). Thus, in general, prefinished sequence is produced in excess of finished sequence, and this trend is certain to continue and even accelerate over the next few years. Even at a prefinished stage, genomic sequence represents a rich source of important biological information that is of great interest to many investigators. However, analyzing such data is a challenging and daunting task, both because of its sheer volume and because it can change on a day-by-day basis. To facilitate the discovery and characterization of genes and other important elements within prefinished sequence, we have developed an analytical strategy and system that uses readily available software tools in new combinations. Implementation of this strategy for the analysis of prefinished sequence data from human chromosome 7 has demonstrated that this is a convenient, inexpensive, and extensible solution to the problem of analyzing the large amounts of preliminary data being produced by large-scale sequencing efforts. Our approach is accessible to any investigator who wishes to assimilate additional information about particular sequence data en route to developing richer annotations of a finished sequence.

Entities:  

Mesh:

Substances:

Year:  1999        PMID: 10022984      PMCID: PMC310715     

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  22 in total

Review 1.  A gene map of the human genome.

Authors:  G D Schuler; M S Boguski; E A Stewart; L D Stein; G Gyapay; K Rice; R E White; P Rodriguez-Tomé; A Aggarwal; E Bajorek; S Bentolila; B B Birren; A Butler; A B Castle; N Chiannilkulchai; A Chu; C Clee; S Cowles; P J Day; T Dibling; N Drouot; I Dunham; S Duprat; C East; C Edwards; J B Fan; N Fang; C Fizames; C Garrett; L Green; D Hadley; M Harris; P Harrison; S Brady; A Hicks; E Holloway; L Hui; S Hussain; C Louis-Dit-Sully; J Ma; A MacGilvery; C Mader; A Maratukulam; T C Matise; K B McKusick; J Morissette; A Mungall; D Muselet; H C Nusbaum; D C Page; A Peck; S Perkins; M Piercy; F Qin; J Quackenbush; S Ranby; T Reif; S Rozen; C Sanders; X She; J Silva; D K Slonim; C Soderlund; W L Sun; P Tabar; T Thangarajah; N Vega-Czarny; D Vollrath; S Voyticky; T Wilmer; X Wu; M D Adams; C Auffray; N A Walter; R Brandon; A Dehejia; P N Goodfellow; R Houlgatte; J R Hudson; S E Ide; K R Iorio; W Y Lee; N Seki; T Nagase; K Ishikawa; N Nomura; C Phillips; M H Polymeropoulos; M Sandusky; K Schmitt; R Berry; K Swanson; R Torres; J C Venter; J M Sikela; J S Beckmann; J Weissenbach; R M Myers; D R Cox; M R James; D Bentley; P Deloukas; E S Lander; T J Hudson
Journal:  Science       Date:  1996-10-25       Impact factor: 47.728

2.  Analysis of compositionally biased regions in sequence databases.

Authors:  J C Wootton; S Federhen
Journal:  Methods Enzymol       Date:  1996       Impact factor: 1.600

3.  The end of the beginning: the race to begin human genome sequencing.

Authors:  M Boguski; A Chakravarti; R Gibbs; E Green; R M Myers
Journal:  Genome Res       Date:  1996-09       Impact factor: 9.043

4.  Identification of protein coding regions in the human genome by quadratic discriminant analysis.

Authors:  M Q Zhang
Journal:  Proc Natl Acad Sci U S A       Date:  1997-01-21       Impact factor: 11.205

5.  Against a whole-genome shotgun.

Authors:  P Green
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

6.  Human whole-genome shotgun sequencing.

Authors:  J L Weber; E W Myers
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

7.  Prediction of complete gene structures in human genomic DNA.

Authors:  C Burge; S Karlin
Journal:  J Mol Biol       Date:  1997-04-25       Impact factor: 5.469

8.  A local alignment tool for very long DNA sequences.

Authors:  K M Chao; J Zhang; J Ostell; W Miller
Journal:  Comput Appl Biosci       Date:  1995-04

9.  A time to sequence.

Authors:  M V Olson
Journal:  Science       Date:  1995-10-20       Impact factor: 47.728

Review 10.  Issues in searching molecular sequence databases.

Authors:  S F Altschul; M S Boguski; W Gish; J C Wootton
Journal:  Nat Genet       Date:  1994-02       Impact factor: 38.330

View more
  1 in total

1.  Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5.

Authors:  M D Wilson; C Riemer; D W Martindale; P Schnupf; A P Boright; T L Cheung; D M Hardy; S Schwartz; S W Scherer; L C Tsui; W Miller; B F Koop
Journal:  Nucleic Acids Res       Date:  2001-03-15       Impact factor: 16.971

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.