Literature DB >> 9799794

Analysis of the quality and utility of random shotgun sequencing at low redundancies.

J Bouck1, W Miller, J H Gorrell, D Muzny, R A Gibbs.   

Abstract

The currently favored approach for sequencing the human genome involves selecting representative large-insert clones (100-200 kb), randomly shearing this DNA to construct shotgun libraries, and then sequencing many different isolates from the library. This method, entitled directed random shotgun sequencing, requires highly redundant sequencing to obtain a complete and accurate finished consensus sequence. Recently it has been suggested that a rapidly generated lower redundancy sequence might be of use to the scientific community. Low-redundancy sequencing has been examined previously using simulated data sets. Here we utilize trace data from a number of projects submitted to GenBank to perform reconstruction experiments that mimic low-redundancy sequencing. These low-redundancy sequences have been examined for the completeness and quality of the consensus product, information content, and usefulness for interspecies comparisons. The data presented here suggest three different sequencing strategies, each with different utilities. (1) Nearly complete sequence data can be obtained by sequencing a random shotgun library at sixfold redundancy. This may therefore represent a good point to switch from a random to directed approach. (2) Sequencing can be performed with as little as twofold redundancy to find most of the information about exons, EST hits, and putative exon similarity matches. (3) To obtain contiguity of coding regions, sequencing at three- to fourfold redundancy would be appropriate. From these results, we suggest that a useful intermediate product for genome sequencing might be obtained by three- to fourfold redundancy. Such a product would allow a large amount of biologically useful data to be extracted while postponing the majority of work involved in producing a high quality consensus sequence.

Entities:  

Mesh:

Year:  1998        PMID: 9799794      PMCID: PMC310787          DOI: 10.1101/gr.8.10.1074

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  23 in total

1.  Base-calling of automated sequencer traces using phred. I. Accuracy assessment.

Authors:  B Ewing; L Hillier; M C Wendl; P Green
Journal:  Genome Res       Date:  1998-03       Impact factor: 9.043

2.  A gene-rich cluster between the CD4 and triosephosphate isomerase genes at human chromosome 12p13.

Authors:  M A Ansari-Lari; D M Muzny; J Lu; F Lu; C E Lilley; S Spanos; T Malley; R A Gibbs
Journal:  Genome Res       Date:  1996-04       Impact factor: 9.043

3.  Long human-mouse sequence alignments reveal novel regulatory elements: a reason to sequence the mouse genome.

Authors:  R C Hardison; J Oeltjen; W Miller
Journal:  Genome Res       Date:  1997-10       Impact factor: 9.043

4.  Representation of cloned genomic sequences in two sequencing vectors: correlation of DNA sequence and subclone distribution.

Authors:  S L Chissoe; M A Marra; L Hillier; R Brinkman; R K Wilson; R H Waterston
Journal:  Nucleic Acids Res       Date:  1997-08-01       Impact factor: 16.971

5.  Large-scale sequencing in human chromosome 12p13: experimental and computational gene structure determination.

Authors:  M A Ansari-Lari; Y Shen; D M Muzny; W Lee; R A Gibbs
Journal:  Genome Res       Date:  1997-03       Impact factor: 9.043

6.  Against a whole-genome shotgun.

Authors:  P Green
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

7.  Human whole-genome shotgun sequencing.

Authors:  J L Weber; E W Myers
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

Review 8.  Genomic DNA sequencing methods.

Authors:  A Favello; L Hillier; R K Wilson
Journal:  Methods Cell Biol       Date:  1995       Impact factor: 1.441

9.  Software trapping: a strategy for finding genes in large genomic regions.

Authors:  A Kamb; C Wang; A Thomas; B S DeHoff; F H Norris; K Richardson; J Rine; M H Skolnick; P R Rosteck
Journal:  Comput Biomed Res       Date:  1995-04

10.  Comparative sequence analysis of a gene-rich cluster at human chromosome 12p13 and its syntenic region in mouse chromosome 6.

Authors:  M A Ansari-Lari; J C Oeltjen; S Schwartz; Z Zhang; D M Muzny; J Lu; J H Gorrell; A C Chinault; J W Belmont; W Miller; R A Gibbs
Journal:  Genome Res       Date:  1998-01       Impact factor: 9.043

View more
  22 in total

1.  Generation and analysis of 25 Mb of genomic DNA from the pufferfish Fugu rubripes by sequence scanning.

Authors:  G Elgar; M S Clark; S Meek; S Smith; S Warner; Y J Edwards; N Bouchireb; A Cottage; G S Yeo; Y Umrania; G Williams; S Brenner
Journal:  Genome Res       Date:  1999-10       Impact factor: 9.043

2.  Locus-specific contig assembly in highly-duplicated genomes, using the BAC-RF method.

Authors:  Y R Lin; X Draye; X Qian; S Ren; L H Zhu; J Tomkins; R A Wing; Z Li; A H Paterson
Journal:  Nucleic Acids Res       Date:  2000-04-01       Impact factor: 16.971

3.  PipMaker--a web server for aligning two genomic DNA sequences.

Authors:  S Schwartz; Z Zhang; K A Frazer; A Smit; C Riemer; J Bouck; R Gibbs; R Hardison; W Miller
Journal:  Genome Res       Date:  2000-04       Impact factor: 9.043

4.  RePS: a sequence assembler that masks exact repeats identified from the shotgun data.

Authors:  Jun Wang; Gane Ka-Shu Wong; Peixiang Ni; Yujun Han; Xiangang Huang; Jianguo Zhang; Chen Ye; Yong Zhang; Jianfei Hu; Kunlin Zhang; Xin Xu; Lijuan Cong; Hong Lu; Xide Ren; Xiaoyu Ren; Jun He; Lin Tao; Douglas A Passey; Jian Wang; Huanming Yang; Jun Yu; Songgang Li
Journal:  Genome Res       Date:  2002-05       Impact factor: 9.043

Review 5.  On the high value of low standards.

Authors:  Elbert Branscomb; Paul Predki
Journal:  J Bacteriol       Date:  2002-12       Impact factor: 3.490

6.  The Atlas genome assembly system.

Authors:  Paul Havlak; Rui Chen; K James Durbin; Amy Egan; Yanru Ren; Xing-Zhi Song; George M Weinstock; Richard A Gibbs
Journal:  Genome Res       Date:  2004-04       Impact factor: 9.043

7.  An intermediate grade of finished genomic sequence suitable for comparative analyses.

Authors:  Robert W Blakesley; Nancy F Hansen; James C Mullikin; Pamela J Thomas; Jennifer C McDowell; Baishali Maskeri; Alice C Young; Beatrice Benjamin; Shelise Y Brooks; Bradley I Coleman; Jyoti Gupta; Shi-Ling Ho; Eric M Karlins; Quino L Maduro; Sirintorn Stantripop; Cyrus Tsurgeon; Jennifer L Vogt; Michelle A Walker; Catherine A Masiello; Xiaobin Guan; Gerard G Bouffard; Eric D Green
Journal:  Genome Res       Date:  2004-10-12       Impact factor: 9.043

8.  Association between divergence and interspersed repeats in mammalian noncoding genomic DNA.

Authors:  F Chiaromonte; S Yang; L Elnitski; V B Yap; W Miller; R C Hardison
Journal:  Proc Natl Acad Sci U S A       Date:  2001-11-20       Impact factor: 11.205

9.  Toward integration of comparative genetic, physical, diversity, and cytomolecular maps for grasses and grains, using the sorghum genome as a foundation.

Authors:  X Draye; Y R Lin; X Y Qian; J E Bowers; G B Burow; P L Morrell; D G Peterson; G G Presting; S X Ren; R A Wing; A H Paterson
Journal:  Plant Physiol       Date:  2001-03       Impact factor: 8.340

10.  Comparative analysis of the gene-dense ACHE/TFR2 region on human chromosome 7q22 with the orthologous region on mouse chromosome 5.

Authors:  M D Wilson; C Riemer; D W Martindale; P Schnupf; A P Boright; T L Cheung; D M Hardy; S Schwartz; S W Scherer; L C Tsui; W Miller; B F Koop
Journal:  Nucleic Acids Res       Date:  2001-03-15       Impact factor: 16.971

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.