Literature DB >> 10077536

Analysis of sequence-tagged-connector strategies for DNA sequencing.

A F Siegel1, B Trask, J C Roach, G G Mahairas, L Hood, G van den Engh.   

Abstract

The BAC-end sequencing, or sequence-tagged-connector (STC), approach to genome sequencing involves sequencing the ends of BAC inserts to scatter sequence tags (STCs) randomly across the genome. Once any BAC or other large segment of DNA is sequenced to completion by conventional shotgun approaches, these STC tags can be used to identify a minimum tiling path of BAC clones overlapping the nucleation sequence for sequence extension. Here, we explore the properties of STC-sequencing strategies within a mathematical model of a random target with homologous repeats and imperfect sequencing technology to understand the consequences of varying various parameters on the incidence of problem clones and the cost of the sequencing project. Problem clones are defined as clones for which either (A) there is no identifiable overlapping STC to extend the sequence in a particular direction or (B) the identified STC with minimum overlap comes from a nonoverlapping clone, either owing to random false matches or repeat-family homology. Based on the minimum overlap, we estimate the number of clones to be entirely sequenced and, then, using cost estimates, identify the decision rule (the degree of sequence similarity required before a match is declared between an STC and a clone) to minimize overall sequencing cost. A method to optimize the overlap decision rule is highly desirable, because both the total cost and the number of problem clones are shown to be highly sensitive to this choice. For a target of 3 Gb containing approximately 800 Mb of repeats with 85%-90% identity, we expect <10 problem clones with 15 times coverage by 150-kb clones. We derive the optimal redundancy and insert sizes of clone libraries for sequencing genomes of various sizes, from microbial to human. We estimate that establishing the resource of STCs as a means of identifying minimally overlapping clones represents only 1%-3% of the total cost of sequencing the human genome, and, up to a point of diminishing returns, a larger STC resource is associated with a smaller total sequencing cost.

Entities:  

Mesh:

Year:  1999        PMID: 10077536      PMCID: PMC310733     

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  12 in total

1.  Sequencing the human genome.

Authors:  L Rowen; G Mahairas; L Hood
Journal:  Science       Date:  1997-10-24       Impact factor: 47.728

2.  Optimization of restriction fragment DNA mapping.

Authors:  A F Siegel; J C Roach; C Magness; E Thayer; G van den Engh
Journal:  J Comput Biol       Date:  1998       Impact factor: 1.479

3.  Against a whole-genome shotgun.

Authors:  P Green
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

4.  Multiple-complete-digest restriction fragment mapping: generating sequence-ready maps for large-scale DNA sequencing.

Authors:  G K Wong; J Yu; E C Thayer; M V Olson
Journal:  Proc Natl Acad Sci U S A       Date:  1997-05-13       Impact factor: 11.205

5.  Human whole-genome shotgun sequencing.

Authors:  J L Weber; E W Myers
Journal:  Genome Res       Date:  1997-05       Impact factor: 9.043

6.  Automated DNA sequencing of the human HPRT locus.

Authors:  A Edwards; H Voss; P Rice; A Civitello; J Stegemann; C Schwager; J Zimmermann; H Erfle; C T Caskey; W Ansorge
Journal:  Genomics       Date:  1990-04       Impact factor: 5.736

7.  Pairwise end sequencing: a unified approach to genomic mapping and sequencing.

Authors:  J C Roach; C Boysen; K Wang; L Hood
Journal:  Genomics       Date:  1995-03-20       Impact factor: 5.736

8.  Ordered shotgun sequencing, a strategy for integrated mapping and sequencing of YAC clones.

Authors:  E Y Chen; D Schlessinger; J Kere
Journal:  Genomics       Date:  1993-09       Impact factor: 5.736

9.  Correlations between isochores and chromosomal bands in the human genome.

Authors:  S Saccone; A De Sario; J Wiegant; A K Raap; G Della Valle; G Bernardi
Journal:  Proc Natl Acad Sci U S A       Date:  1993-12-15       Impact factor: 11.205

10.  Genomic sequence sampling: a strategy for high resolution sequence-based physical mapping of complex genomes.

Authors:  M W Smith; A L Holmsen; Y H Wei; M Peterson; G A Evans
Journal:  Nat Genet       Date:  1994-05       Impact factor: 38.330

View more
  7 in total

1.  Sequence-tagged connectors: a sequence approach to mapping and scanning the human genome.

Authors:  G G Mahairas; J C Wallace; K Smith; S Swartzell; T Holzman; A Keller; R Shaker; J Furlong; J Young; S Zhao; M D Adams; L Hood
Journal:  Proc Natl Acad Sci U S A       Date:  1999-08-17       Impact factor: 11.205

Review 2.  Search and discovery strategies for biotechnology: the paradigm shift.

Authors:  A T Bull; A C Ward; M Goodfellow
Journal:  Microbiol Mol Biol Rev       Date:  2000-09       Impact factor: 11.056

3.  Pooled genomic indexing of rhesus macaque.

Authors:  Aleksandar Milosavljevic; Ronald A Harris; Erica J Sodergren; Andrew R Jackson; Ken J Kalafus; Anne Hodgson; Andrew Cree; Weilie Dai; Miklos Csuros; Baoli Zhu; Pieter J de Jong; George M Weinstock; Richard A Gibbs
Journal:  Genome Res       Date:  2005-02       Impact factor: 9.043

4.  Rice transposable elements: a survey of 73,000 sequence-tagged-connectors.

Authors:  L Mao; T C Wood; Y Yu; M A Budiman; J Tomkins; S Woo; M Sasinowski; G Presting; D Frisch; S Goff; R A Dean; R A Wing
Journal:  Genome Res       Date:  2000-07       Impact factor: 9.043

5.  Construction and utility of 10-kb libraries for efficient clone-gap closure for rice genome sequencing.

Authors:  Tae-Jin Yang; Yeisoo Yu; Gyoungju Nah; Michael Atkins; Seunghee Lee; David A Frisch; Rod A Wing
Journal:  Theor Appl Genet       Date:  2003-06-03       Impact factor: 5.699

6.  Mapping of Mcs30, a new mammary carcinoma susceptibility quantitative trait locus (QTL30) on rat chromosome 12: identification of fry as a candidate Mcs gene.

Authors:  Xuefeng Ren; Jessica C Graham; Lichen Jing; Andrei M Mikheev; Yuan Gao; Jenny Pan Lew; Hong Xie; Andrea S Kim; Xiuling Shang; Cynthia Friedman; Graham Vail; Ming Zhu Fang; Yana Bromberg; Helmut Zarbl
Journal:  PLoS One       Date:  2013-09-02       Impact factor: 3.240

Review 7.  Generation of physical map contig-specific sequences.

Authors:  Yanliang Jiang; Peng Xu; Zhanjiang Liu
Journal:  Front Genet       Date:  2014-07-22       Impact factor: 4.599

  7 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.