Literature DB >> 20123915

A new strategy for genome assembly using short sequence reads and reduced representation libraries.

Andrew L Young1, Hatice Ozel Abaan, Daniel Zerbino, James C Mullikin, Ewan Birney, Elliott H Margulies.   

Abstract

We have developed a novel approach for using massively parallel short-read sequencing to generate fast and inexpensive de novo genomic assemblies comparable to those generated by capillary-based methods. The ultrashort (<100 base) sequences generated by this technology pose specific biological and computational challenges for de novo assembly of large genomes. To account for this, we devised a method for experimentally partitioning the genome using reduced representation (RR) libraries prior to assembly. We use two restriction enzymes independently to create a series of overlapping fragment libraries, each containing a tractable subset of the genome. Together, these libraries allow us to reassemble the entire genome without the need of a reference sequence. As proof of concept, we applied this approach to sequence and assembled the majority of the 125-Mb Drosophila melanogaster genome. We subsequently demonstrate the accuracy of our assembly method with meaningful comparisons against the current available D. melanogaster reference genome (dm3). The ease of assembly and accuracy for comparative genomics suggest that our approach will scale to future mammalian genome-sequencing efforts, saving both time and money without sacrificing quality.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 20123915      PMCID: PMC2813480          DOI: 10.1101/gr.097956.109

Source DB:  PubMed          Journal:  Genome Res        ISSN: 1088-9051            Impact factor:   9.043


  32 in total

1.  SSAHA: a fast search method for large DNA databases.

Authors:  Z Ning; A J Cox; J C Mullikin
Journal:  Genome Res       Date:  2001-10       Impact factor: 9.043

2.  An intermediate grade of finished genomic sequence suitable for comparative analyses.

Authors:  Robert W Blakesley; Nancy F Hansen; James C Mullikin; Pamela J Thomas; Jennifer C McDowell; Baishali Maskeri; Alice C Young; Beatrice Benjamin; Shelise Y Brooks; Bradley I Coleman; Jyoti Gupta; Shi-Ling Ho; Eric M Karlins; Quino L Maduro; Sirintorn Stantripop; Cyrus Tsurgeon; Jennifer L Vogt; Michelle A Walker; Catherine A Masiello; Xiaobin Guan; Gerard G Bouffard; Eric D Green
Journal:  Genome Res       Date:  2004-10-12       Impact factor: 9.043

3.  ABySS: a parallel assembler for short read sequence data.

Authors:  Jared T Simpson; Kim Wong; Shaun D Jackman; Jacqueline E Schein; Steven J M Jones; Inanç Birol
Journal:  Genome Res       Date:  2009-02-27       Impact factor: 9.043

4.  Construction of a general human chromosome jumping library, with application to cystic fibrosis.

Authors:  F S Collins; M L Drumm; J L Cole; W K Lockwood; G F Vande Woude; M C Iannuzzi
Journal:  Science       Date:  1987-02-27       Impact factor: 47.728

5.  Intraspecific nuclear DNA variation in Drosophila.

Authors:  E N Moriyama; J R Powell
Journal:  Mol Biol Evol       Date:  1996-01       Impact factor: 16.240

6.  The phusion assembler.

Authors:  James C Mullikin; Zemin Ning
Journal:  Genome Res       Date:  2003-01       Impact factor: 9.043

7.  A model of the statistical power of comparative genome sequence analysis.

Authors:  Sean R Eddy
Journal:  PLoS Biol       Date:  2005-01-04       Impact factor: 8.029

8.  FlyBase: genes and gene models.

Authors:  Rachel A Drysdale; Madeline A Crosby
Journal:  Nucleic Acids Res       Date:  2005-01-01       Impact factor: 16.971

9.  The genome sequence of Drosophila melanogaster.

Authors:  M D Adams; S E Celniker; R A Holt; C A Evans; J D Gocayne; P G Amanatides; S E Scherer; P W Li; R A Hoskins; R F Galle; R A George; S E Lewis; S Richards; M Ashburner; S N Henderson; G G Sutton; J R Wortman; M D Yandell; Q Zhang; L X Chen; R C Brandon; Y H Rogers; R G Blazej; M Champe; B D Pfeiffer; K H Wan; C Doyle; E G Baxter; G Helt; C R Nelson; G L Gabor; J F Abril; A Agbayani; H J An; C Andrews-Pfannkoch; D Baldwin; R M Ballew; A Basu; J Baxendale; L Bayraktaroglu; E M Beasley; K Y Beeson; P V Benos; B P Berman; D Bhandari; S Bolshakov; D Borkova; M R Botchan; J Bouck; P Brokstein; P Brottier; K C Burtis; D A Busam; H Butler; E Cadieu; A Center; I Chandra; J M Cherry; S Cawley; C Dahlke; L B Davenport; P Davies; B de Pablos; A Delcher; Z Deng; A D Mays; I Dew; S M Dietz; K Dodson; L E Doup; M Downes; S Dugan-Rocha; B C Dunkov; P Dunn; K J Durbin; C C Evangelista; C Ferraz; S Ferriera; W Fleischmann; C Fosler; A E Gabrielian; N S Garg; W M Gelbart; K Glasser; A Glodek; F Gong; J H Gorrell; Z Gu; P Guan; M Harris; N L Harris; D Harvey; T J Heiman; J R Hernandez; J Houck; D Hostin; K A Houston; T J Howland; M H Wei; C Ibegwam; M Jalali; F Kalush; G H Karpen; Z Ke; J A Kennison; K A Ketchum; B E Kimmel; C D Kodira; C Kraft; S Kravitz; D Kulp; Z Lai; P Lasko; Y Lei; A A Levitsky; J Li; Z Li; Y Liang; X Lin; X Liu; B Mattei; T C McIntosh; M P McLeod; D McPherson; G Merkulov; N V Milshina; C Mobarry; J Morris; A Moshrefi; S M Mount; M Moy; B Murphy; L Murphy; D M Muzny; D L Nelson; D R Nelson; K A Nelson; K Nixon; D R Nusskern; J M Pacleb; M Palazzolo; G S Pittman; S Pan; J Pollard; V Puri; M G Reese; K Reinert; K Remington; R D Saunders; F Scheeler; H Shen; B C Shue; I Sidén-Kiamos; M Simpson; M P Skupski; T Smith; E Spier; A C Spradling; M Stapleton; R Strong; E Sun; R Svirskas; C Tector; R Turner; E Venter; A H Wang; X Wang; Z Y Wang; D A Wassarman; G M Weinstock; J Weissenbach; S M Williams; K C Worley; D Wu; S Yang; Q A Yao; J Ye; R F Yeh; J S Zaveri; M Zhan; G Zhang; Q Zhao; L Zheng; X H Zheng; F N Zhong; W Zhong; X Zhou; S Zhu; X Zhu; H O Smith; R A Gibbs; E W Myers; G M Rubin; J C Venter
Journal:  Science       Date:  2000-03-24       Impact factor: 47.728

10.  Finishing a whole-genome shotgun: release 3 of the Drosophila melanogaster euchromatic genome sequence.

Authors:  Susan E Celniker; David A Wheeler; Brent Kronmiller; Joseph W Carlson; Aaron Halpern; Sandeep Patel; Mark Adams; Mark Champe; Shannon P Dugan; Erwin Frise; Ann Hodgson; Reed A George; Roger A Hoskins; Todd Laverty; Donna M Muzny; Catherine R Nelson; Joanne M Pacleb; Soo Park; Barret D Pfeiffer; Stephen Richards; Erica J Sodergren; Robert Svirskas; Paul E Tabor; Kenneth Wan; Mark Stapleton; Granger G Sutton; Craig Venter; George Weinstock; Steven E Scherer; Eugene W Myers; Richard A Gibbs; Gerald M Rubin
Journal:  Genome Biol       Date:  2002-12-23       Impact factor: 13.583

View more
  15 in total

1.  A strategy for direct mapping and identification of mutations by whole-genome sequencing.

Authors:  Steven Zuryn; Stéphanie Le Gras; Karine Jamet; Sophie Jarriault
Journal:  Genetics       Date:  2010-07-06       Impact factor: 4.562

Review 2.  Reconstructing ancient genomes and epigenomes.

Authors:  Ludovic Orlando; M Thomas P Gilbert; Eske Willerslev
Journal:  Nat Rev Genet       Date:  2015-06-09       Impact factor: 53.242

3.  Reference-guided assembly of four diverse Arabidopsis thaliana genomes.

Authors:  Korbinian Schneeberger; Stephan Ossowski; Felix Ott; Juliane D Klein; Xi Wang; Christa Lanz; Lisa M Smith; Jun Cao; Joffrey Fitz; Norman Warthmann; Stefan R Henz; Daniel H Huson; Detlef Weigel
Journal:  Proc Natl Acad Sci U S A       Date:  2011-06-06       Impact factor: 11.205

Review 4.  Challenges of sequencing human genomes.

Authors:  Daniel C Koboldt; Li Ding; Elaine R Mardis; Richard K Wilson
Journal:  Brief Bioinform       Date:  2010-06-02       Impact factor: 11.622

5.  Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus Saccharum.

Authors:  Clícia Grativol; Michael Regulski; Marcelo Bertalan; W Richard McCombie; Felipe Rodrigues da Silva; Adhemar Zerlotini Neto; Renato Vicentini; Laurent Farinelli; Adriana Silva Hemerly; Robert A Martienssen; Paulo Cavalcanti Gomes Ferreira
Journal:  Plant J       Date:  2014-06-17       Impact factor: 6.417

6.  Multiple changes in peptide and lipid expression associated with regeneration in the nervous system of the medicinal leech.

Authors:  Céline Meriaux; Karim Arafah; Aurélie Tasiemski; Maxence Wisztorski; Jocelyne Bruand; Céline Boidin-Wichlacz; Annie Desmons; Delphine Debois; Olivier Laprévote; Alain Brunelle; Terry Gaasterland; Eduardo Macagno; Isabelle Fournier; Michel Salzet
Journal:  PLoS One       Date:  2011-04-22       Impact factor: 3.240

7.  Positional information resolves structural variations and uncovers an evolutionarily divergent genetic locus in accessions of Arabidopsis thaliana.

Authors:  Alvina G Lai; Matthew Denton-Giles; Bernd Mueller-Roeber; Jos H M Schippers; Paul P Dijkwel
Journal:  Genome Biol Evol       Date:  2011-05-27       Impact factor: 3.416

8.  Evolution in agriculture: the application of evolutionary approaches to the management of biotic interactions in agro-ecosystems.

Authors:  Peter H Thrall; John G Oakeshott; Gary Fitt; Simon Southerton; Jeremy J Burdon; Andy Sheppard; Robyn J Russell; Myron Zalucki; Mikko Heino; R Ford Denison
Journal:  Evol Appl       Date:  2011-03       Impact factor: 5.183

9.  A combinatorial approach to the restriction of a mouse genome.

Authors:  Leonid V Bystrykh
Journal:  BMC Res Notes       Date:  2013-07-22

10.  Draft Genome Sequence of Kurthia huakuii LAM0618T, an Organic-Pollutant-Degrading Strain Isolated from Biogas Slurry.

Authors:  Zhiyong Ruan; Yanwei Wang; Jinlong Song; Yi Zhai; Chi Zhang; Chao Chen; Yanting Li; Bingqiang Zhao; Bin Zhao
Journal:  Genome Announc       Date:  2014-01-16
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.