Literature DB >> 21646520

Reference-guided assembly of four diverse Arabidopsis thaliana genomes.

Korbinian Schneeberger1, Stephan Ossowski, Felix Ott, Juliane D Klein, Xi Wang, Christa Lanz, Lisa M Smith, Jun Cao, Joffrey Fitz, Norman Warthmann, Stefan R Henz, Daniel H Huson, Detlef Weigel.   

Abstract

We present whole-genome assemblies of four divergent Arabidopsis thaliana strains that complement the 125-Mb reference genome sequence released a decade ago. Using a newly developed reference-guided approach, we assembled large contigs from 9 to 42 Gb of Illumina short-read data from the Landsberg erecta (Ler-1), C24, Bur-0, and Kro-0 strains, which have been sequenced as part of the 1,001 Genomes Project for this species. Using alignments against the reference sequence, we first reduced the complexity of the de novo assembly and later integrated reads without similarity to the reference sequence. As an example, half of the noncentromeric C24 genome was covered by scaffolds that are longer than 260 kb, with a maximum of 2.2 Mb. Moreover, over 96% of the reference genome was covered by the reference-guided assembly, compared with only 87% with a complete de novo assembly. Comparisons with 2 Mb of dideoxy sequence reveal that the per-base error rate of the reference-guided assemblies was below 1 in 10,000. Our assemblies provide a detailed, genomewide picture of large-scale differences between A. thaliana individuals, most of which are difficult to access with alignment-consensus methods only. We demonstrate their practical relevance in studying the expression differences of polymorphic genes and show how the analysis of sRNA sequencing data can lead to erroneous conclusions if aligned against the reference genome alone. Genome assemblies, raw reads, and further information are accessible through http://1001genomes.org/projects/assemblies.html.

Entities:  

Mesh:

Year:  2011        PMID: 21646520      PMCID: PMC3121819          DOI: 10.1073/pnas.1107739108

Source DB:  PubMed          Journal:  Proc Natl Acad Sci U S A        ISSN: 0027-8424            Impact factor:   11.205


  42 in total

1.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

2.  Fine-scale structural variation of the human genome.

Authors:  Eray Tuzun; Andrew J Sharp; Jeffrey A Bailey; Rajinder Kaul; V Anne Morrison; Lisa M Pertz; Eric Haugen; Hillary Hayden; Donna Albertson; Daniel Pinkel; Maynard V Olson; Evan E Eichler
Journal:  Nat Genet       Date:  2005-05-15       Impact factor: 38.330

3.  Short read fragment assembly of bacterial genomes.

Authors:  Mark J Chaisson; Pavel A Pevzner
Journal:  Genome Res       Date:  2007-12-14       Impact factor: 9.043

4.  Velvet: algorithms for de novo short read assembly using de Bruijn graphs.

Authors:  Daniel R Zerbino; Ewan Birney
Journal:  Genome Res       Date:  2008-03-18       Impact factor: 9.043

5.  PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data.

Authors:  Kai Wang; Mingyao Li; Dexter Hadley; Rui Liu; Joseph Glessner; Struan F A Grant; Hakon Hakonarson; Maja Bucan
Journal:  Genome Res       Date:  2007-10-05       Impact factor: 9.043

6.  Identification of somatically acquired rearrangements in cancer using genome-wide massively parallel paired-end sequencing.

Authors:  Peter J Campbell; Philip J Stephens; Erin D Pleasance; Sarah O'Meara; Heng Li; Thomas Santarius; Lucy A Stebbings; Catherine Leroy; Sarah Edkins; Claire Hardy; Jon W Teague; Andrew Menzies; Ian Goodhead; Daniel J Turner; Christopher M Clee; Michael A Quail; Antony Cox; Clive Brown; Richard Durbin; Matthew E Hurles; Paul A W Edwards; Graham R Bignell; Michael R Stratton; P Andrew Futreal
Journal:  Nat Genet       Date:  2008-04-27       Impact factor: 38.330

7.  Paired-end mapping reveals extensive structural variation in the human genome.

Authors:  Jan O Korbel; Alexander Eckehart Urban; Jason P Affourtit; Brian Godwin; Fabian Grubert; Jan Fredrik Simons; Philip M Kim; Dean Palejev; Nicholas J Carriero; Lei Du; Bruce E Taillon; Zhoutao Chen; Andrea Tanzer; A C Eugenia Saunders; Jianxiang Chi; Fengtang Yang; Nigel P Carter; Matthew E Hurles; Sherman M Weissman; Timothy T Harkins; Mark B Gerstein; Michael Egholm; Michael Snyder
Journal:  Science       Date:  2007-09-27       Impact factor: 47.728

8.  Common sequence polymorphisms shaping genetic diversity in Arabidopsis thaliana.

Authors:  Richard M Clark; Gabriele Schweikert; Christopher Toomajian; Stephan Ossowski; Georg Zeller; Paul Shinn; Norman Warthmann; Tina T Hu; Glenn Fu; David A Hinds; Huaming Chen; Kelly A Frazer; Daniel H Huson; Bernhard Schölkopf; Magnus Nordborg; Gunnar Rätsch; Joseph R Ecker; Detlef Weigel
Journal:  Science       Date:  2007-07-20       Impact factor: 47.728

9.  Detecting polymorphic regions in Arabidopsis thaliana with resequencing microarrays.

Authors:  Georg Zeller; Richard M Clark; Korbinian Schneeberger; Anja Bohlen; Detlef Weigel; Gunnar Rätsch
Journal:  Genome Res       Date:  2008-03-06       Impact factor: 9.043

10.  A robust framework for detecting structural variations in a genome.

Authors:  Seunghak Lee; Elango Cheran; Michael Brudno
Journal:  Bioinformatics       Date:  2008-07-01       Impact factor: 6.937

View more
  110 in total

1.  Analysis of Arabidopsis genome-wide variations before and after meiosis and meiotic recombination by resequencing Landsberg erecta and all four products of a single meiosis.

Authors:  Pingli Lu; Xinwei Han; Ji Qi; Jiange Yang; Asela J Wijeratne; Tao Li; Hong Ma
Journal:  Genome Res       Date:  2011-11-21       Impact factor: 9.043

Review 2.  Natural variation in Arabidopsis: from molecular genetics to ecological genomics.

Authors:  Detlef Weigel
Journal:  Plant Physiol       Date:  2011-12-06       Impact factor: 8.340

3.  Genome-wide genetic changes during modern breeding of maize.

Authors:  Yinping Jiao; Hainan Zhao; Longhui Ren; Weibin Song; Biao Zeng; Jinjie Guo; Baobao Wang; Zhipeng Liu; Jing Chen; Wei Li; Mei Zhang; Shaojun Xie; Jinsheng Lai
Journal:  Nat Genet       Date:  2012-06-03       Impact factor: 38.330

4.  Hybrid mimics and hybrid vigor in Arabidopsis.

Authors:  Li Wang; Ian K Greaves; Michael Groszmann; Li Min Wu; Elizabeth S Dennis; W James Peacock
Journal:  Proc Natl Acad Sci U S A       Date:  2015-08-17       Impact factor: 11.205

5.  Regulation of Parent-of-Origin Allelic Expression in the Endosperm.

Authors:  Karina S Hornslien; Jason R Miller; Paul E Grini
Journal:  Plant Physiol       Date:  2019-05-07       Impact factor: 8.340

6.  The high polyphenol content of grapevine cultivar tannat berries is conferred primarily by genes that are not shared with the reference genome.

Authors:  Cecilia Da Silva; Gianpiero Zamperin; Alberto Ferrarini; Andrea Minio; Alessandra Dal Molin; Luca Venturini; Genny Buson; Paola Tononi; Carla Avanzato; Elisa Zago; Eduardo Boido; Eduardo Dellacassa; Carina Gaggero; Mario Pezzotti; Francisco Carrau; Massimo Delledonne
Journal:  Plant Cell       Date:  2013-12-06       Impact factor: 11.277

7.  Chromosome-level assembly of Arabidopsis thaliana Ler reveals the extent of translocation and inversion polymorphisms.

Authors:  Luis Zapata; Jia Ding; Eva-Maria Willing; Benjamin Hartwig; Daniela Bezdan; Wen-Biao Jiao; Vipul Patel; Geo Velikkakam James; Maarten Koornneef; Stephan Ossowski; Korbinian Schneeberger
Journal:  Proc Natl Acad Sci U S A       Date:  2016-06-27       Impact factor: 11.205

8.  LR_Gapcloser: a tiling path-based gap closer that uses long reads to complete genome assembly.

Authors:  Gui-Cai Xu; Tian-Jun Xu; Rui Zhu; Yan Zhang; Shang-Qi Li; Hong-Wei Wang; Jiong-Tang Li
Journal:  Gigascience       Date:  2019-01-01       Impact factor: 6.524

9.  Genomic localization of AtRE1 and AtRE2, copia-type retrotransposons, in natural variants of Arabidopsis thaliana.

Authors:  Mari Yamada; Yumi Yamagishi; Masashi Akaoka; Hidetaka Ito; Atsushi Kato
Journal:  Mol Genet Genomics       Date:  2014-04-27       Impact factor: 3.291

Review 10.  Genomic variation in Arabidopsis: tools and insights from next-generation sequencing.

Authors:  Jesse D Hollister
Journal:  Chromosome Res       Date:  2014-06       Impact factor: 5.239

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.