Literature DB >> 27660776

Chloroplast Genome Sequence of Arabidopsis thaliana Accession Landsberg erecta, Assembled from Single-Molecule, Real-Time Sequencing Data.

Kai Bernd Stadermann1, Daniela Holtgräwe2, Bernd Weisshaar2.   

Abstract

A publicly available data set from Pacific Biosciences was used to create an assembly of the chloroplast genome sequence of the Arabidopsis thaliana genotype Landsberg erecta The assembly is solely based on single-molecule, real-time sequencing data and hence provides high resolution of the two inverted repeat regions typically contained in chloroplast genomes.
Copyright © 2016 Stadermann et al.

Entities:  

Year:  2016        PMID: 27660776      PMCID: PMC5034127          DOI: 10.1128/genomeA.00975-16

Source DB:  PubMed          Journal:  Genome Announc


GENOME ANNOUNCEMENT

Arabidopsis thaliana is the most used model organism in plant genetics. During the last decades, various genotypes of A. thaliana have been sequenced using different sequencing technologies (1, 2). With third-generation sequencing technologies entering the market, novel techniques for DNA sequence generation have become available. The Ler-0 genotype is currently the second most widely used accession of A. thaliana behind Columbia Col-0. The latest published Ler-0 genome assembly used reads created by third-generation sequencing along with reads from other sources to create a high-quality nuclear genome assembly (3). One of the new technologies is single-molecule, real-time (SMRT) sequencing (4) developed by Pacific Biosciences, which is sometimes referred to as “PacBio sequencing”. This sequencing technology allows long read lengths up to the 20- to 60-kb range, allowing single reads to span long repetitive elements or regions. The typical chloroplast genome contains two of these long (~26 kb) repeat regions, with the additional feature that the two regions are inverted repeats (IR) of each other. These IRs are hard to assemble using only short-read data from next-generation sequencing methods. Recently, we described an approach to assemble a chloroplast genome based on an SMRT sequencing data set originally dedicated to sequencing a nuclear genome (5). We applied this approach to data from a single SMRT cell obtained from a publically available SMRT sequencing data set for Arabidopsis thaliana Ler-0 provided by Pacific Biosciences (6). Sequence reads from sample number SAMN02724977 created by P5-C3 chemistry were used. We followed our protocol for identification of potential chloroplast reads, assembly, alignment, and polishing as described previously (5) with one exception: instead of using the spinach chloroplast genome sequence (7) as a template for read extraction, we used the chloroplast genome sequence of the Arabidopsis thaliana Col-0 genotype (8). We extracted 11,016 subreads summing up to 90,383,214 bp. The assembly resulted in a sequence with overlapping ends, and hence we assume that the chloroplast genome sequence is complete. We removed the additional overlap and aligned the sequence to the standard starting position of chloroplast genome sequences. The resulting assembly, designated Ler0_cp_smrt, has a total length of 154,515 bp and differs in 111 positions from the Col-0 chloroplast genome sequence (AP000423.1). We annotated the genome using CpAVAS (9) and identified 123 genes. Of these, 85 genes encode mRNA (i.e., proteins), 8 encode rRNA, and 30 encode tRNA.

Accession number(s).

The complete sequence of the Arabidopsis thaliana Ler-0 chloroplast genome was deposited in GenBank under accession number KX551970.
  8 in total

1.  Chromosome-level assembly of Arabidopsis thaliana Ler reveals the extent of translocation and inversion polymorphisms.

Authors:  Luis Zapata; Jia Ding; Eva-Maria Willing; Benjamin Hartwig; Daniela Bezdan; Wen-Biao Jiao; Vipul Patel; Geo Velikkakam James; Maarten Koornneef; Stephan Ossowski; Korbinian Schneeberger
Journal:  Proc Natl Acad Sci U S A       Date:  2016-06-27       Impact factor: 11.205

2.  Sequencing of natural strains of Arabidopsis thaliana with short reads.

Authors:  Stephan Ossowski; Korbinian Schneeberger; Richard M Clark; Christa Lanz; Norman Warthmann; Detlef Weigel
Journal:  Genome Res       Date:  2008-09-25       Impact factor: 9.043

3.  Complete structure of the chloroplast genome of Arabidopsis thaliana.

Authors:  S Sato; Y Nakamura; T Kaneko; E Asamizu; S Tabata
Journal:  DNA Res       Date:  1999-10-29       Impact factor: 4.458

4.  The plastid chromosome of spinach (Spinacia oleracea): complete nucleotide sequence and gene organization.

Authors:  C Schmitz-Linneweber; R M Maier; J P Alcaraz; A Cottet; R G Herrmann; R Mache
Journal:  Plant Mol Biol       Date:  2001-02       Impact factor: 4.076

5.  Whole-genome sequencing of multiple Arabidopsis thaliana populations.

Authors:  Jun Cao; Korbinian Schneeberger; Stephan Ossowski; Torsten Günther; Sebastian Bender; Joffrey Fitz; Daniel Koenig; Christa Lanz; Oliver Stegle; Christoph Lippert; Xi Wang; Felix Ott; Jonas Müller; Carlos Alonso-Blanco; Karsten Borgwardt; Karl J Schmid; Detlef Weigel
Journal:  Nat Genet       Date:  2011-08-28       Impact factor: 38.330

6.  Real-time DNA sequencing from single polymerase molecules.

Authors:  John Eid; Adrian Fehr; Jeremy Gray; Khai Luong; John Lyle; Geoff Otto; Paul Peluso; David Rank; Primo Baybayan; Brad Bettman; Arkadiusz Bibillo; Keith Bjornson; Bidhan Chaudhuri; Frederick Christians; Ronald Cicero; Sonya Clark; Ravindra Dalal; Alex Dewinter; John Dixon; Mathieu Foquet; Alfred Gaertner; Paul Hardenbol; Cheryl Heiner; Kevin Hester; David Holden; Gregory Kearns; Xiangxu Kong; Ronald Kuse; Yves Lacroix; Steven Lin; Paul Lundquist; Congcong Ma; Patrick Marks; Mark Maxham; Devon Murphy; Insil Park; Thang Pham; Michael Phillips; Joy Roy; Robert Sebra; Gene Shen; Jon Sorenson; Austin Tomaney; Kevin Travers; Mark Trulson; John Vieceli; Jeffrey Wegener; Dawn Wu; Alicia Yang; Denis Zaccarin; Peter Zhao; Frank Zhong; Jonas Korlach; Stephen Turner
Journal:  Science       Date:  2008-11-20       Impact factor: 47.728

7.  SMRT sequencing only de novo assembly of the sugar beet (Beta vulgaris) chloroplast genome.

Authors:  Kai Bernd Stadermann; Bernd Weisshaar; Daniela Holtgräwe
Journal:  BMC Bioinformatics       Date:  2015-09-16       Impact factor: 3.169

8.  CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences.

Authors:  Chang Liu; Linchun Shi; Yingjie Zhu; Haimei Chen; Jianhui Zhang; Xiaohan Lin; Xiaojun Guan
Journal:  BMC Genomics       Date:  2012-12-20       Impact factor: 3.969

  8 in total
  2 in total

1.  A chromosome-level sequence assembly reveals the structure of the Arabidopsis thaliana Nd-1 genome and its gene set.

Authors:  Boas Pucker; Daniela Holtgräwe; Kai Bernd Stadermann; Katharina Frey; Bruno Huettel; Richard Reinhardt; Bernd Weisshaar
Journal:  PLoS One       Date:  2019-05-21       Impact factor: 3.240

2.  ECuADOR-Easy Curation of Angiosperm Duplicated Organellar Regions, a tool for cleaning and curating plastomes assembled from next generation sequencing pipelines.

Authors:  Damien D Hinsinger; Joeri S Strijk; Angelo D Armijos Carrion
Journal:  PeerJ       Date:  2020-04-07       Impact factor: 2.984

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.