Literature DB >> 33365644

Characterization of the complete chloroplast genome of leek Allium porrum L. (Amaryllidaceae).

Mikhail A Filyushin1, Alexey V Beletsky1, Elena Z Kochieva1.   

Abstract

The complete chloroplast genome sequence of Allium porrum was determined by Illumina single-end sequencing. The complete plastid genome was 152,732 bp in length containing a large single copy (LSC) of 81,744 bp and a small single copy (SSC) of 17,910 bp, which were separated by a pair of 26,524 bp inverted repeats (IRs). A total of 133 genes were annotated, including 80 protein-coding genes, 38 tRNA genes, 8 rRNA genes, and 7 pseudogenes. The overall GC contents of the plastid genome were 36.7%. Unlike A. sativum and A. obliquum in the leek, plastome infA gene is absent, and rpl23 gene is a pseudogene due to a 4 bp deletion and the formation of a premature stop codon.
© 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  Allium porrum; chloroplast genome; leek

Year:  2019        PMID: 33365644      PMCID: PMC7706495          DOI: 10.1080/23802359.2019.1640090

Source DB:  PubMed          Journal:  Mitochondrial DNA B Resour        ISSN: 2380-2359            Impact factor:   0.658


Leek (Allium porrum L., Amaryllidaceae) due to its taste and valuable dietary properties is a popular vegetable crop, the annual production of which is more than 2 million tons per year (http://www.fao.org/faostat/). In leek, the entire above-ground part is edible – the bleached false stem and green leaves, which are used in food both raw and cooked. At the same time, leek, like other Allium species, is a rich source of secondary metabolites (Soininen et al. 2014). The chloroplast genome A. porrum (cultivar ‘Premier’; seed from Federal Scientific Center of Vegetable Crops, Russia) was amplified via long range PCR using 11 pairs of primers developed on the basis of the Allium cepa plastid genome (Filyushin, Beletsky, et al. 2018), sequencing was conducted using the Illumina HiSeq 1500 Sequencing System with single-end 220 bp reads. Spades v.3.8 was used to assemble the high-quality short reads into contigs (Bankevich et al. 2012). Contigs were assembled against the complete chloroplast genome A. cepa (NC_024813) and A. sativum (NC_031829) as a reference. Gaps were closed using assembly graph in Bandage (Wick et al. 2015), reads were then mapped against the resulting single contig to ensure the correctness of the finished assembly. The plastid genome of A. porrum was annotated by using the DOGMA program (http://dogma.ccbb.utexas.edu). The start and stop codons for the genes were identified and corrected manually. All pseudogenes were additionally verified by Sanger sequencing from flanking primers. The assembled A. porrum plastid genome (Genbank accession no. MK820026) was 152,732 bp in length, showing a typical quadripartite structure including a pair of inverted repeats (IRs) of 26,524 bp separating one large single copy region (LSC) of 81,744 bp and one small single copy region (SSC) of 17,910 bp. GC contents of the genome were 36.7%. A total of 133 genes were identified that include 80 protein-coding genes, 38 tRNA genes, 8 rRNA genes, and 7 pseudogenes. Most of the genes are single copy, whereas 17 genes present in double copies, including 6 protein-coding genes (rps19, rpl2, ycf2, ndhB, rps7), 8 tRNA genes (trnR-ACG, trnL-CAA, trnV-GAC, trnH-GUG, trnI-CAU, trnI-GAU, trnA-UGC, trnN-GUU) and all 4 rRNA genes in IRs (rrn4.5, rrn5, rrn16 and rrn23). Intron sequences are found in 17 genes, 15 of which contain a single intron (atpF, rpoC1, ndhA, trnK-UUU, trnG-GCC, trnL-UAA, trnV-UAC; 4 genes in IRs: rpl2, ndhB, trnI-GAU, trnA-UGC), while two (clpP and ycf3) have two introns. Seven genes became pseudogenes due to internal stop codons (rps2, rps16 and ycf15, and rpl23 in IRs) or because of incomplete duplication in the IRB/SSC junction region (ycf1). Unlike the Allium sativum (Filyushin et al. 2016) and Allium obliquum plastomes (Filyushin, Mazur, et al. 2018), in the leek plastome infA gene is absent (additionally verified by Sanger sequencing). On the ML tree, A. porrum clustered with other Allium species, the closest to it is garlic A. sativum, with which they belong to the section Allium (Figure 1).
Figure 1.

Phylogenetic tree inferred by maximum-likelihood using 80 protein-coding gene sequences from plastomes of Allium species (Allium altaicum (NC_040972), Allium cepa (NC_024813), Allium fistulosum (NC_040222), Allium obliquum (NC_037199), Allium porrum (MK820026), Allium sativum (NC_031829)) and Asparagus officinalis (NC_034777) as an outgroup. PhyML 3.1 was used for the sequence alignment and construction of the tree. Bootstrap support values based on 1000 replicates are displayed on each node.

Phylogenetic tree inferred by maximum-likelihood using 80 protein-coding gene sequences from plastomes of Allium species (Allium altaicum (NC_040972), Allium cepa (NC_024813), Allium fistulosum (NC_040222), Allium obliquum (NC_037199), Allium porrum (MK820026), Allium sativum (NC_031829)) and Asparagus officinalis (NC_034777) as an outgroup. PhyML 3.1 was used for the sequence alignment and construction of the tree. Bootstrap support values based on 1000 replicates are displayed on each node.
  3 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Metabolite profiling of leek (Allium porrum L) cultivars by (1) H NMR and HPLC-MS.

Authors:  Tuula H Soininen; Niko Jukarainen; Pasi Soininen; Seppo O K Auriola; Riitta Julkunen-Tiitto; Wieslaw Oleszek; Anna Stochmal; Reijo O Karjalainen; Jouko J Vepsäläinen
Journal:  Phytochem Anal       Date:  2014-01-10       Impact factor: 3.373

3.  Bandage: interactive visualization of de novo genome assemblies.

Authors:  Ryan R Wick; Mark B Schultz; Justin Zobel; Kathryn E Holt
Journal:  Bioinformatics       Date:  2015-06-22       Impact factor: 6.937

  3 in total
  1 in total

1.  Gene Loss, Pseudogenization in Plastomes of Genus Allium (Amaryllidaceae), and Putative Selection for Adaptation to Environmental Conditions.

Authors:  Victoria A Scobeyeva; Ilya V Artyushin; Anastasiya A Krinitsina; Pavel A Nikitin; Maxim I Antipin; Sergei V Kuptsov; Maxim S Belenikin; Denis O Omelchenko; Maria D Logacheva; Evgenii A Konorov; Andrey E Samoilov; Anna S Speranskaya
Journal:  Front Genet       Date:  2021-07-08       Impact factor: 4.599

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.