Literature DB >> 34179497

Complete chloroplast genome sequence and phylogenetic analysis of Torenia fournieri.

Gang Chen1, Li-Guo Wang2, Ying-Hua Wang1.   

Abstract

Torenia fournieri belongs to the genus Torenia in the family Linderniaceae. The complete chloroplast genome of T. fournieri was sequenced and analyzed by Illumina sequencing in this study. The full length of the complete chloroplast genome is 153,938 bp, containing a pair of inverted repeat regions of 24,805 bp (IRa and IRb) separated by a large single copy region (LSC) of 85,498 bp and a small single copy region (SSC) of 18,830 bp. The T. fournieri chloroplast genome encodes 131 genes, comprising 87 protein-coding genes, 36 tRNA genes, 8 rRNA genes, without pseudogene. Phylogenetic analysis showed that T. fournieri was closely related to T. benthamiana and T. concolor within the genus Torenia in family Linderniaceae.
© 2021 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  Chloroplast genome; Torenia fournieri; phylogenetic analysis

Year:  2021        PMID: 34179497      PMCID: PMC8204956          DOI: 10.1080/23802359.2021.1939179

Source DB:  PubMed          Journal:  Mitochondrial DNA B Resour        ISSN: 2380-2359            Impact factor:   0.658


Torenia fournieri is one species of the genus Torenia (Linderniaceae), which distributes predominantly in Fujian, Guangdong, Guangxi, Taiwan, Yunan and Zhejiang provinces in China (Wu and Raven 1998). Torenia fournieri is not only an important tropical and subtropical ornamental plant, but also used as a model plant widely applied for flower research (Nishihara et al., 2018). Torenia fournieri used in this paper was planted in the Biological Garden of College of Life Sciences of Zhaoqing University (N23°6′, E112°30′, Zhaoqing, China), and the specimens (No: BGCLSZU001) were deposited in GDZQU herbarium of Zhaoqing University. Firstly, the chloroplast genome DNA of T. fournieri was extracted from young leaves, and Covaris M220 (Covaris, Woburn, MA, USA) was used for breaking the DNA into about 300 bp fragments. Secondly, we constructed shotgun sequencing libraries according to the TruSeq™ DNA Sample Prep Kit for Illumina. Thirdly, whole genome sequencing was executed using the Illumina NovaSeq platform (Illumina, USA) (Genepioneer Biotechnologies Co. Ltd, Nanjing, China). Pair-end Illumina raw reads were cleaned from adaptors and barcodes and then quality filtered using Trimmomatic (Bolger et al. 2014). Then, reads were mapped to the chloroplast genome of the reference species (Genbank accession number: NC_045273.1), and the reads of nuclear and mitochondrial origins were excluded using Bowtie2 v2.2.4 (Langmead and Salzberg 2012). SPAdes 3.10.1 were used reconstruct the chloroplast genomes with de novo assembly method (Bankevich et al. 2012), and chloroplast contigs were concatenated into larger contigs using Sequencher 5.3.2 (Gene Codes Inc., Ann Arbor, MI, USA). A ‘genome walking’ technique, using the Unix ‘grep’ function, was used to find reads that could fill any gaps between contigs that did not assemble in the initial set of analyses (Souza et al. 2019). Jellyfish v.2.2.3 was used to correct misassembled contigs (Marcais and Kingsford 2011). Annotation of the chloroplast genomes were generated by CpGAVAS (Liu et al. 2012) and a circular representation was drawn with the online tool OGDRAW (Lohse et al. 2007). The complete chloroplast genome sequence has been submitted to Genbank with the accession number of MW307826. The length of chloroplast genome sequence of T. fournieri is 153,938 bp, including two inverted repeat regions (IRa and IRb, each 24,805 bp) separated by a LSC (85,498 bp) region and a SSC (18,830 bp) region. The GC content of the overall chloroplast genome, IR regions, LSC, and SSC are 37.57, 43.45, 35.39 and 32.00%, respectively. The chloroplast genome contains 131 genes in total, including 87 protein-coding genes, 36 tRNAs, 8 rRNAs, without pseudogene. The 15 whole genomes were used for phylogenetic tree analysis. First, we used MAFFT v7.427 (Katoh et al. 2005) -auto mode to align each sequence. The gaps in the alignment were removed using the program trimAl with ‘-nogaps’ v 1.4 (Capella-Gutierrez et al. 2009). Finally, Maximum-likelihood (ML) method was used to construct the phylogenetic tree with MEGA v7.0 (Kumar et al. 2016); the nucleotide substitution model used was the Kimura 2-parameter model with 1000 bootstrap replicates (Figure 1). We found that T. fournieri was closely related to T. benthamiana and T. concolor within the genus Torenia in family Linderniaceae.
Figure 1.

Phylogenetic tree reconstruction of 15 species based on sequences from whole chloroplast genomes. All the sequences were downloaded from NCBI Genbank.

Phylogenetic tree reconstruction of 15 species based on sequences from whole chloroplast genomes. All the sequences were downloaded from NCBI Genbank.
  11 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  A fast, lock-free approach for efficient parallel counting of occurrences of k-mers.

Authors:  Guillaume Marçais; Carl Kingsford
Journal:  Bioinformatics       Date:  2011-01-07       Impact factor: 6.937

3.  Fast gapped-read alignment with Bowtie 2.

Authors:  Ben Langmead; Steven L Salzberg
Journal:  Nat Methods       Date:  2012-03-04       Impact factor: 28.547

4.  MEGA7: Molecular Evolutionary Genetics Analysis Version 7.0 for Bigger Datasets.

Authors:  Sudhir Kumar; Glen Stecher; Koichiro Tamura
Journal:  Mol Biol Evol       Date:  2016-03-22       Impact factor: 16.240

5.  MAFFT version 5: improvement in accuracy of multiple sequence alignment.

Authors:  Kazutaka Katoh; Kei-ichi Kuma; Hiroyuki Toh; Takashi Miyata
Journal:  Nucleic Acids Res       Date:  2005-01-20       Impact factor: 16.971

6.  Application of the CRISPR/Cas9 system for modification of flower color in Torenia fournieri.

Authors:  Masahiro Nishihara; Atsumi Higuchi; Aiko Watanabe; Keisuke Tasaki
Journal:  BMC Plant Biol       Date:  2018-12-05       Impact factor: 4.215

7.  The complete chloroplast genome of Stryphnodendron adstringens (Leguminosae - Caesalpinioideae): comparative analysis with related Mimosoid species.

Authors:  Ueric José Borges de Souza; Rhewter Nunes; Cíntia Pelegrineti Targueta; José Alexandre Felizola Diniz-Filho; Mariana Pires de Campos Telles
Journal:  Sci Rep       Date:  2019-10-02       Impact factor: 4.379

8.  trimAl: a tool for automated alignment trimming in large-scale phylogenetic analyses.

Authors:  Salvador Capella-Gutiérrez; José M Silla-Martínez; Toni Gabaldón
Journal:  Bioinformatics       Date:  2009-06-08       Impact factor: 6.937

9.  CpGAVAS, an integrated web server for the annotation, visualization, analysis, and GenBank submission of completely sequenced chloroplast genome sequences.

Authors:  Chang Liu; Linchun Shi; Yingjie Zhu; Haimei Chen; Jianhui Zhang; Xiaohan Lin; Xiaojun Guan
Journal:  BMC Genomics       Date:  2012-12-20       Impact factor: 3.969

10.  Trimmomatic: a flexible trimmer for Illumina sequence data.

Authors:  Anthony M Bolger; Marc Lohse; Bjoern Usadel
Journal:  Bioinformatics       Date:  2014-04-01       Impact factor: 6.937

View more
  1 in total

1.  Complete chloroplast genome sequence of Pithecellobium clypearia (Jack) Benth.

Authors:  Ying-Hua Wang; Gang Chen
Journal:  Mitochondrial DNA B Resour       Date:  2022-04-29       Impact factor: 0.610

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.