Literature DB >> 33366270

The complete chloroplast genome sequence of Cautleya gracilis.

Yunqing Li1, Yi Wang1.   

Abstract

The first complete chloroplast genome (cpDNA) sequence of Cautleya gracilis was determined from Illumina HiSeq pair-end sequencing data in this study. The cpDNA is 164,001 bp in length, contains a large single-copy region (LSC) of 89,271 bp and a small single-copy region (SSC) of 15,984 bp, which were separated by a pair of inverted repeats (IR) regions of 29,373 bp. The genome contains 131 genes, including 85 protein-coding genes, 8 ribosomal RNA genes, and 38 transfer RNA genes. The overall GC content of the whole genome is 36.1% and the corresponding values of the LSC, SSC, and IR regions are 33.8, 29.4, and 41.3%, respectively. Further phylogenomic analysis showed that C. gracilis close to genus Curcuma in family Zingiberaceae.
© 2019 The Author(s). Published by Informa UK Limited, trading as Taylor & Francis Group.

Entities:  

Keywords:  Cautleya gracilis; Illumina sequencing; chloroplast; phylogenetic analysis

Year:  2019        PMID: 33366270      PMCID: PMC7707701          DOI: 10.1080/23802359.2019.1688713

Source DB:  PubMed          Journal:  Mitochondrial DNA B Resour        ISSN: 2380-2359            Impact factor:   0.658


Cautleya gracilis is the species of the genus Cautleya within the family Zingiberaceae. It distributes in Tibet, Yunnan, and Sichuan of China, India, and Nepal. Cautleya gracilis usually grows in the wet valley, sometimes epiphytic to trees (Tang et al. 2001). The extracts of C. gracilis showed anticancer activity on human medullary carcinoma cells. The extracts from C. gracilis had a strong inhibitory effect on tumour cell growth, disrupting the tumour spheroids, and induced tumour cell apoptosis (Li et al. 2008). Therefore, C. gracilis has huge potential medicinal value. However, there have been no genomic studies on C. gracilis. Herein, we reported and characterized the complete C. gracilis plastid genome (MN539264). One C. gracilis individual (specimen number: 5309270808) was collected from Puwen, Yunnan Province of China (23°20′9″ N, 99°14′16″ E). The specimen is stored at Yunnan Academy of Forestry Herbarium, Kunming, China and the accession number is YAFM20180726. DNA was extracted from its fresh leaves using DNA Plantzol Reagent (Invitrogen, Carlsbad, CA, USA). Paired-end reads were sequenced by using Illumina HiSeq system (Illumina, San Diego, CA, USA). In total, about 25.3 million high-quality clean reads were generated with adaptors trimmed. Aligning, assembly, and annotation were conducted by CLC de novo assembler (CLC Bio, Aarhus, Denmark), BLAST, GeSeq (Tillich et al. 2017), and Geneious version 11.0.5 (Biomatters Ltd, Auckland, New Zealand). To confirm the phylogenetic position of C. gracilis, other nine species of family Zingiberaceae from NCBI were aligned using MAFFT version 7 (Katoh and Standley 2013). The Auto algorithm in the MAFFT alignment software was used to align the eight complete genome sequences and the G-INS-i algorithm was used to align the partial complex sequecnces and maximum likelihood (ML) bootstrap analysis was conducted using RAxML (Stamatakis 2006); bootstrap probability values were calculated from 1000 replicates. Musella lasiocarpa (KY807173) and Musa balbisiana (KT595228) were served as the out-group. The complete C. gracilis plastid genome is a circular DNA molecule with the length of 164,001 bp, contains a large single-copy region (LSC) of 89,271 bp and a small single-copy region (SSC) of 15,984 bp, which were separated by a pair of inverted repeats (IR) regions of 29,373 bp. The overall GC content of the whole genome is 36.1%, and the corresponding values of the LSC, SSC, and IR regions are 33.8, 29.4, and 41.3%, respectively. The plastid genome contained 131 genes, including 85 protein-coding genes, 8 ribosomal RNA genes, and 38 transfer RNA genes. Further, phylogenomic analysis showed that C. gracilis close to the genus Curcuma in family Zingiberaceae (Figure 1). The determination of the complete plastid genome sequences provided new molecular data to illuminate the Zingiberaceae evolution.
Figure 1.

The maximum-likelihood tree based on the 10 chloroplast genomes of Zingiberaceae. The bootstrap value based on 1000 replicates is shown on each node.

The maximum-likelihood tree based on the 10 chloroplast genomes of Zingiberaceae. The bootstrap value based on 1000 replicates is shown on each node.
  4 in total

1.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models.

Authors:  Alexandros Stamatakis
Journal:  Bioinformatics       Date:  2006-08-23       Impact factor: 6.937

2.  MAFFT multiple sequence alignment software version 7: improvements in performance and usability.

Authors:  Kazutaka Katoh; Daron M Standley
Journal:  Mol Biol Evol       Date:  2013-01-16       Impact factor: 16.240

3.  Anticancer activity of novel extracts from Cautleya gracilis (Smith) Dandy: apoptosis in human medullary thyroid carcinoma cells.

Authors:  Zengxia Li; Sonja Sturm; Bernhard Svejda; Harald Höger; Elisabeth Schraml; Elisabeth Ingolic; Veronika Siegl; Hermann Stuppner; Roswitha Pfragner
Journal:  Anticancer Res       Date:  2008 Sep-Oct       Impact factor: 2.480

4.  GeSeq - versatile and accurate annotation of organelle genomes.

Authors:  Michael Tillich; Pascal Lehwark; Tommaso Pellizzer; Elena S Ulbricht-Jones; Axel Fischer; Ralph Bock; Stephan Greiner
Journal:  Nucleic Acids Res       Date:  2017-07-03       Impact factor: 16.971

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.