| Literature DB >> 30687376 |
Chunming Gao1,2,3, Yunfei Deng4,5, Jun Wang1,2,3.
Abstract
Among the four species of Echinacanthus (Acanthaceae), one distributed in the West Himalayan region and three restricted to the Sino-Vietnamese karst region. Because of its ecological significance, molecular markers are necessary for proper assessment of its genetic diversity and phylogenetic relationships. Herein, the complete chloroplast genomes of four Echinacanthus species were determined for the first time. The results indicated that all the chloroplast genomes were mapped as a circular structure and each genomes included 113 unique genes, of which 80 were protein-coding, 29 were tRNAs, and 4 were rRNAs. However, the four cp genomes ranged from 151,333 to 152,672 bp in length. Comparison of the four cp genomes showed that the divergence level was greater between geographic groups. We also analyzed IR expansion or contraction in the four cp genomes and the fifth type of the large single copy/inverted repeat region in Lamiales was suggested. Furthermore, based on the analyses of comparison and nucleotide variability, six most divergent sequences (rrn16, ycf1, ndhA, rps16-trnQ-UUG, trnS-GCU-trnG-UCC, and psaA-ycf3) were identified. A total of 37-45 simple sequence repeats were discovered in the four species and 22 SSRs were identified as candidate effective molecular markers for detecting interspecies polymorphisms. These SSRs and hotspot regions could be used as potential molecular markers for future study. Phylogenetic analysis based on Bayesian and parsimony methods did not support the monophyly of Echinacanthus. The phylogenetic relationships among the four species were clearly resolved and the results supported the recognition of the Sino-Vietnamese Echinacanthus species as a new genus. Based on the protein sequence evolution analysis, 12 genes (rpl14, rpl16, rps4, rps15, rps18, rps19, psbK, psbN, ndhC, ndhJ, rpoB, and infA) were detected under positive selection in branch of Sino-Vietnamese Echinacanthus species. These genes will lead to understanding the adaptation of Echinacanthus species to karst environment. The study will help to resolve the phylogenetic relationship and understand the adaptive evolution of Echinacanthus. It will also provide genomic resources and potential markers suitable for future species identification and speciation studies of the genus.Entities:
Keywords: Echinacanthus; adaptive evolution; chloroplast genome; molecular marker; phylogeny; sequence divergence
Year: 2019 PMID: 30687376 PMCID: PMC6335349 DOI: 10.3389/fpls.2018.01989
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
The basic characteristics of the chloroplast genomes of four Echinacanthus species.
| Characteristics | ||||
|---|---|---|---|---|
| Location | Hekou, Yunnan, China | Libo, Guizhou, China | Yangchun, Guangdong, China | Phidim, Mechi, Nepal |
| Longitude | 22.6858 | 25.3234 | 22.1874 | 27.14 |
| Latitude | 104.0199 | 108.0814 | 111.7408 | 87.7658 |
| Voucher | F. Peng P16102201(IBSC) | Y. F. Deng 26020 (IBSC) | Y. Tong and F. Peng 14082413 (IBSC) | Adhikari B. and Kandel D. R.79 (E) |
| LSC length (bp) | 83,875 | 82,561 | 83,947 | 83,610 |
| SSC length (bp) | 17,389 | 17,398 | 17,572 | 17,740 |
| IR length (bp) | 25,690 | 25,687 | 25,433 | 25,661 |
| Total length (bp) | 152,644 | 151,333 | 152,384 | 152,672 |
| Protein-coding genes | 80 | 80 | 80 | 80 |
| tRNA genes | 29 | 29 | 29 | 29 |
| rRNA genes | 4 | 4 | 4 | 4 |
| Total number of genes | 113 | 113 | 113 | 113 |
| GC content (%) | 38.62% | 38.74% | 38.64% | 38.26% |
| Clean reads | 2030586 | 1806880 | 1422624 | 1526374 |
| Clean base | 46145070000 | 41061352500 | 3232912500 | 34686855000 |
| Read length (bp) | 150 bp | 150 bp | 150 bp | 150 bp |
FIGURE 1The chloroplast genome map of four Echinacanthus species. Genes lying outside the circle are transcribed in the counter clockwise direction, while those inside are transcribed in clockwise direction. The colored bars indicted different functional group. The darker gray area in the inner circle denoted GC content while the lighter gray corresponds to the AT content of the genome. LSC, large single copy; SSC, small single copy; IR, inverted repeat.
Gene contents in four Echinacanthus species chloroplast genome.
| Classification | Genes |
|---|---|
| Large ribosomal subunits | |
| Small ribosomal subunits | |
| DNA dependent RNA polymerase | |
| Protease | |
| Maturase | |
| Ribosomal RNAs | |
| Transfer RNAs | |
| Photosystem I | |
| Photosystem II | |
| NAD(P)H dehydrogenase complex | |
| F-type ATP synthase | |
| Cytochrome b6/f complex | |
| Inner membrane protein | |
| Cytochrome c biogenesis protein | |
| Large subunit of Rubisco | |
| Subunit of acetyl-CoA-carboxylase | |
| Translation initiation factor | |
FIGURE 2The numbers of the simple sequence repeats (SSRs) in the chloroplast genomes of four Echinacanthus species. mono-, mononucleotides; di-, dinucleotides; tri-, trinucleotides; tetra-, tetranucleotides; penta-, pentanucleotides; hexa-, hexanucleotides.
FIGURE 3Comparison of four Echinacanthus chloroplast genomes using mVISTA program, taking the annotation of E. lofouensis as a reference. The top line shows the genes in order. A cut-off of 70% identity was used for the plots and the Y-scale represents the percent identity between 50 and 100%. Genome regions are color-coded as exon and conserved non-coding sequences (CNS).
FIGURE 4Comparative analysis of the nucleotide diversity (Pi) values among four Echinacanthus chloroplast genomes. (A) Coding regions. (B) Non-coding regions. X-axis: name of the regions; Y-axis: nucleotide diversity (Pi).
FIGURE 5Comparison of the LSC, IR, and SSC junction regions among four Echinacanthus chloroplast genomes. Genes are denoted by colored boxes. Ψ shows the pseudogene. The numbers above the gene features indicates the distance between the end of the gene and the borders sites. The slashes indicate the location of the distance. This figure is not to scale.
FIGURE 6Phylogenetic tree conducted using Bayesian Inference (BI) and most parsimonious (MP) methods based on whole complete genomes from different species. The numbers above branches represent posterior probability (PP)/bootstrap percentage (BP).
The polymorphic simple sequence repeats in Echinacanthus.
| Type | Location | Regions | |
|---|---|---|---|
| TA | 5/5/4/2 | LSC | |
| AT | 5/5/5/4 | LSC | |
| TA | 5/0/0/0 | LSC | |
| TA | 3/0/3/5 | LSC | |
| AT | 2/2/2/5 | LSC | |
| TTA | 4/4/4/0 | LSC | |
| ATT | 4/2/4/2 | LSC | |
| AAT | 6/1/2/3 | LSC | |
| ATA | 2/6/2/4 | LSC | |
| ATA | 4/3/3/4 | LSC | |
| TAT | 4/4/4/2 | LSC | |
| TCT | 4/4/4/3 | LSC | |
| AAGA | 3/3/2/2 | LSC | |
| CAAT | 3/3/3/2 | SSC | |
| AATC | 3/3/3/1 | SSC | |
| ATAG | 3/0/0/0 | LSC | |
| TTCT | 2/2/3/2 | SSC | |
| AATT | 2/2/2/3 | LSC | |
| CAAA | 2/2/2/3 | LSC | |
| TAAA | 2/2/2/3 | LSC | |
| TAGA | 2/2/2/3 | LSC | |
| AATTAA | 3/2/2/0 | LSC | |