| Literature DB >> 32071806 |
Sunan Huang1,2, Xuejun Ge1, Asunción Cano3, Betty Gaby Millán Salazar3, Yunfei Deng1.
Abstract
The genus Dicliptera (Justicieae, Acanthaceae) consists of approximately 150 species distributed throughout the tropical and subtropical regions of the world. Newly obtained chloroplast genomes (cp genomes) are reported for five species of Dilciptera (D. acuminata, D. peruviana, D. montana, D. ruiziana and D. mucronata) in this study. These cp genomes have circular structures of 150,689-150,811 bp and exhibit quadripartite organizations made up of a large single copy region (LSC, 82,796-82,919 bp), a small single copy region (SSC, 17,084-17,092 bp), and a pair of inverted repeat regions (IRs, 25,401-25,408 bp). Guanine-Cytosine (GC) content makes up 37.9%-38.0% of the total content. The complete cp genomes contain 114 unique genes, including 80 protein-coding genes, 30 transfer RNA (tRNA) genes, and four ribosomal RNA (rRNA) genes. Comparative analyses of nucleotide variability (Pi) reveal the five most variable regions (trnY-GUA-trnE-UUC, trnG-GCC, psbZ-trnG-GCC, petN-psbM, and rps4-trnL-UUA), which may be used as molecular markers in future taxonomic identification and phylogenetic analyses of Dicliptera. A total of 55-58 simple sequence repeats (SSRs) and 229 long repeats were identified in the cp genomes of the five Dicliptera species. Phylogenetic analysis identified a close relationship between D. ruiziana and D. montana, followed by D. acuminata, D. peruviana, and D. mucronata. Evolutionary analysis of orthologous protein-coding genes within the family Acanthaceae revealed only one gene, ycf15, to be under positive selection, which may contribute to future studies of its adaptive evolution. The completed genomes are useful for future research on species identification, phylogenetic relationships, and the adaptive evolution of the Dicliptera species. ©2020 Huang et al.Entities:
Keywords: Adaptive evolution; Chloroplast genome; Dicliptera; Molecular markers; Phylogeny; Species identification
Year: 2020 PMID: 32071806 PMCID: PMC7007973 DOI: 10.7717/peerj.8450
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Morphological differences among Dicliptera acuminata, D. peruviana, D. montana, D. mucronata and D. ruiziana.
| Ca. 60 cm | Ca. 60 cm | Ca. 50 cm | 60–130 cm | Ca. 30 cm | |
| Erect, branched, sulcate, hirsute | Erect, branched, sulcate, hirsute | Erect, branched, sulcate, pubescent | Erect, branched, sulcate, pubescent | Erect, branched, sulcate, pubescent | |
| Oblong-lanceolate, 3.5–7.0 × 1.5–2.5 cm, villous | Ovate, 3.5–6.0 × 2.5–4.0 cm, pubescent | Ovate, 1.0–1.5 × 0.8–1.0 cm, pilose when young, then glabrescent | Ovate, 3.0–3.5 × 1.5–2.0 cm, scarcely pilose | Ovate, 1.0–1.5 × 0.8–1.0 cm, pubescent | |
| Acuminate-acute | Acute | Acuminate | Acuminate-acute | Acute | |
| Verticillaster | Verticillaster | Spikelike thyrse | Verticillaster | Pedunculate cyme | |
| Lanceolate-linear, ciliate | Ovate, ciliate | Spatulate, gland-tipped pilose | Obovate-rhombic, pilose | Obovate, gland-tipped pilose | |
| Subulate, ciliate | Subulate, | Hyaline, asymmetrical, minute pilose | Linear-subulate, pilose | Subulate | |
| Subulate, ciliate | Linear, hirsute | Lanceolate, minute pilose | Linear-lanceolate, margin minutely pubescent | Lanceolate, gland-tipped pilose | |
| Purple, outside pubescent | Purple, outside pubescent | Pale purple, outside pilose | Purplish red, outside pilose | Pink, outside pubescent | |
| scarcely pilose | glabrous | scarcely pilose | glabrous | scarcely pilose |
Species information and genome features of the chloroplast genomes of five Dicliptera species.
| Location | 9.05°S, 77.81°W | 11.41°S, 77.23°W | 11.79°S, 77.05°W | 15.87°S, 74.15°W | 12.21°S, 76.82°W |
| Geographic region | Caraz, Paron, Peru | Lomas de lguanil, Huaral province, Peru | Lomas de Carabayllo, Lima province, Peru | Lomas de Cháparra, Caravelí province, Peru | Santuario del Amancay, Lima Province, Peru |
| Voucher specimens No. | P10-091 | P170099 | P170177 | P170209 | P170492 |
| Assemblied reads | 3745800 | 2103300 | 2272793 | 2108293 | 1927500 |
| Mean coverage | 3727.5 | 2092.0 | 2262.4 | 2097.8 | 1918.3 |
| Size (bp) | 150738 | 150811 | 150689 | 150750 | 150720 |
| LSC length (bp) | 82844 | 82919 | 82796 | 82843 | 82834 |
| SSC length (bp) | 17092 | 17090 | 17091 | 17091 | 17084 |
| IR length (bp) | 25401 | 25401 | 25401 | 25408 | 25401 |
| CDSs total length | 78714 | 78714 | 78714 | 78747 | 78717 |
| Number of total genes | 114 | 114 | 114 | 114 | 114 |
| Number of Protein-coding genes | 80 | 80 | 80 | 80 | 80 |
| Number of tRNA genes | 30 | 30 | 30 | 30 | 30 |
| Number of rRNA genes | 4 | 4 | 4 | 4 | 4 |
| Overall GC content (%) | 38.0% | 38.0% | 38.0% | 38.0% | 38.0% |
| GC content in LSC (%) | 36.0% | 36.0% | 36.0% | 36.0% | 36.0% |
| GC content in SSC (%) | 31.9% | 31.7% | 31.9% | 31.9% | 31.6% |
| GC content in IR (%) | 43.3% | 43.4% | 43.3% | 43.3% | 43.4% |
| Genbank accession number |
|
|
|
|
|
Figure 1Gene maps of chloroplast genomes.
(A) Dicliptera acuminata; (B) D. peruviana; (C) D. montana; (D) D. ruiziana; (E) D. mucronata. Genes shown outside of the circle are transcribed clockwise, whereas genes inside of the circle are transcribed counterclockwise. The colored bars indicate known protein-coding genes, tRNA and rRNA. The dark gray area in the inner circle indicates GC content, while the light gray area indicates AT content. LSC, large single copy; SSC, small single copy; IR, inverted repeats.
Figure 2Comparison of the border regions of the LSC, SSC and IR among six Acanthaceae chloroplast genomes.
The IRb/SSC junction extended into the ycf1 genes creating various lengths of ycf1 pseudogenes (Ψycf1) among the six cp genomes. The number above, below or adjacent to genes shows the distance between the ends of genes and the boundary sites. The figure features are not to scale.
Figure 3Comparison of the five Dicliptera chloroplast genomes using mVISTA.
CNS indicates conserved noncoding sequences. The Y-scale represents the percent identity between 50% and 100%.
Figure 4Comparative analysis of the nucleotide diversity (Pi) value among five Dicliptera chloroplast genomes.
(A) Coding regions. (B) Non-coding regions.
Figure 5Amino acid frequencies in five Dicliptera species protein-coding sequences.
As shown in the column diagram, Leucine was the most frequent amino acid (10.8%), Cysteine was the least (1.2%).
Figure 6The type and presence of simple sequence repeats (SSRs) and long repeated sequences in the chloroplast genomes of five Dicliptera species.
(A) Percentage of SSR types; (B) number of SSRs and their types; (C) percentage of five repeat types; (D) number of five repeats types.
The polymorphic SSRs among five Dicliptera species.
LSC, large single copy; SSC, small single copy.
| Type | Location | Regions | |
|---|---|---|---|
| A | 11/11/10/11/13 | LSC | |
| T | 0/0/0/10/0 | LSC | |
| T | 13/13/14/13/13 | LSC | |
| A | 12/11/11/10/11 | LSC | |
| A | 10/10/10/0/0 | LSC | |
| TA | 0/0/0/0/8 | LSC | |
| AT | 0/9/0/0/0 | LSC | |
| T | 13/10/12/16/14 | LSC | |
| G | 11/11/12/9/8 | LSC | |
| T | 10/10/10/10/11 | LSC | |
| ATAA | 3/3/3/6/3 | LSC | |
| TA | 6/7/7/7/7 | LSC | |
| T | 12/11/13/13/11 | LSC | |
| T | 12/11/11/12/11 | LSC | |
| T | 11/10/11/11/10 | LSC | |
| T | 11/10/10/10/0 | LSC | |
| TA | 7/6/6/6/6 | LSC | |
| T | 10/10/10/10/11 | SSC | |
| A | 10/11/10/10/10 | SSC | |
| G | 11/0/11/11/10 | SSC | |
| A | 11/10/10/10/0 | SSC |
Figure 7The maximum Likelihood (ML) tree of Acanthaceae.
Numbers associated with branches are ML bootstrap values, MP bootstrap values and Bayesian posterior probabilities, respecively. Hyphens indicate the bootstrap support or posterior probability lower than 50% or 0.5. Mentha spicata (NC_037247) and Sesamum indicum (NC_016433) were used as outgroups.
Figure 8Synonymous (KS) substitution rates and ω values (ω= KA/KS) among all Acanthaceae species.
As shown in the column diagram, the order of genes is alphabetical. The ω value of the ycf15 gene (1.4453) is clearly higher than other genes.