| Literature DB >> 29770326 |
Cornelius M Kyalo1,2,3, Andrew W Gichira1,2,3, Zhi-Zhong Li1,2,3, Josphat K Saina1,2,3, Itambo Malombe4, Guang-Wan Hu1,3, Qing-Feng Wang1,3.
Abstract
Streptocarpus teitensis (Gesneriaceae) is an endemic species listed as critically endangered in the International Union for Conservation of Nature (IUCN) red list of threatened species. However, the sequence and genome information of this species remains to be limited. In this article, we present the complete chloroplast genome structure of Streptocarpus teitensis and its evolution inferred through comparative studies with other related species. S. teitensis displayed a chloroplast genome size of 153,207 bp, sheltering a pair of inverted repeats (IR) of 25,402 bp each split by small and large single-copy (SSC and LSC) regions of 18,300 and 84,103 bp, respectively. The chloroplast genome was observed to contain 116 unique genes, of which 80 are protein-coding, 32 are transfer RNAs, and four are ribosomal RNAs. In addition, a total of 196 SSR markers were detected in the chloroplast genome of Streptocarpus teitensis with mononucleotides (57.1%) being the majority, followed by trinucleotides (33.2%) and dinucleotides and tetranucleotides (both 4.1%), and pentanucleotides being the least (1.5%). Genome alignment indicated that this genome was comparable to other sequenced members of order Lamiales. The phylogenetic analysis suggested that Streptocarpus teitensis is closely related to Lysionotus pauciflorus and Dorcoceras hygrometricum.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29770326 PMCID: PMC5889905 DOI: 10.1155/2018/1507847
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Comparison of the features of four Gesneriaceae chloroplast genomes.
| Feature |
|
|
|
|
|---|---|---|---|---|
| Genome size (bp) | 153,207 | 153,493 | 153,856 | 153,099 |
| Large single copy (bp) | 84,103 | 84,692 | 85,087 | 84,443 |
| Small single copy (bp) | 18,300 | 17,901 | 17,839 | 17,826 |
| Inverted repeats (bp) | 25,402 | 25,450 | 25,465 | 25,415 |
| GC content in LSC (%) | 35.5 | 35.6 | 35.4 | 35.7 |
| GC content in SSC (%) | 31.4 | 36.4 | 36.6 | 31.7 |
| GC content in IR (%) | 43.2 | 40.7 | 40.6 | 43.3 |
| Overall AT content (%) | 62.4 | 62.4 | 62.5 | 62.2 |
| Overall GC content (%) | 37.6 | 37.6 | 37.5 | 37.8 |
Comparison of the features of Streptocarpus teitensis with other 11 Lamiales chloroplast genomes.
| Species | Family | LSC (bp) | SSC (bp) | IR (bp) | Total (bp) | CG content% | Accession number |
|---|---|---|---|---|---|---|---|
|
| Gesneriaceae | 84,103 | 18,300 | 25,402 | 153,207 | 37.58 | MF596485 |
|
| Orobanchaceae | 49,130 | 8,819 | 22,354 | 102,657 | 36.8 | KC128846 |
|
| Pedaliaceae | 85,170 | 17,872 | 25,141 | 153,324 | 38 | JN637766 |
|
| Lamiaceae | 86,078 | 17,689 | 25,763 | 155,293 | 37.9 | KM981744 |
|
| Lamiaceae | 82,695 | 17,555 | 25,539 | 151,328 | 38 | JX312195 |
|
| Lamiaceae | 83,136 | 17,745 | 25,527 | 151,935 | 38 | JX880022 |
|
| Lentibulariaceae | 82,720 | 17,481 | 25,325 | 150,851 | 37.32 | KY025562 |
|
| Acanthaceae | 82,459 | 17,190 | 25,300 | 150,249 | 38.3 | NC022451 |
|
| Bignoniaceae | 84,612 | 17,586 | 25,789 | 153,776 | 38.3 | KR534325 |
|
| Paulowniaceae | 85,241 | 17,736 | 25,784 | 154,545 | 38 | KP718622 |
|
| Scrophulariaceae | 84,058 | 17,449 | 25,523 | 152,553 | 38 | KT428154 |
|
| Oleaceae | 92,877 | 13,272 | 29,486 | 165,121 | 38 | NC_008407 |
Figure 3Mauve multiple alignment of 15 Lamiales, with Nicotiana tabacum set as the reference genome. Color-coded segments indicate regions that shared same genes across different species' genomes. The extent of sequence similarities is indicated by the colored parts inside each region. Lines connect regions with homologous sequences among two genomes.
Figure 2Comparison of SSR repeats (a) and AT repeats richness (b) among four Gesneriaceae species.
Figure 1The gene map of the chloroplast genome of Streptocarpus teitensis. Genes drawn inside the map are transcribed clockwise, while genes drawn outside are transcribed counterclockwise. Different colors represent genes of different functional groups. Inverted repeats (IRA and IRB) are marked by the dark bold lines; GC and AT contents are, respectively, represented by the dark and light grey colors inside the map.
The functional classification of genes found in Streptocarpus teitensis chloroplast genome.
| Function | Group of genes | Gene names |
|---|---|---|
| Photosynthesis | Photosystem 1 |
|
| Photosystem 11 |
| |
| NADH dehydrogenase |
| |
| ATP synthase |
| |
| Cytochrome b/f complex |
| |
| RubisCO large subunit |
| |
|
| ||
| Self-replication | RNA polymerase |
|
| Ribosomal proteins (Large Sub-unit) |
| |
| Ribosomal proteins (small subunit) |
| |
| Ribosomal RNAs |
| |
| Transfer RNAs |
| |
|
| ||
|
| ||
|
| ||
|
| ||
| Proteins of unknown function |
| |
|
| ||
| Other genes | Protease |
|
| Maturase |
| |
| Translational initiation factor |
| |
| Envelope membrane protein |
| |
| Subunit of acetyl-CoA-carboxylase |
| |
| c-type cytochrome synthesis |
| |
∗ marks genes with one intron; ∗∗ marks genes with two introns; (x2) shows genes with duplicates.
The genes with introns in the Streptocarpus teitensis chloroplast genome and the length of the exons and introns.
| Gene | Region | Exon 1 (bp) | Intron 1 (bp) | Exon 2 (bp) | Intron 2 (bp) | Exon 3 (bp) |
|---|---|---|---|---|---|---|
|
| LSC | 472 | 633 | 144 | ||
|
| LSC | 207 | 1636 | 48 | ||
|
| IR | 435 | 670 | 393 | ||
|
| LSC | 1620 | 784 | 456 | ||
|
| SSC | 540 | 1069 | 552 | ||
|
| IR | 756 | 679 | 777 | ||
|
| IR | 38 | 834 | 35 | ||
|
| IR | 35 | 935 | 42 | ||
|
| LSC | 35 | 2493 | 37 | ||
|
| LSC | 23 | 704 | 37 | ||
|
| LSC | 37 | 469 | 50 | ||
|
| LSC | 37 | 575 | 38 | ||
|
| LSC | 150 | 712 | 228 | 688 | 129 |
|
| LSC | 234 | 631 | 297 | 819 | 69 |
The AT and GC% in different regions of Streptocarpus teitensis cp genome.
| Region | Length (bp) | A (%) | T (%) | G (%) | C (%) | GC (%) |
|---|---|---|---|---|---|---|
| LSC | 84,103 | 31.57 | 32.88 | 17.36 | 18.18 | 35.54 |
| SSC | 18,300 | 34.13 | 34.5 | 15.15 | 16.22 | 31.37 |
| IRA | 25,402 | 28.46 | 28.34 | 22.43 | 20.77 | 43.2 |
| IRB | 25,402 | 28.34 | 28.45 | 20.77 | 22.43 | 43.2 |
| Total genome | 153,207 | 30.82 | 31.59 | 18.5 | 19.08 | 37.58 |
Number of SSR repeats in Streptocarpus teitensis chloroplast genome.
| Repeat sequences | Number of repeats | Total | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | ||
| A/T | 55 | 33 | 8 | 5 | 4 | 1 | 106 | |||||
| C/G | 4 | 2 | 6 | |||||||||
| AG/CT | 2 | 2 | ||||||||||
| AT/AT | 4 | 1 | 1 | 6 | ||||||||
| AAC/GTT | 10 | 10 | ||||||||||
| AAG/CTT | 20 | 1 | 21 | |||||||||
| AAT/ATT | 18 | 1 | 19 | |||||||||
| AGC/GCT | 6 | 6 | ||||||||||
| AGG/CCT | 4 | 4 | ||||||||||
| ATC/GAT | 5 | 5 | ||||||||||
| AAAT/ATTT | 3 | 3 | ||||||||||
| AATC/GATT | 2 | 2 | ||||||||||
| AATT/AATT | 1 | 1 | ||||||||||
| AGAT/ATCT | 2 | 2 | ||||||||||
| AAAAG/CTTTT | 2 | 2 | ||||||||||
| AATTC/GAATT | 1 | 1 | ||||||||||
| Total | 196 |
Figure 4Comparison of LSC, SSC, and IR junctions among Gesneriaceae species. Ψ indicates a pseudogene. Genes above the regions are direct while genes below are reverse.
Figure 5Maximum Likelihood phylogenetic tree of 14 species of Lamiales, and two other Asterid species as outgroups based on 67 chloroplast genome protein-coding genes' sequences.