| Literature DB >> 33911961 |
Dhafer Alzahrani1, Enas Albokhari1,2, Samaila Yaradua1,3, Abidina Abba1,4.
Abstract
This current study presents, for the first time, the complete chloroplast genome of two Cleomaceae species: Dipterygium glaucum and Cleome chrysantha in order to evaluate the evolutionary relationship. The cp genome is 158,576 bp in length with 35.74% GC content in D. glaucum and 158,111 bp with 35.96% GC in C. chrysantha. Inverted repeats IR 26,209 bp, 26,251 bp each, LSC of 87,738 bp, 87,184 bp and SSC of 18,420 bp, 18,425 bp respectively. There are 136 genes in the genome, which includes 80 protein coding genes, 31 tRNA genes and four rRNA genes were observed in both chloroplast genomes. 117 genes are unique while the remaining 19 genes are duplicated in IR regions. The analysis of repeats shows that the cp genome includes all types of repeats with more frequent occurrences of palindromic; Also, this analysis indicates that the total number of simple sequence repeats (SSR) were 323 in D. glaucum, and 313 in C. chrysantha, of which the majority of the SSRs in these plastid genomes were mononucleotide repeats A/T which are located in the intergenic spacer. Moreover, the comparative analysis of the four cp sequences revealed four hotspot genes (atpF, rpoC2, rps19, and ycf1), these variable regions could be used as molecular makers for the species authentication as well as resources for inferring phylogenetic relationships of the species. All the relationships in the phylogenetic tree are with high support, this indicate that the complete chloroplast genome is a useful data for inferring phylogenetic relationship within the Cleomaceae and other families. The simple sequence repeats identified will be useful for identification, genetic diversity, and other evolutionary studies of the species. This study reported the first cp genome of the genus Dipterygium and Cleome. The finding of this study will be beneficial for biological disciplines such as evolutionary and genetic diversity studies of the species within the core Cleomaceae.Entities:
Keywords: Chloroplast genome (cp); Cleomaceae; Cleome chrysantha; DNA, Deoxyribonucleic acid; Dipterygium glaucum; IR, Inverted repeat; LSC, Large single copy region; SSC, Small single copy region; SSR; SSR, Simple sequence repeats
Year: 2021 PMID: 33911961 PMCID: PMC8071925 DOI: 10.1016/j.sjbs.2021.01.049
Source DB: PubMed Journal: Saudi J Biol Sci ISSN: 2213-7106 Impact factor: 4.219
Fig. 1Chloroplast genome maps of the two Cleomaceae (A) D. glaucum and (B) C. chrysantha. Genes inside of the circles are transcribed clockwise, while genes outside circles are transcribed counterclockwise. The light grey inner circle and the dark grey corresponds to the AT and GC content respectively. The different colours represent the different functional groups of the genes.
Bases content in the D. glaucum and C. chrysantha chloroplast genomes.
| Species | D. glaucum | C. chrysantha |
|---|---|---|
| Genome size (bp) | 158,576 | 158,111 |
| IR (bp) | 26,209 | 26,251 |
| LSC (bp) | 87,738 | 87,184 |
| SSC (bp) | 18,420 | 18,425 |
| Total number of genes | 136 | 136 |
| rRNA | 4 | 4 |
| tRNA | 31 | 31 |
| Protein-coding genes | 80 | 80 |
| T (U) % | 32.57 | 32.48 |
| C % | 18.2 | 18.32 |
| A % | 31.66 | 31.54 |
| G % | 17.54 | 17.64 |
Gene contents in the chloroplast genomes of D. glaucum and C. chrysantha.
| Category | Group of genes | Name of genes |
|---|---|---|
| RNA genes | ribosomal RNA genes (rRNA) | |
| Transfer RNA genes (tRNA) | ||
| Ribosomal proteins | Small subunit of ribosome | |
| Transcription | Large subunit of ribosome | |
| DNA dependent RNA polymerase | ||
| Translational initiation factor | ||
| Protein genes | Photosystem I | |
| Photosystem II | ||
| Subunit of cytochrome | ||
| Subunit of synthase | ||
| Large subunit of rubisco | ||
| NADH dehydrogenase | ||
| ATP dependent protease subunit P | ||
| Chloroplast envelope membrabe protein | ||
| Other genes | Maturase | |
| Subunit acetyl-coA carboxylase | ||
| C-type cytochrome systhesis | ||
| Hypothetical proteins | ||
| Component of TIC complex |
*(A) D. glaucum cp genome; (B) C. chrysantha cp genome.
Gene with one intron, ++ Gene with two intron and a Gene with copies.
Codon – anticodon recognition patterns and codon usage of D. glaucum chloroplast genome.
| Codon | Amino Acid | RSCU | tRNA | Codon | Amino Acid | RSCU | tRNA |
|---|---|---|---|---|---|---|---|
| UUU | Phe | 1.3 | UAU | Tyr | 1.45 | ||
| UUC | Phe | 0.7 | UAC | Tyr | 0.55 | ||
| UUA | Leu | 1.76 | UAA | Stop | 1.08 | ||
| UUG | Leu | 1.15 | UAG | Stop | 0.7 | ||
| CUU | Leu | 1.28 | CAU | His | 1.5 | ||
| CUC | Leu | 0.52 | CAC | His | 0.5 | ||
| CUA | Leu | 0.86 | CAA | Gln | 1.52 | ||
| CUG | Leu | 0.42 | CAG | Gln | 0.48 | ||
| AUU | Ile | 1.46 | AAU | Asn | 1.43 | ||
| AUC | Ile | 0.73 | AAC | Asn | 0.57 | ||
| AUA | Ile | 0.81 | AAA | Lys | 1.48 | ||
| AUG | Met | 1 | AAG | Lys | 0.52 | ||
| GUU | Val | 1.45 | GAU | Asp | 1.51 | ||
| GUC | Val | 0.73 | GAC | Asp | 0.49 | ||
| GUA | Val | 1.27 | GAA | Glu | 1.45 | ||
| GUG | Val | 0.54 | GAG | Glu | 0.55 | ||
| UCU | Ser | 1.44 | UGU | Cys | 1.2 | ||
| UCC | Ser | 0.92 | UGC | Cys | 0.8 | ||
| UCA | Ser | 1.3 | UGA | Stop | 1.22 | ||
| UCG | Ser | 0.69 | UGG | Trp | 1 | ||
| CCU | Pro | 1.4 | CGU | Arg | 0.83 | ||
| CCC | Pro | 0.78 | CGC | Arg | 0.32 | ||
| CCA | Pro | 1.19 | CGA | Arg | 1.12 | ||
| CCG | Pro | 0.63 | CGG | Arg | 0.49 | ||
| ACU | Thr | 1.29 | AGA | Arg | 0.99 | ||
| ACC | Thr | 0.86 | AGG | Arg | 0.65 | ||
| ACA | Thr | 1.23 | AGU | Ser | 2.07 | trnS-GCU | |
| ACG | Thr | 0.61 | AGC | Ser | 1.18 | ||
| GCU | Ala | 1.62 | GGU | Gly | 1.09 | ||
| GCC | Ala | 0.65 | GGC | Gly | 0.51 | ||
| GCA | Ala | 1.24 | GGA | Gly | 1.54 | ||
| GCG | Ala | 0.49 | GGG | Gly | 0.86 | trnG-UCC |
Codon – anticodon recognition patterns and codon usage of C. chrysantha chloroplast genome.
| Codon | Amino Acid | RSCU | tRNA | Codon | Amino Acid | RSCU | tRNA |
|---|---|---|---|---|---|---|---|
| UUU | Phe | 1.31 | UAU | Tyr | 1.6 | ||
| UUC | Phe | 0.69 | UAC | Tyr | 0.4 | ||
| UUA | Leu | 1.88 | UAA | Stop | 0.86 | ||
| UUG | Leu | 1.16 | UAG | Stop | 1.48 | ||
| CUU | Leu | 1.25 | CAU | His | 1.51 | ||
| CUC | Leu | 0.46 | CAC | His | 0.49 | ||
| CUA | Leu | 0.84 | CAA | Gln | 1.55 | ||
| CUG | Leu | 0.41 | CAG | Gln | 0.45 | ||
| AUU | Ile | 1.48 | AAU | Asn | 1.54 | ||
| AUC | Ile | 0.61 | AAC | Asn | 0.46 | ||
| AUA | Ile | 0.92 | AAA | Lys | 1.49 | ||
| AUG | Met | 1 | AAG | Lys | 0.51 | ||
| GUU | Val | 1.42 | GAU | Asp | 1.61 | ||
| GUC | Val | 0.55 | GAC | Asp | 0.39 | ||
| GUA | Val | 1.43 | GAA | Glu | 1.47 | ||
| GUG | Val | 0.6 | GAG | Glu | 0.53 | ||
| UCU | Ser | 1.7 | UGU | Cys | 1.45 | trnC-GCA | |
| UCC | Ser | 0.92 | UGC | Cys | 0.55 | ||
| UCA | Ser | 1.21 | UGA | Stop | 0.67 | ||
| UCG | Ser | 0.58 | UGG | Trp | 1 | trnW-CCA | |
| CCU | Pro | 1.58 | CGU | Arg | 1.33 | trnR-ACG | |
| CCC | Pro | 0.78 | CGC | Arg | 0.4 | trnR-UCU | |
| CCA | Pro | 1.14 | CGA | Arg | 1.38 | ||
| CCG | Pro | 0.5 | CGG | Arg | 0.48 | ||
| ACU | Thr | 1.63 | AGA | Arg | 1.71 | ||
| ACC | Thr | 0.74 | AGG | Arg | 0.7 | ||
| ACA | Thr | 1.21 | AGU | Ser | 1.17 | trnS-GCU | |
| ACG | Thr | 0.42 | AGC | Ser | 0.42 | ||
| GCU | Ala | 1.85 | GGU | Gly | 1.3 | ||
| GCC | Ala | 0.59 | GGC | Gly | 0.37 | ||
| GCA | Ala | 1.1 | GGA | Gly | 1.66 | ||
| GCG | Ala | 0.45 | GGG | Gly | 0.67 | trnG-UCC |
Fig. 2Amino acids frequencies in D. glaucum and C. chrysantha chloroplast genome protein coding sequences.
Predicted RNA editing site in the D. glaucum chloroplast genome.
| Gene | Nucleotide Position | Amino Acid Position | Codon | Amino Acid | Score |
|---|---|---|---|---|---|
| 791 | 264 | TCG => TTG | S => L | 0.8 | |
| 464 | 155 | GCT => GTT | A => V | 1 | |
| 493 | 165 | CAT => TAT | H => Y | 1 | |
| 706 | 236 | CAT => TAT | H => Y | 1 | |
| 1250 | 417 | TCA => TTA | S => L | 0.86 | |
| 1309 | 437 | CAC => TAC | H => Y | 1 | |
| 125 | 42 | ACA => ATA | T => I | 0.8 | |
| 341 | 114 | TCA => TTA | S => L | 1 | |
| 149 | 50 | TCA => TTA | S => L | 1 | |
| 467 | 156 | CCA => CTA | P => L | 1 | |
| 586 | 196 | CAT => TAT | H => Y | 1 | |
| 611 | 204 | TCA => TTA | S => L | 0.8 | |
| 746 | 249 | TCT => TTT | S => F | 1 | |
| 830 | 277 | TCA => TTA | S => L | 1 | |
| 836 | 279 | TCA => TTA | S => L | 1 | |
| 1255 | 419 | CAT => TAT | H => Y | 1 | |
| 1481 | 494 | CCA => CTA | P => L | 1 | |
| 1526 | 509 | CCT => CTT | P => L | 1 | |
| 65 | 22 | TCT => TTT | S => F | 0.8 | |
| 401 | 134 | TCA => TTA | S => L | 1 | |
| 692 | 231 | TCG => TTG | S => L | 1 | |
| 896 | 299 | TCA => TTA | S => L | 1 | |
| 905 | 302 | CCC => CTC | P => L | 1 | |
| 1328 | 443 | TCA => TTA | S => L | 0.8 | |
| 1423 | 475 | CTT => TTT | L => F | 0.8 | |
| 205 | 69 | CAT => TAT | H => Y | 0.8 | |
| 290 | 97 | TCA => TTA | S => L | 1 | |
| 586 | 196 | CTT => TTT | L => F | 0.8 | |
| 166 | 56 | CAT => TAT | H => Y | 0.8 | |
| 314 | 105 | ACA => ATA | T => I | 0.8 | |
| 452 | 151 | ACA => ATA | T => I | 1 | |
| 214 | 72 | CCT => TCT | P => S | 1 | |
| 77 | 26 | TCT => TTT | S => F | 1 | |
| 338 | 113 | TCT => TTT | S => F | 1 | |
| 551 | 184 | TCA => TTA | S => L | 1 | |
| 2432 | 811 | TCA => TTA | S => L | 0.86 | |
| 41 | 14 | TCA => TTA | S => L | 1 | |
| 1943 | 648 | ACT => ATT | T => I | 0.86 | |
| 2335 | 779 | GCC => GTC | A => V | 0.86 | |
| 80 | 27 | TCA => TTA | S => L | 1 | |
| 176 | 59 | TCA => TTA | S => L | 0.83 |
Predicted RNA editing site in the C. chrysantha chloroplast genome.
| Gene | Nucleotide Position | Amino Acid Position | Codon | Amino Acid | Score |
|---|---|---|---|---|---|
| 791 | 264 | TCG => TTG | S => L | 0.8 | |
| 1400 | 467 | CCT => CTT | P => L | 1 | |
| 167 | 56 | GCT => GTT | A => V | 1 | |
| 196 | 66 | CAT => TAT | H => Y | 1 | |
| 709 | 237 | CAT => TAT | H => Y | 1 | |
| 1253 | 418 | TCA => TTA | S => L | 0.86 | |
| 1312 | 438 | CAC => TAC | H => Y | 1 | |
| 149 | 50 | TCA => TTA | S => L | 1 | |
| 467 | 156 | CCA => CTA | P => L | 1 | |
| 586 | 196 | CAT => TAT | H => Y | 1 | |
| 611 | 204 | TCA => TTA | S => L | 0.8 | |
| 746 | 249 | TCT => TTT | S => F | 1 | |
| 20 | 7 | ACG => ATG | T => M | 1 | |
| 401 | 134 | TCA => TTA | S => L | 1 | |
| 692 | 231 | TCA => TTA | S => L | 1 | |
| 896 | 299 | TCA => TTA | S => L | 1 | |
| 905 | 302 | CCC => CTC | P => L | 1 | |
| 1328 | 443 | TCA => TTA | S => L | 0.8 | |
| 1423 | 475 | CTT => TTT | L => F | 0.8 | |
| 205 | 69 | CAT => TAT | H => Y | 0.8 | |
| 290 | 97 | TCA => TTA | S => L | 1 | |
| 586 | 196 | CTT => TTT | L => F | 0.8 | |
| 166 | 56 | CAT => TAT | H => Y | 0.8 | |
| 314 | 105 | ACA => ATA | T => I | 0.8 | |
| 214 | 72 | CCT => TCT | P => S | 1 | |
| 77 | 26 | TCT => TTT | S => F | 1 | |
| 338 | 113 | TCT => TTT | S => F | 1 | |
| 551 | 184 | TCA => TTA | S => L | 1 | |
| 566 | 189 | TCG => TTG | S => L | 1 | |
| 1981 | 561 | CCC => TCC | P => S | 0.86 | |
| 2434 | 812 | TCA => TTA | S => L | 0.86 | |
| 1517 | 506 | ACT => ATT | T => I | 0.86 | |
| 2342 | 781 | GCC => GTC | A => V | 0.86 | |
| 80 | 27 | TCA => TTA | S => L | 1 | |
| 176 | 59 | TCA => TTA | S => L | 0.83 |
Fig. 3Number of different repeats in the chloroplast genomes of the four Cleomaceae species. P = palindromic, F = forward, R = reverse and C = complement.
Repeat sequences present in the D. glaucum chloroplast genome.
| SN | Repeat Size | Repeat Position 1 | Repeat Type | Repeat Location | Repeat Position 2 | Repeat Location 2 | E-Value |
|---|---|---|---|---|---|---|---|
| 1 | 55 | 0 | P | IGS | 87,683 | 5.45E-24 | |
| 2 | 52 | 30,403 | P | IGS | 30,403 | IGS | 3.49E-22 |
| 3 | 47 | 40,642 | F | 42,866 | 3.57E-19 | ||
| 4 | 39 | 45,750 | F | 102,076 | IGS | 2.34E-14 | |
| 5 | 39 | 45,750 | P | 144,199 | IGS | 2.34E-14 | |
| 6 | 34 | 9240 | R | IGS | 9240 | IGS | 2.40E-11 |
| 7 | 34 | 44,864 | P | 44,864 | 2.40E-11 | ||
| 8 | 31 | 9248 | R | IGS | 9248 | IGS | 1.53E-09 |
| 9 | 30 | 8750 | P | IGS | 8750 | IGS | 6.13E-09 |
| 10 | 29 | 30,292 | F | IGS | 30,321 | IGS | 2.45E-08 |
| 11 | 29 | 95,068 | F | 95,086 | 2.45E-08 | ||
| 12 | 29 | 95,068 | P | 151,199 | 2.45E-08 | ||
| 13 | 29 | 95,086 | P | 151,217 | 2.45E-08 | ||
| 14 | 29 | 151,199 | F | 151,217 | 2.45E-08 | ||
| 15 | 28 | 8507 | P | 46,871 | 9.81E-08 | ||
| 16 | 26 | 9240 | F | IGS | 9253 | IGS | 1.57E-06 |
| 17 | 26 | 10,325 | P | IGS | 10,357 | IGS | 1.57E-06 |
| 18 | 25 | 38,275 | R | IGS | 38,275 | IGS | 6.28E-06 |
| 19 | 25 | 128,760 | R | 128,760 | 6.28E-06 | ||
| 20 | 24 | 8760 | R | IGS | 8760 | IGS | 2.51E-05 |
| 21 | 24 | 32,513 | P | IGS | 32,513 | IGS | 2.51E-05 |
| 22 | 24 | 44,474 | P | IGS | 73,687 | 2.51E-05 | |
| 23 | 24 | 127,804 | P | IGS | 127,804 | IGS | 2.51E-05 |
| 24 | 23 | 3434 | P | 91,728 | 1.01E-04 | ||
| 25 | 23 | 3434 | F | 154,563 | 1.01E-04 | ||
| 26 | 23 | 8872 | F | IGS | 8895 | IGS | 1.01E-04 |
| 27 | 23 | 10,643 | R | IGS | 10,643 | IGS | 1.01E-04 |
| 28 | 23 | 30,247 | F | IGS | 30,268 | IGS | 1.01E-04 |
| 29 | 23 | 44,479 | R | IGS | 44,479 | IGS | 1.01E-04 |
| 30 | 23 | 92,588 | F | 92,612 | 1.01E-04 | ||
| 31 | 23 | 92,588 | P | 153,679 | 1.01E-04 | ||
| 32 | 23 | 92,612 | P | 153,703 | 1.01E-04 | ||
| 33 | 23 | 153,679 | F | 153,703 | 1.01E-04 | ||
| 34 | 22 | 10,745 | F | IGS | 10,767 | IGS | 4.02E-04 |
| 35 | 22 | 15,273 | C | IGS | 28,813 | IGS | 4.02E-04 |
| 36 | 22 | 43,907 | P | IGS | 96,899 | 4.02E-04 | |
| 37 | 22 | 43,907 | F | IGS | 149,393 | IGS | 4.02E-04 |
| 38 | 22 | 44,898 | P | 44,898 | 4.02E-04 | ||
| 39 | 22 | 48,054 | F | IGS | 48,074 | IGS | 4.02E-04 |
| 40 | 22 | 63,409 | P | IGS | 63,409 | IGS | 4.02E-04 |
| 41 | 22 | 103,201 | R | IGS | 103,201 | IGS | 4.02E-04 |
| 42 | 22 | 103,201 | C | IGS | 143,091 | IGS | 4.02E-04 |
| 43 | 22 | 113,539 | P | 113,539 | 4.02E-04 | ||
| 44 | 22 | 113,539 | F | 132,753 | 4.02E-04 | ||
| 45 | 22 | 132,753 | P | 132,753 | 4.02E-04 | ||
| 46 | 22 | 143,091 | R | IGS | 143,091 | IGS | 4.02E-04 |
| 47 | 21 | 8511 | F | 37,322 | 1.61E-03 | ||
| 48 | 21 | 8723 | R | IGS | 8723 | IGS | 1.61E-03 |
| 49 | 21 | 9240 | R | IGS | 9240 | IGS | 1.61E-03 |
Repeat sequences present in the C. chrysantha chloroplast genome.
| SN | Repeat Size | Repeat Position 1 | Repeat Type | Repeat Location | Repeat Position 2 | Repeat Location 2 | E-Value |
|---|---|---|---|---|---|---|---|
| 1 | 52 | 30,002 | P | IGS | 30,002 | IGS | 3.47E-22 |
| 2 | 47 | 40,405 | F | 42,629 | 3.55E-19 | ||
| 3 | 38 | 29,603 | P | IGS | 29,603 | IGS | 9.31E-14 |
| 4 | 34 | 32,963 | P | IGS | 32,963 | IGS | 2.38E-11 |
| 5 | 33 | 117,125 | F | IGS | 117,141 | IGS | 9.53E-11 |
| 6 | 29 | 92,074 | F | 92,098 | 2.44E-08 | ||
| 7 | 29 | 92,074 | P | 153,124 | 2.44E-08 | ||
| 8 | 29 | 92,098 | P | 153,148 | 2.44E-08 | ||
| 9 | 29 | 118,905 | F | IGS | 118,921 | IGS | 2.44E-08 |
| 10 | 29 | 153,124 | F | 153,148 | 2.44E-08 | ||
| 11 | 28 | 8401 | P | 46,575 | 9.76E-08 | ||
| 12 | 27 | 37,820 | R | IGS | 37,820 | IGS | 3.90E-07 |
| 13 | 26 | 79 | P | IGS | 79 | IGS | 1.56E-06 |
| 14 | 26 | 9949 | P | IGS | 9981 | IGS | 1.56E-06 |
| 15 | 26 | 116,265 | P | IGS | 116,265 | IGS | 1.56E-06 |
| 16 | 24 | 28,548 | P | IGS | 28,548 | IGS | 2.50E-05 |
| 17 | 24 | 32,078 | P | IGS | 32,078 | IGS | 2.50E-05 |
| 18 | 24 | 58,529 | P | IGS | 58,558 | IGS | 2.50E-05 |
| 19 | 24 | 76,983 | P | IGS | 77,008 | IGS | 2.50E-05 |
| 20 | 24 | 116,687 | F | IGS | 116,710 | IGS | 2.50E-05 |
| 21 | 23 | 4642 | F | IGS | 4708 | IGS | 9.99E-05 |
| 22 | 23 | 46,656 | F | IGS | 46,676 | IGS | 9.99E-05 |
| 23 | 23 | 97,249 | F | IGS | 97,271 | IGS | 9.99E-05 |
| 24 | 23 | 97,249 | P | IGS | 147,957 | IGS | 9.99E-05 |
| 25 | 23 | 97,271 | P | IGS | 147,979 | IGS | 9.99E-05 |
| 26 | 23 | 147,957 | F | IGS | 147,979 | IGS | 9.99E-05 |
| 27 | 22 | 38,011 | P | IGS | 38,011 | IGS | 4.00E-04 |
| 28 | 22 | 67,702 | F | IGS | 67,724 | IGS | 4.00E-04 |
| 29 | 22 | 113,002 | P | 113,002 | 4.00E-04 | ||
| 30 | 22 | 113,002 | F | 132,227 | 4.00E-04 | ||
| 31 | 22 | 117,470 | R | IGS | 117,470 | IGS | 4.00E-04 |
| 32 | 22 | 132,227 | P | 132,227 | 4.00E-04 | ||
| 33 | 21 | 314 | P | IGS | 363 | IGS | 1.60E-03 |
| 34 | 21 | 4878 | R | IGS | 4878 | IGS | 1.60E-03 |
| 35 | 21 | 5114 | C | IGS | 32,753 | IGS | 1.60E-03 |
| 36 | 21 | 5141 | F | IGS | 5166 | IGS | 1.60E-03 |
| 37 | 21 | 8068 | C | IGS | 30,261 | IGS | 1.60E-03 |
| 38 | 21 | 8405 | F | 37,002 | 1.60E-03 | ||
| 39 | 21 | 37,002 | P | 46,578 | 1.60E-03 | ||
| 40 | 21 | 38,404 | F | 69,049 | 1.60E-03 | ||
| 41 | 21 | 56,602 | F | IGS | 56,623 | IGS | 1.60E-03 |
| 42 | 21 | 74,400 | R | IGS | 74,400 | IGS | 1.60E-03 |
| 43 | 21 | 117,116 | R | IGS | 117,116 | IGS | 1.60E-03 |
| 44 | 21 | 119,109 | R | IGS | 119,109 | IGS | 1.60E-03 |
| 45 | 20 | 160 | F | IGS | 8149 | 6.39E-03 | |
| 46 | 20 | 4063 | F | 23,615 | 6.39E-03 | ||
| 47 | 20 | 4063 | P | 28,441 | IGS | 6.39E-03 | |
| 48 | 20 | 4064 | F | 111,518 | IGS | 6.39E-03 | |
| 49 | 20 | 4064 | P | 133,713 | IGS | 6.39E-03 |
The SSRs in four chloroplast genomes of Cleomaceae.
| SSR type | Repeat unit | Species | |||
|---|---|---|---|---|---|
| Mono | A/T | 277 | 268 | 212 | 282 |
| C/G | 5 | 4 | 7 | 4 | |
| Di | AG/CT | 1 | 1 | 1 | 0 |
| AT/AT | 16 | 12 | 17 | 11 | |
| Tri | AAC/GTT | 1 | 1 | 0 | 0 |
| AAG/CTT | 0 | 1 | 0 | 1 | |
| AAT/ATT | 4 | 3 | 5 | 7 | |
| Tetra | AAAC/GTTT | 0 | 0 | 0 | 1 |
| AAAG/CTTT | 2 | 2 | 0 | 1 | |
| AAAT/ATTT | 11 | 12 | 10 | 14 | |
| AACG/CGTT | 1 | 0 | 0 | 1 | |
| AATT/AATT | 1 | 5 | 3 | 1 | |
| AGAT/ATCT | 1 | 1 | 1 | 1 | |
| ATCC/ATGG | 1 | 1 | 1 | 1 | |
| Penta | AAAAG/CTTTT | 0 | 0 | 0 | 1 |
| AAATG/ATTTC | 1 | 0 | 0 | 0 | |
| AATAT/ATATT | 1 | 0 | 0 | 1 | |
| AATTC/AATTG | 0 | 1 | 0 | 0 | |
| Hexa | AAAAAG/CTTTTT | 0 | 0 | 1 | 0 |
| AAATTC/AATTTG | 0 | 1 | 0 | 1 | |
Fig. 4Frequency of different SSR motifs in different repeat types in four chloroplast genomes of Cleomaceae.
Fig. 5SSR Number in complete genome, protein coding regions, and noncoding genes: (A) D. glaucum, (B) C. chrysantha.
Fig. 6Number of different SSR types in the four-chloroplast genome of Cleomaceae.
Fig. 7Comparison of four chloroplast genomes in Cleomaceae family using mVISTA. Annotation of D. glaucum was used as reference. The top arrows above the alignment indicate gene orientation. Blue bars represent protein coding, pink bars represent noncoding sequences (CNS) and light green indicates tRNAs and rRNAs. The horizontal axis indicates the coordinates in the cp genome, the vertical scale indicates the percentage identity between 50 and 100%.
Fig. 8Comparison of the IR, SSC and LSC junction positions among four chloroplast genomes of Cleomaceae.
Fig. 9The synonymous (dS) and dN/dS ration values of 80 protein coding genes from four Cleomaceae cp genomes.
Fig. 10Phylogenetic tree reconstruction based on the complete chloroplast genome of sixteen taxa inferred from Bayesian Inference (BI) methods showing relationship within Cleomaceae and other families. Numbers in the clade represent posterior probability (PP) values.