| Literature DB >> 25001059 |
Hui Huang, Chao Shi, Yuan Liu, Shu-Yan Mao, Li-Zhi Gao1.
Abstract
BACKGROUND: Camellia is an economically and phylogenetically important genus in the family Theaceae. Owing to numerous hybridization and polyploidization, it is taxonomically and phylogenetically ranked as one of the most challengingly difficult taxa in plants. Sequence comparisons of chloroplast (cp) genomes are of great interest to provide a robust evidence for taxonomic studies, species identification and understanding mechanisms that underlie the evolution of the Camellia species.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25001059 PMCID: PMC4105164 DOI: 10.1186/1471-2148-14-151
Source DB: PubMed Journal: BMC Evol Biol ISSN: 1471-2148 Impact factor: 3.260
Information of the sequenced chloroplast genomes according Min’s taxonomic treatment[2]
| TRI | |||
| ICSG | |||
| TRI | |||
| TRI | |||
| ICSG | |||
| ICSG | |||
| TRI | |||
| TRI | |||
| TRI | |||
| TRI | |||
| ICSG | |||
| ICSG | |||
| KIB |
TRI, Tea Research Institute, Yunnan Academy of Agricultural Science (Menghai, Yunnan, China);
ICSG, International Camellia Species Garden (Jinhua, Zhejiang, China);
KIB, Kunming Institute of Botany, Chinese Academy of Sciences (Kunming, Yunnan, China).
The sequenced chloroplast genome features
| 24,127,775 | 157,127 | 154 | 86,657 | 18,286 | 26,092 | 37.29 | |
| 26,635,918 | 157,102 | 170 | 86,648 | 18,276 | 26,089 | 37.30 | |
| 24,978,790 | 157,110 | 159 | 86,656 | 18,276 | 26,089 | 37.30 | |
| 111,673,521 | 157,117 | 711 | 86,663 | 18,276 | 26,089 | 37.29 | |
| 7,753,104 | 157,086 | 49 | 86,679 | 18,267 | 26,096 | 37.30 | |
| 9,358,318 | 157,121 | 60 | 86,660 | 18,283 | 26,089 | 37.29 | |
| 36,142,305 | 157,076 | 230 | 86,650 | 18,280 | 26,073 | 37.30 | |
| 56,357,778 | 156,971 | 359 | 86,606 | 18,235 | 26,065 | 37.30 | |
| 8,162,492 | 157,145 | 52 | 86,676 | 18,291 | 26,089 | 37.28 | |
| 2,828,916 | 157,121 | 18 | 86,651 | 18,286 | 26,092 | 37.29 | |
| 2,828,754 | 157,087 | 18 | 86,650 | 18,287 | 26,075 | 37.29 | |
| | | ||||||
| 6,595,133 | 157,100 | 42 | 180 | 40,530 | | | |
| 39,446,507 | 157,364 | 251 | 138 | 30,491 | | | |
| 8,553,876 | 156,992 | 54 | 207 | 36,424 | | | |
| 8,426,943 | 157,057 | 54 | 148 | 22,817 | | | |
| 5,504,058 | 157,009 | 35 | 345 | 57,250 |
Figure 1Gene map of the chloroplast genomes. Genes shown outside the outer circle are transcribed clockwise and those inside are transcribed counterclockwise. Genes belonging to different functional groups are color-coded. Dashed area in the inner circle indicates the GC content of the chloroplast genome.
Genes contained in the sequenced chloroplast genomes
| Self replication | Large subunit of ribosomal proteins | rpl2b,c, 14, 16b, 20, 22, 23c, 32, 33, 36 |
| | Small subunit of ribosomal proteins | rps2, 3, 4, 7c, 8, 11, 12b-d, 14, 15, 16b, 18, 19 |
| | DNA dependent RNA polymerase | rpoA, B, C1b, C2 |
| | rRNA genes | rrn4.5c, 5c, 16c, 23c |
| | tRNA genes | trnA-UGCb,c, C-GCA, D-GUC, E-UUC, F-GAA, G-UCC, G-GCCb, H-GUG, I-CAUc, I-GAUb,c, K-UUUb, L-UAG, L-CAAc, L-UAAb, M-CAU, fM-CAU, N-GUUc, P-UGC, Q-UUG, R-ACGc, R-UCU, S-GGA, S-GCU, S-UGA, T-GGU, T-UGU, V-UACb, V-GACc, W-CCA, Y-GUA |
| Photosynthesis | Photosystem I | psaA, B, C, I, J, ycf3a, ycf4 |
| | Photosystem II | psbA, B, C, D, E, F, H, I, J, K, L, M, N, T, Z |
| | NADH oxidoreductase | ndhAb, Bb,c, C, D, E, F, G, H, I, J, K |
| | Cytochrome b6/f complex | petA, Bb, Db, G, L, N |
| | ATP synthase | atpA, B, E, Fb, H, I |
| | Rubisco | rbcL |
| Other gene | Translational initiation factor | infA |
| | Maturase | matK |
| | Protease | clpPa |
| | Envelop membrane protein | cemA |
| | Subunit Acetyl-CoA-carboxylase | accD |
| | c-type cytochrom synthesis gene | ccsA |
| Unknown gene | Conserved Open Reading Frames | ycf1, 2c,15c, orf42 |
aGenes containing two introns;
bGenes containing a single intron;
CTwo gene copies in the IRs;
dGenes split into two independent transcription units.
Figure 2The comparison of the LSC, IR and SSC border regions among the eighteen chloroplast genomes.
Figure 3Visualization alignment of chloroplast genome sequences. VISTA-based identity plots showing sequence identity between the eighteen sequenced Camellia chloroplast genomes and nine other representative flowering plants, with Camellia sinensis var. assamica as a reference. Thick black lines show the inverted repeats (IRs) in the chloroplast genomes. Genome regions are color-coded as protein coding, rRNA coding, tRNA coding or conserved noncoding sequences (CNS).
Figure 4Analyses of repeated sequences in the five chloroplast genomes. A Number of the three repeat types; B Frequency of the direct repeats by length; C Frequency of the reverse repeats by length; D Frequency of palindromic repeats by length; E Location of repeats; F Summary of shared repeats among the five Camellia chloroplast genomes. IGS, intergenic spacer.
Figure 5The distribution of simple sequence repeats (SSRs) in the five chloroplast genomes. M: Mononucleotide; T: Tetranucleotide; H: Hexanucleotide.
Simple sequence repeats (SSRs) in the five representative chloroplast genomes
| ASSA | | | | | | | | | |
| | A/T | 51 | 5 | 34 | 1 | 11 | 39 | 3 | 9 |
| | AAAAAG/CTTTTT | 2 | | 2 | | | | | 2 |
| OLEI | | | | | | | | | |
| | A/T | 49 | 6 | 31 | 1 | 11 | 36 | 4 | 9 |
| | AAAAAG/CTTTTT | 2 | | 2 | | | | | 2 |
| PUBI | | | | | | | | | |
| | A/T | 48 | 5 | 31 | 1 | 11 | 35 | 4 | 9 |
| | AAAAAG/CTTTTT | 2 | | 2 | | | | | 2 |
| PETE | | | | | | | | | |
| | A/T | 53 | 7 | 35 | 0 | 11 | 41 | 3 | 9 |
| | AAAAAG/CTTTTT | 2 | | 2 | | | | | 2 |
| RETI | | | | | | | | | |
| | A/T | 53 | 7 | 34 | 1 | 11 | 40 | 4 | 9 |
| | AGGG/CCCT | 1 | | 1 | | | | | 1 |
| AAAAAG/CTTTTT | 1 | 1 | 1 | ||||||
ASSA, C. sinensis var. assamica; OLEI, C. oleifera; PUBI, C. pubicosta; PETE, C. petelotii; RETI, C. reticulata.
The numbers and ratios of nucleotide substitutions and indels in the five chloroplast genomes
| / | 53 | 28 | 57 | 69 | |
| 265 (5.00) | / | 61 | 45 | 65 | |
| 82 (2.93) | 252 (4.13) | / | 65 | 72 | |
| 217 (3.81) | 204 (4.53) | 193 (2.97) | / | 64 | |
| 242 (3.51) | 224 (3.45) | 221 (3.07) | 171 (2.67) | / |
The upper triangle shows the number of indels, while the lower triangle indicates the total nucleotide substitutions. The ratios of nucleotide substitutions to indels (S/I) are given in brackets.
Figure 6The distribution of indel types in the five chloroplast genomes. The pairwise comparisons were performed to identify indels among the five Camellia chloroplast genomes. A includes ASSA vs. OLEI/PUBI/PETE/RETI and OLEI vs. PUBI. B includes OLEI vs. PETE/RETI, PUBI vs. PETE/RETI and PETE vs. RETI.
Figure 7Percentages of variable characters in homologous regions across the five chloroplast genomes. A Coding regions; B Non-coding regions.
Figure 8Levels of evolutionary divergences among different regions of the five chloroplast genomes.
Figure 9Phylogenetic relationships of the thirteen species of section and the eighteen species of constructed by maximum likelihood (A, C) and maximum parsimony (B, D) with and as outgroup. The A and C ML trees have a -InL = 124830.0859 and -InL = 290325.4563. The B MP tree has a length of 1,129 with a consistency index of 0.890 and a retention index of 0.766. The D MP tree has a length of 14,892 with a consistency index of 0.983 and a retention index of 0.796. Numbers above node are bootstrap support values (>50%).
Figure 10Distribution of indels within introns and coding sequences of the five chloroplast genomes. The phylogenetic tree was a subtree of Figure 10 using C. arabic as outgroup. The insertions are indicated as ‘+’ and deletions are marked as ‘-’ on the branch. The genes are designated as ‘*’. Synapomorphy and homoplasy are shown by black and gray bars, respectively.