| Literature DB >> 27452401 |
Maria D Logacheva1,2, Mikhail I Schelkunov3, Victoria Y Shtratnikova4, Maria V Matveeva2, Aleksey A Penin3,5.
Abstract
Although plastid genomes of flowering plants are typically highly conserved regarding their size, gene content and order, there are some exceptions. Ericaceae, a large and diverse family of flowering plants, warrants special attention within the context of plastid genome evolution because it includes both non-photosynthetic and photosynthetic species with rearranged plastomes and putative losses of "essential" genes. We characterized plastid genomes of three species of Ericaceae, non-photosynthetic Monotropa uniflora and Hypopitys monotropa and photosynthetic Pyrola rotundifolia, using high-throughput sequencing. As expected for non-photosynthetic plants, M. uniflora and H. monotropa have small plastid genomes (46 kb and 35 kb, respectively) lacking genes related to photosynthesis, whereas P. rotundifolia has a larger genome (169 kb) with a gene set similar to other photosynthetic plants. The examined genomes contain an unusually high number of repeats and translocations. Comparative analysis of the expanded set of Ericaceae plastomes suggests that the genes clpP and accD that are present in the plastid genomes of almost all plants have not been lost in this family (as was previously thought) but rather persist in these genomes in unusual forms. Also we found a new gene in P. rotundifolia that emerged as a result of duplication of rps4 gene.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27452401 PMCID: PMC4958920 DOI: 10.1038/srep30042
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Plastid genome maps of Pyrola rotundifolia, Monotropa uniflora and Hypopitys monotropa.
Genes shown inside the circle are transcribed clockwise; those outside the circle are transcribed counterclockwise. Blue and orange lines indicate direct and inverted repeats, respectively. The filled orange area indicates the inverted repeat region.
General characteristics of Ericaceae plastomes.
| Species | Plastome length, bp | GC content, % | Total number of genes | Number of protein-coding genes | Number of rRNA-coding genes | Number of tRNA-coding genes |
|---|---|---|---|---|---|---|
| 150 897 | 37.3 | 109 | 75 | 4 | 30 | |
| 176 045 | 36.8 | 100 | 68 | 4 | 28 | |
| 168 995 | 35.7 | 101 | 67 | 4 | 30 | |
| 45 111 | 28.9 | 40 | 22 | 4 | 14 | |
| 35 062 | 34.4 | 44 | 22 | 4 | 18 |
*–duplicated genes are counted as a single gene. Pseudogenes are not counted.
Gene content in Ericaceae plastomes.
| Species | |||||
|---|---|---|---|---|---|
| Ribosomal proteins, small subunit | rps2, rps3, rps4, | rps2, rps3, rps4, rps7, rps8, rps11, Ψrps12, rps14, | rps2, rps3, rps4, rps4-like gene (ORF357), | rps2, rps3, | rps2, rps3, rps4, rps7, rps8, rps11, rps12, rps14, rps18, Ψrps19 |
| Ribosomal proteins, large subunit | rpl2, rpl14, rpl16, rpl20, rpl22, rpl23, rpl32, rpl33, rpl36 | rpl2, rpl14, rpl16, | Ψrpl2, rpl14, rpl16, rpl20, rpl22, rpl23, | rpl2, rpl14, rpl16, rpl20, rpl22, rpl33, rpl36 | rpl2, rpl14, rpl16, rpl20, rpl22, rpl23, rpl32, rpl33, rpl36 |
| Ribosomal RNA | rrn16, rrn23, rrn5, rrn4.5 | ||||
| Transfer RNA | trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnH-GUG, trnI-CAU, trnfM-CAU, | trnC-GCA, trnD-GUC, trnE-UUC, trnF-GAA, trnH-GUG, trnI-CAU, trnfM-CAU, trnL-UAG, trnM-CAU, trnN-GUU, trnP-UGG, trnQ-UUG, trnR-ACG, trnR-UCU, trnS-UGA, trnS-GCU, trnW-CCA, trnY-GUA | |||
| Photosystem I | psaA, psaB, | psaA, psaB, | psaA, psaB, psaC, psaI, psaJ | ΨpsaB | |
| Photosystem II | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | psbA, psbB, psbC, psbD, psbE, psbF, psbH, psbI, psbJ, psbK, psbL, psbM, psbN, psbT, psbZ | |||
| Cytochrome b6/f complex | petA, petB, petD, petG, petL, petN | petA, petB, petD, petG, petL, petN | petA, petB, ΨpetD, petG, petL, petN | ||
| Photosynthesis – others | rbcL, | rbcL, | rbcL, | ΨccsA | |
| NADH-dehydrogenase | ΨndhA, | ΨndhB | |||
| RNA polymerase | rpoA, rpoB, rpoC1, rpoC2 | rpoA, rpoB, rpoC1, rpoC2 | rpoA, rpoB, rpoC1, rpoC2 | ||
| ATP synthase | atpA, atpB, atpE, atpF, atpH, atpI | atpA, atpB, atpE, ΨatpF, atpH, atpI | atpA, atpB, atpE, atpF, atpH, atpI | ΨatpB | |
| Others | matK, infA, cemA, accD, ΨclpP, Ψycf1 | matK, ΨinfA, cemA, ΨaccD, clpP, Ψycf2 | matK, infA, cemA, accD, clpP, | matK, infA, accD, | matK, infA, accD, clpP |
The names of genes situated within inverted repeats are in bold. Genes with copies both in IR and single-copy regions or that are situated partially in the IR and partially in single-copy regions are marked with an asterisk. Pseudogenes are marked with Ψ. If both the normal and pseudogenized copy of a gene are present in the genome, only the normal one is indicated here.
GC content in Ericaceae and Camellia sinensis.
| Species | GC1 | GC2 | GC3 | Total GC |
|---|---|---|---|---|
| 0.424 | 0.377 | 0.263 | 0.355 | |
| 0.365 | 0.332 | 0.171 | 0.289 | |
| 0.438 | 0.399 | 0.248 | 0.362 | |
| 0.447 | 0.4 | 0.266 | 0.371 | |
| 0.459 | 0.409 | 0.273 | 0.38 | |
| 0.461 | 0.406 | 0.268 | 0.378 |
GC1, GC2 and GC3 refer to the first, second and third positions of codons in protein-coding genes.
Figure 2Phylogenetic tree of Ericaceae reconstructed based on plastid rDNA, plastid protein-coding and nuclear rDNA sequences.
Branch lengths correspond to dN, dS and dN/dS values. The high dN/dS (3.83) value for Pyrola/Vaccinium branch is presumably an artefact caused by an insufficient number of synonymous substitutions.
Figure 3Mode of selection for accD (a) and clpP (b) in Ericaceae. The dots indicate the probability of negative selection, neutral evolution and positive selection for codons in genes accD (a) and clpP (b). The probabilities are estimated by the Empirical Bayes method in PAML. Green lines above the accD diagram designate regions homologous to parts of accD from plants with typical accD structure.
Figure 4(a) Domain structure of the hypothetical product of ORF357; (b) transmembrane domain prediction for ORF357 and random ORFs with different GC contents. Blue dots denote mean values of parts of those random ORFs that are predicted to form transmembrane helices by TMHMM, whiskers represent the standard deviation.