| Literature DB >> 28265281 |
Zdravka Ivanova1, Gaurav Sablok2, Evelina Daskalova1, Gergana Zahmanova1, Elena Apostolova1, Galina Yahubyan1, Vesselin Baev1.
Abstract
Haberlea rhodopensis is a paleolithic tertiary relict species, best known as a resurrection plant with remarkable tolerance to desiccation. When exposed to severe drought stress, H. rhodopensis shows an ability to maintain the structural integrity of its photosynthetic apparatus, which re-activates easily upon rehydration. We present here the results from the assembly and annotation of the chloroplast (cp) genome of H. rhodopensis, which was further subjected to comparative analysis with the cp genomes of closely related species. H. rhodopensis showed a cp genome size of 153,099 bp, harboring a pair of inverted repeats (IR) of 25,415 bp separated by small and large copy regions (SSC and LSC) of 17,826 and 84,443 bp. The genome structure, gene order, GC content and codon usage are similar to those of the typical angiosperm cp genomes. The genome hosts 137 genes representing 70.66% of the plastome, which includes 86 protein-coding genes, 36 tRNAs, and 4 rRNAs. A comparative plastome analysis with other closely related Lamiales members revealed conserved gene order in the IR and LSC/SSC regions. A phylogenetic analysis based on protein-coding genes from 33 species defines this species as belonging to the Gesneriaceae family. From an evolutionary point of view, a site-specific selection analysis detected positively selected sites in 17 genes, most of which are involved in photosynthesis (e.g., rbcL, ndhF, accD, atpE, etc.). The observed codon substitutions may be interpreted as being a consequence of molecular adaptation to drought stress, which ensures an evolutionary advantage to H. rhodopensis.Entities:
Keywords: Haberlea rhodopensis; SSR; chloroplast genome; desiccation stress; rbcL; site-specific selection
Year: 2017 PMID: 28265281 PMCID: PMC5316520 DOI: 10.3389/fpls.2017.00204
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
List of genes encoded by Haberlea rhodopensis chloroplast genome.
| Category | Gene group | Gene name | ||||
|---|---|---|---|---|---|---|
| Subunits of photosystem I | ||||||
| Subunits of photosystem II | ||||||
| Subunits of cytochrome b/f complex | ||||||
| Subunits of ATP synthase | ||||||
| Large subunit of RuBisCo | ||||||
| Subunits of NADH dehydrogenase | ||||||
| Ribosomal RNA genes | ||||||
| Transfer RNA genes | ||||||
| Ribosomal proteins (SSU) | ||||||
| Ribosomal proteins (LSU) | ||||||
| RNA polymerase | ||||||
| Translational initiation factor | ||||||
| Maturase | ||||||
| Envelope membrane protein | ||||||
| LHC of PSII associated factor 1 | ||||||
| Subunit of acetyl-CoA | ||||||
| C-Type cytochrome synthesis gene | ||||||
| Protease | ||||||
| Hypothetical chloroplast reading | ||||||
| Frames | ||||||
| ORF | ||||||
Location and length of intron-containing genes in Haberlea rhodopensis chloroplast genome.
| Gene | Location | Exon I (bp) | Intron I (bp) | Exon II (bp) | intron II (bp) | Exon II (bp) |
|---|---|---|---|---|---|---|
| LSC | 48 | 914 | 213 | |||
| LSC | 144 | 656 | 471 | |||
| LSC | 456 | 781 | 1620 | |||
| LSC | 129 | 703 | 228 | 714 | 153 | |
| LSC | 69 | 808 | 291 | 615 | 228 | |
| IR | 393 | 669 | 435 | |||
| IR | 777 | 680 | 756 | |||
| SSC | 558 | 941 | 540 | |||
| LSC | 37 | 2527 | 26 | |||
| LSC | 37 | 475 | 50 | |||
| LSC | 38 | 582 | 37 | |||
| IR | 42 | 936 | 35 | |||
| IR | 38 | 823 | 35 | |||
Comparison of genome contents of H. rhodopensis and B. hygrometrica.
| Total sequence length (bp) | 153, 099 | 153, 493 |
| Large single copy (bp) | 84, 443 | 84, 692 |
| Small single copy (bp) | 17, 826 | 17, 901 |
| Inverted repeat region (bp) | 25, 415 | 25, 450 |
| GC% content | 37, 8% | 37, 59% |
| Total CDS bases (bp) | 79747 | 79218 |
| Average CDS length (bp) | 917 | 932 |
| Total RNA bases (bp) | 11769 | 11780 |
| Average intergenic distance (bp) | 390 | 396.66 |
Cumulative SSR frequency and corresponding primer pairs in Haberlea rhodopensis (SSR search parameters: 1–10; 2–4; 3–4; 4–3; 5–3; 6–3 where 1, 2, 3, 4, 5, and 6 indicate the mono- di-, tri-, tetra-, penta-, and hexa-nucleotide repeats).
| Total size of examined sequences (bp): | 153099 |
| Total number of identified SSRs: | 71 |
| Number of SSRs present in compound formation: | 10 |
| A/T | 24 |
| C/G | 1 |
| AC/GT | 4 |
| AG/CT | 15 |
| AT/AT | 19 |
| AAG/CTT | 1 |
| AAT/ATT | 1 |
| AAAC/GTTT | 1 |
| AAAT/ATTT | 2 |
| AACT/AGTT | 1 |
| AATC/ATTG | 1 |
| AGAT/ATCT | 1 |
Distribution and localization of repeat sequences in cpDNA of Haberlea rhodopensis.
| Size (bp) | Start position1 | Start position2 | Type | Location | Region | |
|---|---|---|---|---|---|---|
| 59 | 109798 | 109857 | F | ycf1 | IRA | 1.98e-26 |
| 59 | 109857 | 127696 | P | ycf1 | SSC | 1.98e-26 |
| 56 | 84299 | 84443 | F | rps19, IGS | LSC | 1.27e-24 |
| 56 | 84299 | 153043 | P | rps19, IGS(rpl2,trnH-GUG) | IRA, IRB | 1.27e-24 |
| 55 | 36105 | 36159 | F | IGS(trnS-UGA,lhbA) | LSC | 5.08e-24 |
| 41 | 98642 | 119664 | F | IGS(rps7,trnV-GAC),ndhA | IRA, SSC | 1.36e-15 |
| 41 | 119664 | 138870 | P | ndhA, IGS(trnV-GAC,ndhB) | SSC, IRB | 1.36e-15 |
| 40 | 59562 | 59562 | P | IGS(accD, psaI) | LSC | 5.45e-15 |
| 37 | 57457 | 57494 | F | IGS(rbsL, accD) | LSC | 3.49e-13 |
| 44 | 38927 | 41151 | F | psaB, psaA | LSC | 7.62e-12 |
| 37 | 6418 | 80983 | F | IGS(rps16,trnQ-UUG), IGS(rps8,rpl14) | LSC | 3.87e-11 |
| 33 | 113317 | 113349 | F | IGS(rpl32,trnL-UAG) | SSC | 8.93e-11 |
| 42 | 43919 | 119663 | F | ycf3,ndhA | LSC, SSC | 1.06e-10 |
| 39 | 43922 | 138870 | P | ycf3, IGS(trnV-GAC,rps7) | LSC, IRB | 1.45e-10 |
| 39 | 43922 | 98644 | F | ycf3, IGS(rps7, trnV-GAC) | LSC, IRA | 1.45e-10 |
| 31 | 162 | 199 | P | IGS(trnH-GUG,psbA) | LSC | 1.43e-09 |
| 30 | 8063 | 45627 | P | IGS(psbI, trnS-GCU) | LSC | 5.72e-09 |
| 33 | 63845 | 63845 | P | IGS(petA, psbJ) | LSC | 8.84e-09 |
| 36 | 91664 | 145820 | P | ycf2 | IRA, IRB | 2.69e-07 |
| 36 | 91664 | 91682 | F | ycf2 | IRA | 2.69e-07 |
| 36 | 91682 | 145838 | P | ycf2 | IRA, IRB | 2.69e-07 |
| 36 | 145820 | 145838 | F | ycf2 | IRB | 2.69e-07 |
| 35 | 94093 | 94093 | P | IGS(ycf15, trnL-CAA) | IRA | 9.87e-07 |
| 35 | 94093 | 143408 | F | IGS(ycf15,trnL-CAA), IGS(trnL-CAA,ycf15) | IRA, IRB | 9.87e-07 |
| 35 | 143408 | 143408 | P | IGS(trnL-CAA, ycf15) | IRB | 9.87e-07 |
| 32 | 8501 | 8501 | P | IGS(trnS-GCU,trnR-UCU) | LSC | 1.60e-06 |
| 34 | 74589 | 74599 | P | IGS(psbT, psbN) | LSC | 3.61e-06 |
| 31 | 20927 | 20929 | P | rpoC1 | LSC | 5.98e-06 |
| 30 | 13264 | 13264 | P | IGS(atpF, atpH) | LSC | 2.24e-05 |
| 31 | 54809 | 66123 | P | IGS(atpB,rbcL), IGS(psbE,petL) | LSC | 1.73e-04 |
| 30 | 9660 | 36718 | F | IGS(trnS-GCU,trnR-UCU), trnG-UCC | LSC | 6.27e-04 |
| 30 | 43934 | 138867 | P | ycf3, IGS(trnV-GAC, rps7) | LSC, IRB | 6.27e-04 |
| 30 | 43934 | 98656 | F | ycf3, IGS(rps7, trnV-GAC) | LSC, IRA | 6.27e-04 |
| 30 | 89251 | 148215 | P | ycf2 | IRA, IRB | 6.27e-04 |
| 30 | 89251 | 89293 | F | ycf2 | IRA | 6.27e-04 |
| 30 | 89293 | 148257 | P | ycf2 | IRA, IRB | 6.27e-04 |
| 30 | 91675 | 145815 | P | ycf2 | IRA, IRB | 6.27e-04 |
| 30 | 91675 | 91693 | F | ycf2 | IRA | 6.27e-04 |
| 30 | 91693 | 145833 | P | ycf2 | IRA, IRB | 6.27e-04 |
| 30 | 148215 | 148257 | F | ycf2 | IRB | 6.27e-04 |
Positive selection sites identified with selection with df = 1.
| Gene | NULL (M8a) | POSITIVE (M8) | Putative sites under positive selection |
|---|---|---|---|
| –2156.21 | –2153.31 | 1 (131 M) | |
| –11147.00 | –11140.60 | 1 (70 S) | |
| –14940.40 | –14937.90 | 7 (468 N, 486 R, 557 L, 597 K, 604 R, 693 Y, 695N) | |
| –2045.17 | –2043.14 | 1 (93 S) | |
| –7646.86 | –7644.55 | 2 (165 V, 600 A) | |
| –446.59 | –443.27 | 1 (2 S) | |
| –978.46 | –973.65 | 1 (21 A) | |
| –687.88 | –683.14 | 1 (40 L) | |
| –285.42 | –282.01 | 1 (1 M) | |
| –6375.16 | –6358.92 | 13 (86 H, 142 P, 145 T, 225 I, 251 I, 279 S, 354 V, 429 Q, 439 V,449 C, 468 N, 470 P, 472 I) | |
| –1878.87 | –1874.32 | 2 (6 L, 10 S) | |
| –2824.08 | –2819.11 | 5 (73 L, 107 D, 108 K, 110 E, 113 R) | |
| –22883.50 | –22872.20 | 11 (522 Q, 536 Y, 545 H, 549 H, 577 Q, 725 K, 727 S, 935 S, 941 L, 1208 V, 1268 H) | |
| –2736.18 | –2733.16 | 3 (33 K, 154 Q, 164 K) | |
| –2276.40 | –2273.57 | 2 (57 Y, 72 I) | |
| –1419.21 | –1417.18 | 3 (79 Q, 85 T, 93 N) | |
| –7447.39 | –7432.39 | 8 (32 I, 52 W, 97 L, 142 C, 152 L, 156 A, 155 R, 298 R) | |