| Literature DB >> 26933149 |
Sang-Chul Kim1, Jung Sung Kim1, Joo-Hwan Kim2.
Abstract
Genomic events including gene loss, duplication, pseudogenization and rearrangement in plant genomes are valuable sources for exploring and understanding the process of evolution in angiosperms. The family Melanthiaceae is distributed in temperate regions of the Northern Hemisphere and divided into five tribes (Heloniadeae, Chionographideae, Xerophylleae, Melanthieae and Parideae) based on the molecular phylogenetic analyses. At present, complete chloroplast genomes of the Melanthiaceae have been reported from three species. In the previous genomic study of Liliales, atrnI-CAU gene duplication event was reported fromParis verticillata, a member of Parideae. To clarify the significant genomic events of the tribe Parideae, we analysed the complete chloroplast genome sequences of twoTrilliumspecies representing two subgenera:TrilliumandPhyllantherum InTrillium tschonoskii(subgenusTrillium), the circular double-stranded cpDNA sequence of 156 852 bp consists of two inverted repeat (IR) regions of 26 501 bp each, a large single-copy (LSC) region of 83 981 bp and a small single-copy (SSC) region of 19 869 bp. The chloroplast genome sequence ofT. maculatum(subgenusPhyllantherum) is 157 359 bp in length, consisting of two IRs (25 535 bp), one SSC (19 949 bp) and one LSC (86 340 bp), and is longer than that ofT. tschonoskii The results showed that the cpDNAs of Parideae are highly conserved across genome structure, gene order and contents. However, the chloroplast genome ofT. maculatumcontained a 3.4-kb inverted sequence betweenndhCandrbcLin the LSC region, and it was a unique feature for subgeneraPhyllantherum In addition, we found three different types of gene duplication in the intergenic spacer betweenrpl23andycf2containingtrnI-CAU, which were in agreement with the circumscription of subgenera and sections in Parideae excludingT. govanianum These genomic features provide informative molecular markers for identifying the infrageneric taxa ofTrilliumand improve our understanding of the evolution patterns of Parideae in Melanthiaceae. Published by Oxford University Press on behalf of the Annals of Botany Company.Entities:
Keywords: Chloroplast genome; Trillium maculatum; Trillium tschonoskii, trnI-CAU; comparative genomics; gene duplication; single inversion
Year: 2016 PMID: 26933149 PMCID: PMC4823371 DOI: 10.1093/aobpla/plw015
Source DB: PubMed Journal: AoB Plants Impact factor: 3.276
Figure 1.Gene maps and summary of the T. tschonoskii Maxim. and T. maculatum Raf. chloroplast genomes. IR, inverted repeat; LCS, large single-copy region; SSC, small single-copy region.
Figure 2.Comparison of the IR boundaries among five species within Melanthiaceae.
Figure 3.Confirmation of inversion (3492 bp) between ndhC and rbcL in the genus Trillium. (A) Design of primer to amplify junction regions between atpB and rbcL regions. The positions of atpB and rbcL genes in LSC regions are drawn based on the sequence assembly results of T. tschonoskii, T. maculatum in this study (red text). *The data downloaded from the NCBI. The forward primer I1F contains the sequence in atpB region. The sequence of the reverse primer (I1R) is located in the rbcL gene. Polymerase chain reaction amplification of IGS between atpB and rbcL. Relationships of Parideae lineages followed the phylogenetic trees of S. C. Kim, J. S. Kim, W. C. Mark, F. F. Michael and J. H. Kim (unpublished data). (B) Primers were designed to amplify junction regions between trnV-UAC and rbcL regions. The positions of trnV-UAC and rbcL genes in LSC regions are drawn based on the sequence assembly results of T. tschonoskii, T. maculatum in this study (red text). *The data downloaded from the NCBI. The forward primer I2R contains the sequence in trnV-UAC region. The sequence of the reverse primer I1R is located in the rbcL gene. Polymerase chain reaction amplification of IGS between trnV-UAC and rbcL. Relationships of Parideae lineages followed the phylogenetic trees of S. C. Kim, J. S. Kim, W. C. Mark, F. F. Michael and J. H. Kim (unpublished data).
The number and total length of insertion–deletion mutations between the chloroplast genomes of T. tschonoskii and T. maculatum in Parideae.
| Region | Number of indels | Total length of indels |
|---|---|---|
| IGS | 262 | 5139 |
| Intron | 51 | 528 |
| Coding gene | 86 | 3005 |
Single nucleotide polymorphisms found between the plastid genomes of T. tschonoskii and T. maculatum. (A) Single nucleotide polymorphisms in coding gene. (B) Single nucleotide polymorphisms in intron. (C) Single nucleotide polymorphisms in IGS regions. Bold values represent p-distance >0.08.
| Gene | Aligned length (bp) | No. SNP | Gene | Aligned length (bp) | No. SNP | Gene | Aligned length (bp) | No. SNP | |||
|---|---|---|---|---|---|---|---|---|---|---|---|
| (A) | |||||||||||
| | 1062 | 2 | 0.002 | 555 | 4 | 0.007 | 828 | 14 | 0.017 | ||
| | 2205 | 5 | 0.002 | 695 | 5 | 0.007 | 121 | 2 | 0.017 | ||
| | 2253 | 4 | 0.002 | 2814 | 19 | 0.007 | 114 | 2 | 0.018 | ||
| | 405 | 1 | 0.002 | 1503 | 10 | 0.007 | 103 | 2 | 0.019 | ||
| | 1527 | 3 | 0.002 | 306 | 2 | 0.007 | 1494 | 30 | 0.02 | ||
| | 648 | 1 | 0.002 | 543 | 4 | 0.007 | 369 | 9 | 0.024 | ||
| | 1062 | 3 | 0.003 | 276 | 3 | 0.007 | 7209 | 146 | 0.024 | ||
| | 768 | 2 | 0.003 | 3213 | 25 | 0.008 | 468 | 11 | 0.024 | ||
| | 363 | 1 | 0.003 | 120 | 1 | 0.008 | 387 | 9 | 0.025 | ||
| | 1434 | 5 | 0.003 | 399 | 3 | 0.008 | 363 | 8 | 0.026 | ||
| | 963 | 3 | 0.003 | 132 | 1 | 0.008 | 657 | 18 | 0.027 | ||
| | 1533 | 5 | 0.003 | 234 | 2 | 0.009 | 73 | 2 | 0.027 | ||
| | 969 | 3 | 0.003 | 108 | 1 | 0.009 | 73 | 2 | 0.027 | ||
| | 252 | 1 | 0.004 | 303 | 3 | 0.01 | 71 | 2 | 0.028 | ||
| | 1524 | 6 | 0.004 | 606 | 6 | 0.01 | 72 | 2 | 0.028 | ||
| | 246 | 1 | 0.004 | 1023 | 10 | 0.01 | 372 | 11 | 0.03 | ||
| | 1422 | 6 | 0.004 | 243 | 2 | 0.01 | 201 | 7 | 0.035 | ||
| | 252 | 1 | 0.004 | 2097 | 22 | 0.011 | 387 | 13 | 0.035 | ||
| | 483 | 2 | 0.004 | 88 | 1 | 0.011 | 156 | 6 | 0.04 | ||
| | 1521 | 7 | 0.005 | 282 | 3 | 0.011 | 74 | 3 | 0.041 | ||
| | 376 | 2 | 0.005 | 2232 | 26 | 0.012 | 74 | 3 | 0.041 | ||
| | 531 | 3 | 0.006 | 1554 | 22 | 0.014 | 405 | 19 | 0.048 | ||
| | 1083 | 7 | 0.006 | 74 | 1 | 0.014 | 351 | 24 | 0.084 | ||
| | 1182 | 7 | 0.006 | 74 | 1 | 0.014 | 639 | 65 | 0.111 | ||
| | 555 | 4 | 0.007 | 411 | 6 | 0.015 | 6778 | 664 | 0.121 | ||
| | 4140 | 27 | 0.007 | 711 | 12 | 0.017 | 1566 | 323 | 0.23 | ||
| Gene | Aligned length (bp) | No. SNP | Gene | Aligned length (bp) | No. SNP | ||||||
| (B) | |||||||||||
| | 812 | 14 | 0.018 | 714 | 9 | 0.013 | |||||
| | 709 | 12 | 0.034 | 783 | 11 | 0.015 | |||||
| | 983 | 29 | 0.012 | 936 | 2 | 0.002 | |||||
| | 1077 | 6 | 0.006 | 1109 | 34 | 0.012 | |||||
| | 695 | 1 | 0.001 | 538 | 2 | 0.004 | |||||
| | 823 | 4 | 0.005 | 595 | 4 | 0.007 | |||||
| | 747 | 5 | 0.007 | 737 | 7 | 0.006 | |||||
| | 1075 | 26 | 0.026 | 738 | 8 | 0.01 | |||||
| | 664 | 3 | 0.005 | ||||||||
| IGS | Aligned length (bp) | No. SNP | IGS | Aligned length (bp) | No. SNP | ||||||
| (C) | |||||||||||
| | 578 | 1 | 0.002 | 102 | 2 | 0.02 | |||||
| | 323 | 1 | 0.003 | 938 | 16 | 0.02 | |||||
| | 242 | 1 | 0.004 | 110 | 2 | 0.02 | |||||
| | 1905 | 8 | 0.004 | 859 | 16 | 0.022 | |||||
| | 376 | 2 | 0.005 | 147 | 3 | 0.022 | |||||
| | 205 | 1 | 0.005 | 92 | 2 | 0.023 | |||||
| | 952 | 6 | 0.006 | 132 | 3 | 0.023 | |||||
| | 168 | 1 | 0.006 | 1139 | 26 | 0.023 | |||||
| | 785 | 5 | 0.007 | 138 | 3 | 0.023 | |||||
| | 296 | 2 | 0.007 | 179 | 4 | 0.023 | |||||
| | 144 | 1 | 0.007 | 301 | 4 | 0.023 | |||||
| | 572 | 4 | 0.007 | 429 | 9 | 0.023 | |||||
| | 132 | 1 | 0.008 | 23 | 0.024 | ||||||
| | 124 | 1 | 0.008 | 1208 | 25 | 0.024 | |||||
| | 296 | 2 | 0.008 | 82 | 2 | 0.025 | |||||
| | 831 | 8 | 0.01 | 119 | 3 | 0.025 | |||||
| | 412 | 4 | 0.01 | 242 | 6 | 0.026 | |||||
| | 307 | 3 | 0.01 | 360 | 9 | 0.027 | |||||
| | 674 | 2 | 0.01 | 150 | 4 | 0.027 | |||||
| | 1031 | 11 | 0.011 | 739 | 21 | 0.029 | |||||
| | 1016 | 12 | 0.011 | 68 | 2 | 0.029 | |||||
| | 89 | 1 | 0.011 | 655 | 19 | 0.031 | |||||
| | 482 | 3 | 0.011 | 1204 | 37 | 0.037 | |||||
| | 179 | 2 | 0.011 | 167 | 6 | 0.037 | |||||
| | 283 | 3 | 0.011 | 778 | 28 | 0.038 | |||||
| | 686 | 8 | 0.012 | 396 | 15 | 0.039 | |||||
| | 246 | 3 | 0.013 | 380 | 14 | 0.04 | |||||
| | 759 | 8 | 0.013 | 134 | 5 | 0.042 | |||||
| | 320 | 4 | 0.013 | 724 | 26 | 0.044 | |||||
| | 386 | 5 | 0.013 | 200 | 8 | 0.049 | |||||
| | 474 | 7 | 0.015 | 507 | 23 | 0.049 | |||||
| | 296 | 4 | 0.015 | 151 | 7 | 0.051 | |||||
| | 206 | 3 | 0.015 | 1178 | 57 | 0.052 | |||||
| | 642 | 10 | 0.016 | 124 | 6 | 0.054 | |||||
| | 183 | 3 | 0.016 | 140 | 8 | 0.057 | |||||
| | 154 | 2 | 0.016 | 210 | 11 | 0.065 | |||||
| | 126 | 2 | 0.016 | 37 | 2 | 0.077 | |||||
| | 243 | 10 | 0.017 | 782 | 31 | 0.086 | |||||
| | 712 | 12 | 0.017 | 115 | 4 | 0.091 | |||||
| | 59 | 1 | 0.017 | 80 | 5 | 0.098 | |||||
| | 242 | 4 | 0.018 | 285 | 20 | 0.099 | |||||
| | 154 | 3 | 0.019 | 65 | 10 | 0.154 | |||||
| | 388 | 7 | 0.019 | 44 | 8 | 0.186 | |||||
| | 166 | 3 | 0.019 | 210 | 20 | 0.238 | |||||
| | 54 | 1 | 0.019 | 2090 | 236 | 0.291 | |||||
Figure 4.Average p-distance across five classes of genomic regions between two Trillium.
Number of SSRs present in the three Parideae chloroplast genomes.
| Taxon | |||
|---|---|---|---|
| Genome size | 157 379 | 157 359 | 156 852 |
| No. of SSRs | |||
| A/T | 133 | 117 | 127 |
| C/G | 5 | 4 | 6 |
| AC/GT | 3 | 3 | 3 |
| AG/CT | 17 | 19 | 18 |
| AT/TA | 32 | 35 | 32 |
| AAG/CTT | 2 | 2 | |
| AAT/ATT | 6 | 10 | 3 |
| ACT/AGT | 2 | 1 | |
| ATC/GAT | 1 | ||
| AAAG/CTTT | – | – | 1 |
| AAAT/ATTT | 3 | 4 | 4 |
| AAGG/CCTT | 1 | 1 | 1 |
| AATC/GATT | 1 | 1 | 1 |
| AATG/CATT | 1 | 1 | 3 |
| AGAT/ATCT | 1 | 1 | 1 |
| ACTAT/ATAGT | 1 | – | 1 |
| AAAAT/ATTTT | – | 1 | 1 |
| AATAT/ATATT | – | 2 | 1 |
| AATATG/CATATT | – | 1 | – |
| AAAATC/GATTTT | – | 1 | – |
| ATATCC/GGATAT | – | – | 1 |
| AAAAAT/ATTTTT | – | – | 2 |
| AAGACT/AGTCTT | – | – | 1 |
| AACTAC/GTAGTT | – | – | 1 |
| AAAGAG/CTCTTT | – | – | 1 |
| Total | 204 | 205 | 213 |
Figure 5.Summary of three types of trnI-CAU gene duplication in the tribe Parideae. *Including tandem repeats.