| Literature DB >> 31040371 |
Hoang Dang Khoa Do1, Joo-Hwan Kim2.
Abstract
Transcriptome data provide useful information for studying the evolutionary history of angiosperms. Previously, different genomic events (i.e., duplication, deletion, and pseudogenization) were discovered in the plastid genome of Liliales; however, the effects of these events have not addressed because of the lack of transcriptome data. In this study, we completed the plastid genome (cpDNA) and generated transcriptome data of Lilium lancifolium. Consequently, the cpDNA of L. lancifolium is 152,479 bp in length, which consists of one large single copy (81,888 bp), one small single copy (17,607 bp), and two inverted repeat regions (26,544 bp). The comparative genomic analysis of newly sequenced cpDNA and transcriptome data revealed 90 RNA editing sites of which two positions are located in the rRNA coding region of L. lancifolium. A further check on the secondary structure of rRNA showed that RNA editing causes notable structural changes. Most of the RNA editing contents are C-to-U conversions, which result in nonsynonymous substitutions. Among coding regions, ndh genes have the highest number of RNA editing sites. Our study provided the first profiling of plastid transcriptome analyses in Liliales and fundamental information for further studies on post-transcription in this order as well as other petaloid monocotyledonous species.Entities:
Mesh:
Substances:
Year: 2019 PMID: 31040371 PMCID: PMC6491592 DOI: 10.1038/s41598-019-43259-7
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The map of plastid genome and number of RNA editing sites in different gene groups. (A) The map of plastid genome of Lilium lancifolium. Genes shown outside and inside of the outer circle are transcribed counter clockwise and clockwise, respectively. The dark gray area in the inner circle indicates the CG content of the chloroplast genome. The colors represent different groups of genes in cpDNA. LSC: Large single copy; SSC: small single copy; IRA: inverted repeat region A; IRB: inverted repeat region B. (B) The number of RNA editing sites in different gene groups. A: Rubisco; B: ATP dehydrogenase subunit P; C: Ribosomal RNAs; D: Cytochrome b6/f; E: Hypothetical proteins; F: ATP synthase; G: Miscellaneous proteins; H: Large and small subunit ribosomal proteins; I: Photosystem I and II; J: RNA polymerase; K: NADH oxidoreductase.
Pairwise identity of IGS region among three complete cpDNAs of Lilium lancifolium.
| Regions | Identity (%) | Regions | Identity (%) | ||
|---|---|---|---|---|---|
| KY748297-China | KY940844-Korea | KY748297-China | KY940844-Korea | ||
|
| 99.67 | 99.67 |
| 100 | 99.72 |
|
| 99.87 | 99.87 |
| 100 | 99.86 |
|
| 99.73 | 99.46 |
| ||
|
| 99.68 | 99.68 |
| 100 | 99.92 |
|
| 99.2 | 99.2 |
| 99.6 | 100 |
|
| 99.71 | 99.71 |
| 99.43 | 99.43 |
|
| 99.12 | 98.23 |
| 100 | 99.63 |
|
| 99.89 | 99.89 | 99.87 | 99.75 | |
|
| 100 | 98.28 | 100 | 99.88 | |
| 99.87 | 99.74 |
| 100 | 99.24 | |
|
| 99.92 | 100 |
| 96.95 | 96.95 |
|
| 100 | 99.85 |
| 100 | 99.9 |
|
| 99.89 | 99.89 |
| 100 | 99.86 |
|
| 99.75 | 99.75 |
| 98.28 | 98.28 |
| 100 | 99.02 |
| 99.95 | 100 | |
|
| 99.65 | 99.65 | 100 | 99.89 | |
|
| 99.87 | 100 |
| ||
| 100 | 99.26 |
| |||
|
| 99.86 | 99.86 |
| 99.75 | 99.75 |
|
| 100 | 99.88 |
| 99.51 | 99.51 |
The bold letters indicate regions which have low similarity (<95%).
The number of RNA editing sites among coding regions of plastid genome of L. lancifolium.
| Gene | Site | Position (aa/nucleotide) | Editing content | Coverage | Number of reads (percentage) |
|---|---|---|---|---|---|
|
| 1 | 50/150 | P(cc | 188389 | C: 39 (0.02%); U: 187680 (99.62%) |
|
| 1 | 160/478 | H( | 751 | C: 195 (26.1%); U: 556 (73.9%) |
| 2 | 245/734 | S(u | 3612 | C: 1888 (52.3%); U: 1718 (47.6%) | |
|
| 1 | 232/696 | S(uc | 2011054 | C: 387 (0.02%); U: 2002141 (99.56%) |
|
| 1 | 258/774 | S(u | 2684 | C: 37 (1.4%); U: 2646 (98.5%) |
| 2 | 383/1148 | S(u | 3878 | C: 51 (1.3%); U: 3821 (98.5%) | |
|
| 1 | 31/92 | P(c | 3913 | C: 331 (8.5%); U: 3579 (91.4%) |
|
| 1 | 15/45 | Y(ua | 3682 | C: 2736 (74.3%); U: 940 (25.5%) |
| 2 | 210/629 | S(u | 3580 | C: 47 (1.3%); U: 3528 (98.5%) | |
|
| 1 | 1235/3704 | S(u | 493 | C: 43 (8.7%); U: 451 (91.3%) |
|
| 1 | 14/41 | P(c | 764 | C: 203 (26.5%); U: 558 (72.9%) |
| 2 | 61/182 | S(u | 1004 | C: 543 (54%); U: 462 (46%) | |
| 3 | 107/321 | I(au | 818 | C: 487 (59.5%); U: 331 (40.4%) | |
| 4 | 178/500 | S(u | 848 | C: 98 (11.5%); U: 750 (88.3%) | |
| 5 | 210/629 | S(u | 857 | C: 149 (17.4%); U: 709 (82.6%) | |
| 6 | 267/799 | R( | 906 | C: 43 (4.7%); U: 859 (94.7%) | |
|
| 1 | 29-10 | S(u | 223 | C: 81 (36.2%); U: 142 (63.4%) |
| 2 | 113/338 | S(u | 205 | C: 82 (39.8%); U: 123 (59.7%) | |
| 3 | 184/551 | S(u | 78 | C: 66 (83.5%); U: 13 (16.5%) | |
| 4 | 189/566 | S(u | 112 | C: 54 (47.8%); U: 59 (52.2%) | |
| 5 | 665/1994 | S(u | 233 | C: 11 (4.7%); U: 223 (95.3%) | |
| 6 | 807/2420 | S(u | 230 | C: 31 (13.4%); U: 200 (86.6%) | |
| 7 | 900/2698 | P( | 286 | C: 75(26.1%); U: 212 (73.9%) | |
|
| 1 | 17/50 | S(u | 23558 | C: 869 (3.7%); U: 22664 (96.2%) |
| 2 | 60/180 | L(cu | 15389 | C: 14560 (94.6%); U: 810 (5.3%) | |
|
| 1 | 27/80 | S(u | 14988 | C: 470 (3.1%); U: 14472 (96.6%) |
|
| 1 | 51/153 | A(gc | 10434 | C: 6 (0.1%); U: 10407 (99.7%) |
|
| 1 | 15/44 | S(u | 2070 | C: 694 (33.5%); U: 1372 (66.2%) |
| 2 | 21/63 | I(au | 1477 | C: 640 (43.3%); U: 834 (56.4%) | |
| 3 | 62/185 | T(a | 648 | C: 502 (77.3%); U: 146 (22.5%) | |
| 4 | 64/191 | P(c | 903 | C: 423 (46.8%); U: 469 (51.9%) | |
|
| 1 | 43/128 | S(u | 968 | C: 245 (25.3%); U: 724 (74.7%) |
|
| 1 | 23/69 | P(cc | 264 | C: 155 (58.5%); U: 110 (41.5%) |
| 2 | 27/81 | F(uu | 343 | C: 191 (55.5%); U: 153 (44.5%) | |
|
| 1 | 13-5 | H( | 335 | C: 69 (20.5%); U: 267 (79.5%) |
| 2 | 104/311 | P(c | 264 | C: 155 (58.5%); U: 110 (41.5%) | |
| 3 | 108/323 | S(u | 343 | C: 191 (55.5%); U: 153 (44.5%) | |
|
| 1 | 395/1184 | S(u | 8704 | C: 140 (1.6%); U: 8556 (98.3%) |
|
| 1 | 452/1355 | S(u | 625 | C: 224 (35.8%); U: 393 (62.8%) |
| 2 | 466/1397 | P(u | 574 | C: 245 (42.6%); U: 329 (57.2%) | |
|
| 1 | 25/74 | S(u | 1721 | C: 579 (33.6%); U: 1140 (66.2%) |
| 2 | 27/80 | H( | 1152 | C: 1078 (93.5%); U: 72 (6.2%) | |
| 3 | 34/102 | V(gu | 3243 | C: 2758 (85%); U: 482 (14.9%) | |
|
| 1 | 176/528 | F(uu | 2311 | C: 1774 (76.7%); U: 538 (23.3%) |
|
| 1 | 20/59 | P(c | 21133 | C: 392 (1.9%); U: 20673 (97.8%) |
|
| 1 | 26/77 | S(u | 8436 | C: 252 (3%); U: 8177 (96.9%) |
|
| 1 | 72/214 | P( | 13182 | C: 145 (1.1%); U: 13023 (98.8%) |
|
| 1 | 2/5 | S(u | 2646 | C: 559 (21.1%); U: 2084 (78.7%) |
| 2 | 19/56 | P(c | 1591 | C: 48 (3%); U: 1540 (96.7%) | |
|
| 1 | 74/221 | S(u | 5264 | C: 283 (5.4%); U: 4975 (94.5%) |
|
| 1 | 26/82 | H( | 865 | C: 110 (12.7%); U: 756 (87.3%) |
| 2 | 187/559 | H( | 1533 | C: 107 (5.4%); U: 1402 (91.4%) | |
|
| 1 | 10/30 | F(uu | 57622 | C: 12298 (21.3%); U: 45238 (78.5%) |
|
| 1 | 4/11 | N(a | 10646 | A: 8356 (78.5%); G: 2268 (21.3%) |
| 2 | 142/424 | R( | 27257 | C: 290 (1.1%); U: 26917 (98.7%) | |
| 3 | 206/617 | P(c | 9585 | C: 220 (2.3%); U: 9351 (97.5%) | |
|
| 1 | 162/484 | Q( | 13552 | C: 248 (1.8%); U: 13286 (98%) |
|
| 1 | 67/200 | S(u | 947 | C: 323 (34.1%); U: 622 (65.6%) |
| 2 | 123/368 | S(u | 1193 | C: 254 (21.3%); U: 938 (78.6%) | |
|
| 1 | 14-5 | V(g | 3479 | U: 2 (0.1%); C: 3475 (99.8%) |
|
| 1 | 157/470 | T(a | 1513 | C: 74 (4.9%); U: 1440 (95.1%) |
| 2 | 195/583 | H( | 1951 | C: 309 (15.8%); U: 1638 (83.9%) | |
|
| 1 | 1/2 | T(a | 708 | C: 340 (48%); U: 369 (52%) |
|
| 1 | 24/71 | S(u | 562 | C: 84 (48%); U: 479 (85.1%) |
|
| 1 | 50/149 | S(u | 803 | C: 88 (10.9%); U: 716 (89.1%) |
| 2 | 156/467 | P(c | 929 | C: 41 (4.4%); U: 887 (95.4%) | |
| 3 | 181/542 | T(a | 536 | C: 55 (10.2%); U: 482 (89.8%) | |
| 4 | 204/611 | S(u | 343 | C: 58 (16.9%); U: 286 (83.1%) | |
| 5 | 205/704 | S(u | 347 | C: 53 (15.2%); U: 295 (84.8%) | |
| 6 | 246/737 | P(c | 167 | C: 65 (38.7%); U: 102 (60.7%) | |
| 7 | 277/830 | S(u | 193 | C: 123 (61.3%); U: 71 (36.4%) | |
| 8 | 279/836 | S(u | 162 | C: 80 (49.1%); U: 83 (50.9%) | |
| 9 | 371/112 | S(u | 1134 | C: 108 (9.5%); U: 1025 (90.3%) | |
| 10 | 494/1481 | P(c | 1271 | C: 188 (14.8%); U: 1082 (85.1%) | |
|
| 1 | 21/62 | S(u | 674 | C: 46 (6.8%); U: 628 (93%) |
| 2 | 87/259 | H( | 208 | C: 51 (24.4%); U: 158 (75.6%) | |
| 3 | 131/392 | S(u | 1205 | C: 13 (11.1%); U: 1191 (98.8%) | |
|
| 1 | 118/353 | S(u | 750 | C: 70 (9.3%); U: 680 (90.5%) |
| 2 | 272/815 | S(u | 731 | C: 68 (9.3%); U: 662 (90.4%) | |
|
| 1 | 1/2 | T(a | 791 | C: 285 (36%); U: 505 (63.8%) |
| 2 | 22/65 | S(u | 628 | C: 82 (13%); U: 545 (86.6%) | |
| 3 | 130/389 | S(u | 722 | C: 129 (17.8%); U: 593 (82%) | |
| 4 | 227/680 | S(u | 840 | C: 190 (22.6%); U: 648 (77.1%) | |
| 5 | 318/953 | T(a | 930 | C: 124 (13.3%); U: 803 (86.3%) | |
|
| 1 | 17/50 | S(u | 531 | C: 115 (21.6%); U: 417 (78.4%) |
| 2 | 116/347 | P(c | 1198 | C: 107 (8.9%); U: 1088 (90.7%) | |
|
| 1 | 358/1073 | S(u | 1467 | C: 198 (13.5%); U: 1226 (86.2%) |
|
| 1 | 30-10 | L(cu | 1107 | C: 1020 (92.1%); U: 85 (7.7%) |
| 2 | 169/505 | H( | 740 | C: 85 (11.5%); U: 651 (87.9%) | |
|
| 1 | −/72 | C → U | 9350 | C: 24 (0.3%); U: 9258 (99%) |
|
| 1 | −/1327 | U → C | 2532 | C: 2148 (84.8%); U: 381 (15%) |
The asterisk indicates synonymous substitution. Bold letters represent changes of nucleotides and their positions in the codons.
Figure 2The predicted secondary structure of rrn5S with (lower) and without RNA editing site (upper). The color radian (from purple to red) means the probability of connection among nucleotides (from 0 to 1). The black arrows indicate the position of RNA editing.
RNA expression of protein-coding genes in the L. lancifolium chloroplast genome.
| Gene | Length (bp) | RPKM | Gene | Length (bp) | RPKM | Gene | Length (bp) | RPKM |
|---|---|---|---|---|---|---|---|---|
|
| 1032 | 557101 |
| 744 | 1798 |
| 872 | 459 |
|
| 1443 | 72754 |
| 120 | 1780 |
| 534 | 454 |
|
| 1467 | 11128 |
| 1338 | 1766 |
| 2837 | 451 |
|
| 1416 | 9863 |
| 555 | 1700 |
| 2215 | 425 |
|
| 1233 | 9292 |
| 709 | 1460 |
| 1506 | 411 |
|
| 222 | 8993 |
| 417 | 1391 |
| 1470 | 400 |
|
| 1062 | 8199 |
| 963 | 1363 |
| 606 | 382 |
|
| 189 | 6822 |
| 306 | 1341 |
| 273 | 369 |
|
| 246 | 6521 |
| 1949 | 1259 |
| 1182 | 365 |
|
| 1527 | 5815 |
| 369 | 1235 |
| 966 | 362 |
|
| 132 | 5486 |
| 1420 | 1158 |
| 914 | 360 |
|
| 252 | 5439 |
| 399 | 1052 |
| 114 | 338 |
|
| 303 | 5273 |
| 228 | 1042 |
| 1497 | 310 |
|
| 2253 | 4791 |
| 2080 | 1004 |
| 711 | 303 |
|
| 2205 | 4244 |
| 102 | 932 |
| 363 | 292 |
|
| 123 | 3913 |
| 111 | 837 |
| 282 | 220 |
|
| 1497 | 3584 |
| 477 | 827 |
| 5577 | 207 |
|
| 204 | 3280 |
| 657 | 788 |
| 105 | 190 |
|
| 408 | 3191 |
| 1008 | 705 |
| 354 | 177 |
|
| 246 | 2600 |
| 540 | 680 |
| 4125 | 171 |
|
| 117 | 2587 |
| 2229 | 641 |
| 3207 | 133 |
|
| 129 | 2369 |
| 174 | 626 |
| 105 | 112 |
|
| 192 | 2181 |
| 468 | 622 |
| 90 | 89 |
|
| 306 | 2058 |
| 114 | 555 |
| 6621 | 70 |
|
| 1524 | 1966 |
| 279 | 523 |
| 231 | 56 |
|
| 1142 | 1964 |
| 393 | 496 |
| 960 | 13 |
|
| 1539 | 1865 |
| 1991 | 488 |