| Literature DB >> 33104732 |
Yang Jae Kang1,2, Soonok Kim3, Jungho Lee4, Hyosig Won5, Gi-Heum Nam3, Myounghai Kwak3.
Abstract
Iris is one of the largest genera in the family Iridaceae, comprising hundreds of species, including numerous economically important horticultural plants used in landscape gardening and herbal medicine. Improved taxonomic classification of Iris species, particularly the endangered Korean-native Iris, is needed for correct species delineation. To this end, identification of diverse genetic markers from Iris genomes would facilitate molecular identification and resolve ambiguous classifications from molecular analyses; however, only two Iris plastid genomes, from Iris gatesii and Iris sanguinea, have been sequenced. Here, we used high-throughput next-generation sequencing, combined with Sanger sequencing, to construct the plastid genomes of 14 Korean-native Iris species with one outgroup and predict their gene content. Using these data, combined with previously published plastid genomes from Iris and one outgroup (Sisyrinchium angustifolium), we constructed a Bayesian phylogenetic tree showing clear speciation among the samples. We further identified sub-genomic regions that have undergone neutral evolution and accurately recapitulate Bayesian-inferred speciation. These contain key markers that could be used to identify and classify Iris samples into taxonomic clades. Our results confirm previously reported speciation patterns and resolve questionable relationships within the Iris genus. These data also provide a valuable resource for studying genetic diversity and refining phylogenetic relationships between Iris species.Entities:
Mesh:
Year: 2020 PMID: 33104732 PMCID: PMC7588056 DOI: 10.1371/journal.pone.0241178
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Iris species and outgroup used in this study.
| Scientific name | Sequencing method | GenBank ID | Reference |
|---|---|---|---|
| Mi-seq | MK593170 | In this study | |
| Mi-seq | NC_029227 | [ | |
| Hi-seq | NC_024936 | [ | |
| Mi-seq | MK593161 | In this study | |
| Mi-seq | MK593168 | In this study | |
| Mi-seq | MK593158 | In this study | |
| Mi-seq | MK593164 | In this study | |
| Mi-seq | MK593160 | In this study | |
| Hi-seq | MK593167 | In this study | |
| Hi-seq | MK593169 | In this study | |
| Mi-seq | MK593156 | In this study | |
| Sanger | MK593157 | In this study | |
| Sanger | MK593163 | In this study | |
| Mi-seq | MK593166 | In this study | |
| Mi-seq | MK593165 | In this study | |
| Sanger | MK593159 | In this study | |
| Mi-seq | MK593162 | In this study |
NGS sequencing statistics for Iris species and outgroup.
| Species | Trimmed data (bp) | Percentage of trimmed bases to original raw reads | # of aligned reads to the plastid genome | Plastid genome coverage (x) | Plastid genome length (bp) |
|---|---|---|---|---|---|
| 1,412,822,734 | 79.03% | 92,374 | 140 | 151,305 | |
| 2,169,071,850 | 78.93% | 309,614 | 544 | 153730 | |
| 1,305,887,854 | 71.47% | 338,839 | 661 | 153,055 | |
| 1,443,127,793 | 70.37% | 94,059 | 162 | 152,294 | |
| 4,340,216,662 | 86.23% | 654,027 | 381 | 151,081 | |
| 1,133,976,531 | 69.96% | 158,227 | 276 | 150,955 | |
| 1,033,295,333 | 69.29% | 143,774 | 246 | 152,562 | |
| 929,066,495 | 69.65% | 122,004 | 226 | 152,654 | |
| 1,350,741,033 | 69.42% | 179,282 | 309 | 153,083 | |
| 4,265,652,485 | 89.30% | 2,067,701 | 1,218 | 152,287 | |
| 1,330,043,383 | 70.53% | 179,563 | 309 | 152,408 | |
| 1,262,063,831 | 69.91% | 104,833 | 178 | 152,921 | |
| 2,344,526,509 | 77.33% | 256,424 | 455 | 152,281 |
Summary of plastid genome assemblies and gene annotations.
| Species | Plastid genome size | LSC | IRb | SSC | IRa | Predicted genes | High- quality predictions | tRNAs | rRNAs |
|---|---|---|---|---|---|---|---|---|---|
| 151305 | 80790 | 26177 | 18161 | 26177 | 83 | 63 | 30 | 12 | |
| 153573 | 83071 | 26181 | 18140 | 26181 | 83 | 71 | 31 | 12 | |
| 153730 | 83137 | 26199 | 18195 | 26199 | 83 | 70 | 31 | 12 | |
| 153055 | 82579 | 26094 | 18288 | 26094 | 83 | 68 | 30 | 12 | |
| 150947 | 81514 | 25549 | 18335 | 25549 | 83 | 72 | 31 | 12 | |
| 152294 | 82159 | 26023 | 18089 | 26023 | 83 | 72 | 30 | 12 | |
| 151081 | 81144 | 26022 | 17893 | 26022 | 83 | 71 | 31 | 12 | |
| 150955 | 81516 | 25549 | 18341 | 25549 | 83 | 68 | 30 | 12 | |
| 153620 | 82777 | 26784 | 17275 | 26784 | 83 | 68 | 31 | 12 | |
| 152562 | 82786 | 25948 | 17880 | 25948 | 83 | 73 | 31 | 12 | |
| 152654 | 82212 | 26183 | 18076 | 26183 | 83 | 71 | 31 | 12 | |
| 153083 | 82635 | 26183 | 18082 | 26183 | 83 | 71 | 31 | 12 | |
| 152287 | 82311 | 25920 | 18136 | 25920 | 83 | 71 | 31 | 12 | |
| 152921 | 82900 | 26067 | 17887 | 26067 | 83 | 72 | 31 | 12 | |
| 152281 | 82307 | 25920 | 18134 | 25920 | 83 | 71 | 30 | 12 | |
| Mean (bp) | 152423.20 | 82255.87 | 26053.27 | 18060.80 | 26053.27 | 83.00 | 70.13 | 30.67 | 12 |
*LSC: long single copy section, IRa and IRb: inverted repeats, SSC: short single copy section
Fig 1Ks distributions for Iris species pairwise comparisons.
(a) Pairwise Ks histogram nested to I. odaesanensis. (b) All-to-all triangle heatmap for modal values of pairwise Ks distributions.
Fig 2Bayesian inference phylogenetic tree generated using 57 intact single-copy plastid protein sequences that were predicted from each plastid genome assembly.
The species with pictures are Korean-native Iris. The branch colours and values correspond to posterior values.
Fig 3Plastid sub-genome selection by Tajima’s D statistics.
(a) Whole-plastid genome Tajima’s D distribution revealing few regions with values higher than -0.5. Upper panel shows diversity (pi) values, and lower panel depicts Tajima’s D values. (b) Hierarchical clustering of samples (rows) and polymorphic sites (columns) in the selected plastid sub-genome matrix by Tajima’s D. (c) Maximum likelihood phylogenetic tree with 1,000 bootstrap values generated from the genotype matrix of the selected plastid sub-genomes by Tajima’s D.
Species collected in this study.
| Species | Notes | Collection Localities | Voucher Number |
|---|---|---|---|
| Korea, Jeju-do, Seogwipo | KB NIBRVP0000470408 | ||
| Korea, Gyeonggi-do, Pocheon | KB NIBRVP0000138241 | ||
| Endangered | Korea, Gangwon-do, Goseong | KB NIBRVP0000472979 | |
| Korea, Gangwon-do, Goseong | KB NIBRVP0000410251 | ||
| Korea, Gyeongsangbuk-do, Yeongdeok | KB NIBRVP0000281433 | ||
| Korea, Jeollanam-do, Boseong | KB NIBRVP0000480267 | ||
| Korea, Gyeongsangbuk-do, Yeongdeok | KB NIBRVP0000281459 | ||
| Endangered | Korea, Daegu | DGU Won12627 | |
| Korea, Gangwon-do, Gangneung | KB NIBRVP0000612157 | ||
| Korea, Incheon, National Institute of Biological Resources | Living collection | ||
| Korea, Incheon, Ongjin, Is. Baengyeong | KB NIBRGR0000107578 | ||
| Endemic to Korea | Korea, Gyeongsangbuk-do, Cheongsong | KB GPI2009-005C | |
| Endemic to Korea | Korea, Gyeonggi-do, Yongin-si | KB NIBRVP0000480288 | |
| Korea, Jeollanam-do, Haenam | KB NIBRVP0000480289 | ||
| Endangered | Korea, Jeollanam-do, Jeongeup-si, Naejangsan National Park | KB GEIBVP0000170010 | |
| Korea, Gyeonggi-do, Yangpyeong | KB NIBRGR0000164710 |