| Literature DB >> 29118779 |
Yun Song1,2, Yan Chen1, Jizhou Lv3, Jin Xu1,2, Shuifang Zhu1, MingFu Li1,2, Naizhong Chen1,2.
Abstract
Rice is the most important crop in the world as the staple food for over half of the population. The wild species of Oryza represent an enormous gene pool for genetic improvement of rice cultivars. Accurate and rapid identification of these species is critical for effective utilization of the wild rice germplasm. In this study, we developed valuable chloroplast molecular markers by comparing the chloroplast genomes for species identification. Four chloroplast genomes of Oryza were newly sequenced on the Illumina HiSeq platform and other 14 Oryza species chloroplast genomes from Genbank were simultaneously taken into consideration for comparative analyses. Among 18 Oryza chloroplast genomes, five variable regions (rps16-trnQ, trnTEYD, psbE-petL, rpoC2 and rbcL-accD) were detected for DNA barcodes, in addition to differences in simple sequence repeats (SSR) and repeat sequences. The highest species resolution (72.22%) was provided by rpoC2 and rbcL-accD with distance-based methods. Three-marker combinations (rps16-trnQ + trnTEYD + rbcL-accD, rps16-trnQ + trnTEYD + rpoC2 and rpoC2 + trnTEYD + psbE-petL) showed the best species resolution (100%). Phylogenetic analysis based on the chloroplast genome provided the best resolution of Oryza. In the comparison of chloroplast genomes in this study, identification of the most variable regions and assessment of the focal regions of divergence were efficient in developing species-specific DNA barcodes. Based on evaluation of the chloroplast genomic resources, we conclude that chloroplast genome sequences are a reliable and valuable molecular marker for exploring the wild rice genetic resource in rice improvement.Entities:
Keywords: DNA barcoding; Oryza; chloroplast genome; sequence divergence; variable markers
Year: 2017 PMID: 29118779 PMCID: PMC5661024 DOI: 10.3389/fpls.2017.01854
Source DB: PubMed Journal: Front Plant Sci ISSN: 1664-462X Impact factor: 5.753
A list of the 14 taxa sampled from Genbank in this study.
| Species | Accession number in Genbank |
|---|---|
| KJ830774 | |
| KM103379 | |
| KT992850 | |
| KM881638 | |
| KM881640 | |
| KM881642 | |
| KM103373 | |
| KU179220 | |
| KM088022 | |
| KM881643 | |
| KM103375 | |
| KF562709 | |
| JN861110 | |
| KM088016 |
Summary statistics for assembly of four Oryza species chloroplast genomes.
| Gene features | ||||
|---|---|---|---|---|
| Raw data no. | 9,642,763 | 10,287,100 | 9,797,240 | 9,910,302 |
| Mapped read no. | 105,824 | 140,832 | 211,039 | 499,727 |
| Mapped to reference genome (%) | 1.10 | 1.37 | 2.15 | 5.04 |
| Chloroplast genome coverage (×) | 117 | 156 | 235 | 556 |
| Size (bp) | 135,236 | 135,191 | 134,817 | 134,748 |
| LSC length (bp) | 81,135 | 81,212 | 80,844 | 80,788 |
| IR length (bp) | 20,802 | 20,820 | 20,822 | 20,817 |
| SSC length (bp) | 12,497 | 12,339 | 12,329 | 12,326 |
| Number of genes | 110 | 110 | 110 | 110 |
| Protein coding genes | 77 | 77 | 77 | 77 |
| tRNA genes | 29 | 29 | 29 | 29 |
| rRNA genes | 4 | 4 | 4 | 4 |
| GC content (%) | 39.1 | 39.0 | 39.0 | 39.0 |
| Accession number in Genbank | MF401453 | MF401451 | MF401450 | MF401452 |
Variability of the five new markers and the universal chloroplast DNA barcodes in Oryza.
| Markers | Length | Variable sites | Information sites | Discrimination success (%) based on Distance method | ||
|---|---|---|---|---|---|---|
| Numbers | % | Numbers | % | |||
| 1,500 | 101 | 6.73% | 45 | 3.00% | 66.67% | |
| 1,000 | 68 | 6.80% | 37 | 3.70% | 72.22% | |
| 900 | 47 | 5.22% | 16 | 1.78% | 72.22% | |
| 800 | 48 | 6.00% | 22 | 2.75% | 50.00% | |
| 800 | 54 | 6.75% | 28 | 3.50% | 66.67% | |
| 2,300 | 149 | 6.48% | 67 | 2.91% | 88.89% | |
| 2,300 | 155 | 6.74% | 73 | 3.17% | 77.78% | |
| 1,800 | 116 | 6.44% | 59 | 3.28% | 88.89% | |
| 1,800 | 122 | 6.78% | 65 | 3.61% | 94.44% | |
| 1,700 | 95 | 5.59% | 38 | 2.24% | 83.33% | |
| 1,700 | 101 | 5.94% | 44 | 2.59% | 83.33% | |
| 2,400 | 148 | 6.17% | 61 | 2.54% | 88.89% | |
| 2,500 | 169 | 6.76% | 82 | 3.28% | 83.33% | |
| 1,900 | 115 | 6.05% | 53 | 2.79% | 83.33% | |
| 1,600 | 102 | 6.38% | 50 | 3.13% | 77.78% | |
| 3,200 | 196 | 6.13% | 83 | 2.59% | 100.00% | |
| 2,600 | 170 | 6.54% | 87 | 3.35% | 100.00% | |
| 3,300 | 217 | 6.58% | 104 | 3.15% | 100.00% | |
| 800 | 22 | 2.75% | 12 | 1.50% | 33.33% | |
| 818 | 41 | 5.01% | 26 | 3.18% | 33.33% | |
| 563 | 12 | 2.13% | 9 | 1.60% | 16.67% | |
| 1,618 | 63 | 3.89% | 38 | 2.35% | 33.33% | |
| 2,181 | 75 | 3.44% | 47 | 2.15% | 38.89% | |