| Literature DB >> 30578434 |
Marie Noelle Ndjiondjop1, Nikolaos Alachiotis2, Pavlos Pavlidis2, Alphonse Goungoulou3, Sèdjro Bienvenu Kpeki3, Dule Zhao3, Kassa Semagn4.
Abstract
KEY MESSAGE: The extent of molecular diversity parameters across three rice species was compared using large germplasm collection genotyped with genomewide SNPs and SNPs that fell within selective sweep regions. Previous studies conducted on limited number of accessions have reported very low genetic variation in African rice (Oryza glaberrima Steud.) as compared to its wild progenitor (O. barthii A. Chev.) and to Asian rice (O. sativa L.). Here, we characterized a large collection of African rice and compared its molecular diversity indices and population structure with the two other species using genomewide single nucleotide polymorphisms (SNPs) and SNPs that mapped within selective sweeps. A total of 3245 samples representing African rice (2358), Asian rice (772) and O. barthii (115) were genotyped with 26,073 physically mapped SNPs. Using all SNPs, the level of marker polymorphism, average genetic distance and nucleotide diversity in African rice accounted for 59.1%, 63.2% and 37.1% of that of O. barthii, respectively. SNP polymorphism and overall nucleotide diversity of the African rice accounted for 20.1-32.1 and 16.3-37.3% of that of the Asian rice, respectively. We identified 780 SNPs that fell within 37 candidate selective sweeps in African rice, which were distributed across all 12 rice chromosomes. Nucleotide diversity of the African rice estimated from the 780 SNPs was 8.3 × 10-4, which is not only 20-fold smaller than the value estimated from all genomewide SNPs (π = 1.6 × 10-2), but also accounted for just 4.1%, 0.9% and 2.1% of that of O. barthii, lowland Asian rice and upland Asian rice, respectively. The genotype data generated for a large collection of rice accessions conserved at the AfricaRice genebank will be highly useful for the global rice community and promote germplasm use.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30578434 PMCID: PMC6449321 DOI: 10.1007/s00122-018-3268-2
Source DB: PubMed Journal: Theor Appl Genet ISSN: 0040-5752 Impact factor: 5.699
Summary of the different datasets, chromosomal distribution and physical map length of SNP markers used in the present study
| Chromosome | All SNPs used for genotyping | Dataset-1a | Dataset-2b | Number of SNPs polymorphic within | Map length (bp) | |||||
|---|---|---|---|---|---|---|---|---|---|---|
| Lowland | Upland | All | ||||||||
| Chr1 | 3124 | 3113 | 2758 | 309 | 570 | 390 | 1636 | 1046 | 1796 | 43,229,577 |
| Chr2 | 2812 | 2809 | 2458 | 281 | 468 | 333 | 1429 | 742 | 1566 | 35,885,080 |
| Chr3 | 2678 | 2668 | 2333 | 256 | 508 | 326 | 1362 | 897 | 1573 | 36,413,113 |
| Chr4 | 2378 | 2368 | 2077 | 242 | 380 | 300 | 1243 | 845 | 1376 | 35,497,590 |
| Chr5 | 1914 | 1914 | 1685 | 212 | 378 | 256 | 916 | 565 | 1024 | 29,763,242 |
| Chr6 | 2167 | 2156 | 1880 | 232 | 437 | 322 | 1274 | 848 | 1361 | 31,190,621 |
| Chr7 | 2017 | 2011 | 1793 | 226 | 399 | 282 | 1016 | 658 | 1146 | 29,679,399 |
| Chr8 | 1877 | 1877 | 1668 | 148 | 288 | 188 | 1298 | 723 | 1336 | 28,429,363 |
| Chr9 | 1559 | 1558 | 1387 | 157 | 267 | 192 | 849 | 389 | 943 | 22,947,498 |
| Chr10 | 1576 | 1573 | 1453 | 225 | 301 | 253 | 788 | 559 | 878 | 23,205,328 |
| Chr11 | 2085 | 2085 | 1851 | 318 | 449 | 352 | 1233 | 987 | 1405 | 29,000,206 |
| Chr12 | 1946 | 1941 | 1736 | 234 | 362 | 290 | 1062 | 601 | 1170 | 27,504,691 |
| Unmapped | 5606 | |||||||||
| Total | 31,739 | 26,073 | 23,079 | 2840 | 4807 | 3484 | 14,106 | 8860 | 15,574 | 372,745,708 |
| Polymorphism (%) | 82.1 | 88.5 | 10.9 | 18.4 | 13.4 | 54.1 | 34.0 | 59.7 | ||
aDataset-1 refers to the number of SNPs with two alleles irrespective of minor allele frequency (MAF)
bDataset-2 refers to the number of polymorphic SNPs across all 3245 accessions, each with a minor allele frequency of ≥ 0.01
Fig. 1Plot of PC1 and PC2 from principal component analysis of 3245 samples representing Oryza glaberrima (2358), O. barthii (115) and O. sativa (772) using 23,079 polymorphic SNPs. As only 14.4% of the 23,079 SNPs were polymorphic between O. barthii and O. glaberrima, these two species appeared nearly identical, while O. sativa genotypes showed clear separation into upland (218) and lowland (554) ecologies
Summary of the molecular diversity indices of three rice species based on six datasets of different marker density
| Species and dataseta | Results from Tajima’s neutrality testb | |||||
|---|---|---|---|---|---|---|
|
|
| Ps |
|
|
| |
| All | ||||||
| Dataset-1 | 2358 | 6843 | 0.262 | 0.031 | 0.01620 | − 1.381 |
| Dataset-2 | 2358 | 6057 | 0.262 | 0.031 | 0.01806 | − 1.213 |
| Dataset-3 | 2358 | 158 | 0.203 | 0.024 | 0.00083 | − 2.641 |
| Dataset-4 | 2358 | 320 | 0.272 | 0.033 | 0.01247 | − 1.725 |
| Dataset-5 | 2358 | 416 | 0.292 | 0.035 | 0.02194 | − 1.045 |
| Dataset-6 | 2358 | 374 | 0.263 | 0.032 | 0.01391 | − 1.565 |
| Group-1 | ||||||
| Dataset-1 | 983 | 4427 | 0.170 | 0.023 | 0.01425 | − 1.101 |
| Dataset-2 | 983 | 4108 | 0.178 | 0.024 | 0.01592 | − 0.979 |
| Dataset-3 | 983 | 76 | 0.097 | 0.013 | 0.00070 | − 2.598 |
| Dataset-4 | 983 | 204 | 0.174 | 0.023 | 0.01378 | − 1.169 |
| Dataset-5 | 983 | 261 | 0.183 | 0.025 | 0.01866 | − 0.690 |
| Dataset-6 | 983 | 246 | 0.173 | 0.023 | 0.01423 | − 1.115 |
| Group-2 | ||||||
| Dataset-1 | 1375 | 5414 | 0.208 | 0.027 | 0.01499 | − 1.270 |
| Dataset-2 | 1375 | 4834 | 0.209 | 0.027 | 0.01664 | − 1.106 |
| Dataset-3 | 1375 | 106 | 0.136 | 0.017 | 0.00089 | − 2.612 |
| Dataset-4 | 1375 | 244 | 0.208 | 0.027 | 0.00944 | − 1.832 |
| Dataset-5 | 1375 | 354 | 0.248 | 0.032 | 0.02054 | − 1.014 |
| Dataset-6 | 1375 | 285 | 0.201 | 0.026 | 0.01176 | − 1.546 |
| Minicore | ||||||
| Dataset-1 | 350 | 4664 | 0.179 | 0.028 | 0.01870 | − 1.020 |
| Dataset-2 | 350 | 4061 | 0.176 | 0.027 | 0.02066 | − 0.763 |
| Dataset-3 | 350 | 78 | 0.100 | 0.016 | 0.00106 | − 2.733 |
| Dataset-4 | 350 | 198 | 0.169 | 0.026 | 0.01473 | − 1.331 |
| Dataset-5 | 350 | 294 | 0.206 | 0.032 | 0.02453 | − 0.719 |
| Dataset-6 | 350 | 265 | 0.186 | 0.029 | 0.01720 | − 1.244 |
| Subset of minicore | ||||||
| Dataset-1 | 163 | 4114 | 0.158 | 0.028 | 0.01948 | − 0.986 |
| Dataset-2 | 163 | 3632 | 0.157 | 0.028 | 0.02147 | − 0.744 |
| Dataset-3 | 163 | 54 | 0.069 | 0.012 | 0.00132 | − 2.716 |
| Dataset-4 | 163 | 179 | 0.152 | 0.027 | 0.01655 | − 1.232 |
| Dataset-5 | 163 | 265 | 0.186 | 0.033 | 0.02544 | − 0.724 |
| Dataset-6 | 163 | 231 | 0.163 | 0.029 | 0.01809 | − 1.190 |
| Subset of minicore | ||||||
| Dataset-1 | 115 | 3904 | 0.150 | 0.028 | 0.02006 | − 0.971 |
| Dataset-2 | 115 | 3491 | 0.151 | 0.028 | 0.02211 | − 0.751 |
| Dataset-3 | 115 | 54 | 0.069 | 0.013 | 0.00162 | − 2.756 |
| Dataset-4 | 115 | 174 | 0.148 | 0.028 | 0.01790 | − 1.180 |
| Dataset-5 | 115 | 244 | 0.171 | 0.032 | 0.02622 | − 0.615 |
| Dataset-6 | 115 | 218 | 0.153 | 0.029 | 0.01872 | − 1.164 |
|
| ||||||
| Dataset-1 | 115 | 5685 | 0.218 | 0.041 | 0.04366 | 0.219 |
| Dataset-2 | 115 | 4919 | 0.213 | 0.040 | 0.04396 | 0.326 |
| Dataset-3 | 115 | 108 | 0.138 | 0.026 | 0.02020 | − 0.731 |
| Dataset-4 | 115 | 254 | 0.216 | 0.041 | 0.02258 | − 1.478 |
| Dataset-5 | 115 | 370 | 0.259 | 0.049 | 0.05148 | 0.184 |
| Dataset-6 | 115 | 317 | 0.223 | 0.042 | 0.04903 | 0.563 |
| Dataset-1 | 554 | 19,090 | 0.732 | 0.106 | 0.09722 | − 0.257 |
| Dataset-2 | 554 | 17,327 | 0.751 | 0.109 | 0.10654 | − 0.066 |
| Dataset-3 | 554 | 585 | 0.750 | 0.109 | 0.08782 | − 0.581 |
| Dataset-4 | 554 | 819 | 0.697 | 0.101 | 0.09336 | − 0.232 |
| Dataset-5 | 554 | 856 | 0.600 | 0.087 | 0.06371 | − 0.810 |
| Dataset-6 | 554 | 1015 | 0.714 | 0.104 | 0.09803 | − 0.163 |
| Dataset-1 | 218 | 12,734 | 0.488 | 0.082 | 0.04340 | − 1.513 |
| Dataset-2 | 218 | 12,066 | 0.523 | 0.088 | 0.04751 | − 1.474 |
| Dataset-3 | 218 | 398 | 0.510 | 0.086 | 0.04041 | − 1.679 |
| Dataset-4 | 218 | 607 | 0.517 | 0.087 | 0.04355 | − 1.588 |
| Dataset-5 | 218 | 604 | 0.424 | 0.071 | 0.04088 | − 1.356 |
| Dataset-6 | 218 | 426 | 0.299 | 0.050 | 0.01159 | − 2.447 |
| Both lowland and upland | ||||||
| Dataset-1 | 772 | 20,642 | 0.792 | 0.110 | 0.17332 | − 5.807 |
| Dataset-2 | 772 | 18,763 | 0.813 | 0.113 | 0.19298 | − 6.165 |
| Dataset-3 | 772 | 654 | 0.838 | 0.116 | 0.18026 | 1.639 |
| Dataset-4 | 772 | 910 | 0.774 | 0.107 | 0.17936 | 1.999 |
| Dataset-5 | 772 | 961 | 0.674 | 0.093 | 0.13485 | 1.324 |
| Dataset-6 | 772 | 1067 | 0.751 | 0.104 | 0.15331 | 1.413 |
aNumber of sequences or sample size (m); number of segregating sites (S); proportion of polymorphic sites (Ps) = S/n); θS = Ps/a1; nucleotide diversity (θπ); Tajima test (D)
bDataset-1 = 26,073 SNPs; Dataset-2 = 23,079 SNPs; Dataset-2 = 780 SNPs; Dataset-4 = 1175 SNPs; Dataset-5 = 1426 SNPs; Dataset-6 = 1421 SNPs. See Table 1 for details
Fig. 2Manhattan plots showing selective sweep regions detected in African rice (O. glaberrima). The horizontal solid line indicates the threshold value for declaring candidate selective sweeps (see material and method section for details)
Chromosomal distribution of number of selective sweep regions identified in three rice species
| Chromosome |
| Lowland | Upland |
|
|---|---|---|---|---|
| 1 | 3 | 11 | 8 | 6 |
| 2 | 6 | 11 | 6 | 1 |
| 3 | 1 | 15 | 5 | 2 |
| 4 | 2 | 9 | 10 | 3 |
| 5 | 4 | 5 | 8 | 3 |
| 6 | 5 | 10 | 4 | 2 |
| 7 | 3 | 8 | 4 | 1 |
| 8 | 2 | 2 | 3 | 1 |
| 9 | 2 | 4 | 10 | 2 |
| 10 | 2 | 3 | 5 | 4 |
| 11 | 3 | 5 | 3 | 6 |
| 12 | 4 | 6 | 2 | 5 |
| Grand total | 37 | 89 | 68 | 36 |
Fig. 3Genetic distance summaries between pairs of 3245 samples computed from SNPs that were polymorphic across all three species (23,079 SNPs), O. glaberrima (2840 SNPs), O. barthii (4807 SNPs), lowland O. sativa (14,106 SNPs) and upland O. sativa (8860 SNPs). The number of samples is shown in the legend within brackets
Analysis of molecular variance (AMOVA) for the extraction of SNP variation among and within groups (populations)
| Category | No. of SNPs used for analysis | Source of variation | Sum of squares | Variance components | Percentage of variation | |
|---|---|---|---|---|---|---|
| Four groups ( | 23,079 | Among groups | 3 | 7,600,354.2 | 5358.1 | 92.8 |
| Within groups | 3241 | 1,347,725.3 | 415.8 | 7.2 | ||
| Total | 3244 | 8,948,079.4 | 5774.0 | 100.0 | ||
| Two groups ( | 23,079 | Among groups | 1.0 | 18,837.7 | 84.9 | 27.65 |
| Within groups | 2471.0 | 548,912.1 | 222.1 | 72.35 | ||
| Total | 2472 | 567,749.8 | 307.0 | 100.0 | ||
| Two groups ( | 3484 | Among groups | 1.0 | 18,466.0 | 83.2 | 27.67 |
| Within groups | 2471.0 | 537,451.2 | 217.5 | 72.33 | ||
| Total | 2472 | 555,917.2 | 300.7 | 100.0 |