| Literature DB >> 30416368 |
Marie Noelle Ndjiondjop1, Kassa Semagn1,2, Jianwei Zhang3, Arnaud Comlan Gouda1, Sèdjro Bienvenu Kpeki1, Alphonse Goungoulou1, Peterson Wambugu4, Khady Nani Dramé5, Isaac Kofi Bimpong1, Dule Zhao1.
Abstract
Species misclassification (misidentification) and handling errors have been frequently reported in various plant species conserved at diverse gene banks, which could restrict use of germplasm for correct purpose. The objectives of the present study were to (i) determine the extent of genotyping error (reproducibility) on DArTseq-based single-nucleotide polymorphisms (SNPs); (ii) determine the proportion of misclassified accessions across 3134 samples representing three African rice species complex (Oryza glaberrima, O. barthii, and O. longistaminata) and an Asian rice (O. sativa), which are conserved at the AfricaRice gene bank; and (iii) develop species- and sub-species (ecotype)-specific diagnostic SNP markers for rapid and low-cost quality control (QC) analysis. Genotyping error estimated from 15 accessions, each replicated from 2 to 16 times, varied from 0.2 to 3.1%, with an overall average of 0.8%. Using a total of 3134 accessions genotyped with 31,739 SNPs, the proportion of misclassified samples was 3.1% (97 of the 3134 accessions). Excluding the 97 misclassified accessions, we identified a total of 332 diagnostic SNPs that clearly discriminated the three indigenous African species complex from Asian rice (156 SNPs), O. longistaminata accessions from both O. barthii and O. glaberrima (131 SNPs), and O. sativa spp. indica from O. sativa spp. japonica (45 SNPs). Using chromosomal position, minor allele frequency, and polymorphic information content as selection criteria, we recommended a subset of 24 to 36 of the 332 diagnostic SNPs for routine QC genotyping, which would be highly useful in determining the genetic identity of each species and correct human errors during routine gene bank operations.Entities:
Keywords: African rice; Asian rice; Cultivated rice; DArTseq; Genotyping by sequencing; Wild rice
Year: 2018 PMID: 30416368 PMCID: PMC6208651 DOI: 10.1007/s11032-018-0885-z
Source DB: PubMed Journal: Mol Breed ISSN: 1380-3743 Impact factor: 2.589
Fig. 1Comparison of identity-by-state-based genetic distance among pairs of DNA samples from the same accession as a measure of genotyping error of 117 DNA samples from 15 accessions. Each accession was represented from 2 to 16 DNA samples and genotyped with 31,7369 SNPs of which 23,490 SNPs were polymorphic. Only three accessions showed genotyping error between pairs of DNA samples greater than 1%
Fig. 2Neighbor-joining tree of 619 lowland (black) and upland (blue) O. sativa accessions based on genetic distance matrix computed from 27,645 polymorphic SNPs to demonstrate 64 misclassified O. glaberrima (red) and 4 O. barthii (green) accessions. Thirty-two of the 64 misclassified O. glaberrima accessions originated from regenerated seed lots
Fig. 3Neighbor-joining tree of O. glaberrima (black) accessions with misclassified O. sativa (blue) and O. barthii (green) accessions based on genetic distance matrix computed from 27,645 polymorphic SNPs
Summary of the 36 diagnostic SNPs recommended for routine quality control genotyping in three sets of rice germplasm. (See Supplementary Table S3 for details, including major and minor alleles and sequences)
| Clone (SNP) ID | Chromosome* | Physical position (bp)* | Minor allele frequency | Polymorphism information content (PIC) | Comment** |
|---|---|---|---|---|---|
| 19323604|F|0-33:G>A-33:G>A | 1 | 13,006,094 | 0.175 | 0.179 | Diagnostic between lowland |
| 3048732|F|0-43:C>T-43:C>T | 2 | 33,878,778 | 0.175 | 0.155 | Diagnostic between lowland |
| 5143398|F|0-19:C>A-19:C>A | 3 | 878,615 | 0.176 | 0.248 | Diagnostic between lowland |
| 3061709|F|0-55:T>C-55:T>C | 4 | 27,782,473 | 0.175 | 0.150 | Diagnostic between lowland |
| 19322100|F|0-34:G>A-34:G>A | 5 | 7,309,276 | 0.175 | 0.192 | Diagnostic between lowland |
| 5398605|F|0-5:A>C-5:A>C | 6 | 18,012,850 | 0.176 | 0.177 | Diagnostic between lowland |
| 5408937|F|0-68:G>A-68:G>A | 7 | 7,357,816 | 0.175 | 0.181 | Diagnostic between lowland |
| 3054571|F|0-22:G>T-22:G>T | 8 | 18,620,171 | 0.177 | 0.168 | Diagnostic between lowland |
| 3764294|F|0-38:T>G-38:T>G | 9 | 18,312,602 | 0.176 | 0.159 | Diagnostic between lowland |
| 3061808|F|0-33:G>A-33:G>A | 11 | 11,776,508 | 0.176 | 0.257 | Diagnostic between lowland |
| 3451491|F|0-7:A>G-7:A>G | 12 | 12,303,324 | 0.176 | 0.161 | Diagnostic between lowland |
| 3999042|F|0-64:A>T-64:A>T | 1 | 3,338,923 | 0.225 | 0.343 | Diagnostic between |
| 5400312|F|0-7:T>C-7:T>C | 2 | 14,337,444 | 0.224 | 0.346 | Diagnostic between |
| 3772372|F|0-31:A>C-31:A>C | 3 | 32,703,627 | 0.222 | 0.341 | Diagnostic between |
| 5404734|F|0-26:G>C-26:G>C | 4 | 12,064,222 | 0.224 | 0.341 | Diagnostic between |
| 9759889|F|0-13:C>A-13:C>A | 5 | 23,874,724 | 0.224 | 0.340 | Diagnostic between |
| 3767922|F|0-20:A>C-20:A>C | 6 | 29,415,564 | 0.224 | 0.342 | Diagnostic between |
| 3756907|F|0-10:G>A-10:G>A | 7 | 22,969,163 | 0.223 | 0.252 | Diagnostic between |
| 4392882|F|0-24:G>A-24:G>A | 9 | 17,267,685 | 0.225 | 0.470 | Diagnostic between |
| 5402462|F|0-8:A>G-8:A>G | 10 | 21,079,827 | 0.224 | 0.339 | Diagnostic between |
| 3766781|F|0-33:G>C-33:G>C | 11 | 19,638,886 | 0.223 | 0.342 | Diagnostic between |
| 9760013|F|0-21:G>C-21:G>C | 12 | 17,069,058 | 0.225 | 0.487 | Diagnostic between |
| 9752577|F|0-33:A>T-33:A>T | 1 | 40,671,128 | 0.219 | 0.344 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 3449013|F|0-42:C>T-42:C>T | 2 | 2,687,374 | 0.219 | 0.382 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 9756975|F|0-38:A>C-38:A>C | 3 | 21,681,288 | 0.219 | 0.305 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 3442388|F|0-60:C>A-60:C>A | 4 | 23,879,007 | 0.219 | 0.262 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 9759135|F|0-10:T>C-10:T>C | 5 | 20,316,123 | 0.219 | 0.318 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 5410301|F|0-24:C>T-24:C>T | 6 | 5,959,340 | 0.219 | 0.251 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 3987858|F|0-29:T>A-29:T>A | 7 | 4,661,018 | 0.219 | 0.349 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 6997101|F|0-26:G>A-26:G>A | 8 | 8,968,236 | 0.219 | 0.384 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 3052158|F|0-20:G>A-20:G>A | 9 | 1,894,850 | 0.219 | 0.252 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 3771354|F|0-14:C>T-14:C>T | 9 | 18,345,726 | 0.219 | 0.339 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 5400008|F|0-23:G>A-23:G>A | 9 | 21,792,333 | 0.219 | 0.357 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 3055213|F|0-32:A>G-32:A>G | 10 | 10,645,458 | 0.219 | 0.324 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 5388786|F|0-44:T>A-44:T>A | 11 | 2,335,078 | 0.219 | 0.386 | Diagnostic SNP between Africa rice species complex and Asian rice |
| 3995884|F|0-25:G>A-25:G>A | 12 | 25,433,207 | 0.219 | 0.332 | Diagnostic SNP between Africa rice species complex and Asian rice |
*Chromosome numbers and physical positions in base pairs (bp) are based on Rice_v9
**African rice species complex refers to O. glaberrima, O. barthii, and O. longistaminata
Fig. 4Haplotype pattern of 36 diagnostic SNP markers recommended for quality control genotyping between the three African species complex and O. sativa (14 SNPs), between O. longistaminata and O. barthii/O. glaberrima (11 SNPs) and between lowland O. sativa spp. indica and upland O. sativa spp. japonica (11 SNPs). Each species or ecotype is represented by seven randomly selected accessions, which are listed in rows on the left side with prefix (Glab O. glaberrima, Bart O. barthii, Long O. longistaminata, Indica O. sativa spp. indica, japonica O. sativa spp. japonica), followed by accession number. The SNP IDs are listed as column headings. Each column has two alleles that are shaded either in orange or blue. See Supplementary Table S1 for details on accessions, Supplementary Table S3 for SNP summary, and Supplementary Table S4 for the genotype data of all diagnostic markers across all 3037 accessions (excluding misclassified accessions)