| Literature DB >> 30816245 |
S Senthilvel1, Arpita Ghosh2, Mobeen Shaik3, Ranjan K Shaw3, Prashanth G Bagali2.
Abstract
Castor is a commercially important oilseed crop that provides raw materials for several industries. Currently, the availability of genomic resources for castor is very limited. In this study, genome-wide SNPs were discovered in castor via whole-genome sequencing of 14 diverse lines to an average of 34X coverage. A total of 2,179,759 putative SNPs were detected, and a genotyping array was designed with 6,000 high-quality SNPs representing 2,492 scaffolds of the draft castor genome (87.5% genome coverage). The array was validated by genotyping a panel of 314 inbred castor lines, which resulted in 5,025 scorable SNPs with a high call rate (98%) and reproducibility (100%). Using this array, a consensus linkage map consisting of 1,978 SNP loci was constructed with an average inter-marker distance of 0.55 cM. The genome-wide SNP data, the genotyping array and the dense linkage map are valuable genomic tools for promoting high-throughput genomic research and molecular breeding in castor.Entities:
Year: 2019 PMID: 30816245 PMCID: PMC6395776 DOI: 10.1038/s41598-019-39967-9
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Characteristics of castor inbred lines used for sequencing.
| Genotype | Morphological features | Special attribute |
|---|---|---|
| JI220 | Medium height, green stem, triple bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Elite inbred line, resistant to Fusarium wilt |
| JC12 | Medium height, red stem, double bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Elite inbred line, parent of a mapping population, resistant to reniform nematode |
| TMV5 | Medium height, red stem, triple bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Commercial variety |
| RG43 | Dwarf, red stem, triple bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Germplasm accession, resistant to Fusarium wilt, reniform nematode and leafhopper ( |
| RG72 | Dwarf, red stem, double bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Germplasm accession, early maturing and drought tolerant |
| RG3309 | Tall, Magogany stem, double bloom, short-spiny capsule, normal internode, flat leaves, monoecious inflorescence | Germplasm accession, moderately resistant to gray mold and resistant to Fusarium wilt |
| DPC9 | Medium height, green stem, no bloom, spiny capsule, normal internode, flat leaves, pistillate inflorescence | Female parent of a commercial hybrid |
| 48-1 | Medium height, red stem, doube bloom, non-spiny capsule, normal internode, flat leaves, monoecious inflorescence | Commercial variety, parent of a mapping population, resistant to Fusarium wilt |
| RG1139 | Medium height, red stem, no bloom, short-spiny capsule, normal internode, flat leaves, monoecious inflorescence | Germplasm accession, parent of a mapping population, moderately resistant to gray mold |
| RG2787 | Tall, Mahogany stem, double bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Germplasm accession, resistant to Macrophomina root rot and Fusarium wilt |
| RG2819 | Tall, green stem, double bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Germplasm accession, resistant to Macrophomina root rot and Fusarium wilt |
| VP1 | Dwarf, green stem, triple bloom, spiny capsule, condensed internode, cup shaped leaves, pistillate inflorescence | Female parent of a commercial hybrid |
| DCS9 | Medium height, red stem, double bloom, spiny capsule, normal internode, flat leaves, monoecious inflorescence | Commercial variety and parental line of a mapping population |
| RG3216 | Medium height, red stem, no bloom, short-spiny capsule, normal internode, flat leaves, monoecious inflorescence | Germplasm accession, parent of a mapping population |
Statistics on whole genome sequencing of 14 castor inbred lines.
| Genotype | Number of raw reads | Total data (Gb) | Number of filtered reads | Alignment (%) | Genome coverage (%) | Platform used |
|---|---|---|---|---|---|---|
| JI220 | 11,251,618 | 5.40 | 10,531,223 | 89.76 | 94.63 | MiSeq |
| JC12 | 12,626,709 | 7.20 | 11,906,343 | 96.91 | 94.97 | MiSeq |
| TMV5 | 62,392,969 | 12.47 | 57,398,835 | 97.36 | 95.57 | HiSeq |
| RG72 | 59,087,998 | 11.80 | 55,979,084 | 96.51 | 95.29 | HiSeq |
| RG3309 | 70,686,223 | 14.13 | 66,770,385 | 95.66 | 95.04 | HiSeq |
| DPC9 | 65,820,623 | 13.10 | 60,207,321 | 96.37 | 95.50 | HiSeq |
| RG43 | 12,787,405 | 5.90 | 11,622,830 | 87.41 | 92.84 | MiSeq |
| 48-1 | 51,270,973 | 10.25 | 48,407,365 | 97.26 | 95.50 | HiSeq |
| RG1139 | 73,466,140 | 14.69 | 69,352,963 | 97.83 | 97.02 | HiSeq |
| RG2787 | 63,835,573 | 12.76 | 58,743,049 | 95.58 | 95.57 | HiSeq |
| RG2819 | 62,782,508 | 12.55 | 58,926,757 | 96.90 | 95.70 | HiSeq |
| VP1 | 62,126,053 | 12.42 | 58,885,533 | 98.09 | 95.63 | HiSeq |
| DCS9 | 75,062,342 | 15.00 | 62,244,984 | 97.41 | 94.87 | HiSeq |
| RG3216 | 108,880,111 | 22.00 | 91,131,064 | 97.73 | 97.39 | HiSeq |
Statistics on SNPs/InDels identified between the samples and the reference.
| Genotype | Total SNPs identified | Transition | Transversions | Triallelic SNPs | Number of SNPs after filtration | Genic SNPs |
|---|---|---|---|---|---|---|
| JI220 | 294,956 | 167,623 | 78,092 | 51 | 55,587 | 29,439 |
| JC12 | 929,114 | 577,558 | 245,896 | 24 | 63,221 | 84,792 |
| TMV5 | 713,512 | 453,056 | 192,247 | 9 | 74,178 | 73,923 |
| RG72 | 644,037 | 406,047 | 172,030 | 5 | 82,802 | 62,838 |
| RG3309 | 852,482 | 225,033 | 538,652 | 6 | 105,300 | 80,195 |
| DPC9 | 604,063 | 380,789 | 161,192 | 6 | 78,457 | 56,241 |
| RG43 | 387,296 | 235,152 | 100,703 | 27 | 62,842 | 32,010 |
| 48-1 | 592,812 | 373,094 | 160,160 | 4 | 78,747 | 61,035 |
| RG1139 | 490,382 | 295,749 | 129,786 | 2 | 74,252 | 63,802 |
| RG2787 | 670,889 | 423,464 | 176,940 | 11 | 88,115 | 60,934 |
| RG2819 | 672,843 | 425,258 | 178,583 | 10 | 88,806 | 61,609 |
| VP1 | 440,832 | 266,069 | 118,137 | 2 | 73,224 | 53,660 |
| DCS9 | 305,433 | 187,765 | 90,138 | 0 | 111,179 | 39,500 |
| RG3216 | 415,777 | 117,476 | 56,680 | 0 | 74,089 | 63,916 |
Figure 1Distribution of call rate of SNPs across 314 castor inbred lines.
Figure 2Distribution of minor allele frequency (MAF) of SNPs in the array.
Figure 3Distribution of polymorphism information content (PIC) values of SNPs in the array.
Details of linkage maps constructed.
| Linkage group | (JC12 × 48-1 map) | (DCS9 × RG1139 map) | Consensus map | ||||||
|---|---|---|---|---|---|---|---|---|---|
| No. of markers | Map length (cM) | Average distance between markers (cM) | No. of markers | Map length (cM) | Average distance between markers (cM) | No. of markers | Map length (cM) | Average distance between markers (cM) | |
| LG1 | 207 | 200.0 | 1.07 | 182 | 148.2 | 0.91 | 314 | 155.1 | 0.54 |
| LG2 | 125 | 92.1 | 0.76 | 132 | 60.8 | 0.59 | 228 | 83.8 | 0.43 |
| LG3 | 107 | 105.6 | 1.08 | 155 | 102.9 | 0.75 | 207 | 105.6 | 0.54 |
| LG4 | 105 | 139.4 | 1.38 | 74 | 73.0 | 1.12 | 147 | 75.4 | 0.55 |
| LG5 | 83 | 128.2 | 1.62 | 74 | 84.5 | 1.28 | 135 | 86.9 | 0.70 |
| LG6 | 74 | 96.5 | 1.38 | 85 | 93.4 | 1.18 | 137 | 96.8 | 0.74 |
| LG7 | 139 | 105.6 | 0.80 | 174 | 105.9 | 0.68 | 249 | 105.9 | 0.45 |
| LG8 | 82 | 89.1 | 1.14 | 169 | 74.1 | 0.48 | 215 | 84.9 | 0.43 |
| LG9 | 102 | 95.5 | 1.06 | 183 | 115.1 | 0.73 | 249 | 115.1 | 0.53 |
| LG10 | 66 | 87.8 | 1.42 | 45 | 46.9 | 1.20 | 103 | 86.4 | 0.92 |
| Total map | 1090 | 1139.8 | 1.12 | 1273 | 904.8 | 0.81 | 1978 | 995.8 | 0.55 |
Figure 4Comparison of DCS9 × RG1139 and JC12 × 48-1 maps.