| Literature DB >> 27171472 |
Yan Holtz1, Morgane Ardisson2, Vincent Ranwez1, Alban Besnard2, Philippe Leroy3,4, Gérard Poux2, Pierre Roumet2, Véronique Viader2, Sylvain Santoni2, Jacques David1.
Abstract
Targeted sequence capture is a promising technology which helps reduce costs for sequencing and genotyping numerous genomic regions in large sets of individuals. Bait sequences are designed to capture specific alleles previously discovered in parents or reference populations. We studied a set of 135 RILs originating from a cross between an emmer cultivar (Dic2) and a recent durum elite cultivar (Silur). Six thousand sequence baits were designed to target Dic2 vs. Silur polymorphisms discovered in a previous RNAseq study. These baits were exposed to genomic DNA of the RIL population. Eighty percent of the targeted SNPs were recovered, 65% of which were of high quality and coverage. The final high density genetic map consisted of more than 3,000 markers, whose genetic and physical mapping were consistent with those obtained with large arrays.Entities:
Mesh:
Year: 2016 PMID: 27171472 PMCID: PMC4865223 DOI: 10.1371/journal.pone.0154609
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Polymorphisms available for bait design.
| Number | |
|---|---|
| SNPs on DWr | 17,543 |
| SNPs on BWr | 1,356 |
| SNPs available (total) | 18,899 |
| SNPs targeted by baits | 4,352 |
| SNPs available (total) | 84,710 |
| SNPs targeted by baits | 1,888 |
Fig 1Description of the bait design.
Orange lines represent the durum wheat genome, with the targeted SNP in brackets. Bait sequences are represented in grey. The number of SNPs targeted by each type of bait is specified.
Efficiency of the genotyping by capture protocol.
| Number of targeted SNPs | 6,240 |
| | |
| Not captured | 939 (15%) |
| Captured but monomorphic | 768 |
| Heterozygote state only | 160 (3%) |
| | |
| OR-SNPs | 4,373 |
| OR-clean SNPs (targeted and clean) (1) | |
| Clean bonus SNPs (2) | 2,305 |
| On untargeted contig | 1,410 |
| Keeping only one SNP per contig (3) | |
| Clean SNPs available (1+2) | 5,127 |
| SNP used for the genetic map (one per contig) (1+3) |
Fig 2Detection of a bonus SNP on the homeolog of a targeted contig.
A portion of A and B genomes were represented, with an SNP on the A genome (in red) and an SNP on the B genome (in blue). Divergences between both genomes are represented in green. The bait shown in grey was designed initially to capture a portion of the A genome, but captured also the homeologous portion of the B genome, with the related bonus SNP.
Features of the DS durum wheat genetic map.
| Chr. | SNPs | Length (cM) | Intermarker distance cM/marker | Biggest gap (cM) | unique pos. | Spearman r with gen. pos. | Spearman r with phys. pos. |
|---|---|---|---|---|---|---|---|
| 1A | 231 | 175.5 | 0.76 | 18.2 | 90 | 0.96 | 0.94 |
| 1B | 298 | 181.4 | 0.61 | 11.9 | 121 | 0.95 | 0.95 |
| 2A | 403 | 218.3 | 0.54 | 12.6 | 132 | 0.65 | 0.97 |
| 2B | 324 | 234.9 | 0.73 | 13 | 141 | 0.95 | 0.95 |
| 3A | 204 | 199.6 | 0.98 | 13.6 | 97 | 0.99 | 0.96 |
| 3B | 337 | 229.7 | 0.68 | 14.2 | 142 | 0.94 | 0.9 |
| 4A | 231 | 229.3 | 1.00 | 21.1 | 115 | 0.92 | 0.96 |
| 4B | 281 | 163.7 | 0.58 | 18.8 | 100 | 0.84 | 0.97 |
| 5A | 279 | 288.2 | 1.04 | 15.5 | 134 | 1 | 0.96 |
| 5B | 280 | 246.4 | 0.88 | 12.1 | 134 | 0.99 | 0.74 |
| 6A | 172 | 178.2 | 1.04 | 11.1 | 96 | 0.99 | 0.9 |
| 6B | 253 | 183.4 | 0.73 | 20.8 | 114 | 0.9 | 0.97 |
| 7A | 292 | 237.6 | 0.82 | 12.2 | 135 | 0.99 | 0.97 |
| 7B | 144 | 197.3 | 1.38 | 20.9 | 73 | 0.94 | 0.9 |
| Mean | 266.4 | 211.7 | 0.8 | 15.4 | 116.0 | 0.93 | 0.93 |
| Total | 3729 | 2964 | - | - | 1624 | - | - |
| Mean A | 258.9 | 218.1 | 0.9 | 14.9 | 114.1 | 0.93 | 0.95 |
| Mean B | 273.9 | 205.3 | 0.8 | 16.0 | 117.9 | 0.93 | 0.91 |
| Total A | 1812 | 1526.7 | - | - | 799 | - | - |
| Total B | 1917 | 1436.8 | - | - | 825 | - | - |
SNP number is given for each chromosome. Unique pos.: number of unique positions present for each chromosome. Spearman r with gen. pos.: Spearman’s rank correlation coefficients between DS genetic map and putative genetic positions on bread wheat (BWr). Spearman r with phys. pos.: Spearman’s rank correlation coefficients between DS genetic map and putative physical position. (See text).
Fig 3Correlations between putative physical and genetic positions.
The 14 durum wheat chromosomes are shown separately, with the putative physical position on the X-axis (bp) and the genetic position on the Y-axis (cM). The chromosome name is given at the top left of each plot. The number of markers is given in brackets. A red line represents the fitted polynomial model and a grey area represents the 95% confidence interval. The two vertical grey lines are maximum and minimum values of the physical positions.