| Literature DB >> 24278269 |
James Breen1, Thomas Wicker, Margarita Shatalina, Zeev Frenkel, Isabelle Bertin, Romain Philippe, Wolfgang Spielmeyer, Hana Simková, Jan Safář, Federica Cattonaro, Simone Scalabrin, Federica Magni, Sonia Vautrin, Hélène Bergès, Etienne Paux, Tzion Fahima, Jaroslav Doležel, Abraham Korol, Catherine Feuillet, Beat Keller.
Abstract
Bread wheat (Triticum aestivum) has a large and highly repetitive genome which poses major technical challenges for its study. To aid map-based cloning and future genome sequencing projects, we constructed a BAC-based physical map of the short arm of wheat chromosome 1A (1AS). From the assembly of 25,918 high information content (HICF) fingerprints from a 1AS-specific BAC library, 715 physical contigs were produced that cover almost 99% of the estimated size of the chromosome arm. The 3,414 BAC clones constituting the minimum tiling path were end-sequenced. Using a gene microarray containing ∼40 K NCBI UniGene EST clusters, PCR marker screening and BAC end sequences, we arranged 160 physical contigs (97 Mb or 35.3% of the chromosome arm) in a virtual order based on synteny with Brachypodium, rice and sorghum. BAC end sequences and information from microarray hybridisation was used to anchor 3.8 Mbp of Illumina sequences from flow-sorted chromosome 1AS to BAC contigs. Comparison of genetic and synteny-based physical maps indicated that ∼50% of all genetic recombination is confined to 14% of the physical length of the chromosome arm in the distal region. The 1AS physical map provides a framework for future genetic mapping projects as well as the basis for complete sequencing of chromosome arm 1AS.Entities:
Mesh:
Substances:
Year: 2013 PMID: 24278269 PMCID: PMC3836966 DOI: 10.1371/journal.pone.0080272
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Genetic markers linked to the chromosome 1AS physical map.
| Marker | Zipper | TmGxG | TaBxT | TaAxF | TaNxW | TaSxO | TaCxCS | ConSSR | Com2004 | ConsPos |
| bcd1434 | 18 | 5.1 | – | – | – | – | – | – | – | 5 |
| psp2999(Pm3) | 28 | 4.8 | 3 | 1 | – | 14 | – | – | 12 | 7 |
| whs179 | 29 | – | – | – | – | – | – | – | – | 10 |
| gmw136 | 29 | – | – | – | – | – | – | – | 14 | 14 |
| cfa2153 | 39 | – | – | 1 | - | 13.8 | 5.4 | – | 15 | 9 |
| barc148 | 48 | – | – | – | 55 | 33.4 | – | 56.8 | – | 48 |
| F640(Lr10) | 56 | – | – | – | – | – | – | – | – | 22 |
| mwg2245b (Lr10) | 56 | – | – | – | – | – | – | 22 | – | 22 |
| mag1884 | 88 | – | – | – | 22 | – | – | – | – | 22 |
| gpw2005 | 122 | – | – | – | - | – | 27.3 | – | 31 | 29 |
| wmc24 | 176 | – | – | – | 45 | 35 | – | 48.8 | 37 | 41 |
| gpw2142 | 343 | – | – | – | – | 43.1 | – | – | 54 | 49 |
| cfd58 | 452 | – | – | 52.9 | – | – | – | – | - | 53 |
| psp3027 | 496 | – | – | – | – | – | – | – | 50 | 50 |
| fba393 | 588 | – | – | – | – | – | 12.5 | – | 21 | 17 |
| wmc286 | C | – | – | – | – | – | – | – | 45 | 45 |
Position of the closest gene in the reference zipper.
TmGxG: T. monococcum G1777×G2528 [67].
TaBxT: Banks×Banks+tin (Spielmeyer et al., unpublished).
TaAxF: Arina×Forno [68].
TaNxW: Nanda2419×Wangshuibai [69].
TaSxO : Syntethic×Opata [70].
TaCxCS: Courtot×Chinese Spring (Sourdille, unpublished).
ConSSR: Consensus SSR [53].
Com2004: Wheat-Composite 2004 (Appels et al., unpublished).
ConsPos: Consensus position (approximate average of all cM positions for each marker).
Genetic positions (telomere to centromere) from different maps were taken from published genetic maps in GrainGenes (wheat.pw.usda.gov). The numbers in the individual marker fields indicate the cM position of the respective marker. Note that markers mapping to the same position in the reference zipper can have multiple different cM positions, depending on the map/population.
Comparison of the results of two physical map assembly algorithms used to assemble chromosome 1AS fingerprints, FPC and LTC.
| FPC | LTC | |
| Number of contigs | 805 | 505 |
| Contigs with more than 5 BACs | 631 | 394 |
| Assembly length [Mb] | 236 | 226 |
| Chromosome arm fraction | 85.8% | 82% |
| Maximum clones on a contig | 274 | 317 |
| Clones in contigs | 20,705 | 21,622 |
| Singletons | 5,213 | 4,296 |
| Contigs in N50 | 175 | 90 |
| Length of N50 contig [kb] | 466 | 798 |
Figure 1Pajek analysis of BAC contigs from chromosome 1AS assembled with the FPC software.
Repetitive BAC clones within the BAC fingerprints were problematic for the FPC assembly, leading to non-linear contig patterns. The reassembly of fingerprints using the LTC assembly program resolved non-linear contigs. The dashed line indicates where the non-linear contig was cut into two contig segments, because the two segments are only connected by a single BAC clone (indicated by all connections converging in one point.
Final composition of the BAC clone backbone of the wheat 1AS physical map.
| LTC contigs with >5 BACs | 394 |
| BACs in LTC contigs | 21,255 |
| Total size of LTC contigs [Mbp] | 226 |
| Included FPC contigs | 321 |
| BACs in FPC contigs | 610 |
| Total size of FPC contigs [Mbp] | 48 |
| Total number of contigs | 715 |
| Total BACs included | 21,865 |
| Total size of the backbone [Mbp] | 274 |
| Total coverage of 1AS [%] | 99 |
The backbone consist mainly of contigs assembled with the LTC software (Frenkel et al., 2010) and is complemented with BACs from an earlier assembly with the FPC software (Soderlund et al., 1997, Table 1).
Figure 2Comparison of LTC and FPC assemblies.
a. The BAC clones constituting the contig ltc5279 are depicted at the top. Underneath, FPC contigs which cover corresponding regions are displayed. Gray lines connect the start points of corresponding BACs. Contig ltc5279 (approximately 2,161 kb in size) which is the fusion product of eight smaller FPC contigs. Overall, the relative positions of BAC clones within LTC and FPC contigs are very similar. b. Example of three small FPC contigs which are merged into one LTC contig (ltc132). This LTC contig also includes BACs which were singletons in the FPC assembly (blue). Note that in a and b the scales are different. c. Size distribution of overlaps of FPC contigs which were merged in the LTC assembly. The x-axis indicates the size range of overlaps of two FPC clones that were merged by LTC. The y-axis shows how many cases were identified in each size range. The gray series shows the size distribution of all overlaps. The blue series shows only those cases where additional singletons were included to merge FPC contigs while the red series shows the cases where no additional clones were used for the merging.
Figure 3Sequence composition of minimum tiling path BAC-end sequences from chromosome 1AS.
a. Pie chart indicating the composition of chromosome 1AS MTP BAC-ends. b. and c. Composition of the TE fraction. b. Composition of TE superfamilies. c. Composition of the TE fraction broken down into TE families. The three-letter code preceding the family name is according to the TE classification system proposed by Wicker et al., 2007.
Deletion bin-mapped wheat ESTs on wheat chromosome arm 1AS and estimates of gene density within the bins.
| Deletion bin | size [Mbp] | ESTs | Zipper hits | Zipper genes | gene density |
| 1AS-0.86–1.00 | 39 | 196 | 25 | 1–200 | 5.12 |
| 1AS-0.47–0.86 | 107 | 171 | 41 | 200–650 | 4.20 |
| 1ASC-0.47 | 129 | 16 | 1 | 650–749 | 0.77 |
| Total | 275 | 383 | 67 | 1–749 | 2.72 |
Number of wheat ESTs mapped specifically to respective deletion Bin (Sorrels et al., 2003).
Number of genes in the Brachypodium 1S reference zipper which have homology at the nucleotide level to bin-mapped ESTs.
Approximate range of zipper genes corresponding to wheat deletion bin.
Approximate gene density, assuming the same number of genes for wheat as in the 1AS syntenic region in Brachypodium.
Figure 4Physical map of wheat chromosome 1AS.
The figure integrates multiple sequence resources. a. Chromosome 1AS deletion bin map with the three bins shown in (yellow, green and gray). ESTs from the three deletion bins which were mapped to Brachypodium reference zipper genes are indicated with boxes with colour of the corresponding bin. If more than one EST mapped to the same Brachypodium gene, the boxes were stacked on top of each other. This information was used to estimate the boundaries of each deletion bin in the Brachypodium reference zipper (dashed lines). b. Brachypodium reference zipper. c. Physical map of the 1AS chromosme arm. BAC contigs are symbolised with blue lines (see enlarged legend at the right). The length of the line reflects the number of putative syntenic genes found on the contig, not its physical size. Syntenic genes are also symbolised by black boxes. The number of non-syntenic genes for each contig is indicated with a stack of red boxes. Grey boxes indicate place holders for contigs that contained no syntenic genes but were anchored by means other than synteny (e.g. genetic markers of centromere-specific repeats. d. Published genetic markers from chromosome 1AS that were used to deduce an estimated genetic map (marker and map names and genetic distances are detailed in Table 1).
Figure 5The three levels of anchoring used in the construction of the chromosome 1AS physical map.
On level 1, genes were anchored to physical BAC contigs using positive hybridisation probe matches, BAC-end sequences and Illumina contigs. Individual anchoring procedures are indicated by capital letters in circles and described in the text. For level 2 anchoring, all BAC contigs which contain genes which have their homologs in the 1AS syntenic region of Brachypodium, rice or sorghum were anchored to the reference zipper. This means that the order of genes in wheat was assumed to be the same as in Brachypodium, rice or sorghum. In the final step (level 3), data from genetic markers were used to anchor BAC contigs to previously published genetic maps.
Numbers of genes which were assigned to 1AS BAC contigs during level 1 anchoring.
| Approach | Total genes | Zipper |
| Hybridisation | 389 | 154 |
| BAC-ends | 180 | 32 |
| Illumina/BAC-end | 117 | 42 |
| Illumina/unigene | 61 | 26 |
| PCR | 0 | 4 |
| Total | 747 | 254 |
Genes that have homologs in the 1AS reference zipper and which could be used for level 2 anchoring.
Statistics on level 2 anchoring of BAC contigs in the 1AS physical map.
|
| |
| Non-centromeric contigs | 158 |
| Non-centromeric contigs size [Mbp] | 102.1 |
| 1AS fraction covered [%] | 37 |
| Syntenic genes | 254 |
| Non-syntenic genes | 326 |
|
| |
| Centromeric contigs | 26 |
| Centromeric contigs size [Mbp] | 10.7 |
| 1AS fraction covered [%] | 4 |
| Genes on centromeric contigs | 40 |
|
| |
| Total achored contigs | 185 |
| Total contig size [Mbp] | 112.8 |
| Total 1AS fraction covered [%] | 41 |
| total genes | 620 |
Genes which have their closest homolog in the 1AS syntenic region of Brachypodium, rice and sorghum.
Genes which have their closest homolog outside the 1AS syntenic region of Brachypodium, rice and sorghum.
Figure 6Small scale validation of the chromosome 1AS physical map using information from the previously published Pm3 powdery mildew resistance locus.
Two assembled physical contigs (ltc132 and ltc5245) were linked together using a previously published 178 kb sequence from chromosome 1AS (cv. Chinese Spring) covering the the Pm3 locus (Wicker et al. 2007). Approximate locations of NimbleGen transcriptome hybridisation probes are shown in blue. The Pm3 and the low molecular weight (LMW) glutenin loci are known to be closely linked (Wicker et al. 2003; Wang et al. 2010). The inset shows a phylogenetic analysis that compares glutenin UniGene sequences with previously published glutenin genes from 1AS (TmGluA3, green), 1BS (TaGluB) and 1DS (TaGluD).
Information contained in the definition lines of Illumina sequences that were used for the integrated 1AS sequence model in the order it is given in the definition line.
| Name | Example | Explanation |
| Identifier | 1AS_z_110 | Chromosome arm and zipper position |
| Illumina name | 1AS_c-5660802 | Original name of Illumina contig |
| Gene content | Bradi3g05850 282–1376 | Approximate position of gene(s) |
| Contig position | on ctg: 108–216 | Position of respective BAC in contig [kb] |
| NimbleGen probe | signal = Ta_S12867912 | Name of probe that gave a signal |