| Literature DB >> 25886949 |
Katherine W Jordan1, Shichen Wang2, Yanni Lun3,4, Laura-Jayne Gardiner5, Ron MacLachlan6, Pierre Hucl7, Krysta Wiebe8, Debbie Wong9, Kerrie L Forrest10, Andrew G Sharpe11, Christine Hd Sidebottom12, Neil Hall13, Christopher Toomajian14, Timothy Close15, Jorge Dubcovsky16,17, Alina Akhunova18,19, Luther Talbert20, Urmil K Bansal21, Harbans S Bariana22, Matthew J Hayden23, Curtis Pozniak24, Jeffrey A Jeddeloh25, Anthony Hall26, Eduard Akhunov27.
Abstract
BACKGROUND: Bread wheat is an allopolyploid species with a large, highly repetitive genome. To investigate the impact of selection on variants distributed among homoeologous wheat genomes and to build a foundation for understanding genotype-phenotype relationships, we performed population-scale re-sequencing of a diverse panel of wheat lines.Entities:
Mesh:
Year: 2015 PMID: 25886949 PMCID: PMC4389885 DOI: 10.1186/s13059-015-0606-4
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Figure 1Summary of re-sequencing panel. (a) Evolution of the hexaploid wheat genome. The tetraploid wheat T. turgidum (AABB) originated by the hybridization of T. urartu with the close unidentified relative of Ae. speltoides occurred about 0.58 to 0.82 million years ago according to the genome-wide divergence time estimate [10]. The origin of hexaploid wheat occurred about 10,000 years ago [11] by the hybridization of T. turgidum with Ae. tauschii (DD) [12]. Marcussen et al. [10] suggested that Ae. tauschii might have originated by homoploid hybrid speciation (shown by dashed arrows). (b) Geographic distribution of 62 accessions of wheat accessions. Pie charts indicate the proportion of genetic ancestry for K = 4 inferred using Structure. (c) Efficiency of homoeologous gene capture. The depth of read coverage was extracted for each of the three copies of 47,739 homoeologous gene sets. The histogram of the log2 transformed ratio of read coverage between A and B (red), A and D (blue), and B and D (green) genomes was plotted. Each plot shows a normal distribution with the overall mean at 0. (d) Overlap between the SNP and indel datasets generated by WEC and GBS. (e) Minor allele frequency of different functional classes of SNPs as a proportion of total SNPs within each genome and class. PTC: premature termination codons; SSD: splice-site disruptions.
Figure 2Distribution of genetic diversity, allele frequency, and recombination across the wheat genome. (a) Distribution of genetic diversity in the A (green), B (red), and D (blue) genomes: (π, top left), Tajima’s measure of site frequency spectrum (D, top right), historic recombination (ρ, bottom left), and LD (bottom right). (b) Distribution of nucleotide diversity π (shaded polygon), FST between cultivars and landraces (solid black line), and site frequency spectrum (D) along chromosomes 5A (top panel), 5B (middle panel) and 5D (bottom panel). Gray shaded boxes represent the approximate location of the centromere. Rug plots represent lower (red) and upper (blue) 2.5% tails of test statistic distribution. Black X above the plot represents upper 2.5% tail of ρ statistic. The location of domestication (Q) gene is shown by arrow. (c). Distribution of alleles of the AL8/78 genotype of Ae. tauschii along the chromosomes of the D genome in the 26 wheat landraces. The average frequency of AL8/78 alleles was calculated in a 3 Mb sliding window. The color scale shows the proportion of the AL8/78 alleles in each window (red - highest, blue - lowest).
Figure 3Genotype imputation. (a) Relationship between the accuracy of genotype imputation and the percentage of missing data, which is estimated after removing genotypes over a range of genotype calling probability thresholds. Imputation in Opata (solid lines) and Rialto (dashed lines) cultivars was performed using the reference panel of 60 lines (Opata and Rialto cultivars were excluded) genotyped using the 90 K iSelect assay. (b) Genotype imputation at disease resistance loci. Two GWAS regions overlapped with the positions of the previously mapped Lr37/Yr17/Sr38 (middle panel) and Lr68 (right panel) disease resistance loci; the markers associated with these loci showed highest similarity to CSS contigs 2AS-5264433 and 7BL-6748067, respectively. SNP sites directly genotyped using the 90 K SNP array are shown as red dots; imputed SNPs are shown as black dots.