| Literature DB >> 31300020 |
Hong Cheng1,2, Jing Liu2,3, Jia Wen2, Xiaojun Nie1, Luohao Xu3, Ningbo Chen2, Zhongxing Li4, Qilin Wang1, Zhuqing Zheng2, Ming Li2, Licao Cui1, Zihua Liu2, Jianxin Bian1, Zhonghua Wang1, Shengbao Xu1, Qin Yang1, Rudi Appels5, Dejun Han1, Weining Song6, Qixin Sun7,8, Yu Jiang9.
Abstract
BACKGROUND: Bread wheat is one of the most important and broadly studied crops. However, due to the complexity of its genome and incomplete genome collection of wild populations, the bread wheat genome landscape and domestication history remain elusive.Entities:
Keywords: Bread wheat; Genetic diversity; Haplotype; Introgression; Selection
Mesh:
Year: 2019 PMID: 31300020 PMCID: PMC6624984 DOI: 10.1186/s13059-019-1744-x
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Fig. 1Geographic distribution and population diversity of wheat accessions. a Schematic geographical distribution of the collection sites for 93 whole-genome sequencing (WGS) accessions and 92 whole-exome sequencing (WES) accessions. The locations of wild progenitors of bread wheat are shown in the inset. b–d Nucleotide diversity and population divergence across the wild progenitor, landraces, and varieties among the A subgenome, B subgenome, and D subgenome. The circle size and the value in each circle represent nucleotide diversity (π × 103), and the length of the line indicates divergence (Fst) among the populations. e The length of the deleted CNVs in divergent populations. The deleted CNVs were identified using the IWGSC RefSeq v1.0. The B subgenome contains more deleted CNVs in length than the A and D subgenomes in the wild emmer, landrace, and variety
Fig. 2Phylogenetic relationships and population structures. a The NJ tree was built with 1000 bootstraps using a total of 312,952 overlapping SNPs with exome capture data. Branch colors reflect different populations. Accessions are arranged according to the populations and their geographic locations. b Population structure of the 152 wheat accessions on AB subgenomes, including wild emmer (WEW), domesticated emmer (DEW), durum (DUM), landrace-west (LAW), landrace-east (LAE), variety (VAR). c Nucleotide distance (dxy) between each pairwise of wheat accessions by whole-genome resequencing. d NJ tree of 68 whole-genome resequencing accessions on the D subgenome. The tree was built using the 1,879,923 SNPs located in the genic region. e Population structure of the 68 wheat accessions on the D subgenome. f PCA plots of the first two components of 68 wheat accessions. The color and shape of dots separately indicate the population and location
Fig. 3Segmental ancestry inference of present-day bread wheat. a The distributions of long haploblocks with haplotype diversity along the AB subgenomes of bread wheat. The origins of haplotypes are shown in different colors. Segmental ancestry derived from the four populations of wild emmer, NL, SL-1, SL-2, and mix. b Haplotype patterns of chromosome 4A in diverse populations. Each column is an accession, each row is a phased haplotype. The haplotypes were constructed for each accession using all the SNPs on chromosome 4A. Alleles that are identical to or different from the ones in the IWGSC RefSeq v1.0 reference genome are indicated by blue and red, respectively. c ML tree of the longest haplotypes on chromosome 4A from ~ 170 to 440 Mb. All the accessions clustered into three groups corresponding to three distinct haplotypes in b
Fig. 4Genome-wide introgressions from wild relatives into landraces and varieties. a Map of the lengths and distributions of all the putative introgressed segments on 21 chromosomes. Introgression regions in different populations are distinguished by the color: blue for landrace, orange for variety, and green for the introgressed segments shared between landrace and variety. The length of these columns indicates the actual length of the introgression segments. Previously mapped QTLs overlapped with introgression regions are indicated to the right side of the chromosomes by colored rectangles, magenta for QTLs related to yield, blue for QTLs associated with disease resistance, and yellow for QTLs related to development. The QTLs indicate references and confidence intervals are provided in Additional file 4: Table S9. b ML tree of the 1B/1R region on the chromosome 1B of the IWGSC RefSeq 1.0 from 0 to approximately 240 Mb using sequences of each individual. C42 and C46 clustered with rye. c ML tree of the 23 Mb introgression fragment on chromosome 5B. Several bread wheat accessions clustered in a clade, segregating from all wild emmer accessions. d ML tree of a 0.45-Mb fragment on chromosome 5D introgressed from Ae. tauschii. This fragment was shared in six landrace accessions. Trees constructed by all accessions are provided in Additional file 1: Figures S21, S29, and S31
Fig. 5Genome-wide selective signals during domestication and improvement. Whole-genome screening of selective signals during domestication (a) and improvement (b). The ln π ratio values are plotted against the position on each of the 21 chromosomes. The horizontal gray dashed lines show the genome-wide threshold for selective sweeps, ln πwild emmer/πlandrace > 3.95, ln π/πlandrace > 4.29, and ln πlandrace/πvariety > 2.41. The previous reported QTLs that overlap with selection signals are highlighted with points in different colors, magenta for QTLs related to yield, blue for QTLs associated with disease resistance, and yellow for development related. All the windows overlapped with QTLs, and the QTLs indicate references and confidence intervals are provided in Additional file 9: Tables S15 and Additional file 10: S16. c–e The patterns of SNPs of the large regions with long stretches of elevated π ratio on chromosomes 3A, 5A, and 4B. Each column is an accession, and each row is an SNP site. Different colors donate the genotypes of SNPs, light blue for reference homozygous sites and red for homozygous non-reference sites