| Literature DB >> 35834607 |
Yun-Yu Chen1,2, Miriam Schreiber1,3, Micha M Bayer1, Ian K Dawson1,4, Peter E Hedley1, Li Lei5, Alina Akhunova5,6, Chaochih Liu5, Kevin P Smith5, Justin C Fay7, Gary J Muehlbauer5, Brian J Steffenson8, Peter L Morrell5, Robbie Waugh1,3, Joanne R Russell1.
Abstract
The distribution of recombination events along large cereal chromosomes is uneven and is generally restricted to gene-rich telomeric ends. To understand how the lack of recombination affects diversity in the large pericentromeric regions, we analysed deep exome capture data from a final panel of 815 Hordeum vulgare (barley) cultivars, landraces and wild barleys, sampled from across their eco-geographical ranges. We defined and compared variant data across the pericentromeric and non-pericentromeric regions, observing a clear partitioning of diversity both within and between chromosomes and germplasm groups. Dramatically reduced diversity was found in the pericentromeres of both cultivars and landraces when compared with wild barley. We observed a mixture of completely and partially differentiated single-nucleotide polymorphisms (SNPs) between domesticated and wild gene pools, suggesting that domesticated gene pools were derived from multiple wild ancestors. Patterns of genome-wide linkage disequilibrium, haplotype block size and number, and variant frequency within blocks showed clear contrasts among individual chromosomes and between cultivars and wild barleys. Although most cultivar chromosomes shared a single major pericentromeric haplotype, chromosome 7H clearly differentiated the two-row and six-row types associated with different geographical origins. Within the pericentromeric regions we identified 22 387 non-synonymous SNPs, 92 of which were fixed for alternative alleles in cultivar versus wild accessions. Surprisingly, only 29 SNPs found exclusively in the cultivars were predicted to be 'highly deleterious'. Overall, our data reveal an unconventional pericentromeric genetic landscape among distinct barley gene pools, with different evolutionary processes driving domestication and diversification.Entities:
Keywords: zzm321990Hordeum vulgarezzm321990; SNPs; diversity; domestication; evolution; pericentromeric regions
Mesh:
Substances:
Year: 2022 PMID: 35834607 PMCID: PMC9546296 DOI: 10.1111/tpj.15908
Source DB: PubMed Journal: Plant J ISSN: 0960-7412 Impact factor: 7.091
Figure 1Population structure of 815 barley accessions. (a) Principle coordinate analysis (PCO) based on 9845 randomly selected single‐nucleotide polymorphisms (SNPs). Samples are colour coded based on domestication status and row type. The proportion of variance explained by the PCOs are labelled beside the axes. The figure was produced with curlywhirly (https://ics.hutton.ac.uk/curlywhirly/). (b) Genetic admixture proportion inferred from faststructure based on the same 9845 SNPs for the PCO analysis. Colour blocks represent different estimated ancestral populations (K = 7). Samples were grouped based on domestication status and row type, as indicated at the black bars below. The figure was produced using structure plot (Ramasamy et al., 2014).
Figure 2Extensive genetic differentiation in the pericentromeric regions among Hordeum vulgare (barley) groups, showing all single‐nucleotide polymorphisms (SNPs) without minor allele frequency (MAF) filtering. The top track shows the chromosome diagrams, with the gradient of blue colours representing zone 1 (light blue), zone 2 (medium blue) and zone 3 (dark blue) regions, and the red bars representing the centromere, using the coordinates reported by Mascher et al. (2017) and physical distance. (a) Genetic diversity (π): red, wild barleys; orange, landraces; blue, cultivars. (b) Fixation index (F ST) between cultivars and wild barleys. (c) F ST between landraces and wild barleys. (d) F ST between cultivars and landraces. In (b) and (c), sites with F ST ≥ 0.8 were coloured red (with no such sites in panel d).
Figure 3Diagram of how different wild founder haplotypes give rise to horizontal F ST patterns. (a) In the simplest case, single‐nucleotide polymorphisms (SNPs) in cultivars and wild barleys are fixed completely at two different states and a track of F ST = 1 is formed. (b) Horizontal track with a lower F ST value is formed when some wild barleys share the fixed cultivated allele. (c) ‘Overlapping’ horizontal tracks of F ST formed when different wild barley alleles have varying degrees of differentiation from the cultivars. (d) ‘Break point’ variable horizontal tracks of F ST formed that represent rare recombination between two wild barley founder haplotypes. (e) Real exome sequence genotype data from a segment of barley chromosome 4H, zone 3, showing at least three wild barley founder haplotypes, separated by white space, in this region: the ancestors of the cultivars and one possible double crossover event between different wild founders (asterisk).
Linkage disequilibrium (LD) haplotype block structure for each group
| Group | Chr. | Chr. length (bp) | No. blocks | Block coverage (kb) | Chr. block coverage (%) | Largest block (kb) | No. SNPs in blocks |
|---|---|---|---|---|---|---|---|
| Cultivars | 1H | 558 535 432 | 932 | 505 269 | 90 | 161 870 | 22 405 |
| ( | 2H | 768 075 024 | 1418 | 707 970 | 92 | 184 043 | 33 234 |
| 3H | 699 711 114 | 1161 | 635 706 | 91 | 80 843 | 31 218 | |
| 4H | 647 060 158 | 691 | 610 364 | 94 | 258 652 | 20 001 | |
| 5H | 670 030 160 | 1351 | 615 478 | 92 | 186 594 | 36 651 | |
| 6H | 583 380 513 | 1041 | 542 069 | 93 | 149 053 | 26 062 | |
| 7H | 657 224 000 | 1221 | 597 940 | 91 | 89 407 | 29 601 | |
| Average | 1116 | 602 114 | 92 | 158 637 | 28 453 | ||
| Landraces | 1H | 558 535 432 | 1843 | 485 605 | 87 | 74 708 | 31 909 |
| ( | 2H | 768 075 024 | 2746 | 667 275 | 87 | 125 158 | 49 418 |
| 3H | 699 711 114 | 2613 | 611 705 | 87 | 134 606 | 49 199 | |
| 4H | 647 060 158 | 1476 | 602 320 | 93 | 185 970 | 34 126 | |
| 5H | 670 030 160 | 2457 | 591 257 | 88 | 185 621 | 50 045 | |
| 6H | 583 380 513 | 2170 | 508 486 | 87 | 130 284 | 40 095 | |
| 7H | 657 224 000 | 2501 | 572 238 | 87 | 76 166 | 46 584 | |
| Average | 2258 | 576 984 | 88 | 130 359 | 43 054 | ||
| Wild barleys | 1H | 558 535 432 | 5769 | 275 005 | 49 | 4476 | 41 438 |
| ( | 2H | 768 075 024 | 6835 | 373 400 | 49 | 6847 | 52 893 |
| 3H | 699 711 114 | 6686 | 364 791 | 52 | 81 241 | 49 423 | |
| 4H | 647 060 158 | 5153 | 392 599 | 61 | 10 417 | 45 920 | |
| 5H | 670 030 160 | 6684 | 326 365 | 49 | 55 927 | 49 417 | |
| 6H | 583 380 513 | 4588 | 316 772 | 54 | 17 857 | 35 907 | |
| 7H | 657 224 000 | 6932 | 306 958 | 47 | 7225 | 51 179 | |
| Average | 6092 | 336 556 | 51 | 26 284 | 46 597 |
Figure 4Gene haplotype analysis for different barley chromosome zones. Haplotypes of 32 222 genes with variants covered by exome sequencing were characterized. (a) Gene haplotype count by chromosome. (b) Gene haplotype count by chromosome zone. (c) Major haplotype frequency by chromosome. (d) Major haplotype frequency by chromosome zone. (e) Block size (bp) by chromosome. (f) Block size (bp) by chromosome zone. Key: blue, cultivars; orange, landraces; red, wild barleys.
Figure 5Signatures of positive selection in barley differentiated by chromosome and zone. (a) Selective sweep signal (μ) of barley genomes. Red colours represent genomic regions with μ values above the 95th percentile. The top track shows the chromosome diagrams, with the gradient of blue colours representing zone 1 (light blue), zone 2 (medium blue) and zone 3 (dark blue) regions, and the red bars representing the centromere, using the coordinates reported by Mascher et al. (2017). (b) Distribution of μ values by chromosome for different barley groups. (c) μ values by zone (data from all seven chromosomes combined) for different barley groups.
Figure 6Maximum‐likelihood (ML) trees for barley constructed using single‐nucleotide polymorphisms (SNPs) from zones 1 and 2, compared with ML tree constructed using zone‐3 SNPs. (a) Chromosome 4H. (b) Chromosome 7H.
Figure 7Pericentromeric genetic diversity in Hordeum vulgare (barley) visualized as haplogroups. Horizontal lines connecting through each chromosome represent barley accessions (colour coded by domestication status and row type). The vertical position of the line at any given chromosome represents the haplogroup number identified for that accession, based on the order presented in Table S5. The four panels show the diversity profile of: (a) all 815 accessions; (b) cultivars; (c) landraces; and (d) wild barleys.
Potential deleterious alleles fixed in domesticated gene pools
| Chr. | Position | Effect | Wild seq. | Cultivar seq. | Gene affected | Transcript affected | PROVEAN score | Annotation | Morex v.3 gene ID |
|---|---|---|---|---|---|---|---|---|---|
| 1H | 161 039 495 | Missense | Asn | Tyr |
| 1 | −3.819 | Galactosyltransferase | HORVU.MOREX.r3.1HG0031180 |
| 1H | 253 486 741 | Missense | Ser | Phe |
| 1, 2, 3 | −5.483 to −5.800 | ABC transporter G family member 24 | HORVU.MOREX.r3.1HG0038900 |
| 1H | 256 277 577 | Missense | Ala | Val |
| 1, 3, 4 | −3 | n/a | HORVU.MOREX.r3.1HG0039050 |
| 2H | 265 057 192 | Missense | Pro | Ser |
| 11, 31 | −2.511 | Pre‐mRNA‐splicing factor ATP‐dependent RNA helicase DEAH7 | HORVU.MOREX.r3.2HG0142570,HORVU.MOREX.r3.2HG0142550,HORVU.MOREX.r3.2HG0142540 (gene split in Morex v.3) |
| 2H | 269 489 889 | Missense | Cys | Arg |
| 2 | −3.955 | n/a | HORVU.MOREX.r3.2HG0142940 |
| 2H | 271 533 763 | Missense | Pro | Ser |
| 1, 2 | −6.607 to −6.973 | E3 ubiquitin protein ligase NEURL1B | HORVU.MOREX.r3.2HG0143100 |
| 2H | 273 026 038 | Missense | Glu | Asp |
| 4 | −2.911 | Peptide‐ | HORVU.MOREX.r3.2HG0143200 |
| 2H | 288 348 617 | Missense | Asp | Val |
| 1 | −7.99 | ATP‐dependent DNA helicase | HORVU.MOREX.r3.2HG0144360 |
| 2H | 302 860 598 | Missense | Ser | Thr |
| 2, 3 | −3 | n/a | no hit |
| 2H | 325 368 183 | Missense | Ser | Arg |
| 1, 2 | −5 | n/a | HORVU.MOREX.r3.2HG0146980 |
| 2H | 327 156 323 | Missense | His | Arg |
| 1 | −8 | n/a | no hit |
| 2H | 342 024 777 | Missense | Cys | Tyr |
| 1 | −10.236 | Tyrosine‐sulfated glycopeptide receptor 1 | HORVU.MOREX.r3.2HG0148300 |
| 2H | 352 826 802 | Missense | Lys | Met |
| 1 | −5.78 | AUGMIN subunit 3 | HORVU.MOREX.r3.2HG0149180 |
| 2H | 365 683 330 | Missense | Gly | Asp |
| 1 | −6.767 | n/a | HORVU.MOREX.r3.2HG0150130 |
| 2H | 397 248 990 | Missense | Ser | Thr |
| 3, 4 | −3 | P‐loop containing nucleoside triphosphate hydrolase | HORVU.MOREX.r3.2HG0152460 |
| 2H | 398 383 966 | Missense | Asn | Lys |
| 1 | −6 | n/a | no hit |
| 4H | 169 008 802 | Missense | Lys | Thr |
| 1, 2 | −4.900 to −4.933 | GRAS family transcription factor containing protein, expressed | HORVU.MOREX.r3.4HG0357830 |
| 4H | 195 116 684 | Missense | Pro | Leu |
| 1 | −3.439 | Putative inactive leucine‐rich repeat receptor‐like protein kinase | HORVU.MOREX.r3.4HG0360590 |
| 4H | 237 605 948 | Missense | Thr | Met |
| 1 | −5.473 | n/a | HORVU.MOREX.r3.4HG0363910 |
| 4H | 337 692 163 | Missense | Arg | Cys |
| 1, 2, 5, 6, 9, 10, 12, 15, 16, 17, 18, 19, 20, 21, 22 | −6.000 to −6.233 | Rho GTPase activator | HORVU.MOREX.r3.4HG0372920 |
| 4H | 340 149 652 | Missense | Leu | Val |
| 1, 2, 6, 8, 9 | −3 | n/a | no hit |
| 4H | 366 230 980 | Missense | Ser | Leu |
| 11, 12, 20 | −2.545 to −2.975 | β‐Adaptin‐like protein C | HORVU.MOREX.r3.4HG0374910 |
| 5H | 169 096 533 | Missense | Thr | Ile |
| 18 | −6 | Ureide permease 1‐like isoform X2 | HORVU.MOREX.r3.5HG0448790,HORVU.MOREX.r3.5HG0448780 (gene split in Morex v.3) |
| 5H | 200 493 783 | Missense | Ser | Tyr |
| 1 | −6 | n/a | no hit |
| 5H | 207 656 318 | Missense | Thr | Ile |
| 1 | −6 | n/a | HORVU.MOREX.r3.5HG0451070 |
| 5H | 261 369 954 | Missense | Gly | Ala |
| 1 | −4.628 | tRNA (guanine(37)‐ | HORVU.MOREX.r3.5HG0455140 |
| 6H | 231 545 723 | Missense | Thr | Ala |
| 1, 2, 3, 4, 5, 6, 7, 8, 9, 10, 12, 13 | −2.868 to −3.139 | Xaa‐Pro dipeptidase | HORVU.MOREX.r3.6HG0581810 |
| 6H | 238 482 883 | Stop lost | STOP | Trp |
| 4, 6, 9, 12, 18, 23, 25, 34, 61 | −3.011 to −3.292 | Probable magnesium transporter | HORVU.MOREX.r3.6HG0582120 |
| 6H | 291 363 394 | Missense | Thr | Ala |
| 4, 5, 6 | −2.682 | Vesicle transport protein | HORVU.MOREX.r3.6HG0586950,HORVU.MOREX.r3.6HG0586940 (gene split in Morex v.3) |