| Literature DB >> 34220925 |
James Sweet-Jones1, Vasileios Panagiotis Lenis2,3, Andrey A Yurchenko4, Nikolay S Yudin4, Martin Swain2, Denis M Larkin1,4.
Abstract
BACKGROUND: Advances in genetic tools applied to livestock breeding has prompted research into the previously neglected breeds adapted to harsh local environments. One such group is the Welsh mountain sheep breeds, which can be farmed at altitudes of 300 m above sea level but are considered to have a low productive value because of their poor wool quality and small carcass size. This is contrary to the lowland breeds which are more suited to wool and meat production qualities, but do not fare well on upland pasture. Herein, medium-density genotyping data from 317 individuals representing 15 Welsh sheep breeds were used alongside the whole-genome resequencing data of 14 breeds from the same set to scan for the signatures of selection and candidate genetic variants using haplotype- and SNP-based approaches.Entities:
Keywords: Wales; adaptation; sheep; signatures of selection; whole-genome resequencing
Year: 2021 PMID: 34220925 PMCID: PMC8253514 DOI: 10.3389/fgene.2021.612492
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Breed representation in the genotyping and resequencing datasets.
| Breed | Abbreviation | Genotyped samples | Horns | Base color | Fleece | HapFLK group | Resequencing group |
| Badger Faced Welsh Mountain | BFWM | 21 | Yes | Black and white | Firm | Upland | NA* |
| Balwen | − | 14 | No | Black and white | Firm | Upland | Upland |
| Beulah | – | 22 | No | Black and white | Fine | KHB | Lowland |
| Black Welsh Mountain | BWM | 24 | Yes | Black | Firm | Upland | Upland |
| Brecknock Hill Cheviot | BHC | 24 | No | White | Fine | Upland | Upland |
| Clun Forest | − | 17 | No | Black | Fine | Lowland | Lowland |
| Dolgellau Welsh Mountain | DWM | − | Yes | White | Firm | NA* | Upland |
| Hardy Speckled Faced | HSF | 24 | Yes | White | Fine | Lowland | Lowland |
| Hill Radnor | − | 21 | No | Gray-brown | Fine | Lowland | Lowland |
| Kerry Hill | − | 18 | No | White | Fine | KHB | Lowland |
| Llandovery White Faced | LWF | 24 | No | White | Fine | Upland | Upland |
| Llanwenog | − | 21 | No | Black | Fine | Lowland | Lowland |
| Lleyn | − | 22 | No | White | Fine | Lowland | Lowland |
| South Wales Welsh Mountain | SWWM | 17 | Yes | White | Firm | Upland | Upland |
| Talybont Welsh Mountain | TWM | 24 | No | White | Firm | Upland | Upland |
| Welsh Mountain Hill Flock | WMHF | 24 | Yes | White | Firm | Upland | NA |
| Average/Total | − | 21/317 | No | − | − | 102/40/173 | 7/7 |
FIGURE 1Output of genomic regions under strong selection in the KHB group from the HapFLK study. This includes the Manhattan plot (A) with a gene proximal to the most significant SNP of the highly selected peaks. Cluster plot (B) shows the tracks of haplotype diversity for the peak highlighted on the Manhattan plot by black borders where the red line represents the most significant SNP.
FIGURE 2Manhattan plots of Llandovery White-faced (A) and South Wales Welsh Mountain (B) genotyped breeds’ DCMS output. All other breeds are shown in Supplementary Figure 2. Selection thresholds for a suggestive (q < 0.05) and strong (q < 0.01) selection shown by blue and red lines, respectively. Significant selection peaks have been annotated with the name of the top-ranked gene of that region.
Candidate genomic regions and genes from the HapFLK analysis and corresponding DCMS region.
| OAR | Region start | Region end | Group | Candidate genes (rank) | Function | Overlapping DCMS result | |
| 1 | 258,588,662 | 275,499,089 | Upland | 0.0002 | |||
| 3 | 23,229,012 | 33,863,536 | Upland | 4.0 × 10–6 | Cholesterol efflux | Balwen | |
| 6 | 31,991,037 | 45,997,407 | Upland | 1.0 × 10–5 | Metabolism | LWF, WMHF, SWWM | |
| 10 | 35,062,152 | 51,872,705 | KHB | 8.0 × 10–5 | Neurodevelopment, cell adhesion | ||
| 13 | 46,620,758 | 72,941,741 | Upland | 2.0 × 10–6 | Innate immunity, angiogenesis | BFWM | |
| 13 | 47,164,400 | 54,366,328 | Lowland | 0.01 | Vision | HSF | |
| 15 | 32,152,675 | 65,710,885 | Upland | 4.0 × 10–7 | Mucous production |
FIGURE 3(A) Manhattan plot of DCMS q-values of the lowland (yellow) and upland (blue) breeds showing missense mutations found under selection, highlighted in red with the corresponding gene names. Selection thresholds for a suggestive (q < 0.05) and strong (q < 0.01) selection shown by blue and red lines, respectively. Underlined gene names show selection in both the lowland and upland breeds. (B) Allele frequencies for missense mutations identified by a strong F score represented by pie charts (green = reference allele, red = derived allele). This shows the location of missense mutations along their gene with nucleotide substitution highlighted by a red circle. Blue and yellow dotted lines point to the corresponding peak positions of the Manhattan plot. Amino acid substitution is shown, alongside the Polyphen score and q-value. (C) Haplostrip plots spanning genes containing the missense mutations but were selected on the basis of the H2/H1 and H12 haplotype statistics. Similar haplotypes are clustered together per population to demonstrate selection within these regions across the whole gene. These show the presence of reference (white) or derived (black) alleles making up different haplotypes. Populations of interest are highlighted in boxes corresponding to their colors on the Manhattan plot. “❖” is used to denote genes under selection by DCMS and HapFLK.
Candidate genes and missense mutations in the lowland and upland sheep breeds.
| OAR | Region start | Region end | SNP | Breed | Gene (rank) | Reference allele | Alternative allele | Mutation | PolyPhen | Function | |
| 2 | 2,887,916 | 2,887,916 | 2,887,916 | Lowland | A* | G | M47V | 0 | 0.006 | Cell cycle | |
| 2 | 28,318,917 | 28,318,992 | Upland | 0.003 | GTP binding | ||||||
| 3 | 10,725,499 | 10,725,501 | Upland | 0.007 | Cell stress | ||||||
| 4 | 7,476,204 | 7,476,211 | 7,476,211 | Upland | T* | C | K2619Q | 0.9 | 0.006 | Membrane transport | |
| 4 | 47,088,764 | 47,089,276 | Lowland | 0.004 | Cell adhesion | ||||||
| 7 | 84,989,884 | 84,989,669 | Upland | 0.004 | Angiogenesis | ||||||
| 8 | 21,197,663 | 21,198,137 | 21,197,663 | Upland | A | G* | L23P | 0.04 | 0.002 | Anti-oxidation | |
| 13 | 49,771,782 | 49,771,855 | Upland | 0.1 | Ubiquitinoylation | ||||||
| 13 | 52,848,831 | 52,848,943 | Upland | 0.006 | Cell recognition | ||||||
| 13 | 52,848,831 | 52,848,943 | Upland | 0.1 | Insulin pathway | ||||||
| 14 | 14,231,667 | 14,232,187 | Lowland | 0.004 | Coat color | ||||||
| 15 | 75,351,718 | 75,352,196 | Upland | 0.006 | Development | ||||||
| 20 | 10,644,119 | 10,644,602 | Upland | 0.009 | Fat deposition | ||||||
| 20 | 16,972,103 | 16,972,103 | Upland | 0.009 | Cyclic nucleotide transport | ||||||
| 22 | 15,368,530 | 15,368,926 | Upland | 2.0 × 10–8 | Cell cycle | ||||||
| 24 | 27,106,763 | 27,106,886 | Upland | 0.002 | Bile synthesis | ||||||
| 24 | 35,794,428 | 35,794,428 | Lowland/Upland | 0.007 | Insulin pathway | ||||||