| Literature DB >> 26029239 |
Beatriz Gutiérrez-Gil1, Juan J Arranz1, Pamela Wiener2.
Abstract
This review compiles the results of 21 genomic studies of European Bos taurus breeds and thus provides a general picture of the selection signatures in taurine cattle identified by genome-wide selection-mapping scans. By performing a comprehensive summary of the results reported in the literature, we compiled a list of 1049 selection sweeps described across 37 cattle breeds (17 beef breeds, 14 dairy breeds, and 6 dual-purpose breeds), and four different beef-vs.-dairy comparisons, which we subsequently grouped into core selective sweep (CSS) regions, defined as consecutive signals within 1 Mb of each other. We defined a total of 409 CSSs across the 29 bovine autosomes, 232 (57%) of which were associated with a single-breed (Single-breed CSSs), 134 CSSs (33%) were associated with a limited number of breeds (Two-to-Four-breed CSSs) and 39 CSSs (9%) were associated with five or more breeds (Multi-breed CSSs). For each CSS, we performed a candidate gene survey that identified 291 genes within the CSS intervals (from the total list of 5183 BioMart-extracted genes) linked to dairy and meat production, stature, and coat color traits. A complementary functional enrichment analysis of the CSS positional candidates highlighted other genes related to pathways underlying behavior, immune response, and reproductive traits. The Single-breed CSSs revealed an over-representation of genes related to dairy and beef production, this was further supported by over-representation of production-related pathway terms in these regions based on a functional enrichment analysis. Overall, this review provides a comparative map of the selection sweeps reported in European cattle breeds and presents for the first time a characterization of the selection sweeps that are found in individual breeds. Based on their uniqueness, these breed-specific signals could be considered as "divergence signals," which may be useful in characterizing and protecting livestock genetic diversity.Entities:
Keywords: breeds; candidate genes; cattle; diversity; domestication; selection signals; selective sweep
Year: 2015 PMID: 26029239 PMCID: PMC4429627 DOI: 10.3389/fgene.2015.00167
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Summary of the genome-wide selection mapping studies included in this review.
| Boitard and Rocha, | UMD3.1 | 50K-chip | 35,564 | 3 | 1 | BLO | Beef | HMM-allele frequency | Table 1 |
| Bovine HapMap Consortium, | Btau4.0 | AfC-10K 1536-chip | 33,326 | 67 | 19 | BMAS | Dairy, Beef, Dual-purpose | FST, iHS | Table 1; Table S8 |
| Druet et al., | UMD3.1 | HD-chip | 725,293 | 147 | 12 | BB, DBB, HOL, JER, LIM, HER, ANG, CHA, GUE, PIE, ROM, BRSW | Dairy, Beef, Dual-purpose | HMM-average heterozygosity | Tables S1–S12 |
| Flori et al., | Btau4.0 | 50K-chip | 42,486 | 6 | 3 | HOL, NORM, MONT | Dairy | FST | Table 2 |
| Glick et al., | Btau4.0 | 50K-chip | 41,814 | 25 | 1 | ISR-HOL | Dairy | REHH | Table 3 |
| Hayes et al., | UMD3.1 | TM-9323 | 9,323 | 15 | 2 | AUSHOL-vs.-AUSANG | Beef vs. Dairy | Differences in allele frequencies | Table 2 |
| Hosokawa et al., | Btau4.0 | 50K-chip | 40,635 | 11 | 2 | JAPBL-vs.-JAHOL | Beef vs. Dairy | Differences in allele frequencies | Table 2 |
| Kemper et al., | UMD3.1 | HD-chip (real and imputed) | 610,123 | 30 | 8 | HOL, JER, ANG, CHA, HER, LIM, MUG, SHOR | Dairy, Beef | HAPH, iHS, FST | Table 2, Table 6 |
| Lee et al., | UMD3.1 | WGS | 15,125,420 | 15 | 1 | HAN | Beef (high intramuscular fat content) | LD-ω, CLR | Table 1 |
| Lim et al., | UMD3.1 | AfM-10K | 4,522 | 2 | HAN | Beef (high intramuscular fat content) | EHH-iES | Table 1 | |
| Mancini et al., | Btau4.0 | 50K-chip | 29,848 | 5 | 5 | IT-BR, IT-HOL, PIE, MAR, IT-PEZZ | Dairy, Beef, Dual-purpose | FST | Table 1 |
| Pan et al., | Btau4.0 | 50K-chip | 40,130 | 16 | 1 | CHI-HOL | Dairy | EHH | Table 3 |
| Pintus et al., | Btau4.0 | 50K-chip | 42,514 | 53 | 2 | PIE-vs.-ITBR | Beef vs. Dairy | FST (LOWESS regression correction) | Table 1 |
| Porto-Neto et al., | UMD3.1 | HD-chip | 680,000 | 55 | 9 | HAN (against six European breeds, NEL, NDAM) | Beef (high intramuscular fat content) | FST | Table 2 |
| Qanbari et al., | Btau4.0 | 50K-chip | 41,398 | 12 | 1 | HOL | Dairy | REHH | Table 4 |
| Qanbari et al., | Btau4.0 | 50K-chip | 40,595 | 14 | 10 | HOL, BRSW, SIM, AUS-ANG, BELR | Dairy, Beef, Dual-purpose | iHS, FST | Table 2 |
| Qanbari et al., | Btau4.0 | WGS-based imputation | Sequenced data (15,182,131 SNPs), medium density panel (39,304), high density panel (645,189). | 140 | 1 | FLE | Dual-purpose | iHS, CLR | Tables S3, S4 |
| Ramey et al., | UMD3.1 | 50K-chip AFFXB1P-chip-2787037 | 52,942 (50kchip dataset); 2,575,339 SNPs (AFFXB1P) | 132 | 15 | ANG, BRA, CHA, HAN, HER, LIM, SAL, SHOR, SIM, BRSW, FINA, HOL, JER, WAG, BRA | Dairy, Beef, Dual-purpose | Extended low diversity haplotypes | Tables 2, 3, 4 |
| Rothammer et al., | UMD3.1 | 50K-chip | 47,651 | 61 | 8 | GAL, BB, RED-HOL, OBRA, MUR, FRGE, FLE, BRA | Beef | XP-EEH | Tables S4–S13 |
| Stella et al., | Btau4.0 | HMap-data | 32,689 | 215 | 5 | BRSW, GUE, HOL, NORW, JER | Dairy | CLL | Within-breed results |
| Wiener et al., | Btau4.0 | HMap-data | 31,312 | 30 | 2 | CHA-vs.-HOL | Beef vs. Dairy | Difference in allele frequencies | Table S2 |
High-throughput genotyping arrays used for genotyping in the different studies 50k chip, Illumina's Bovine SNP50 Genotyping BeadChip; 1536-chip, Illumina 1536 BeadArray assays; HD-chip, Illumina Bovine HD chip genotyping assay; AfC-10K, Affymetrix Custom 10K; TM-9323, 9323 SNPs (Parallele TM/Affymetrix TM); HMap-data, data from Bovine HapMap Consortium; WGS, Whole genome sequence; AfM-10K, Affymetrix MegAllele GeneChip Bovine Mapping 10K SNP array; AFFXB1P-chip-2787037, Affymetrix Axiom Genome-wide BOS 1 assay = 2,787,037 SNPs.
Where indicated the initial significant positions reported by the original authors were filtered in this review as described below (number of raw reported sweeps):
From an initial list of 398 positions, only those showing an average heterozygosity < 0.003 were considered herein.
From an initial list of 1251 breed-specific significant positions, only those with consecutive XP-EHH values (gap < 1 Mb) higher than 4 were considered herein.
From the initial results from the within-breed analysis results kindly provided by the authors (a total of 13,077 significant positions at P < 0.01, genome-wide), we selected, for each breed, the top 10% of the positions showing a significant CLL, and then we grouped the positions in signatures regions. Following the authors' criteria, adjacent signature regions were considered “distinct” if they were separated by at least 3 consecutive windows with non-significant CLL (P < 0.01 genome-wide).
From the initial list of 121 significant positions of top 1% MA_d values, the positions were grouped in regions of subsequent significant regions with no more than 1 Mb gaps.
Breed abbreviations: ANG, Angus; AUS-ANG, Australian Angus; BB, Belgian Blue; BLO, Blonde d'Aquitaine; CHA, Charolais; GAL, Galloway; HER, Hereford; HAN, Korean Hanwoo; LIM, Limousin; MAR, Marchigiana; MUG, Murray Gray; PIE, Piedmontese; REDA, Red Angus; ROM, Romagnola; SAL, Salers; SHOR, Shorthorn; WAG, Wagyu; AUSHOL-vs.-AUSANG, Australian Holstein vs. Australian Angus; CHA-vs.-HOL, Charolais vs. Holstein; JAPBL-vs.-JAHOL, Japanese Black vs. Japanese Holstein; PIE-vs.-ITBR, Piedmontese vs. Italian Brown; BRA, Braunvieh; BRSW, Brown Swiss; CHI-HOL, Chinese Holstein; FINA, Finnish Ayrshire; GUE, Guernsey; HOL, Holstein; ISR-HOL, Israeli Holstein; IT-BR, Italian Brown; IT-HOL, Italian Holstein; IT-PEZZ, Italian Pezzeta rosa; JER, Jersey; MONT, Montbéliarde; NORM, Normande; NORW, Norwegian Red; RED-HOL, Red Holstein; DBB, Dual-purpose Belgian blue; FLE, Fleckvieh; FRGE, Franken Gelbvieh; MUR, Murnau-Werdenfelser; OBRA, Original Braunvieh; SIM, Simmental.
Statistical methods used in the original studies reviewed in this work: CLL, Parametric composite log likelihood of the differences in allelic frequencies; CLR, Composite likelihood ratio of a model considering a section sweep as defined by Nielsen et al. (2005); EHH, Extended haplotype homozygosity; FST, Population differentiation statistic; HAPH, Haplotype homozygosity measurement; HMM allele frequency, Hidden Markov model-based test, local variations in the allele frequency; iHS, Integrated haplotype homozygosity score; LD ω, LD-based omega (ω) statistics; REHH, Relative extended haplotype homozygosity; XP-EEH, Cross population extended haplotype homozygosity.
Boettcher and Stella, personal communication.
Additional breeds analyzed in the original studies but not considered in this review: BMAS, Beefmaster; BELR, Belmont Red; NDAM, N'Dama; SHE, Sheko; NEL, Nelore; BRA, Brahman; GIR, Gir; SGER, Santa Gertrudis.
Figure 1Graphical representation of the Core Selection Sweep (CSS) regions defined in the present work across the 29 bovine autosomes, based on the selection sweeps reported by the 21 genome-wide selection-mapping scans reviewed in the present study. Note that the proximal end of each chromosome (centromere) is represented at the bottom of the plot. ANG, Angus; AUS-ANG, Australian Angus; AUSHOL-vs.-AUSANG, Australian Holstein vs. Australian Angus; BB, Belgian Blue; BLO, Blonde d'Aquitaine; BRA, Braunvieh; BRSW, Brown Swiss; CHA, Charolais; CHA-vs.-HOL, Charolais vs. Holstein; CHI-HOL, Chinese Holstein; DBB, Dual-purpose Belgian blue; FINA, Finnish Ayrshire; FLE, Fleckvieh; FRGE, Franken Gelbvieh; GAL, Galloway; GUE, Guernsey; HER, Hereford; HOL, Holstein; ISR-HOL, Israeli Holstein; IT-BR, Italian Brown; IT-HOL, Italian Holstein; JAPBL-vs.-JAHOL, Japanese Black vs. Japanese Holstein; JER, Jersey; HAN, Korean Hanwoo; LIM, Limousin; MAR, Marchigiana; MONT, Montbéliarde; MUR, Murnau-Werdenfelser; MUG, Murray Gray; NORM, Normande; NORW, Norwegian Red; OBRA, Original Braunvieh; PIE, Piedmontese; PIE-vs.-ITBR, Piedmontese vs. Italian Brown; REDA, Red Angus; RED-HOL, Red Holstein; ROM, Romagnola; SAL, Salers; SHOR, Shorthorn; SIM, Simmental; WAG, Wagyu.
Characterization of the four categories of core selection sweeps (CSSs) defined in this review based on the number of associated breeds.
| Number of CSS | 232 | 134 | 39 | 4 | 409 |
| Average CSS-interval length (Mb) | 73.01 | 202.95 | 264.05 | 1.21 | 541.22 |
| Total number of extracted genes | 839 | 1886 | 2440 | 17 | 5182 |
| Number of candidate genes | 67 | 83 | 139 | 2 | 291 |
| Color | 3 | 19 | 18 | – | 40 |
| Dairy-Mastitis | 36 | 38 | 54 | 1 | 129 |
| Beef | 21 | 19 | 55 | 1 | 96 |
| Stature/Body-size | 7 | 7 | 12 | – | 26 |
Genes extracted with the web-based BioMart tool available at ensembl.org and based on the UMD_3.1 bovine genome assembly.
Candidate genes identified through the candidate gene survey performed in this study for four phenotype classes under putative selection in modern cattle breeds.
Results from the gene enrichment analysis performed using WikiPathway analysis (WebGestalt software; Wang et al., .
| Single-breed-CSSs involving BEEF breeds (304 genes) | Interleukin-11 signaling pathway | 3 | C = 49; O = 3; E = 0.30; | |
| Osteopontin signaling | 2 | C = 18; O = 2; E = 0.11; R = 18.01; rawP = 0.0054; adjP = 0.0540 | ||
| RANKL-RANK signaling pathway | 3 | C = 66; O = 3; E = 0.41; R = 7.37; rawP = 0.0080; adjP = 0.0540 | ||
| Corticotropin-releasing hormone | 4 | C = 123; O = 4; E = 0.76; R = 5.27; rawP = 0.0072; adjP = 0.0540 | ||
| Endochondral ossification | 3 | C = 69; O = 3; E = 0.42; R = 7.08; rawP = 0.0089; adjP = 0.0534 | ||
| Osteoclast signaling | 2 | C = 19; O = 2; E = 0.12; R = 17.07; rawP = 0.0060; adjP = 0.0540 | ||
| B Cell receptor signaling pathway | 4 | C = 114; O = 4; E = 0.70; R = 5.69; rawP = 0.0056; adjP = 0.0540 | ||
| NOD pathway | 3 | C = 39; O = 3; E = 0.24; R = 12.47; rawP = 0.0018; adjP = 0.0540 | ||
| Androgen receptor signaling pathway | 4 | C = 91; O = 4; E = 0.56; R = 7.13; rawP = 0.0025; adjP = 0.0540 | ||
| Striated muscle contraction | 2 | C = 38; O = 2; E = 0.23; R = 8.53; rawP = 0.0230; adjP = 0.0690 | ||
| Single-breed-CSSs involving DAIRY breeds (239 genes) | Regulation of toll-like receptor signaling pathway | 6 | C = 154; O = 6; E = 0.69; R = 8.75; rawP = 7.00e-05;adjP = 0.0033 | |
| Estrogen signaling pathway | 3 | C = 30; O = 3; E = 0.13; R = 22.46; rawP = 0.0003; adjP = 0.0047 | ||
| Toll-like receptor signaling pathway | 5 | C = 116; O = 5; E = 0.52; R = 9.68; rawP = 0.0002; adjP = 0.0047 | ||
| Leptin signaling pathway | 4 | C = 81; O = 4; E = 0.36; R = 11.09; rawP = 0.0005; adjP = 0.0059 | ||
| Vitamin D synthesis | 2 | C = 10; O = 2; E = 0.04; R = 44.92; rawP = 0.0009; adjP = 0.0070 | ||
| TFs Regulate miRNAs related to cardiac hypertrophy | 2 | C = 10; O = 2; E = 0.04; R = 44.92; rawP = 0.0009; adjP = 0.0070 | ||
| Focal Adhesion | 5 | C = 185; O = 5; E = 0.82; R = 6.07; rawP = 0.0015; adjP = 0.0088 | ||
| Alpha 6 Beta 4 signaling pathway | 3 | C = 50; O = 3; E = 0.22; R = 13.48; rawP = 0.0015; adjP = 0.0088 | ||
| Complement activation, classical pathway | 2 | C = 17; O = 2; E = 0.08; R = 26.43; rawP = 0.0026; adjP = 0.0136 | ||
| Insulin signaling | 4 | C = 163; O = 4; E = 0.73; R = 5.51; rawP = 0.0062; adjP = 0.0291 | ||
| Single-breed-CSSs involving DUAL-PURPOSE breeds (195 genes) | Serotonin receptor 4-6-7 and NR3C Signaling | 3 | C = 18; O = 3; E = 0.06; R = 46.67; rawP = 3.50e-05; adjP = 0.0014 | |
| Calcium regulation in the cardiac cell | 5 | C = 151; O = 5; E = 0.54; R = 9.27; rawP = 0.0002; adjP = 0.0041 | ||
| Serotonin Receptor 2 and ELK-SRF-GATA4 signaling | 2 | C = 16; O = 2; E = 0.06; R = 35.01; rawP = 0.0015; adjP = 0.0205 | ||
| IL-7 signaling pathway | 2 | C = 25; O = 2; E = 0.09; R = 22.40; rawP = 0.0036; adjP = 0.0246 | ||
| IL-9 signaling pathway | 2 | C = 25; O = 2; E = 0.09; R = 22.40; rawP = 0.0036; adjP = 0.0246 | ||
| Myometrial relaxation and contraction pathways | 4 | C = 162; O = 4; E = 0.58; R = 6.91; rawP = 0.0028; adjP = 0.0246 | ||
| Monoamine GPCRs | 2 | C = 33; O = 2; E = 0.12; R = 16.97; rawP = 0.0062; adjP = 0.0261 | ||
| Serotonin HTR1 group and FOS pathway | 2 | C = 33; O = 2; E = 0.12; R = 16.97; rawP = 0.0062; adjP = 0.0261 | ||
| Hypothetical network for drug addiction | 2 | C = 35; O = 2; E = 0.12; R = 16.00; rawP = 0.0070; adjP = 0.0261 | ||
| EPO receptor signaling | 2 | C = 35; O = 2; E = 0.12; R = 16.00; rawP = 0.0070; adjP = 0.0261 |
Wikipathway analysis Statistics: C, the number of reference genes in the category; O, the number of genes in the gene set and also in the category; E, the expected number in the category; R, ratio of enrichment; rawP, p-value from hypergeometric test; adjP, p-value adjusted by the multiple test adjustment.