| Literature DB >> 25126940 |
Maria-Ines Fariello1, Bertrand Servin2, Gwenola Tosser-Klopp2, Rachel Rupp2, Carole Moreno2, Magali San Cristobal2, Simon Boitard3.
Abstract
The diversity of populations in domestic species offers great opportunities to study genome response to selection. The recently published Sheep HapMap dataset is a great example of characterization of the world wide genetic diversity in sheep. In this study, we re-analyzed the Sheep HapMap dataset to identify selection signatures in worldwide sheep populations. Compared to previous analyses, we made use of statistical methods that (i) take account of the hierarchical structure of sheep populations, (ii) make use of linkage disequilibrium information and (iii) focus specifically on either recent or older selection signatures. We show that this allows pinpointing several new selection signatures in the sheep genome and distinguishing those related to modern breeding objectives and to earlier post-domestication constraints. The newly identified regions, together with the ones previously identified, reveal the extensive genome response to selection on morphology, color and adaptation to new environments.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25126940 PMCID: PMC4134316 DOI: 10.1371/journal.pone.0103813
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Population groups from the Sheep HapMap dataset used for the detection of selection signatures.
| Group | Abbreviation | Size | Populations (Abbreviations) |
| Africa | AFR | 2 | Red Maasai (RMA) |
| Ethiopian Menz (EMZ) | |||
| Asia | ASI | 8 | Bangladeshi BGE (BGE) |
| Bangladeshi Garole (BGA) | |||
| Changthangi (CHA) | |||
| Deccani (IDC) | |||
| Garut (GUR) | |||
| Indian Garole (GAR) | |||
| Sumatra (SUM) | |||
| Tibetan (TIB) | |||
| Central Europe | CEU | 4 | Bundner Oberlander (BOS) |
| Engadine Red (ERS) | |||
| Valais Blacknose (VBS) | |||
| Valais Red (VRS) | |||
| Italy | ITA | 4 | Altamurana (ALT) |
| Comisana (COM) | |||
| Leccese (LEC) | |||
| Sardinian Ancestral Black (SAB) | |||
| Northern Europe | NEU | 6 | Galway (GAL) |
| German (GTX), New Zealand (NTX) and Scottish (STX) Texel | |||
| Irish Suffolk (ISF) | |||
| New Zealand Romney (NZR) | |||
| South West Asia | SWA | 4 | Afshari (AFS) |
| Moghani (MOG) | |||
| Norduz (NDZ) | |||
| Qezel (QEZ) | |||
| South West Europe | SWE | 4 | Autralian Merino (MER) |
| Churra (CHU) | |||
| Meat (LAM) and Milk (LAC) Lacaune |
Figure 1Localization of selection signatures identified in 7 groups of populations.
Candidate genes are indicated above their genomic localization. Only chromosomes harboring selection signatures are plotted.
Selection signatures in the 7 geographical groups.
| OAR | Begin (Mbp) | End (Mbp) | P-value | Q-value | Group | Test | Diff. pop. | Cand. gene | Nr. genes | Rank |
| 2 | 46.65 | 57.99 | 6.3e-10 | 7.1e-07 | ITA | hapFLK | COM | NPR2† | 85 | 15 |
| 2 | 51.41 | 53.44 | 4.1e-09 | 1.6e-04 | ITA | FLK | COM | 41 | 2 | |
| 2 | 74.00 | 74.86 | 7.4e-04 | 3.7e-02 | ITA | hapFLK | COM | 7 | ||
| 2 | 81.27 | 87.32 | 4.1e-09 | 2.3e-06 | ITA | hapFLK | COM | BNC2 | 18 | 1 |
| 2 | 110.08 | 112.08 | 1.5e-05 | 6.7e-02 | ASI | FLK | SUM TIB GUR | 11 | ||
| 2 | 113.36 | 122.24 | 7.0e-06 | 3.3e-03 | NEU | hapFLK | GTX NTX STX | MSTN† | 42 | 8 |
| 2 | 239.76 | 241.76 | 2.9e-05 | 9.3e-02 | SWA | FLK | AFS | RUNX3 | 33 | 1 |
| 3 | 84.40 | 86.40 | 2.5e-05 | 9.1e-02 | ASI | FLK | 15 | |||
| 3 | 120.91 | 125.49 | 5.3e-04 | 3.0e-02 | ITA | hapFLK | COM | KITLG | 5 | 5 |
| 3 | 122.07 | 130.85 | 6.8e-08 | 4.2e-04 | AFR | hapFLK | 25 | 1 | ||
| 3 | 151.42 | 156.93 | 3.3e-16 | 3.1e-12 | ITA | hapFLK | COM SAB | HMGA2† | 26 | 1 |
| 3 | 154.79 | 154.93 | 5.9e-04 | 4.3e-02 | AFR | hapFLK | 12 | 12 | ||
| 3 | 159.64 | 161.60 | 6.1e-04 | 3.3e-02 | ITA | hapFLK | COM | 6 | ||
| 3 | 167.85 | 171.67 | 1.5e-04 | 1.3e-02 | ITA | hapFLK | COM ALT SAB | 27 | ||
| 4 | 4.61 | 6.61 | 5.3e-06 | 2.1e-02 | SWA | FLK | MOG | 8 | ||
| 4 | 8.50 | 19.66 | 4.2e-06 | 1.1e-03 | CEU | hapFLK | VBS VRS | 49 | ||
| 4 | 15.11 | 17.11 | 8.4e-07 | 1.5e-02 | CEU | FLK | VBS | 7 | ||
| 4 | 26.46 | 28.46 | 2.4e-05 | 9.1e-02 | ASI | FLK | GUR IDC SUM | HDAC9 | 6 | 1 |
| 4 | 44.49 | 45.76 | 2.7e-04 | 3.4e-02 | NEU | hapFLK | NZR | 12 | ||
| 4 | 45.57 | 47.57 | 1.8e-06 | 2.4e-02 | ASI | FLK | SUM | 8 | ||
| 4 | 67.75 | 69.80 | 3.5e-07 | 2.3e-03 | SWA | FLK | MOG | HOXA | 18 | 2→10 |
| 5 | 29.40 | 31.40 | 1.1e-05 | 6.7e-02 | ASI | FLK | GAR | 3 | ||
| 5 | 47.35 | 49.35 | 1.4e-05 | 6.7e-02 | ASI | FLK | BGA | 35 | ||
| 5 | 78.16 | 78.76 | 4.2e-04 | 4.2e-02 | NEU | hapFLK | NZT | 16 | ||
| 6 | 5.62 | 7.62 | 3.1e-06 | 6.0e-02 | ITA | FLK | SAB | 11 | ||
| 6 | 33.22 | 41.02 | 3.4e-08 | 8.0e-05 | SWE | hapFLK | LAC LAM | ABCG2†/NCAPG | 27 | 2/17 |
| 6 | 34.71 | 39.12 | 1.6e-07 | 4.1e-05 | ITA | hapFLK | COM | 23 | 11/2 | |
| 6 | 35.94 | 38.31 | 2.1e-04 | 1.9e-02 | CEU | hapFLK | VRS VBS | 19 | 9/17 | |
| 6 | 67.98 | 70.36 | 4.3e-06 | 1.1e-03 | CEU | hapFLK | VBS | KIT† | 9 | 5 |
| 6 | 68.90 | 70.95 | 9.6e-07 | 5.3e-03 | SWA | FLK | 10 | 2 | ||
| 6 | 93.30 | 94.39 | 3.8e-04 | 2.7e-02 | CEU | hapFLK | VRS&VBS or ERS&BOS | FGF5† | 8 | 5 |
| 7 | 49.15 | 51.15 | 1.1e-05 | 9.7e-02 | CEU | FLK | VRS | 7 | ||
| 7 | 78.31 | 80.31 | 8.1e-07 | 1.5e-02 | CEU | FLK | VRS ERS | 13 | ||
| 8 | 23.97 | 25.97 | 2.9e-05 | 9.6e-02 | ASI | FLK | TIB | 6 | ||
| 9 | 29.46 | 31.55 | 3.7e-04 | 3.4e-02 | SWE | hapFLK | CHU MER | 1 | ||
| 9 | 37.79 | 46.03 | 1.9e-05 | 6.2e-03 | NEU | hapFLK | NZT ISF | 6 | ||
| 10 | 24.02 | 34.91 | 1.4e-14 | 1.1e-10 | CEU | hapFLK | BOS ERS VRS | RXFP2† | 68 | 9 |
| 10 | 29.42 | 29.71 | 9.6e-04 | 4.4e-02 | ITA | hapFLK | COM ALT | 14 | 2 | |
| 10 | 28.50 | 30.50 | 6.3e-06 | 7.5e-02 | CEU | FLK | BOS ERS | 14 | 1 | |
| 10 | 28.50 | 30.50 | 3.2e-05 | 9.7e-02 | SWA | FLK | NDZ | 14 | 1 | |
| 10 | 28.50 | 30.50 | 1.3e-06 | 5.4e-02 | SWE | FLK | MER | 14 | 1 | |
| 10 | 48.90 | 49.59 | 5.2e-04 | 3.1e-02 | CEU | hapFLK | 3 | |||
| 11 | 12.55 | 14.12 | 1.4e-04 | 2.2e-02 | NEU | hapFLK | 33 | |||
| 11 | 24.18 | 38.74 | 9.8e-09 | 8.0e-05 | SWE | hapFLK | LAC MER | 296 | ||
| 11 | 40.31 | 46.70 | 3.3e-06 | 5.5e-04 | ITA | hapFLK | SAB | 164 | ||
| 12 | 42.66 | 44.66 | 3.4e-07 | 7.6e-03 | ASI | FLK | SUM | 10 | ||
| 13 | 33.10 | 40.02 | 5.7e-06 | 1.8e-03 | AFR | hapFLK | 41 | |||
| 13 | 40.60 | 50.30 | 4.9e-07 | 4.9e-04 | AFR | hapFLK | BMP2† | 76 | 1 | |
| 13 | 43.34 | 51.28 | 2.7e-07 | 1.7e-04 | SWE | hapFLK | LAC LAM | PRNP | 49 | 8 |
| 13 | 56.11 | 57.17 | 2.5e-08 | 4.8e-04 | SWA | hapFLK | MOG | EDN3 | 19 | 1 |
| 13 | 55.33 | 57.43 | 8.4e-11 | 1.1e-06 | SWA | FLK | MOG | 19 | 1 | |
| 14 | 6.37 | 13.60 | 1.6e-04 | 1.4e-02 | ITA | hapFLK | SAB | 70 | ||
| 14 | 13.64 | 13.70 | 5.3e-04 | 4.9e-02 | NEU | hapFLK | ISF | MC1R | 48 | 33 |
| 14 | 13.70 | 16.46 | 1.2e-04 | 1.1e-02 | ITA | hapFLK | SAB | 37 | 21 | |
| 14 | 45.49 | 50.09 | 1.6e-04 | 2.5e-02 | NEU | hapFLK | NTX NZR | 117 | ||
| 15 | 48.87 | 50.87 | 1.5e-05 | 6.7e-02 | ASI | FLK | GAR IDC | 36 | ||
| 15 | 71.71 | 73.71 | 3.8e-06 | 1.6e-02 | SWA | FLK | MOG | ALX4/EXT2 | 13 | 1/3 |
| 16 | 33.20 | 35.10 | 1.8e-04 | 1.8e-02 | AFR | hapFLK | C6/C7 | 8 | 5/7 | |
| 16 | 63.97 | 65.97 | 1.1e-05 | 6.7e-02 | ASI | FLK | GAR IDC | 5 | ||
| 19 | 4.42 | 7.43 | 2.2e-04 | 1.9e-02 | CEU | hapFLK | VRS BOS | GLB1† | 17 | 14 |
| 19 | 30.42 | 35.09 | 3.2e-05 | 4.2e-03 | CEU | hapFLK | VBS BOS ERS | MITF† | 14 | 9 |
| 19 | 44.60 | 46.60 | 3.9e-06 | 3.9e-02 | ASI | FLK | GAR BGA | WNT5A | 4 | 1 |
| 20 | 36.74 | 38.52 | 2.8e-04 | 2.3e-02 | CEU | hapFLK | VRS | 10 | ||
| 22 | 18.90 | 24.36 | 1.5e-11 | 7.4e-08 | NEU | hapFLK | GTX | PITX3‡ | 85 | 5 |
| 23 | 42.50 | 46.96 | 2.2e-05 | 5.4e-03 | AFR | hapFLK | MC2R/MC5R | 35 | 1/2 | |
| 23 | 54.14 | 56.14 | 3.8e-07 | 7.6e-03 | ASI | FLK | GAR | 5 | ||
| 25 | 0.08 | 3.08 | 3.7e-04 | 2.4e-02 | ITA | hapFLK | SAB | 16 |
Regions identified with the hapFLK or FLK test, with the corresponding population group and most differentiated populations (except for the AFR group). Full names of groups and populations are given in Table 1. The number of genes included in each region and the rank of candidate genes within the region is also provided. Overlapping regions detected in different population groups or with different tests are grouped within horizontal lines (in this case candidate genes are the same for all overlapping regions and are only reported in the first one). : signatures of selection previously identified [4]. : this outlying region is not due to evolutionary processes (see details in the main text).
Figure 2Genome scan for selection signature in ancestral populations of the geographical groups.
Significant SNP at the 5% FDR level are plotted in darker color.
Selection signatures in ancestral populations.
| Estimated ancestral allele frequencies | |||||||||||||
| OAR | position | AFR | ASI | SWA | NEU | CEU | ITA | SWE | P-value | Q-value | Cand. gene | Nr. genes | Rank |
| 1 | 7192190 | 0.15 | 0.08 | 0.16 | 0.55 | 0.69 | 0.04 | 0.38 | 1.7e-06 | 5.3e-03 | TRPM8 | 19 | 8 |
| 1 | 237070498 | 0.87 | 0.95 | 0.91 | 0.48 | 0.24 | 0.77 | 0.35 | 1.4e-05 | 2.5e-02 | GYG1 | 16 | 5 |
| 1 | 239424807 | 0.46 | 0.68 | 0.06 | 0.21 | 0.15 | 0.11 | 0.17 | 3.4e-05 | 4.8e-02 | 9 | ||
| 1 | 239491620 | 0.53 | 0.41 | 0.94 | 0.86 | 0.93 | 0.93 | 0.88 | 4.3e-05 | 5.6e-02 | 9 | ||
| 2 | 45500785 | 0.43 | 0.91 | 0.23 | 0.76 | 0.87 | 0.87 | 0.93 | 2.2e-06 | 6.4e-03 | LPL | 6 | 3 |
| 2 | 182607165 | 0.99 | 0.97 | 0.18 | 0.64 | 0.73 | 0.83 | 0.64 | 3.4e-08 | 1.8e-04 | INSIG2 | 10 | 3 |
| 2 | 182672296 | 0.99 | 0.94 | 0.32 | 0.90 | 0.86 | 0.89 | 0.81 | 7.7e-07 | 2.8e-03 | 10 | ||
| 2 | 192231314 | 0.59 | 0.93 | 0.36 | 0.96 | 0.89 | 0.81 | 0.95 | 1.6e-05 | 2.8e-02 | 8 | ||
| 3 | 132478420 | 0.24 | 0.89 | 0.18 | 0.93 | 0.81 | 0.84 | 0.82 | 1.2e-06 | 3.9e-03 | HOXC | 54 | 1 |
| 3 | 180860403 | 0.71 | 0.53 | 0.28 | 0.82 | 0.31 | 0.12 | 0.13 | 1.7e-05 | 2.8e-02 | 22 | ||
| 5 | 15522700 | 0.68 | 0.63 | 0.92 | 0.27 | 0.76 | 0.99 | 0.78 | 9.8e-06 | 2.0e-02 | 51 | ||
| 7 | 89519883 | 0.63 | 0.61 | 0.19 | 0.89 | 0.18 | 0.60 | 0.95 | 6.1e-10 | 5.2e-06 | TSHR | 6 | 3 |
| 8 | 31748642 | 0.84 | 0.93 | 0.94 | 0.16 | 0.63 | 0.47 | 0.19 | 2.8e-05 | 4.1e-02 | PREP | 6 | 1 |
| 11 | 18248852 | 0.35 | 0.32 | 0.82 | 0.64 | 0.94 | 0.96 | 0.92 | 1.3e-05 | 2.5e-02 | NF1 | 23 | 1 |
| 11 | 18325488 | 0.87 | 0.93 | 0.00 | 0.35 | 0.04 | 0.03 | 0.04 | 3.3e-16 | 7.2e-12 | 24 | 4 | |
| 11 | 18335747 | 0.87 | 0.93 | 0.00 | 0.35 | 0.04 | 0.03 | 0.04 | 3.3e-16 | 7.2e-12 | 22 | 4 | |
| 11 | 18433474 | 0.87 | 0.93 | 0.02 | 0.35 | 0.07 | 0.02 | 0.05 | 3.8e-15 | 5.4e-11 | 22 | 1 | |
| 11 | 18440783 | 0.78 | 0.93 | 0.02 | 0.34 | 0.07 | 0.02 | 0.05 | 2.0e-14 | 2.2e-10 | 22 | 1 | |
| 11 | 25704651 | 0.97 | 0.96 | 0.97 | 0.42 | 0.94 | 0.94 | 0.96 | 8.5e-06 | 1.9e-02 | 73 | ||
| 11 | 26284826 | 0.99 | 0.97 | 0.94 | 0.38 | 0.93 | 0.95 | 0.79 | 3.2e-05 | 4.6e-02 | 100 | ||
| 11 | 26571629 | 0.92 | 0.94 | 0.98 | 0.29 | 0.89 | 0.88 | 0.86 | 1.8e-05 | 2.8e-02 | 115 | ||
| 11 | 26872280 | 0.78 | 0.71 | 0.93 | 0.15 | 0.89 | 0.90 | 0.90 | 2.2e-07 | 9.5e-04 | 111 | ||
| 13 | 12120674 | 0.29 | 0.84 | 0.97 | 0.91 | 0.97 | 0.92 | 0.84 | 7.7e-06 | 1.8e-02 | GATA3 | 6 | 1 |
| 13 | 62857560 | 0.52 | 0.62 | 0.65 | 0.98 | 0.67 | 0.92 | 0.36 | 3.6e-06 | 9.7e-03 | ASIP | 32 | 12 |
| 15 | 3706790 | 0.71 | 0.22 | 0.96 | 0.28 | 0.27 | 0.34 | 0.21 | 6.8e-06 | 1.7e-02 | 4 | ||
| 15 | 29856310 | 0.98 | 0.99 | 0.99 | 0.47 | 0.92 | 0.95 | 0.96 | 9.8e-06 | 2.0e-02 | 35 | ||
| 16 | 38696505 | 0.95 | 0.98 | 0.95 | 0.99 | 0.68 | 0.31 | 0.30 | 6.8e-07 | 2.7e-03 | PRLR | 18 | 2 |
| 17 | 4867509 | 0.91 | 0.95 | 0.85 | 0.54 | 0.18 | 0.58 | 0.17 | 1.8e-05 | 2.8e-02 | TMEM154 | 9 | 1 |
| 18 | 19342316 | 0.90 | 0.79 | 0.67 | 0.35 | 0.75 | 0.10 | 0.09 | 1.9e-07 | 9.3e-04 | ACAN | 31 | 4 |
| 18 | 66470371 | 0.99 | 0.97 | 0.90 | 0.90 | 0.18 | 0.04 | 0.08 | 1.9e-09 | 1.3e-05 | TRAF3 | 28 | 5 |
| 20 | 17381047 | 0.24 | 0.61 | 0.97 | 0.98 | 0.93 | 0.99 | 0.91 | 3.1e-08 | 1.8e-04 | VEGFA | 48 | 1 |
| 25 | 7517270 | 0.95 | 0.94 | 0.93 | 0.14 | 0.27 | 0.57 | 0.19 | 1.8e-05 | 2.8e-02 | wool QTL | 13 | |
SNP with significant FLK value at the 5% FDR level, with estimated allele frequencies in all ancestral groups. The number of genes included in each region (1Mb up-or-downstream the position) and the rank of candidate genes within the region is also provided. : signatures of selection previously identified [4].