| Literature DB >> 28288562 |
Luiz F Brito1, James W Kijas2, Ricardo V Ventura3,4, Mehdi Sargolzaei3,5, Laercio R Porto-Neto2, Angela Cánovas3, Zeny Feng6, Mohsen Jafarikia3,7, Flávio S Schenkel3.
Abstract
BACKGROUND: The detection of signatures of selection has the potential to elucidate the identities of genes and mutations associated with phenotypic traits important for livestock species. It is also very relevant to investigate the levels of genetic diversity of a population, as genetic diversity represents the raw material essential for breeding and has practical implications for implementation of genomic selection. A total of 1151 animals from nine goat populations selected for different breeding goals and genotyped with the Illumina Goat 50K single nucleotide polymorphisms (SNP) Beadchip were included in this investigation.Entities:
Keywords: Capra hircus; F-statistics; SNP; Selective sweep; hapFLK
Mesh:
Year: 2017 PMID: 28288562 PMCID: PMC5348779 DOI: 10.1186/s12864-017-3610-0
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Summary of genotyped animals and genetic diversity compared between nine goat populations
| Breed | Alpine | Boer | Boer | Cashmere | LaMancha | Nubian | Rangeland | Saanen | Toggenburg |
|---|---|---|---|---|---|---|---|---|---|
| Origin | Canada | Australia | Canada | Australia | Canada | Canada | Australia | Canada | Canada |
| Abbreviation | AL | BA | BC | CA | LA | NU | RA | SA | TO |
| Sample size | 403 | 61 | 67 | 48 | 81 | 54 | 66 | 318 | 53 |
| Purpose | Milk | Meat | Meat | Fiber | Milk | Milk/Meat | Meat | Milk | Milk |
| PN a | 0.946 | 0.969 | 0.924 | 0.981 | 0.939 | 0.902 | 0.995 | 0.945 | 0.911 |
| HO a | 0.385 | 0.365 | 0.363 | 0.384 | 0.384 | 0.338 | 0.413 | 0.379 | 0.353 |
| HE a | 0.388 | 0.356 | 0.357 | 0.372 | 0.382 | 0.335 | 0.411 | 0.382 | 0.336 |
| DST | 0.307 | 0.281 | 0.284 | 0.293 | 0.303 | 0.269 | 0.323 | 0.302 | 0.263 |
| FEH ± SD | 0.103 ± 0.058 | 0.141 ± 0.043 | 0.156 ± 0.048 | 0.104 ± 0.044 | 0.108 ± 0.046 | 0.214 ± 0.051 | 0.039 ± 0.036 | 0.117 ± 0.056 | 0.179 ± 0.055 |
| FVR ± SD | 0.006 ± 0.063 | −0.029 ± 0.065 | −0.014 ± 0.064 | −0.027 ± 0.053 | −0.001 ± 0.079 | −0.004 ± 0.082 | −0.001 ± 0.033 | 0.005 ± 0.090 | −0.041 ± 0.223 |
| FLEUT ± SD | 0.006 ± 0.093 | −0.028 ± 0.087 | −0.014 ± 0.105 | −0.027 ± 0.080 | 0.000 ± 0.134 | −0.005 ± 0.146 | 0.000 ± 0.034 | 0.006 ± 0.138 | −0.027 ± 0.373 |
| FROH ± SD | 0.031 ± 0.019 | 0.047 ± 0.015 | 0.057 ± 0.016 | 0.021 ± 0.009 | 0.039 ± 0.018 | 0.057 ± 0.018 | 0.009 ± 0.009 | 0.033 ± 0.019 | 0.046 ± 0.018 |
| FPED ± SD | 0.021 ± 0.040 | NA | 0.002 ± 0.016 | NA | 0.044 ± 0.050 | 0.017 ± 0.034 | NA | 0.040 ± 0.042 | 0.054 ± 0.053 |
P proportion of polymorphic SNPs, H and H expected and observed heterozygosity, respectively, D average pairwise genetic distance, SD standard deviation, NA not available, F , F , F , F and F inbreeding coefficients based on excess of homozygosity, VanRaden, Leutenneger, runs of homozygosity and pedigree, respectively
aestimates for the three Australian breeds were previously reported by Kijas et al. [24] using the same dataset
Fig. 1Goat breeds with different ears size. a LaMancha breed (short ears), b Toggenbourg breed (average ears) and c Nubian breed (long ears). Photo credits: Ontario Goat (http://ontariogoat.ca/)
Pearson correlations among alternative inbreeding coefficients
| FVR | FLEUT | FROH | FPED | |
|---|---|---|---|---|
| FEH | −0.132 | −0.318 | 0.901 | 0.320 |
| FVR | 0.969 | −0.133 | 0.067 | |
| FLEUT | −0.264 | −0.011 | ||
| FROH | 0.372 |
F , F , F , F and F inbreeding coefficients based on excess of homozygosity, VanRaden, Leutenegger, runs of homozygosity and pedigree, respectively
Descriptive analysis of the runs of homozygosity per breed and including all genotyped animals
| Breed | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| AL | BA | BC | CA | LA | NU | RA | SA | TO | ||
| NSEG | Mean | 15.6 | 23.6 | 31.5 | 8.1 | 19.4 | 31.2 | 5.2 | 16.7 | 24.1 |
| SD | 8.5 | 7.2 | 7.9 | 2.9 | 7.1 | 7.2 | 3.4 | 8.4 | 9.1 | |
| Min | 0 | 9 | 0 | 4 | 6 | 13 | 0 | 0 | 2 | |
| Max | 44 | 45 | 59 | 16 | 38 | 46 | 19 | 43 | 40 | |
| KB | Mean | 74,510 | 113,200 | 137,900 | 49,630 | 93,710 | 138,100 | 22,800 | 79,720 | 111,700 |
| SD | 47,837 | 35,307 | 39,826 | 22,197 | 43,500 | 45,131 | 22,370 | 46,714 | 43,759 | |
| Min | 0 | 47,010 | 0 | 12,780 | 21,670 | 49,430 | 0 | 0 | 5731 | |
| Max | 307,100 | 192,100 | 287,600 | 98,230 | 231,900 | 332,000 | 160,400 | 242,500 | 179,200 | |
| KBAVER | Mean | 4503 | 4824 | 4298 | 5967 | 4719 | 4375 | 3859 | 4518 | 4570 |
| SD | 1187 | 740 | 677 | 1423 | 767 | 609 | 1933 | 1052 | 686 | |
| Min | 0 | 3618 | 0 | 2766 | 3096 | 3327 | 0 | 0 | 2866 | |
| Max | 7869 | 6647 | 5338 | 8595 | 6872 | 7217 | 12,130 | 8916 | 6108 | |
| NSNP | Mean | 100.2 | 100.2 | 91.9 | 126.7 | 100.8 | 92.9 | 91.4 | 100.2 | 97.2 |
| SD | 62.5 | 66.2 | 52.8 | 87.3 | 63.6 | 53.6 | 72.3 | 63.2 | 61.1 | |
| Min | 42 | 46 | 45 | 47 | 48 | 45 | 47 | 43 | 47 | |
| Max | 800 | 655 | 614 | 640 | 669 | 691 | 701 | 826 | 573 | |
| Density | Mean | 47.5 | 47.5 | 47.3 | 47.7 | 47.5 | 47.3 | 47.6 | 47.4 | 47.3 |
| SD | 2.1 | 2.3 | 2.3 | 1.9 | 2.1 | 2.3 | 1.9 | 2.0 | 2.2 | |
| Min | 29.8 | 23.5 | 21.5 | 35.6 | 26.8 | 27.9 | 40.5 | 28.5 | 26.7 | |
| Max | 50 | 50 | 50 | 50 | 49 | 50 | 49 | 50 | 50 | |
| PHOM | Mean | 0.96 | 0.99 | 0.96 | 0.99 | 0.96 | 0.96 | 0.98 | 0.96 | 0.96 |
| SD | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | 0.01 | |
NSEG average number of segments for the individual declared homozygous, KB average of total number of kb contained within homozygous segments, KB average size of homozygous segments, NSNP average number of SNPs in run, PHOM proportion of sites homozygous, AL Alpine, BA Australian Boer, BC Canadian Boer, CA Cashmere, LA LaMancha, NU Nubian, RA Rangeland, SA Saanen, TO Toggenburg, Min minimum, Max maximum, SD standard deviation
Fig. 2Proportion of runs of homozygosity segments in each length category for the nine goat populations. AUS: Australia; CAN: Canada
Fig. 3Distribution of SNP FST values within each of the nine goat populations. AUS: Australia; CAN: Canada
Fig. 4Whole genome scans for selection in the LaMancha breed compared with all other breeds using the smoothed FST approach. Smoothed FST values greater than average plus two or three standard deviations were coloured with red and yellow, respectively. Plots for the other breeds are presented in the Supplementary material section
Signatures of selection for the scenario 1 (FST1, all breed comparisons) identified using smoothed FST statistics and considering two (green) or three (red) standard deviations as significance threshold
Chr chromosome, AL Alpine, BA Australian Boer, BC Canadian Boer, CA Cashmere, LA LaMancha, NU Nubian, RA Rangeland, SA Saanen, TO Toggenburg
Signatures of selection for the scenario 2 (FST2, selected breeds based on selection purpose) identified using smoothed FST statistics and considering two (green) or three (red) standard deviations as significance threshold
Milk Alpine and Saanen breeds, Fiber Cashmere breed, Meat Australian and Canadian Boer breed, Chr chromosome
Signatures of selection for the scenario 3 (FST3, breed groups based on selection purpose) identified using smoothed FST statistics and considering two (green) or three (red) standard deviations as significance threshold
Milk Alpine, Saanen, LaMancha and Toggenburg breeds, Fiber Cashmere breed, Dual-purpose Nubian breed, Meat Canadian and Australian Boer and Rangeland breeds, Chr chromosome
Fig. 5Whole genome scans for selection using the haplotype based hapFLK metric and –log (P-values) were plotted in genomic order. Odd and even numbered chromosomes are shown in yellow and black, respectively. SNP number is given on the x axis, and the genome-wide threshold corresponding to P < 0.001, P < 0.005 and P < 0.01 is shown as horizontal blue, green and red lines, respectively
Fig. 6Genomic population tree using all SNPs that passed genotype quality control. AUS: Australia and CAN: Canada
Fig. 7Genomic population tree using significant SNPs the most significant region on chromosome 7 (CHI7:48.4–57.3 Mb). AUS: Australia and CAN: Canada
Genomic regions under differential selection for all goat breed comparisons (FST1) and list of the genes located in the region of the 10 most significant SNPs (based on smoothed FST values)
| Scenario | Breed | Chr | Size (Mb) | Ngenes | Peak SNP | Genes present in the region of the 10 peak SNPs |
|---|---|---|---|---|---|---|
| FST1 | AL | 10 | 43.4 | 401 | snp25170-scaffold259-4130319 |
|
| FST1 | BA | 13 | 19.2 | 137 | snp49043-scaffold7-3464730 |
|
| FST1 | BC | 3 | 12.9 | 133 | snp46905-scaffold654-1593334 |
|
| FST1 | BC | 20 | 21.4 | 63 | snp34011-scaffold40-2937263 |
|
| FST1 | BC | 24 | 2.6 | 7 | snp836-scaffold1022-339033 | No genes in the peak region |
| FST1 | LA | 7 | 23.3 | 353 | snp23253-scaffold2322-180232 |
|
| FST1 | NU | 6 | 11.4 | 53 | snp16822-scaffold1760-904920 |
|
| FST1 | NU | 11 | 6.2 | 37 | snp9748-scaffold135-1883590 |
|
| FST1 | NU | 22 | 9.6 | 47 | snp15314-scaffold163-1176183 |
|
| FST1 | RL | 6 | 3.6 | 7 | snp11723-scaffold1432-75297 |
|
| FST1 | SA | 6 | 13.2 | 86 | snp58130-scaffold94-7008249 |
|
AL Alpine, BA Australian Boer, BC Canadian Boer, CA Cashmere, LA LaMancha, NU Nubian, RA Rangeland, SA Saanen, TO Toggenburg, Chr chromosome, Ngenes total number of genes in the whole significant region
Genomic regions under differential selection based on breeds grouped per selection purpose and list of the genes located in the region of the 10 most significant SNPs (based on smoothed FST values)
| Scenario | Group | Chr | Size (Mb) | Ngenes | Peak SNP | Genes present in the region of the 10 peak SNPs |
|---|---|---|---|---|---|---|
| FST2 | Meat | 3 | 13.7 | 149 | snp25334-scaffold261-808539 |
|
| FST2 | Meat | 13 | 10.2 | 66 | snp48991-scaffold7-1097529 |
|
| FST2 | Meat | 20 | 9.6 | 17 | snp57443-scaffold916-509375 | No genes |
| FST2 | Meat | 25 | 1.4 | 3 | snp9539-scaffold1343-1758601 |
|
| FST3 | DP | 6 | 13.3 | 60 | snp16815-scaffold1760-560278 |
|
| FST3 | DP | 22 | 10.0 | 48 | snp15310-scaffold163-1010990 |
|
| FST3 | Meat | 3 | 7.7 | 96 | snp46886-scaffold654-758578 |
|
| FST3 | Meat | 6 | 11.3 | 52 | snp2176-scaffold1066-801755 | No genes |
| FST3 | Meat | 20 | 5.4 | 12 | snp33977-scaffold40-1436358 |
|
Ngenes total number of genes in the whole significant region, DP dual-purpose, F milk (Alpine and Saanen breeds), fiber (Cashmere breed) and meat (Australian and Canadian Boer breed), F milk (Alpine, Saanen, LaMancha and Toggenburg breeds), fiber (Cashmere breed), dual-purpose (Nubian breed) and meat (Canadian and Australian Boer and Rangeland breeds)
Genomic regions under differential selection based on hapFLK methodology and list of the genes located in the region of the 10 most significant SNPs
| Scenario | Breed | Chr | Size (Mb) | Ngenes | Peak SNP | Genes present in the region of the 10 peak SNPs |
|---|---|---|---|---|---|---|
| hapFLK | All breeds | 4 | 0.5 | 0 | snp5822-scaffold1205-140455 | No genes |
| hapFLK | All breeds | 6 | 0.0 | 0 | snp2181-scaffold1066-1048134 | No genes |
| hapFLK | All breeds | 7 | 8.6 | 255 | snp12700-scaffold1488-1106620 |
|
| hapFLK | All breeds | 7 | 8.9 | 92 | snp57917-scaffold938-380104 |
|
| hapFLK | All breeds | 13 | 0.0 | 0 | snp7739-scaffold1278-1991262 | No genes |
| hapFLK | All breeds | 19 | 0.3 | 18 | snp3321-scaffold1101-888266 |
|
| hapFLK | All breeds | 23 | 0.8 | 2 | snp23628-scaffold2385-317663 |
|
Chr chromosome, Ngenes total number of genes in the whole significant region
Linkage disequilibrium (r2) levels for all the breeds in the peak region of chromosome 7
| Average linkage disequilibrium (r2) | |||
|---|---|---|---|
| Breed | NSNP after QC | Adjacent SNPs (± SD) | Syntenic SNPs (± SD) |
| AL | 179 | 0.169 ± 0.218 | 0.054 ± 0.077 |
| BA | 173 | 0.364 ± 0.322 | 0.134 ± 0.179 |
| BC | 164 | 0.392 ± 0.316 | 0.154 ± 0.193 |
| CA | 178 | 0.171 ± 0.212 | 0.085 ± 0.120 |
| LA | 112 | 0.641 ± 0.358 | 0.263 ± 0.289 |
| NU | 154 | 0.253 ± 0.281 | 0.091 ± 0.137 |
| RA | 187 | 0.130 ± 0.194 | 0.031 ± 0.051 |
| SA | 179 | 0.229 ± 0.269 | 0.087 ± 0.119 |
| TO | 163 | 0.265 ± 0.289 | 0.126 ± 0.159 |
AL Alpine, BA Australian Boer, BC Canadian Boer, CA Cashmere, LA LaMancha, NU Nubian, Pop population, RA Rangeland, SA Saanen, TO Toggenburg, QC genotype quality control, SD standard deviation
Fig. 8Manhattan plot for genome-wide association studies for ear size (defined as short, medium or long). Blue and red line indicates 0.05 and 0.01 significance threshold levels (p-values), respectively