| Literature DB >> 28770057 |
Hafiz Ishfaq Ahmad1, Guiqiong Liu1, Xunping Jiang1, Chenhui Liu1, Yuqing Chong1, Huang Huarong1.
Abstract
Detecting signatures of selection can provide a new insight into the mechanism of contemporary breeding and artificial selection and further reveal the causal genes associated to the phenotypic variation. However, the signatures of selection on genes entailing for profitable traits between Chinese commercial and indigenous goats have been poorly interpreted. We noticed footprints of positive selection at MC1R gene containing SNPs genotyped in five Chinese native goat breeds. An experimental distribution of FST was built based on approximations of FST for each SNP across five breeds. We identified selection using the high FST outlier method and found that MC1R candidate gene show evidence of positive selection. Furthermore, adaptive selection pressure on specific codons was determined using different codon based on maximum-likelihood methods; signature of positive selection in mammalian MC1R was explored in individual codons. Evolutionary analyses were inferred under maximum likelihood models, the HyPhy package implemented in the DATAMONKEY Web Server. The results of codon selection displayed positive diversifying selection at the sites were mainly involved in development of genetic variations in coat color in various mammalian species. Positive diversifying selection inferred with recent evolutionary changes in domesticated goat MC1R provides new insights that the gene evolution may have been modulated by domestication events in goats.Entities:
Keywords: SNP; goat; maximum likelihood; positive selection
Year: 2017 PMID: 28770057 PMCID: PMC5528238 DOI: 10.1002/ece3.2919
Source DB: PubMed Journal: Ecol Evol ISSN: 2045-7758 Impact factor: 2.912
Heterozygosity (He) and Fixation Index F values for each of 13 genotyped SNPs calculated by LOSITAN in five goat populations
| SNP | Locus | Heterozygosity ( | Fixation index ( |
|
|---|---|---|---|---|
|
| Melanocortin receptor |
|
|
|
|
| Melanocortin receptor | 0.50487 | 0.36586 | 0.94737 |
|
| Agouti protein | 0.49878 | 0.09401 | 0.27772 |
|
| Leutinizing hormone | 0.29569 | 0.01821 | 0.01939 |
|
| Leutinizing hormone | 0.22927 | 0.01597 | 0.00865 |
|
| Gonadotrpinreleasing hormone | 0.17240 | 0.03570 | 0.07813 |
|
| Prostaglandin receptor | 0.49341 | 0.11916 | 0.37981 |
|
| Follicle stimulating hormone | 0.24826 | 0.42750 | 0.96298 |
|
| Mayogenic factor | 0.33079 | 0.17224 | 0.59913 |
|
| Inhibin | 0.44237 | 0.18577 | 0.58081 |
|
| Neuropeptide Y | 0.29587 | 0.10812 | 0.37418 |
|
| Insulin like growth factor | 0.45943 | 0.18686 | 0.58423 |
|
| Aralkylamine N‐acetyltransferase | 0.36705 | 0.13483 | 0.43468 |
Bold value showing loci with high F ST value. Significant: p* (Simulated F ST < sample F ST).
Figure 1Candidate locus agouti under positive selection keeping the 95% confidence interval. F ST, fixation index; H, Heterozygosity
Sites found under positive selection at MC1R gene
| REL | IFEL | SLAC | FEL | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Positive selection sites | dN‐dS | Bayes Factor | Positive selection sites | dN‐dS |
| Positive selection sites | dN‐dS |
| Positive selection sites | dN‐dS |
|
| 59 | 0.24 | 40.18 | 59 | 0.27 | .03 | 59 | 2.385 | .02 | 59 | 0.26 | .02 |
| 218 | 0.11 | 41.56 | 279 | 0.76 | .04 | 279 | 2.554 | .04 | 364 | 0.19 | .02 |
| 248 | 0.04 | 21.95 | 327 | 0.28 | .04 | ||||||
| 327 | 0.05 | 31.74 | |||||||||
| 364 | 0.15 | 255.7 | |||||||||
REL, IFEL, SLAC, and FEL test significance levels are given as p‐values, while empirical Bayes factors are given for REL. Significant values (p < .05), Bayes factors >20.
Mixed‐effect model evolution based episodic diversifying selection at MC1R gene
| Codon | α | β− | Pr[β = β−] | β+ | Pr[β = β+] |
| q‐value |
|---|---|---|---|---|---|---|---|
| 59 | 0 | 0 | 0.616 | 1.809 | 0.384 | .015 | 1 |
| 86 | 0.722 | 0 | 0.954 | 103.8 | 0.046 | .031 | 1 |
| 108 | 0.148 | 0 | 0.987 | 613.7 | 0.013 | .005 | 1 |
| 153 | 2.186 | 0.502 | 0.958 | 38.36 | 0.042 | .045 | 1 |
| 179 | 0.254 | 0 | 0.984 | 16.28 | 0.016 | .021 | 1 |
| 364 | 0 | 0 | 1e‐09 | 0.454 | 1 | .03 | 1 |
Table reports the distribution of synonymous (α) and nonsynonymous (β) substitution rates over sites inferred by MEME model where the branch proportion with β > α significantly >0. p‐values determined by χ2 distribution and q‐values obtained using Simes’ procedure to reduce false discovery rates under strict null neutral model.
Fast unconstrained Bayesian approximation inferring pervasive diversifying selection at MC1R gene
| Codon | α | β | β−α | Pr[β > α] | E.B. Factor | PSRF | Neff |
|---|---|---|---|---|---|---|---|
| 59 | 0.08 | 0.26 | 0.18 | 0.927 | 38.32 | 0.998 | 1,892.62 |
| 279 | 0.22 | 0.67 | 0.45 | 0.924 | 36.66 | 0.998 | 1,560.56 |
| 364 | 0.04 | 0.21 | 0.16 | 0.95 | 56.49 | 1.004 | 591.746 |
α, The mean of posterior distribution with respect to data set wide distribution of rates of the empirical Bayes estimate of synonymous substitution rates; β, Posterior mean of nonsynonymous substitution rate; Pr [β > α], Posterior mean of the site level probability of positive selection; PSRF, Potential scale reduction factor and the values close to indicate MCMC convergence; Neff, The effective sample size from pooled chain.
Figure 2Fast unconstrained Bayesian approximation inferring pervasive diversifying selection based on synonymous (α) and nonsynonymous rates (β) substitution rate with continuous model parameters that vary from one site to another, illustrated in posterior alignment‐wide distribution of substitution rates Mean values. β = 0.76, β−α = −0.24, ω = 5.49 Weights. Pr[α > β] = 0.708, Pr[α = β] = 0.042, Pr[α < β] = 0.250
Codon model selection based on Modified Bayesian Information Criterion (mBIC)
|
| Models | Credible | mBIC | ΔmBIC | dN/dS |
|---|---|---|---|---|---|
| 1 | 1 | 0 | 27,594.2 | 0.16/75 | |
| 2 | 3,302 | 0 | 27,204.6 | 389.6 | 0.07/45 |
| 3 | 6,680 | 1,948 | 27,114.7 | 89.9 | 0.06/39 |
| 4 | 536 | 1 | 27,121.3 | −6.66 | 0.05/20 |
N, Number of rate classes included in models; Models, Genetic algorithm models; Credible, All the models evaluated by genetic algorithm within 9.21 mBIC unit (best model has credible values 0.01 or >1); mBIC, Modified Bayesian Information Criterion; ΔmBIC, mBIC for N rate classes compared to N−1 rate classes; dN/dS, Maximum‐likelihood estimates for each rate class.
Figure 3Evolutionary rate cluster in structured genetic algorithm models (GA) inferred from gene alignment. Each cluster labeled with maximum‐likelihood estimate of its rate inferred under genetic algorithm. The nodes (residues) are annotated by their biochemical properties and Stanfel class, and the rates (edges) are labeled with model averaged rate estimates
Figure 4Evolutionary fingerprints of gene inferred from alignments on log scale, with the diagonal line corresponding the values α = β for neutral evolution. Dots in the circle indicating the ratio β/α and the area of circle represent the weight of rate classes. The points above the diagonal correspond positive selection and below the diagonal negative selection