| Literature DB >> 25148050 |
Ming Fang1, Weixuan Fu2, Dan Jiang3, Qin Zhang2, Dongxiao Sun2, Xiangdong Ding2, Jianfeng Liu2.
Abstract
The multiple-SNP analysis has been studied by many researchers, in which the effects of multiple SNPs are simultaneously estimated and tested in a multiple linear regression. The multiple-SNP association analysis usually has higher power and lower false-positive rate for detecting causative SNP(s) than single marker analysis (SMA). Several methods have been proposed to simultaneously estimate and test multiple SNP effects. In this research, a fast method called MEML (Mixed model based Expectation-Maximization Lasso algorithm) was developed for simultaneously estimate of multiple SNP effects. An improved Lasso prior was assigned to SNP effects which were estimated by searching the maximum joint posterior mode. The residual polygenic effect was included in the model to absorb many tiny SNP effects, which is treated as missing data in our EM algorithm. A series of simulation experiments were conducted to validate the proposed method, and the results showed that compared with SMMA, the new method can dramatically decrease the false-positive rate. The new method was also applied to the 50k SNP-panel dataset for genome-wide association study of milk production traits in Chinese Holstein cattle. Totally, 39 significant SNPs and their nearby 25 genes were found. The number of significant SNPs is remarkably fewer than that by SMMA which found 105 significant SNPs. Among 39 significant SNPs, 8 were also found by SMMA and several well-known QTLs or genes were confirmed again; furthermore, we also got some positional candidate gene with potential function of effecting milk production traits. These novel findings in our research should be valuable for further investigation.Entities:
Mesh:
Year: 2014 PMID: 25148050 PMCID: PMC4141689 DOI: 10.1371/journal.pone.0099544
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The profiles of the true SNP parameters (the top panel), the estimated 500 SNP heritabilities with MEML (the middle panel) and −log10 P with SMMA (the bottom panel), respectively.
The x-axis indicates the SNP numbers. In the top panel, the true heritabilities of small-effect SNPs are presented with diamonds on the top of their needles but not for large-effect SNPs. The dotted horizontal lines in the middle and the bottom panels present the thresholds with 1,000 permutations from the multiple-SNP and SMMA methods, respectively.
Figure 2The profiles of the estimated heritabilities of 500 SNPs for five milk production traits against on the selected SNPs.
The panels from the top to the bottom are the estimated heritiabilities for milk yield, fat yield, protein yield, fat percentage and protein percentage traits, respectively. The x-axis indicates the chromosome number (chromosome are divided by vertical dotted lines). The dotted horizontal line presents the threshold from 1,000 permutations.
The significant SNPs and their nearby genes in the analysis of Chinese dairy cattle data.
| Trait | SNP | Chr | Position (bp) | Heritability | Nearest Gene | |
| Gene | Distance(bp) | |||||
| MY | ARS-BFGL-NGS-4939 | 14 | 1801116 | 6.26E-02 |
| Within |
| Hapmap47777-BTA-91000 | X | 1.41E+08 | 9.81E-03 |
| 81004 | |
| ARS-BFGL-NGS-49079 | 9 | 6574398 | 9.05E-03 | NA | NA | |
| ARS-BFGL-NGS-103091 | 5 | 74518588 | 8.14E-03 |
| 45586 | |
| Hapmap60955-rs29022431 | 23 | 21292766 | 7.83E-03 | NA | NA | |
| ARS-BFGL-NGS-11319 | 2 | 6763227 | 7.69E-03 |
| 22898 | |
| Hapmap48369-BTA-50306 | 1 | 7627111 | 7.39E-03 | NA | NA | |
| FY | ARS-BFGL-NGS-4939 | 14 | 1801116 | 2.63E-02 |
| Within |
| Hapmap42263-BTA-60093 | 25 | 35342491 | 1.18E-02 |
| 42285 | |
| Hapmap40191-BTA-73919 | 5 | 71978791 | 7.54E-03 |
| 52073 | |
| PY | ARS-BFGL-BAC-6525 | 10 | 92127288 | 1.94E-02 |
| Within |
| ARS-BFGL-NGS-115291 | 4 | 4090824 | 1.70E-02 | NA | NA | |
| ARS-BFGL-NGS-39539 | 23 | 41457147 | 1.05E-02 |
| 154416 | |
| ARS-BFGL-NGS-4939 | 14 | 1801116 | 1.02E-02 |
| Within | |
| ARS-BFGL-NGS-110497 | 26 | 45870133 | 9.98E-03 |
| Within | |
| ARS-BFGL-NGS-29581 | 4 | 1.14E+08 | 9.66E-03 |
| Within | |
| FP | ARS-BFGL-NGS-4939 | 14 | 1801116 | 0.179094 |
| Within |
| Hapmap50271-BTA-17442 | 5 | 81903458 | 2.68E-02 |
| Within | |
| ARS-BFGL-NGS-111443 | 5 | 94269370 | 1.21E-02 |
| 46923 | |
| Hapmap51303-BTA-74377 | 5 | 83790390 | 1.18E-02 |
| Within | |
| ARS-BFGL-NGS-118998 | 20 | 32030332 | 1.03E-02 |
| Within | |
| Hapmap39717-BTA-112973 | 2 | 26781358 | 8.62E-03 |
| Within | |
| BTB-00231742 | 5 | 77095345 | 8.15E-03 | NA | NA | |
| BTB-00285653 | 8 | 30036807 | 7.72E-03 |
| Within | |
| BTB-00777571 | 20 | 34017024 | 6.96E-03 | NA | NA | |
| ARS-BFGL-NGS-113507 | 11 | 98407974 | 6.38E-03 |
| Within | |
| PP | ARS-BFGL-NGS-4939 | 14 | 1801116 | 3.52E-02 |
| Within |
| BTA-39609-no-rs | 0 | 0 | 1.63E-02 |
|
| |
| Hapmap48524-BTA-92140 | 5 | 75684520 | 1.58E-02 |
| 24751 | |
| BTA-50402-no-rs | 20 | 34451383 | 1.50E-02 | NA | NA | |
| BTB-01844123 | X | 307557 | 1.30E-02 | NA | NA | |
| BTA-121739-no-rs | 6 | 38063313 | 1.17E-02 |
| Within | |
| Hapmap54188-rs29022489 | 6 | 75017253 | 1.11E-02 | NA | NA | |
| Hapmap24324-BTC-062449 | 6 | 37631640 | 1.07E-02 |
| 45459 | |
| ARS-BFGL-NGS-111443 | 5 | 94269370 | 1.04E-02 |
| 46923 | |
| ARS-BFGL-NGS-107037 | 10 | 46486647 | 9.55E-03 |
| Within | |
| ARS-BFGL-NGS-61452 | 4 | 75250982 | 9.15E-03 |
| 91623 | |
| ARS-BFGL-NGS-53343 | 6 | 29709875 | 8.08E-03 | NA | NA | |
| ARS-BFGL-NGS-117896 | 28 | 35874524 | 7.50E-03 |
| Within | |
| Hapmap42216-BTA-45665 | 19 | 45934555 | 7.45E-03 |
| Within | |
| Hapmap50621-BTA-21320 | 6 | 64425164 | 7.31E-03 | NA | NA | |
| ARS-BFGL-NGS-53398 | X | 21953655 | 7.30E-03 |
| Within | |
| Hapmap38455-BTA-100999 | 9 | 76346736 | 6.94E-03 |
| 63196 | |
| BTA-48480-no-rs | 2 | 95119968 | 6.70E-03 |
| 25151 | |
SNP are also detected by SMMA; NA: there is no assigned gene around the SNP in a distance of 200 kb; —: the SNPs with unknown positions.