| Literature DB >> 26294278 |
M Frąszczak1, J Szyda2,3.
Abstract
The goal of this study was to compare significant SNP selection approaches in the context of complex traits based on SNP estimates obtained by models: a model fitting a single SNP (M1), a model fitting a single SNP and a random polygenic effect (M2), the nonparametric CAR score (M3), a SNP-BLUP model with random effects of all SNPs fitted simultaneously (M4). There were 46,267 SNPs tested in a population of 2601 Holstein Friesian bulls, four traits (milk and fat yields, somatic cell score, non-return rate for heifers) were considered. The numbers of SNPs selected as significant differed among models. M1 selected a very large number of SNPs, except for a NRH in which no SNPs were significant. M2 and M3 both selected similar and low number of SNPs for each trait. M4 selected more SNPs than M2 and M3. Considering linkage disequilibrium between SNPs, for MY M2 and M3 selected SNPs more highly correlated with each other than in the case of M4, while for FY M3 selection contained more correlated SNPs than M2 and M4. In conclusion, if the research interest is to identify SNPs not only with strong, but also with moderate effects on a complex trait a multiple-SNP model is recommended. Such models are capable of accounting for at least a part of linkage disequilibrium between SNPs through the design matrix of SNP effects. Functional annotation of SNPs significant in M4 reveals good correspondence between selected polymorphisms and functional information as well as with QTL mapping results.Entities:
Keywords: Complex traits; GWAS; Mixed model; Significance testing
Mesh:
Year: 2015 PMID: 26294278 PMCID: PMC4830853 DOI: 10.1007/s13353-015-0305-6
Source DB: PubMed Journal: J Appl Genet ISSN: 1234-1983 Impact factor: 3.240
The numbers of SNPs selected as significant
| Trait | M1 | M2 | M3 | M4 |
|---|---|---|---|---|
| Fat yield | 2435 | 72 | 48 | 182 |
| Milk yield | 3398 | 66 | 78 | 153 |
| Somatic cell score | 2242 | 4 | 0 | 163 |
| Non-return rate for heifers | 0 | 0 | 0 | 125 |
Fig. 1The percentage of SNPs significant between models
Fig. 2Genomic location of significant SNPs common between models
Annotation of SNPs significant in M2, M3, and M4. Genes significant for both MY and FY are marked in bold. For significant genes all SNPs located within intron/exon regions are listed
| Gene acronym | Gene name | Ensemble ID | BTA | SNP | SNP annotation | SNP position |
|---|---|---|---|---|---|---|
| Fat yield | ||||||
| FOXH1 | Forkhead box H1 | ENSBTAG00000004761 | 14 | rs109146371 | 3390 bp upstream | 1,651,311 |
| CYHR1 | Cysteine/histidine-rich 1 | ENSBTAG00000035254 | 14 | rs109968515 | intron | 1,675,278 |
| VPS28 | Homologous to vacuolar protein sorting 28 | ENSBTAG00000026320 | 14 | rs17870736 | intron | 1,696,470 |
| DGAT1 | Diacylglycerol O-acyltransferase 1 | ENSBTAG00000026356 | 14 | rs109421300 | intron | 1,801,116 |
| MAF1 | Homologous to RNA polymerase III-inhibiting protein | ENSBTAG00000012242 | 14 | rs41256919 | exon – missense variant | 1,923,292 |
| SPATC1 | Spermatogenesis and centriole associated 1 | ENSBTAG00000026350 | 14 | rs41629750 | 4472 bp upstream | 2,002,873 |
| PLEC | Plectin | ENSBTAG00000011922 | 14 | rs109350371 | 460 bo upstream | 2,054,457 |
|
| Mitogen-activated protein kinase 15 | ENSBTAG00000019864 | 14 | rs110323635 | exon – missense variant | 2,239,085 |
| EEF1D | Elongation factor 1-delta | ENSBTAG00000014643 | 14 | rs109661298 | intron | 2,319,504 |
| ZC3H3 | Zinc finger CCCH-type containing 3 | ENSBTAG00000021472 | 14 | rs109617015 | intron | 2,386,688 |
| RHPN1 | Rhophilin, Rho GTPase binding protein 1 | ENSBTAG00000002104 | 14 | rs109529219 | intron | 2,468,020 |
| Novel gene | ENSBTAG00000003606 | 14 | rs110199901 | intron | 2,524,432 | |
|
| Lymphocyte antigen 6 complex, locus K | ENSBTAG00000000158 | 14 | rs110174651 | 297 bp downstream | 2,754,909 |
| LY6D | Lymphocyte antigen 6 complex, locus D | ENSBTAG00000034498 | 14 | rs110237430 | 978 bp downstream | 2,803,998 |
| LYPD2 | LY6/PLAUR domain containing 2 | ENSBTAG00000016210 | 14 | rs109476486 | 2653 bp upstream | 2,826,632 |
| BAI1 | Brain-specific angiogenesis inhibitor 1 | ENSBTAG00000006385 | 14 | rs109545018 | intron | 3,006,509 |
|
| G protein-coupled receptor 20 | ENSBTAG00000015985 | 14 | rs110411273 | 71 bp downstream | 3,640,788 |
| TSNARE1 | t-SNARE domain containing 1 | ENSBTAG00000009974 | 14 | rs109875744 | intron | 3,078,843 |
|
| PTK2 protein tyrosine kinase 2 | ENSBTAG00000009578 | 14 | rs109670279 | intron | 3,885,798 |
|
| Argonaute RISC catalytic component 2 | ENSBTAG00000001579 | 14 | rs109948273 | intron | 4,103,850 |
|
| Trafficking protein particle complex 9 | ENSBTAG00000013955 | 14 | rs109807697 | intron | 4,240,287 |
| COL22A1 | Collagen, type XXII, alpha 1 | ENSBTAG00000015374 | 14 | rs110444021 | intron | 5,225,467 |
| Milk yield | ||||||
| C8orf33 | Chromosome 14 open reading frame 33 | ENSBTAG00000000879 | 14 | rs109752439 | 87 bp downstream | 1,489,496 |
| FAM135B | Family with sequence similarity 135, member B | ENSBTAG00000018218 | 14 | rs110622450 | intron | 5,428,037 |
Fig. 3Position of genes on BTA14 marked by significant SNPs common for models 2–4