| Literature DB >> 24456127 |
Lesley-Ann Raven1, Benjamin G Cocks, Ben J Hayes.
Abstract
BACKGROUND: Genome wide association studies (GWAS) in most cattle breeds result in large genomic intervals of significant associations making it difficult to identify causal mutations. This is due to the extensive, low-level linkage disequilibrium within a cattle breed. As there is less linkage disequilibrium across breeds, multibreed GWAS may improve precision of causal variant mapping. Here we test this hypothesis in a Holstein and Jersey cattle data set with 17,925 individuals with records for production and functional traits and 632,003 SNP markers.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24456127 PMCID: PMC3905911 DOI: 10.1186/1471-2164-15-62
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Results of within breed and within gender genome wide association analysis
| | | | | |
| Fat | 806 | 984 | 25 (3.10) | 44 (4.0) |
| Milk | 2117 | 3969 | 461 (21.8) | 242 (23.5) |
| Protein | 4016 | 2606 | 557 (13.9) | 504 (24.1) |
| Fat% | 677 | 5093 | 66 (9.75) | 141 (16.1) |
| Protein% | 692 | 5444 | 150 (21.7) | 368 (34.2) |
| Fertility | 16 | 1 | 0 | 0 |
| Mamm. Syst. | 1721 | 51 | 0 | 0 |
| Survival | 4302 | 206 | 1 (0.5) | 10 (2.9) |
| SCC | 248 | 3600 | 42 (16.9) | 28 (19.0) |
| | | | | |
| Fat | 2100 | 3899 | 1543 (73.5) | 228 (82.0) |
| Milk | 5321 | 5488 | 1529 (28.74) | 1023 (31.6) |
| Protein | 3288 | 8730 | 2936 (89.3) | 178 (90.5) |
| Fat% | 2084 | 3422 | 1353 (64.9) | 372 (67.3) |
| Protein% | 3378 | 4374 | 1644 (48.7) | 1512 (54.1) |
| Fertility | 1567 | 10 | 0 | 0 |
| Mamm. Syst. | 14770 | 147 | 28 (19.0) | 79 (24.5) |
| Survival | 3522 | 220 | 20 (9.1) | 138 (14.5) |
| SCC | 729 | 701 | 26 (3.7) | 82 (7.0) |
The number of SNP which are significant in both genders within Jerseys and Holsteins are presented (percentage of significant SNP that are significant in both genders given in parenthesis), as well as the number of 100 kb intervals containing significant SNP for that trait in both genders (percentage of 100 kb windows with a significant SNP in either gender with significant SNP in both genders given in parenthesis). Milk production traits were tested at P < 10-8 and reproductive and functional traits at P < 10-5.
Correlations of SNP effects for very significant SNP (P < 10 ) within genders within each breed, and between breeds for milk production, reproduction and health traits
| Fat | 0.984 | 0.394 | 0.983 | 0.969 | |
| Milk | 0.973 | 0.082 | 0.978 | 0.973 | |
| Protein | 0.979 | 0.273 | 0.986 | 0.984 | |
| Fat% | 0.973 | 0.629 | 0.955 | −0.057 | |
| Protein% | 0.977 | 0.034 | 0.965 | −0.081 | |
| Survival | 0.979 | 0.997 | | | |
| Mamm. Syst | 0.979 | | | | |
| SCC | 0.999 | | | | |
| Fertility | 0.979 |
Results of within breed genome wide association analysis
| Fat | 2654 | 5718 | 124 (4.7) | 633 (7.5) |
| Milk | 23808 | 17573 | 1210 (6.9) | 6341(14.2) |
| Protein | 33671 | 21065 | 1327 (6.3) | 8804 (14.9) |
| Fat% | 7688 | 12141 | 2996 (38.9) | 2416 (43.1) |
| Protein% | 12141 | 7434 | 2032 (27.3) | 3243 (33.4) |
| Fertility | 2276 | 125 | 0 | 1 (0.8) |
| Mamm. Sys | 12088 | 3032 | 130 (4.3) | 707 (11.1) |
| Survival | 3930 | 6263 | 300 (7.6) | 739 (12.3) |
| SCC | 1693 | 3507 | 36 (2.1) | 101 (6.1) |
The number of SNP which are significant in both breeds are presented (percentage of significant SNP that are significant in both breeds given in parenthesis), as well as the number of 100 kb intervals containing significant SNP for that trait in both breeds (percentage of 100 kb windows with a significant SNP in either breed with significant SNP in both breeds given in parenthesis). Milk production traits were tested at P < 10-8 and reproductive and functional traits at P < 10-5.
Most significant SNP regions and candidate genes for production and functional traits in dairy cattle from a large multibreed GWAS
|
| ||||||||
|---|---|---|---|---|---|---|---|---|
| ARS-BFGL-NGS-4939 | 14 | 1801116 | 255.551 | DGAT1 | | | diacylglycerol O-acyltransferase 1 | |
| BTB-00932332 | 26 | 22118554 | 24.142 | BRTC | | | Beta-transducin repeat containing | |
| BovineHD0500026662 | 5 | 93945655 | 23.718 | | MGST1 | | microsomal glutathione S-transferase 1 | |
| BovineHD1800017481 | 18 | 60506726 | 12.907 | LOC788871 | | | zinc finger protein 85-like | |
| BovineHD1200015001 | 12 | 54273515 | 12.918 | | | RNF219 | ring finger protein 219 | |
| ARS-BFGL-NGS-4939 | 14 | 1801116 | 416.491 | DGAT1 | | | diacylglycerol O-acyltransferase 1 | |
| BovineHD2000009925 | 20 | 34582764 | 44.167 | | | LOC782462 | sorting nexin-13-like | |
| BovineHD0500026852 | 5 | 94562606 | 24.636 | EPS8 | | | epidermal growth factor receptor pathway substrate 8 | |
| BovineHD0600024338 | 6 | 88865430 | 22.814 | | | GC | group-specific component (vitamin D binding protein) | |
| BovineHD4100003579 | 5 | 32784231 | 19.562 | RPAP3 | | | RNA polymerase II associated protein 3 | |
| ARS-BFGL-NGS-4939 | 14 | 1801116 | 144.166 | DGAT1 | | | diacylglycerol O-acyltransferase 1 | |
| BovineHD4100005296 | 6 | 87180731 | 31.840 | CSN2 | | | casein beta | |
| BovineHD2600004009 | 26 | 15654751 | 21.778 | PLCE1 | | | phospholipase C, epsilon 1 | |
| BovineHD0500029843 | 5 | 104307736 | 19.303 | CD27 | | | CD27 molecule | |
| BovineHD2600011254 | 26 | 40809501 | 18.214 | | | PPAPDC1A | phosphatidic acid phosphatase type 2 domain containing 1A | |
| ARS-BFGL-NGS-4939 | 14 | 1801116 | 1684.117 | DGAT1 | | | diacylglycerol O-acyltransferase 1 | |
| BovineHD0500026662 | 5 | 93945655 | 93.370 | | MGST1 | | microsomal glutathione S-transferase 1 | |
| BovineHD2000009925 | 20 | 34582764 | 61.199 | | | LOC782462 | sorting nexin-13-like | |
| ARS-BFGL-NGS-57448 | 27 | 36155097 | 32.124 | GINS4 | | | GINS complex subunit 4 (Sld5 homolog) | |
| BovineHD0200034371 | 2 | 119076939 | 13.851 | | SP110 | | SP110 nuclear body protein | |
| ARS-BFGL-NGS-4939 | 14 | 1801116 | 398.865 | DGAT1 | | | diacylglycerol O-acyltransferase 1 | |
| BovineHD2000009927 | 20 | 34587828 | 180.352 | | | LOC782462 | sorting nexin-13-like | |
| BovineHD0600023879 | 6 | 87160102 | 70.809 | | CAS1A | | casein alpha s1 | |
| BovineHD0300005054 | 3 | 15451257 | 44.404 | GBA | | | glucosidase, beta, acid | |
| BovineHD4100003579 | 5 | 32784231 | 34.001 | RPAP3 | | | RNA polymerase II associated protein 3 | |
| BovineHD1800016761 | 18 | 57548213 | 7.445 | LOC786539 | | | carcinoembryonic antigen-related cell adhesion molecule 18-like | |
| BovineHD0600024357 | 6 | 88922396 | 6.712 | | | NPFFR2 | neuropeptide FF receptor 2 | |
| BovineHD0500024481 | 5 | 86451512 | 6.441 | | | SOX5 | SRY (sex determining region Y)-box 5 | |
| BovineHD0400015668 | 4 | 57384888 | 6.371 | | | LOC100298628 | HIG1 domain family, member 1D-like pseudogene | |
| BovineHD0100005095 | 1 | 16862494 | 6.215 | LOC100140852 | | | magnesium transporter NIPA2-like | |
| BovineHD2600006024 | 26 | 23379482 | 9.478 | SUFU | | | suppressor of fused homolog (Drosophila) | |
| BovineHD0200030267 | 2 | 105149497 | 8.263 | SMARCAL1 | | | SWI/SNF related, matrix assoc., actin dep. Reg. of chromatin, subfamily a-like 1 | |
| BovineHD1100008368 | 11 | 28171549 | 8.175 | PRKCE | | | protein kinase C, epsilon | |
| BovineHD0200035718 | 2 | 123208383 | 8.069 | | SDC3 | | syndecan 3 | |
| BovineHD2000008146 | 20 | 27565339 | 7.890 | | | ISL1 | ISL LIM homeobox 1 | |
| BovineHD1100031193 | 11 | 107227039 | 67.300 | | NLRP6 | | NLR family, pyrin domain containing 6 | |
| Hapmap40387-BTA-107848 | 9 | 104960154 | 35.305 | | WDR27 | | WD repeat domain 27 | |
| BovineHD3000041679 | 30 | 144726003 | 33.731 | | | TBL1X | transducin (beta)-like 1X-linked | |
| BovineHD2000006692 | 20 | 22290945 | 31.258 | | MIER3 | | mesoderm induction early response 1, family member 3-like | |
| BovineHD1700000492 | 17 | 2237102 | 30.497 | | | NPY2R | neuropeptide Y receptor Y2 | |
| ARS-BFGL-NGS-92033 | 1 | 2582894 | 12.503 | | MIS18A | | MIS18 kinetochore protein homolog A (S. pombe) | |
| ARS-BFGL-NGS-57102 | 3 | 31219298 | 10.716 | | CTTNBP2NL | | CTTNBP2 N-terminal like | |
| BTA-104934-no-rs | 7 | 67940770 | 10.324 | LARP1 | | | La ribonucleoprotein domain family, member 1 | |
| BovineHD1100009904 | 11 | 32859722 | 9.169 | NRXN1 | | | neurexin 1 | |
| BovineHD1600008712 | 16 | 30646220 | 8.556 | ADCK3 | aarF domain containing kinase 3 |
Comparison of the proportion of SNP occurring within genes in the whole genome (632 K SNP) and SNP significant (P < 10 ) in the multibreed and single breed analyses
| | ||||
|---|---|---|---|---|
| 0.319 | 0.381 | 0.361 | 0.324 | |
| 0.319 | 0.348 | 0.330 | 0.323 | |
| 0.319 | 0.363 | 0.329 | 0.327 | |
| 0.319 | 0.374 | 0.356 | 0.332 | |
| 0.319 | 0.327 | 0.332 | 0.311 | |
| 0.319 | 0.375 | 0.200 | 0.399 | |
| 0.319 | 0.296 | 0.344 | 0.392 | |
| 0.319 | 0.150 | 0.328 | 0.191 | |
Figure 1Association analysis of SNP surrounding on BTA20. Associations of SNP in a 30 Mb region of BTA20 with protein percentage in Holsteins (Black dots), Jerseys (Orange dots) and in a multibreed GWAS (lower panel). The yellow vertical line represents the position of the GHR gene.
Figure 2Association analysis of SNP surrounding on BTA6. Associations of SNP in a 12 Mb region of BTA6 with protein percentage in Holsteins (Black dots), Jerseys (Orange dots) and in a multibreed GWAS (lower panel). The yellow vertical line represents the position of the ABCG2 gene.
Figure 3Association analysis of SNP surrounding on BTA14. Associations of SNP in a 7 Mb region of BTA14 for a) fat percentage and b) protein percentage for Holsteins (Black dots), Jerseys (Orange dots) and in a multibreed GWAS (lower panel). The yellow vertical line represents the position of the DGAT1 gene.
Figure 4Association analysis of SNP on BTA19. Associations of SNP on BTA19 encompassing a cluster of fat metabolism genes for a) fat percentage and b) protein percentage in Holsteins (black dots, upper figure), Jerseys (orange dots, upper figure), and in a multibreed population (lower panel of each Figure).
Figure 5Association of SNP on BTA5. Associations of SNP in a 20 Mb QTL region of BTA5 with fat percentage in Holsteins (black dots, upper figure), Jerseys (orange dots, upper figure) and in a multibreed population (lower panel).
Figure 6Principal component analysis of the genomic relationship matrix, with animals coloured by breed and sex.