| Literature DB >> 29751743 |
Zexi Cai1, Bernt Guldbrandtsen2, Mogens Sandø Lund2, Goutam Sahana2.
Abstract
BACKGROUND: Genome-wide association studies (GWAS) have been successfully implemented in cattle research and breeding. However, moving from the associations to identifying the causal variants and revealing underlying mechanisms have proven complicated. In dairy cattle populations, we face a challenge due to long-range linkage disequilibrium (LD) arising from close familial relationships in the studied individuals. Long range LD makes it difficult to distinguish if one or multiple quantitative trait loci (QTL) are segregating in a genomic region showing association with a phenotype. We had two objectives in this study: 1) to distinguish between multiple QTL segregating in a genomic region, and 2) use of external information to prioritize candidate genes for a QTL along with the candidate variant.Entities:
Keywords: Candidate genes; Closely linked association signals; Dairy cattle; GWAS; Milk traits
Year: 2018 PMID: 29751743 PMCID: PMC5948690 DOI: 10.1186/s12863-018-0620-0
Source DB: PubMed Journal: BMC Genet ISSN: 1471-2156 Impact factor: 2.797
Lead SNPs from genome-wide associated regions for fat yield in Nordic Holstein cattle. Base positions are given as position in UMD 3.1.1 [49]
| BTA | base position | Imputation accuracy | Effect | –log10(p) | Region | Gene | Annotation |
|---|---|---|---|---|---|---|---|
| 2 | 126979882 | 0.9972 | −1.31 | 11.46 | 126041707~ 127230070 | Downstream | |
| 2 | 85991577b | 0.9542 | 1.30 | 8.91 | 85042155~ 86241732 |
| Intron |
| 3 | 7226390 | 0.9998 | −1.09 | 9.01 | 6264604~ 7476473 |
| Intron |
| 5 | 93948357 | 0.9906 | 3.28 | 62.41 | 93698481~ 94198475 |
| Intron |
| 5 | 20284735b | 0.9692 | −1.30 | 9.79 | 20035379~ 20534779 | 5S_rRNA | intergenic |
| 6 | 95497933 | 0.9996 | −1.45 | 14.76 | 95248213~ 95747954 |
| intergenic |
| 6 | 32950721b | 0.4975 | 6.33 | 11.39 | 32367171~ 33200834 | ENSBTAG00000047255 | Intron |
| 7 | 57287990 | 0.8807 | −1.66 | 20.11 | 57038213~ 57538309 |
| Intron |
| 9 | 38715137 | 0.9809 | −1.47 | 8.89 | 38345408~ 38965425 |
| Intron |
| 11 | 88771449 | 0.9876 | 1.16 | 10.43 | 88521462~ 89021477 | ENSBTAG00000047976 | intergenic |
| 11 | 15323223b | 0.8962 | −1.32 | 9.81 | 14855568~ 15573444 |
| Intron |
| 11 | 55681193c | 0.9948 | −1.60 | 9.91 | 55423855~ 55931229 |
| Intron |
| 12 | 68965758 | 0.9957 | −1.10 | 8.93 | 68502223~ 69216445 | ENSBTAG00000045195 | intergenic |
| 14a | 1802265 | 0.9398 | −6.93 | 240.56 | 1549133~ 2049435 |
| missense |
| 14a | 1802266 | 0.9362 | −6.93 | 240.56 | 1549133~ 2049435 |
| missense |
| 15 | 65891100 | 0.9992 | 1.50 | 12.99 | 65363764~ 66141839 |
| intergenic |
| 15 | 25044706b | 0.9908 | −1.17 | 9.80 | 24795472~ 25295470 |
| Intron |
| 17 | 62543160 | 0.9898 | 1.14 | 10.49 | 62224291~ 62793298 |
| Intron |
| 18 | 18970551 | 0.9442 | −1.19 | 10.30 | 18341203~ 19220732 |
| intergenic |
| 19 | 27522927 | 0.8500 | −1.32 | 10.86 | 26625240~ 27773016 |
| intergenic |
| 20 | 22609736 | 0.9813 | 1.53 | 14.23 | 21664412~ 22859809 |
| intergenic |
| 26 | 20547445 | 0.9993 | −1.76 | 21.46 | 20297497~ 20797570 |
| Intron |
| 26 | 42408595b | 0.9998 | −1.21 | 10.30 | 41409014~ 42658925 |
| Intron |
| 29 | 23609412 | 0.7717 | 2.06 | 10.73 | 22613737~ 23859451 | ENSBTAG00000047094 | intergenic |
| Total number of significant SNPs | 54,435 | ||||||
a Fourteen additional SNPs on chromosome 14 located near DGAT1 gene had same highest P value (details on those not presented). b indicated this SNP was found on second round, c indicated this SNP was found on third round
Lead SNPs from genome-wide associated regions for protein yield in Nordic Holstein cattle. Base positions are given as position in UMD 3.1.1 [49]
| BTA | base position | Imputation accuracy | Effect | –log10(p) | Region | gene | Annotation |
|---|---|---|---|---|---|---|---|
| 1 | 63177947 | 0.9885 | −1.94 | 12.35 | 61838881~ 63178271 | ENSBTAG00000046854 (near) | intergenic |
| 2 | 124837669 | 0.9886 | 1.59 | 12.63 | 124834921~ 124837676 |
| Intron |
| 3 | 17160521 | 0.9717 | −1.15 | 8.76 | 15377852~ 17160986 | Upstream | |
| 5 | 93511826 | 0.8626 | −1.37 | 14.25 | 93087740~ 93511841 | intergenic | |
| 5 | 21792183a | 0.9813 | −1.37 | 10.39 | 20072875~ 21792190 | intergenic | |
| 5 | 87923795b | 0.9926 | 1.50 | 8.97 | 85996611~ 87924188 | intergenic | |
| 6 | 88477501 | 0.9962 | −2.60 | 25.98 | 87154594~ 88477524 |
| Intron |
| 6 | 48477272a | 0.7329 | 1.49 | 13.59 | 46907022~ 48477298 | ENSBTAG00000045570 (near) | intergenic |
| 6 | 88749792b | 0.3222 | −1.66 | 12.13 | 86831016~ 88749854 | intergenic | |
| 7 | 41372989 | 0.9999 | −1.54 | 18.14 | 41085164~ 41373119 | intergenic | |
| 7 | 72100619a | 0.9077 | 1.59 | 13.29 | 70118741~ 72100721 | intergenic | |
| 8 | 93065787 | 0.8573 | 1.65 | 10.07 | 91065857~ 93066321 |
| Intron |
| 8 | 31538155a | 1.0000 | 1.91 | 9.62 | 30388755~ 31538625 | intergenic | |
| 9 | 33267855 | 0.8655 | −1.46 | 11.96 | 32627954~ 33267856 | intergenic | |
| 10 | 93933304 | 0.8370 | −1.36 | 9.90 | 92043775~ 93933391 |
| Intron |
| 11 | 35512708 | 0.9999 | −1.45 | 11.82 | 33534073~ 35512724 | ENSBTAG00000027786 (near) | intergenic |
| 13 | 37208792 | 0.9279 | −1.69 | 10.90 | 35572625~ 37208843 | intergenic | |
| 13 | 77657858a | 0.9906 | 1.17 | 9.52 | 76677111~ 77908632 |
| intron |
| 14 | 1835440 | 0.7471 | 2.84 | 48.66 | 1448510~ 1836166 |
| Intron |
| 14 | 67981742a | 0.7652 | 1.78 | 11.60 | 65984180~ 67981772 | intergenic | |
| 16 | 32262983 | 0.9290 | −1.52 | 12.79 | 30519873~ 32263130 |
| Intron |
| 18 | 57015407 | 0.9754 | 2.56 | 17.71 | 55015676~ 57015473 |
| Intron |
| 18 | 15272231a | 0.6697 | −1.16 | 9.53 | 15032157~ 15272234 | downstream | |
| 19 | 27522927 | 0.8500 | −1.42 | 12.55 | 26422519~ 27522980 | downstream | |
| 19 | 61014793a | 0.8505 | −1.08 | 8.65 | 60995058~ 61014874 | intergenic | |
| 20 | 69006609 | 0.9920 | −1.29 | 11.27 | 68120719~ 69006661 | intergenic | |
| 20 | 9282667a | 0.6747 | 1.71 | 10.93 | 7765154~ 9282923 |
| intron |
| 23 | 10974968 | 0.9304 | −1.18 | 10.68 | 9127211~ 10975139 | intergenic | |
| 25 | 36403719 | 1.0000 | 1.33 | 10.25 | 36112575~ 36403849 | intergenic | |
| 26 | 37695494 | 0.9122 | −1.41 | 14.76 | 36684176~ 37695588 | intergenic | |
| 27 | 36304978 | 0.9834 | 1.06 | 8.52 | 35875452~ 36305040 |
| intron |
| 29 | 17620617 | 0.9576 | 1.47 | 10.37 | 15650574~ 17620644 |
| intron |
| 29 | 35459126a | 0.9999 | 1.61 | 10.11 | 33464929~ 35459620 |
| intron |
| Total number of significant SNPs | 38,439 | ||||||
aindicated this SNP was found on second round, b indicated this SNP was found on third round
Lead SNP from genome-wide associated regions for milk yield in Nordic Holstein cattle. Base positions are given as position in UMD 3.1.1 [49]
| BTA | base position | Imputation accuracy | Effect | –log10(p) | Region | Gene | Annotation |
|---|---|---|---|---|---|---|---|
| 2 | 80753895 | 0.9454 | 1.13 | 9.95 | 79587952~ 80754103 | intergenic | |
| 3 | 56402959 | 0.9308 | −1.36 | 11.68 | 56392727~ 56402961 | ENSBTAG00000001873 (near) | intergenic |
| 4 | 101547644 | 0.7008 | −1.66 | 12.65 | 100921921~ 101547648 | upstream | |
| 5 | 93953487 | 0.9726 | −2.10 | 29.52 | 91953587~ 93953619 | upstream | |
| 5 | 23022794a | 0.7617 | 1.38 | 12.93 | 21101742~ 23022797 | intergenic | |
| 5 | 85080296b | 0.7619 | −1.28 | 11.24 | 84425435~ 85080331 | ENSBTAG00000009778 (near) | intergenic |
| 6 | 88847595 | 0.9009 | −1.78 | 21.61 | 88612186~ 88847596 | intergenic | |
| 6 | 46901490a | 0.7413 | −1.28 | 11.45 | 46181675~ 46901554 | intergenic | |
| 6 | 38027010b | 0.9950 | −4.75 | 9.47 | 36909885~ 38027583 |
| missense |
| 7 | 65370850 | 0.9848 | −1.36 | 13.58 | 65256765~ 65370922 | intergenic | |
| 8 | 73877814 | 0.8453 | −1.37 | 11.14 | 71877875~ 73877845 | ENSBTAG00000010829 (near) | upstream |
| 8 | 42062591a | 0.9595 | −1.27 | 10.07 | 40245362~ 42062776 |
| intergenic |
| 9 | 33478527 | 0.8801 | −1.25 | 9.23 | 31790030~ 33478670 | intergenic | |
| 10 | 1989907 | 0.9469 | −1.15 | 9.92 | 448434~ 1990092 | ENSBTAG00000047622 (near) | intergenic |
| 13 | 36822330 | 0.9933 | −1.66 | 10.74 | 36663680~ 36822395 |
| Intron |
| 14# | 1802667 | 0.7975 | 5.98 | 178.35 | 1702853~ 1797137 |
| Intron |
| 14 | 67577503a | 0.8898 | −2.16 | 11.04 | 66624772~ 67828111 |
| intergenic |
| 15 | 54392611 | 0.9577 | 1.57 | 16.58 | 52771707~ 54393036 |
| Intron |
| 16 | 28384260 | 0.9984 | 1.64 | 10.50 | 28012864~ 28384605 | Intergenic | |
| 17 | 66510224 | 0.9438 | 1.83 | 11.63 | 66119023~ 66510712 |
| Intron |
| 18 | 46583346 | 0.9829 | 1.86 | 11.97 | 44583383~ 46583963 | upstream | |
| 19 | 27442452 | 0.7904 | −1.26 | 9.71 | 26592355~ 27442492 | bta-mir-497 (near) | upstream |
| 20 | 29996719 | 0.9580 | −2.95 | 31.02 | 27997007~ 29996870 | intergenic | |
| 23 | 25076472 | 0.9797 | −1.34 | 9.23 | 23690289~ 25076491 |
| Intron |
| 26 | 37716420 | 0.9790 | −1.43 | 12.28 | 36730021~ 37966463 |
| intergenic |
| 28 | 34972377 | 0.9991 | −1.29 | 9.81 | 33464705~ 34972672 |
| intergenic |
| Total number of significant SNPs | 57,808 | ||||||
#Eight additional SNPs on chromosome 14 had same highest P value. a indicated this SNP was found on second round, b indicated this SNP was found on third round
Fig. 1Manhattan plot for association of SNP with fat yield in Nordic Holstein cattle. Red horizontal line indicates genome-wide significance level [−log10(P) = 8.5]
The genetics variants explained by QTL and the rest of SNPs
| Number of QTL | V(G1)/Vpb (%)b | V(G2)/Vpc (%)c | |
|---|---|---|---|
| Fat1a | 16 | 22.77 | 62.59 |
| Fat2a | 23 | 25.12 | 60.01 |
| Prot1a | 21 | 10.85 | 74.05 |
| Prot2a | 33 | 15.34 | 68.89 |
| Milk1a | 20 | 18.85 | 66.67 |
| Milk2a | 26 | 21.29 | 63.97 |
a Fat means the trait of fat yield, Prot means the trait of protein yield, Milk means the trait of milk yield; 1 indicate the lead SNP list only included the lead SNP from the first round, 2 indicated the lead SNP list included all lead SNP found by our approach.b means the percentage of genetics variants explained by the QTL, c means the percentage of genetics variants explained by the rest of SNP other than QTL
Fig. 2Manhattan plot for association of SNP with protein yield in Nordic Holstein cattle. Red horizontal line indicates genome-wide significance level [−log10(P) = 8.5]
Fig. 3Manhattan plot for association of SNP with milk yield in Nordic Holstein cattle. Red horizontal line indicates genome-wide significance level [−log10(P) = 8.5]
Genes related to “abnormal milk composition” phenotype in the mammalian phenotype database [24] overlapped with milk QTL identified in the present study
| Gene name | Location | Phenotype |
|---|---|---|
|
| BTA6: 87,141,556–87,159,096 | abnormal milk composition |
|
| BTA6: 87,179,502–87,188,025 | abnormal milk composition |
|
| BTA6: 87,378,398–87,392,750 | abnormal milk composition |
|
| BTA14: 1,795,351–1,804,562 | abnormal milk composition |
Genes related to “abnormal of mammary gland development” in the mammalian phenotype database [24] overlapped with milk QTL identified in the present study
| Gene name | Location | Phenotype |
|---|---|---|
|
| BTA15: 65,779,325–65,815,261 | decreased mammary gland tumor incidence |
|
| BTA15: 65,824,442–65,854,386 | abnormal mammary gland development |
|
| BTA14: 67,677,676–67,987,801 | increased mammary gland tumor incidence |
|
| BTA14: 1,795,351–1,804,562 | abnormal mammary gland development |
|
| BTA26: 20,966,010–21,008,277 | abnormal mammary gland growth during pregnancy |
Fig. 4The VEP annotation of SNPs in linkage disequilibrium (LD > 0.20) with leading SNPs. a The summary of all annotation. b The summary of annotation that change the protein coding sequence