| Literature DB >> 34276758 |
Huatao Liu1, Hailiang Song1, Yifan Jiang1, Yao Jiang1, Fengxia Zhang1, Yibing Liu1, Yong Shi1, Xiangdong Ding1, Chuduan Wang1.
Abstract
The body shape of a pig is the most direct production index, which can fully reflect the pig's growth status and is closely related to important economic traits. In this study, a genome-wide association study on seven body size traits, the body length (BL), height (BH), chest circumference (CC), abdominal circumference (AC), cannon bone circumference (CBC), rump width (RW), and chest width (CW), were conducted in Yorkshire pigs. Illumina Porcine 80K SNP chips were used to genotype 589 of 5,572 Yorkshire pigs with body size records, and then the chip data was imputed to sequencing data. After quality control of imputed sequencing data, 784,267 SNPs were obtained, and the averaged linkage disequilibrium (r 2) was 0.191. We used the single-trait model and the two-trait model to conduct single-step genome wide association study (ssGWAS) on seven body size traits; a total of 198 significant SNPS were finally identified according to the P-value and the contribution to the genetic variance of individual SNP. 11 candidate genes (CDH13, SIL1, CDC14A, TMRPSS15, TRAPPC9, CTNND2, KDM6B, CHD3, MUC13, MAPK4, and HMGA1) were found to be associated with body size traits in pigs; KDM6B and CHD3 jointly affect AC and CC, and MUC13 jointly affect RW and CW. These genes are involved in the regulation of bone growth and development as well as the absorption of nutrients and are associated with obesity. HMGA1 is proposed as a strong candidate gene for body size traits because of its important function and high consistency with other studies regarding the regulation of body size traits. Our results could provide valuable information for pig breeding based on molecular breeding.Entities:
Keywords: SNP effect; body size traits; pigs; ssGWAS; two-trait model
Year: 2021 PMID: 34276758 PMCID: PMC8283822 DOI: 10.3389/fgene.2021.629049
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Descriptive statistics for body weight and seven body size traits.
| Trait1 | N-obs2 | Mean | S.D. | CV(%) | Min value | Max value |
| BL(cm) | 5573 | 108.89 | 6.18 | 5.67 | 88 | 134 |
| BH(cm) | 5573 | 62.87 | 2.92 | 4.64 | 51 | 75 |
| CC(cm) | 5573 | 104.58 | 5.75 | 5.50 | 85 | 126 |
| AC(cm) | 4898 | 113.52 | 6.31 | 5.56 | 94 | 137 |
| CW(cm) | 5572 | 29.75 | 2.31 | 7.76 | 19 | 38 |
| RW(cm) | 5573 | 31.64 | 2.13 | 6.73 | 22 | 40 |
| CBC(cm) | 5573 | 17.98 | 1.03 | 5.73 | 13 | 23 |
| BW(kg) | 5573 | 101.31 | 12.59 | 12.43 | 61 | 150 |
Genetic correlations between seven body size traits.
| Trait1 | BL | BH | CC | AC | CW | RW | CBC |
| BL | −0.010(0.088) | 0.033(0.092) | −0.014(0.092) | 0.014(0.078) | −0.286(0.078) | 0.206(0.078) | |
| BH | 0.171(0.104) | 0.071(0.106) | −0.221(0.091) | −0.217(0.090) | −0.105(0.096) | ||
| CC | 0.255(0.093) | 0.127(0.095) | 0.197(0.096) | ||||
| AC | 0.153(0.096) | 0.204(0.095) | 0.202(0.096) | ||||
| CW | 0.015(0.086) | ||||||
| RW | −0.032(0.085) | ||||||
| CBC |
Significant SNPs and associated genes for seven body size traits.
| Trait1 | Chromosome | Position (bp) | SNP effect (%) | Gene | Distance | Gene function | |
| BL | 6 | 5671575 | 2.35E−07 | 0.00186 | +13217 | cadherin 13 | |
| 1 | 6435744 | 1.5E−06 | 0.00017 | NA | NA | ||
| 1 | 6472959 | 1.5E−06 | 0.00080 | PRKN | +38323 | parkin RBR E3 ubiquitin protein ligase | |
| 17 | 33632497 | 2.62E−06 | 0.00289 | ENSSSCG00000028461 | −47405 | signal regulatory protein alpha | |
| 13 | 25520933 | 4.45E−06 | 0.00004 | ULK4 | −8396 | unc-51 like kinase 4 | |
| 16 | 1276330 | 4.57E−06 | 0.00031 | NA | NA | ||
| 14 | 137476010 | 6.47E−06 | 0.00054 | NA | NA | ||
| 8 | 28933773 | 7.46E−06 | 0.00144 | NWD2 | −23316 | NACHT and WD repeat domain containing 2 | |
| 13 | 166328893 | 8.39E−06 | 0.00039 | NA | NA | ||
| BH | 16 | 886074 | 2.84E−06 | 0.00817 | CTNND2 | +28239 | alpha-2-macroglobulin like 1 |
| 8 | 7942460 | 3.01E−06 | 0.00083 | NA | NA | ||
| 3 | 26586077 | 4.62E−06 | 0.00117 | CLEC19A | −45911 | C-type lectin domain containing 19A | |
| 5 | 62690928 | 6.5E−06 | 0.00004 | A2ML1 | −42827 | alpha-2-macroglobulin like 1 | |
| 4 | 128701315 | 7.54E−06 | 0.00152 | NA | NA | ||
| 14 | 33580513 | 9.85E−06 | 0.00060 | HSPB8 | +45615 | heat shock protein family B (small) member 8 | |
| CBC | 4 | 117759672 | 2.16E−07 | 0.00279 | −34935 | cell division cycle 14A | |
| 13 | 182971424 | 1.83E−06 | 0.00420 | −29625 | transmembrane serine protease 15 | ||
| 17 | 12868538 | 1.85E−06 | 0.00635 | PSD3 | −43049 | pleckstrin and Sec7 domain containing 3 | |
| 1 | 1201299 | 2.3E−06 | 0.00025 | ENSSSCG00000041157 | −47914 | NA | |
| 1 | 1205821 | 2.3E−06 | 0.00018 | ENSSSCG00000050693 | −42855 | NA | |
| 1 | 1220233 | 2.3E−06 | 0.00039 | ENSSSCG00000045916 | −18409 | NA | |
| 1 | 1367723 | 2.3E−06 | 0.00064 | ENSSSCG00000043714 | +5537 | NA | |
| 18 | 21663467 | 0.000003 | 0.00400 | GRM8 | −14659 | glutamate metabotropic receptor 8 | |
| 14 | 9698552 | 3.19E−06 | 0.00026 | ENSSSCG00000049499 | 9436 | NA | |
| 5 | 7020488 | 3.46E−06 | 0.00023 | PMM1 | −49963 | phosphomannomutase 1 | |
| 12 | 50490164 | 4.08E−06 | 0.00233 | SPNS3 | −47230 | sphingolipid transporter 3 (putative) | |
| 3 | 12869355 | 4.1E−06 | 0.00259 | ENSSSCG00000036217 | +18272 | NA | |
| 4 | 10221008 | 5.38E−06 | 0.00076 | ASAP1 | −37722 | ArfGAP with SH3 domain, ankyrin repeat and PH domain 1 | |
| 2 | 124456560 | 5.76E−06 | 0.00036 | PRR16 | +6055 | proline rich 16 | |
| 1 | 13806583 | 7.02E−06 | 0.00142 | ENSSSCG00000004081 | −2527 | NA | |
| AC | 8 | 3249196 | 1.88E−06 | 0.00134 | AFAP1 | −46660 | actin filament associated protein 1 |
| 9 | 14578071 | 2.31E−06 | 0.00128 | NA | NA | ||
| 14 | 13670622 | 2.71E−06 | 0.00048 | PRSS55 | −4581 | serine protease 55 | |
| 5 | 15137502 | 2.96E−06 | 0.00513 | RHEBL1 | −39837 | RHEB like 1 | |
| 4 | 5362087 | 4.48E−06 | 0.00139 | ENSSSCG00000044937 | +36176 | NA | |
| 7 | 26363076 | 5.4E−06 | 0.00093 | NA | NA | ||
| 14 | 43227411 | 5.77E−06 | 0.00024 | ENSSSCG00000033385 | −49062 | KIAA1671 ortholog | |
| 16 | 522752 | 6.96E−06 | 0.00014 | −3796 | catenin delta 2 | ||
| CC | 3 | 63528527 | 1.32E−07 | 0.00015 | ENSSSCG00000008250 | −41861 | catenin alpha 2 |
| 6 | 19429624 | 3.27E−07 | 0.00022 | Metazoa_SRP | −49801 | Metazoan signal recognition particle RNA | |
| 1 | 3149903 | 7.78E−07 | 0.00047 | PDE10A | −28771 | phosphodiesterase 10A A | |
| 6 | 120477523 | 1.95E−06 | 0.00173 | FHOD3 | −40874 | formin homology 2 domain containing 3 | |
| 10 | 56219300 | 2.01E−06 | 0.00203 | ITGB1 | −46293 | integrin subunit beta 1 | |
| 17 | 18990746 | 2.63E−06 | 0.00004 | ANKEF1 | −32351 | ankyrin repeat and EF-hand domain containing 1 | |
| 17 | 18997949 | 2.63E−06 | 0.00008 | ANKEF1 | −37701 | ankyrin repeat and EF-hand domain containing 1 | |
| 2 | 122228151 | 3.03E−06 | 0.00105 | ENSSSCG00000051343 | −14167 | NA | |
| 2 | 122235537 | 3.03E−06 | 0.00074 | ENSSSCG00000051343 | −21553 | NA | |
| 16 | 33630686 | 3.58E−06 | 0.00018 | NA | NA | ||
| 16 | 33638300 | 3.58E−06 | 0.00079 | NA | NA | ||
| 16 | 5533970 | 4.64E−06 | 0.00394 | ENSSSCG00000016791 | +16579 | NA | |
| 12 | 5297390 | 5.41E−06 | 0.00092 | RNF157 | −48707 | ring finger protein 157 | |
| 4 | 64552365 | 5.7E−06 | 0.00666 | ENSSSCG00000042029 | −24706 | NA | |
| 10 | 43341283 | 7.22E−06 | 0.00064 | CUBN | −39687 | cubilin | |
| 8 | 21799389 | 8.84E−06 | 0.00030 | ENSSSCG00000050984 | −18261 | NA | |
| 10 | 60737384 | 9.42E−06 | 0.00449 | ENSSSCG00000011121 | −24200 | CUGBP Elav-like family member 2 | |
| RW | 8 | 137165913 | 5.64E−07 | 0.00049 | NA | NA | |
| 9 | 2330339 | 2.74E−06 | 0.00381 | SYT9 | −11533 | synaptotagmin 9 | |
| 1 | 38033383 | 4.19E−06 | 0.00149 | NKAIN2 | −12912 | sodium/potassium transporting ATPase interacting 2 | |
| 3 | 63682227 | 5.03E−06 | 0.00010 | NA | NA | ||
| 11 | 32555905 | 6.78E−06 | 0.00015 | DIAPH3 | +46764 | diaphanous related formin 3 | |
| 16 | 48600234 | 7.1E−06 | 0.00118 | ENSSSCG00000046085 | −23005 | NA | |
| 16 | 48696355 | 7.1E−06 | 0.00155 | ENSSSCG00000039883 | +49947 | NA | |
| 1 | 100210738 | 7.97E−06 | 0.00095 | +8278 | mitogen-activated protein kinase 4 | ||
| 1 | 100335688 | 7.97E−06 | 0.00017 | −49706 | mitogen-activated protein kinase 4 | ||
| CW | 8 | 132277288 | 8.17E−07 | 0.00009 | PTPN13 | −27877 | protein tyrosine phosphatase non-receptor type 13 |
| MAPK10 | −27281 | mitogen-activated protein kinase 10 | |||||
| 7 | 115471416 | 9.52E−07 | 0.00653 | PPP4R4 | −18233 | protein phosphatase 4 regulatory subunit 4 | |
| 14 | 37118119 | 1.09E−06 | 0.00289 | ENSSSCG00000051786 | −2275 | NA | |
| 14 | 37165658 | 1.09E−06 | 0.00051 | ENSSSCG00000051786 | −49874 | NA | |
| 14 | 37230969 | 1.09E−06 | 0.00039 | ENSSSCG00000051786 | −9755 | NA | |
| 2 | 80016213 | 2.07E−06 | 0.00192 | COL23A1 | −46865 | collagen type XXIII alpha 1 chain | |
| 14 | 139878474 | 2.34E−06 | 0.00437 | TCERG1L | −46513 | transcription elongation regulator 1 like | |
| 12 | 49725382 | 2.67E−06 | 0.00112 | TRPV1 | −33424 | transient receptor potential cation channel subfamily V member 1 | |
| 16 | 35012960 | 3.46E−06 | 0.00029 | DDX4 | −46102 | DEAD-box helicase 4 | |
| 7 | 115132809 | 4.65E−06 | 0.00069 | ENSSSCG00000002464 | −31787 | proline rich membrane anchor 1 | |
| 16 | 73800572 | 4.82E−06 | 0.00153 | U6 | +41611 | U6 spliceosomal RNA | |
| 16 | 73812833 | 4.82E−06 | 0.00172 | U6 | +29350 | U6 spliceosomal RNA | |
| 16 | 73816240 | 4.82E−06 | 0.00175 | U6 | +25343 | U6 spliceosomal RNA | |
| 9 | 107845695 | 5.74E−06 | 0.00014 | ENSSSCG00000032905 | −7136 | NA | |
| 8 | 76456715 | 6.42E−06 | 0.00057 | ENSSSCG00000042273 | −45304 | NA | |
| 3 | 131731345 | 6.59E−06 | 0.00001 | ENSSSCG00000049751 | −23407 | NA | |
| 3 | 131738702 | 6.59E−06 | 0.00016 | ENSSSCG00000049751 | −30764 | NA | |
| 3 | 131744661 | 6.59E−06 | 0.00016 | ENSSSCG00000049751 | −36723 | NA | |
| 3 | 131756951 | 6.59E−06 | 0.00025 | ENSSSCG00000049751 | −43896 | NA | |
| 3 | 131758601 | 6.59E−06 | 0.00024 | ENSSSCG00000049751 | −45546 | NA | |
| 5 | 61521505 | 7.76E-06 | 0.00033 | ENSSSCG00000033403 | −14845 | C-type lectin domain family 7 member A-like | |
| 16 | 67601687 | 8.2E-06 | 0.00052 | ENSSSCG00000049229 | −42425 | NA | |
| 7 | 92897102 | 4.65E−06 | 0.00109 | +25885 | high mobility group AT-hook 1 | ||
| 15 | 11796106 | 9.96E−06 | 0.00074 | NA | NA |
FIGURE 1Manhattan plot of the genome-wide association study on three body size traits by using single-trait model ssGWAS. BL, Body length; BH, Body height; CBC, Cannon bone circumference. In the Manhattan plots, negative log10 P-values of the quantified SNPs were plotted against their genomic positions. The x-axis represents the chromosomes and the y-axis represents the observed −log10(P-value). Different colors indicate various chromosomes. Each trait has a significant threshold of FDR adjusted, for (A) BL, it was 9.26 × 10–6. Similarly, (B) BH was 1.08 × 10–5, and (C) CBC was 1.02 × 10–5.
FIGURE 2Manhattan plot of the genome-wide association study on four body size traits by using two-trait model ssGWAS. AC, Abdominal circumference; CC, Chest circumference; RW, Rump width; CW, Chest width. AC and CC are a pair of traits, RW and CW are a pair of traits. In the Manhattan plots, negative log10 P-values of the quantified SNPs were plotted against their genomic positions. The x-axis represents the chromosomes and the y-axis represents the observed –log10(P-value). Different colors indicate various chromosomes. Each trait has a significant threshold of FDR adjusted, for (A) AC, it was 9.74 × 10–6. Similarly, (B) CC was 1.05 × 10–5, (C) RW was 9.60 × 10–6, and (D) CW was 1.01 × 10–5.
FIGURE 3Manhattan plot of the genome-wide association study on seven body size traits and Venn plot of SNPs according to the contribution of SNP to genetic variance by using ssGWAS. BL, Body length; BH, Body height; CBC, Cannon bone circumference; AC, Abdominal circumference; CC, Chest circumference; RW, Rump width; CW, Chest width. BL, BH, and CBC were single-trait models, AC, CC, RW, and CW were two-trait models. AC and CC are a pair of traits, RW and CW are a pair of traits. In the Manhattan plots (A–G), the proportion of genetic variance of the quantified SNPs were plotted against their genomic positions. The x-axis represents the chromosomes and the y-axis represents the percentage of SNP explaining the genetic variance. Different colors indicate different chromosomes. Venn plot (H) of SNPs for the two pairs of body size traits, AC and CC, RW, and CW are a pair of traits, respectively.
Overview of ssGWAS location for the percentage that explains the proportion of genetic variance.
| Trait1 | 20 SNPs distributions of maximum effect | SNPs of the maximum effect | Top 20 SNPs effect (%) | Number of nearest gene | Candidate gene |
| BL | SSC17 | 17_7477978 | 0.117 | 4 | |
| BH | SSC2, SSC5, SSC16 | 2_46827557 | 0.08.7 | 8 | SIL1 |
| CBC | SSC7, SSC4 | 7_55099416 | 0.101 | 17 | TRAPPC9 |
| AC | SSC12 | 12_53181656 | 0.128. | 8 | KDM6B CHD3 |
| CC | SSC12 | 12_53169477 | 0.129 | 8 | KDM6B CHD3 |
| RW | SSC6, SSC7, SSC12, SSC13, SSC17 | 17_13172524 | 0.099 | 15 | MUC13 |
| CW | SSC6, SSC7, SSC13, SSC17 | 6_39554872 | 0.070 | 28 | MUC13 |