| Literature DB >> 28500056 |
Jordi Leno-Colorado1, Nick J Hudson2, Antonio Reverter3, Miguel Pérez-Enciso4,5.
Abstract
Ascertaining the molecular and physiological basis of domestication and breeding is an active area of research. Due to the current wide distribution of its wild ancestor, the wild boar, the pig (Sus scrofa) is an excellent model to study these processes, which occurred independently in East Asia and Europe ca. 9000 yr ago. Analyzing genome variability patterns in terms of metabolic pathways is attractive since it considers the impact of interrelated functions of genes, in contrast to genome-wide scans that treat genes or genome windows in isolation. To that end, we studied 40 wild boars and 123 domestic pig genomes from Asia and Europe when metabolic pathway was the unit of analysis. We computed statistical significance for differentiation (Fst) and linkage disequilibrium (nSL) statistics at the pathway level. In terms of Fst, we found 21 and 12 pathways significantly differentiated at a q-value < 0.05 in Asia and Europe, respectively; five were shared across continents. In Asia, we found six significant pathways related to behavior, which involved essential neurotransmitters like dopamine and serotonin. Several significant pathways were interrelated and shared a variable percentage of genes. There were 12 genes present in >10 significant pathways (in terms of Fst), comprising genes involved in the transduction of a large number of signals, like phospholipase PCLB1, which is expressed in the brain, or ITPR3, which has an important role in taste transduction. In terms of nSL, significant pathways were mainly related to reproductive performance (ovarian steroidogenesis), a similarly important target trait during domestication and modern animal breeding. Different levels of recombination cannot explain these results, since we found no correlation between Fst and recombination rate. However, we did find an increased ratio of deleterious mutations in domestic vs. wild populations, suggesting a relaxed functional constraint associated with the domestication and breeding processes. Purifying selection was, nevertheless, stronger in significantly differentiated pathways than in random pathways, mainly in Europe. We conclude that pathway analysis facilitates the biological interpretation of genome-wide studies. Notably, in the case of pig, behavior played an important role, among other physiological and developmental processes.Entities:
Keywords: behavior; domestication; genome sequence data; pathway analysis; pig
Mesh:
Year: 2017 PMID: 28500056 PMCID: PMC5499126 DOI: 10.1534/g3.117.042671
Source DB: PubMed Journal: G3 (Bethesda) ISSN: 2160-1836 Impact factor: 3.154
Figure 1(A) Heatmap of the European individuals using the molecular relationship matrix, computed using all available autosomal single nucleotide polymorphisms (SNPs). (B) Heatmap of the Asian pigs. In Europe, breed codes are DU, Duroc; IB, Iberian; LR, Landrace; LW, Large White; MG, Mangalitza; PI, Pietrain; and YU, Yucatan minipig. In Asia, breed codes are BX, Bamaxiang; HT, Hetao; LA, Laiwu; LU, Luchuan; MI, Minzhu; MS, Meishan; ST, Sichuan; TT, Tibet; WU, Wuzhishan; and YT, Yunnan. Colors are used to differentiate among the populations: ASDM (Asian domestic, blue), ASWB (Asian wild boar, purple), EUDM (European domestic, green), and EUWB (European wild boar, dark red).
Significant pathways (q-value < 0.05) obtained in the Fst analysis in Asia and/or Europe
| Biological Process | Pathway Name | NCBI ID | Number of Genes | ||||
|---|---|---|---|---|---|---|---|
| Behavior | Opioid signaling | 1337511 | 46 | 0.0005 | 0.010 | 0.1300 | 0.686 |
| Behavior | Glutamatergic synapse | 213816 | 75 | 0.0005 | 0.010 | 0.0005 | 0.018 |
| Behavior | Dopaminergic synapse | 469195 | 88 | 0.0005 | 0.010 | 0.0460 | 0.495 |
| Behavior | Serotonergic synapse | 525344 | 75 | 0.0005 | 0.010 | 0.1710 | 0.734 |
| Behavior | Long-term depression | 84497 | 41 | 0.0005 | 0.010 | 0.0230 | 0.442 |
| Behavior | Adrenergic signaling in cardiomyocytes | 908279 | 104 | 0.0005 | 0.010 | 0.0440 | 0.495 |
| Biological regulation | Renin secretion | 1223594 | 42 | 0.0140 | 0.245 | 0.0005 | 0.018 |
| Biological regulation | Phosphatidylinositol signaling system | 84464 | 70 | 0.0470 | 0.438 | 0.0005 | 0.018 |
| Cell communication | Cell–cell communication | 1336387 | 61 | 0.1820 | 0.570 | 0.0005 | 0.018 |
| Cell communication | Integrin cell surface interactions | 1337048 | 51 | 0.0005 | 0.010 | 0.0005 | 0.018 |
| Cellular process | Assembly of the primary cilium | 1336230 | 123 | 0.0005 | 0.010 | 0.0390 | 0.479 |
| Cellular process | Hippo signaling pathway | 749791 | 30 | 0.0005 | 0.010 | 0.1590 | 0.725 |
| Cellular process | Wnt signaling pathway | 84473 | 89 | 0.0005 | 0.010 | 0.2010 | 0.753 |
| Cellular process | Axon guidance | 84477 | 92 | 0.0005 | 0.010 | 0.0330 | 0.456 |
| DNA repair | DNA double-strand break repair | 1336358 | 106 | 0.0005 | 0.010 | 0.1100 | 0.685 |
| DNA repair | Nonhomologous end-joining | 92864 | 12 | 0.0005 | 0.010 | 0.0260 | 0.442 |
| Immune response | Complement cascade | 1336947 | 23 | 0.1350 | 0.535 | 0.0005 | 0.018 |
| Immune response | Fc-γ receptor (FCGR) dependent phagocytosis | 1336978 | 36 | 0.0005 | 0.010 | 0.0370 | 0.467 |
| Immune response | Chagas disease (American trypanosomiasis) | 147807 | 83 | 0.0005 | 0.010 | 0.5210 | 0.898 |
| Metabolic process | Glycosaminoglycan metabolism | 1336589 | 81 | 0.0005 | 0.010 | 0.2230 | 0.757 |
| Metabolic process | Phospholipase D signaling pathway | 1311111 | 104 | 0.1600 | 0.542 | 0.0005 | 0.018 |
| Metabolic process | G alpha (s) signaling events | 1337504 | 88 | 0.0005 | 0.010 | 0.2840 | 0.810 |
| Metabolic process | Pancreatic secretion | 169304 | 59 | 0.0005 | 0.010 | 0.0005 | 0.018 |
| Metabolic process | Insulin secretion | 777548 | 60 | 0.0005 | 0.010 | 0.0005 | 0.018 |
| Muscle contraction | Muscle contraction | 1337146 | 117 | 0.5250 | 0.856 | 0.0005 | 0.018 |
| Muscle contraction | Vascular smooth muscle contraction | 96246 | 85 | 0.0440 | 0.430 | 0.0005 | 0.018 |
| Regulation of transcription | Nuclear signaling by ERBB4 | 1337437 | 23 | 0.0005 | 0.010 | 0.1310 | 0.686 |
| Transport | ABC transporters | 84452 | 27 | 0.0005 | 0.010 | 0.0005 | 0.018 |
NCBI ID, National Center for Biotechnology Information database identifier.
P-value obtained from permutations.
Figure 2Gene P-values (−log10) of significant genes at the 1% nominal level in Europe (red bars), in Asia (blue bars,) or both continents (black bars) from the significant differentiated (Fst) pathways involved in behavior. When a gene was significant in both continents, the smallest P-value is plotted. Gene symbols are provided when available; otherwise numbers indicate ensembl ENSSSCG id.
Significant pathways (q-value < 0.05) obtained in the nSL analysis for the four populations, according to continent Europe (EU)/Asia (AS) and domestic (DM)/wild (WB) status
| Biological Process | Pathway Name | NCBI ID | Number of Genes | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| Behavior/reproduction | Ovarian steroidogenesis | 791446 | 30 | 5e−04 | 0.032 | 0.012 | 1 | 5e−04 | 0.037 | 0.524 | 1 |
| Biological regulation | Biological oxidations | 1336802 | 120 | 5e−04 | 0.032 | 0.148 | 1 | 5e−04 | 0.037 | 1 | 1 |
| Hormone synthesis | Steroid hormone biosynthesis | 84375 | 32 | 5e−04 | 0.032 | 0.029 | 1 | 5e−04 | 0.037 | 0.457 | 1 |
| Immune response | Inflammatory mediator regulation of TRP channels | 948291 | 73 | 5e−04 | 0.032 | 0.259 | 1 | 0.120 | 1.000 | 1 | 1 |
| Lipid metabolic process | Arachidonic acid metabolism | 1336691 | 39 | 5e−04 | 0.032 | 5e−04 | 0.110 | 5e−04 | 0.037 | 0.998 | 1 |
| Lipid metabolic process | Arachidonic acid metabolism | 84417 | 47 | 5e−04 | 0.032 | 0.041 | 1 | 5e−04 | 0.037 | 0.977 | 1 |
| Lipid metabolic process | Linoleic acid metabolism | 84418 | 24 | 5e−04 | 0.032 | 5e−04 | 0.110 | 5e−04 | 0.037 | 0.755 | 1 |
NCBI ID, National Center for Biotechnology Information database identifier; ASDM, Asian domestic; ASWB, Asian wild boar; EUDM, European domestic; EUWB, European wild boar.
P-value obtained from permutations.
Figure 3Significant genes at the 1% nominal level either in Europe, Asia or both, present in the significant pathways obtained from the nSL (linkage disequilibrium) analysis. Gene symbols are provided when available; otherwise numbers indicate ensembl ENSSSCG id.
Figure 4Coassociation network among the 31 interconnected significant pathways. Each node represents a pathway that is connected by an edge if partial correlation with another pathway is significant and >0.8 (in absolute value). Node size is proportional to number of genes in the pathways. Node shapes represent pathway source: triangles for REACTOME and circles for KEGG. Colors indicate the population with lowest Fst (statistical significance for differentiation) P-value: pink for Asia, blue for Europe, and green for equal significance. Node line width to pathway variability: thin and thick lines for pathways with variability below and above average, respectively. Black and red edges represent positive and negative correlations between pathways, respectively. The three main pathway clusters are identified with letters A–C.
Genes present in 10 or more significant pathways (Fst)
| Gene Symbol | Ensembl Gene ID | Pathways | Significant Pathways | Genomic Position | ||
|---|---|---|---|---|---|---|
| PLCB3 | ENSSSCG00000013034 | 35 | 15 | 0.128 | 0.741 | 2: 6911684–6927124 |
| PLCB1 | ENSSSCG00000007056 | 35 | 15 | 1e−16 | 5e−04 | 17: 19691509–19860912 |
| PRKCA | ENSSSCG00000017268 | 46 | 13 | 0.248 | 0.887 | 12: 13502757–13602445 |
| PRKACA | ENSSSCG00000013771 | 47 | 13 | 0.830 | 0.583 | 2: 65350514–65371241 |
| PLA2G4A | ENSSSCG00000023351 | 21 | 11 | 0.388 | 2e−04 | 9: 140460880–140623439 |
| PRKCG | ENSSSCG00000003256 | 36 | 11 | 0.985 | 0.157 | 6: 52982004–53001741 |
| ITPR3 | ENSSSCG00000001518 | 27 | 11 | 0.938 | 9e−04 | 7: 34443056–34510838 |
| ENSSSCG00000000175 | ENSSSCG00000000175 | 35 | 11 | 0.579 | 0.385 | 5: 15129052–15145294 |
| ENSSSCG00000023437 | ENSSSCG00000023437 | 25 | 10 | 0.435 | 0.003 | 13: 67554829–67604679 |
| ADCY2 | ENSSSCG00000017101 | 32 | 10 | 0.024 | 0.249 | 16: 80358753–80624210 |
| MAPK1 | ENSSSCG00000010081 | 74 | 10 | 0.696 | 0.633 | 14: 53590167–53614842 |
| ADCY3 | ENSSSCG00000008578 | 32 | 10 | 0.999 | 0.994 | 3: 121107128–121201171 |
| ENSSSCG00000007833 | ENSSSCG00000007833 | 39 | 10 | 7e−06 | 0.003 | 3: 23052200–23174810 |
| ADCY4 | ENSSSCG00000001988 | 32 | 10 | 0.999 | 0.463 | 7: 80227590–80243075 |
ID, identifier; Fst, statistical significance for differentiation.
Deleterious and tolerated SNPs grouped according to Europe (EU)/Asia (AS) continent and domestic (DM)/wild (WB) status
| Continent | Population | SNP Type | Nonsignificant Genes of Nonsignificant Pathways | Significant Genes of Nonsignificant Pathways | Nonsignificant Genes of Significant Pathways | Significant Genes of Significant Pathways | |
|---|---|---|---|---|---|---|---|
| Asia | ASDM | Tolerated | 8534 | 1,195 | 2,625 | 1,176 | 0.756 |
| Deleterious | 6,825 | 954 | 2,144 | 927 | |||
| λASDM | 0.800 | 0.798 | 0.817 | 0.788 | |||
| ASWB | Tolerated | 6,245 | 843 | 2,029 | 636 | 0.033 | |
| Deleterious | 4,432 | 587 | 1,320 | 391 | |||
| λASWB | 0.710 | 0.696 | 0.651 | 0.615 | |||
| ASDM/ASWB | λASDM/λASWB | 1.127 | 1.147 | 1.255 | 1.282 | ||
| Europe | EUDM | Tolerated | 5,191 | 716 | 1,675 | 620 | 0.023 |
| Deleterious | 3,429 | 483 | 1,200 | 349 | |||
| λEUDM | 0.661 | 0.675 | 0.716 | 0.563 | |||
| EUWB | Tolerated | 2,654 | 416 | 893 | 346 | 0.001 | |
| Deleterious | 1,153 | 198 | 382 | 103 | |||
| λEUWB | 0.434 | 0.476 | 0.428 | 0.298 | |||
| EUDM/EUWB | λEUDM/λEUWB | 1.523 | 1.418 | 1.675 | 1.891 | ||
| Total | Total | Tolerated | 22,624 | 3,170 | 7,222 | 2,778 | 0.003 |
| Deleterious | 15,839 | 2,222 | 5,046 | 1,770 | |||
| λTotal | 0.700 | 0.701 | 0.699 | 0.637 |
SNP, single nucleotide polymorphism; ASDM, Asian domestic; ASWB, Asian wild boar; EUDM, European domestic; EUWB, European wild boar.
λ corresponds to the ratio of deleterious vs. tolerated SNPs.
P-value obtained from χ2 test of the 2 × 2 table containing nonoverlapping SNP sets (nonsignificant genes from nonsignificant pathways vs. significant genes from significant pathways).