| Literature DB >> 30335783 |
Pablo Augusto de Souza Fonseca1,2, Samir Id-Lahoucine1, Antonio Reverter3, Juan F Medrano4, Marina S Fortes5, Joaquim Casellas6, Filippo Miglior1,7, Luiz Brito1, Maria Raquel S Carvalho2, Flávio S Schenkel1, Loan T Nguyen5, Laercio R Porto-Neto3, Milton G Thomas8, Angela Cánovas1.
Abstract
The identification of biological processes related to the regulation of complex traits is a difficult task. Commonly, complex traits are regulated through a multitude of genes contributing each to a small part of the total genetic variance. Additionally, some loci can simultaneously regulate several complex traits, a phenomenon defined as pleiotropy. The lack of understanding on the biological processes responsible for the regulation of these traits results in the decrease of selection efficiency and the selection of undesirable hitchhiking effects. The identification of pleiotropic key-regulator genes can assist in developing important tools for investigating biological processes underlying complex traits. A multi-breed and multi-OMICs approach was applied to study the pleiotropic effects of key-regulator genes using three independent beef cattle populations evaluated for fertility traits. A pleiotropic map for 32 traits related to growth, feed efficiency, carcass and meat quality, and reproduction was used to identify genes shared among the different populations and breeds in pleiotropic regions. Furthermore, data-mining analyses were performed using the Cattle QTL database (CattleQTLdb) to identify the QTL category annotated in the regions around the genes shared among breeds. This approach allowed the identification of a main gene network (composed of 38 genes) shared among breeds. This gene network was significantly associated with thyroid activity, among other biological processes, and displayed a high regulatory potential. In addition, it was possible to identify genes with pleiotropic effects related to crucial biological processes that regulate economically relevant traits associated with fertility, production and health, such as MYC, PPARG, GSK3B, TG and IYD genes. These genes will be further investigated to better understand the biological processes involved in the expression of complex traits and assist in the identification of functional variants associated with undesirable phenotypes, such as decreased fertility, poor feed efficiency and negative energetic balance.Entities:
Mesh:
Substances:
Year: 2018 PMID: 30335783 PMCID: PMC6193631 DOI: 10.1371/journal.pone.0205295
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Fig 1Flowchart presenting the methodological pipeline performed to identify the potential key-regulatory genes for pleiotropic effect on fertility and production traits in beef cattle.
Fig 2Venn diagram displaying the comparison of genes among the different independent populations analysed.
In yellow, the genes identified in Brahman and Tropical Composite breeds (Hawken et al., 2012) [22]. In blue, the genes identified in Brahman breed (Nguyen et al., 2017) [20]. In red, the genes identified in Brangus breed (Cánovas et al., 2014) [13].
Top 10 enriched biological processes (BP) and enriched KEGG pathways identified by STRING database using the 89 genes shared among the reported results from Cánovas et al. (2014) [13], Hawken et al. (2012) [22] and Nguyen et al. (2017) [20] and mapped in regions with 5 or more categories of QTL.
| Top 10 enriched GO (Biological Function) | |||
|---|---|---|---|
| ID | Description | Bonferroni P-value | Implicated genes |
| GO:0048513 | Organ development | 8.67E-05 | ABI1, ABL1, ACTB, ACTL6B, ADAMTS18, ATF2, CAPN3, CCDC85C, CHD7, CUX1, DCHS1, EMX2, FKBP1A, FOLR1, FSHR, GSK3B, HHEX, IGF1R, LIN7A, MYC, NTRK1, POU3F1, PPARG, PRKG1, RBM20, RP1, RSPO2, SPINT1, TBL1XR1, TG, ZNF148 |
| GO:0048731 | System development | 0.0002 | ABI1, ACTB, ACTL6B, ADAMTS18, ADCYAP1, ATF2, BARHL2, BMP7, CAPN3, CCDC85C, CHD7, EMX2, FKBP1A, FOLR1, FSHR, GSK3B, HHEX, IGF1R, LGALS3, LIN7A, MYC, OPCML, PDLIM5, PPARG, PRICKLE1, PRKG1, RBM20, RP1, RSPO2, SCG2, SPINT1, TBL1XR1, TG, TRPC7, ZNF148 |
| GO:0007275 | Multicellular organismal development | 0.0003 | ABI1, ACTB, ACTL6B, ADAMTS18, ADCYAP1, ATF2, BARHL2, BMP7, CAPN3, CCDC85C, CHD7, EMX2, FKBP1A, FOLR1, FSHR, GSK3B, HHEX, IGF1R, LGALS3, LIN7A, MYC, OPCML, PDLIM5, PPARG, PRICKLE1, PRKG1, RBM20, RP1, RSPO2, SCG2, SMAD2, SPINT1, TBL1XR1, TG, TLL2, TRPC7, ZNF148 |
| GO:0048856 | Anatomical structure development | 0.0003 | ABI1, ACTB, ACTL6B, ADAMTS18, ADCYAP1, ATF2, BARHL2, BMP7, CAPN3, CASR, CCDC85C, CHD7, EMX2, FKBP1A, FOLR1, FSHR, GSK3B, HHEX, IGF1R, LIN7A, LMOD2, MYC, OPCML, PDLIM5, PPARG, PRICKLE1, PRKG1, RBM20, RP1, RSPO2, SCG2, SMAD2, SPINT1, TBL1XR1, TG, TRPC7, ZNF148 |
| GO:0044707 | Single-multicellular organism process | 0.0004 | ABI1, ACTB, ACTL6B, ADAMTS18, ADCYAP1, ATF2, BARHL2, BMP7, BRPF3, CAPN3, CASR, CCDC85C, CHD7, EMX2, FKBP1A, FOLR1, FSHR, GSK3B, HHEX, IGF1R, LGALS3, LIN7A, LMOD2, MYC, NPAS3, OPCML, PDLIM5, PGAM2, PNKD, PPARG, PRICKLE1, RBM20, RP1, RSPO2, SCG2, SMAD2, SPINT1, SST, TBL1XR1, TG, TLL2, TRPC7, ZNF148 |
| GO:0009888 | Tissue development | 0.0007 | ABI1, ATF2, CHD7, CUX1, DCHS1, FKBP1A, FOLR1, GSK3B, HHEX, IGF1R, LGALS3, MYC, NTRK1, POU3F1, PPARG, PRICKLE1, RP1, RSPO2, SPINT1, TBL1XR1, TDGF1 |
| GO:0032502 | Developmental process | 0.001 | ABI1, ACTB, ACTL6B, ADAMTS18, ADCYAP1, ATF2, BARHL2, BMP7, CASR, CCDC85C, CHD7, EMX2, FKBP1A, FOLR1, FSHR, GSK3B, HHEX, IGF1R, LIN7A, LMOD2, MYC, NPAS3, OPCML, PDLIM5, PPARG, PRICKLE1, PRKG1, RBM20, RP1, RSPO2, SCG2, SMAD2, SPINT1, TBL1XR1, TG, TLL2, TRPC7, ZNF148 |
| GO:0060429 | Epithelium development | 0.001 | ABI1, CHD7, CUX1, DCHS1, FOLR1, HHEX, IGF1R, LGALS3, MYC, NTRK1, POU3F1, PPARG, PRICKLE1, RSPO2, SMAD2, SPINT1 |
| GO:0035239 | Tube morphogenesis | 0.003 | BMP7, CHD7, DCHS1, FOLR1, HHEX, MYC, PRICKLE1, RSPO2, SMAD2, SPINT1 |
| GO:0006468 | Receptor activity | 0.003 | ABI1, ABL1, ATF2, DAPK2, GSK3B, IGF1R, MYC, NTRK1, PRKG1, SCG2, SMAD2, STK32B, STK33, TDGF1 |
| KEGG: 5202 | Transcriptional misregulation in cancer | 0.00183 | HHEX, IGF1R, MYC, NTRK1, PPARG, SPINT1, SUPT3H |
| KEGG: 5200 | Pathways in cancer | 0.00943 | ABL1, DAPK2, GSK3B, IGF1R, MYC, NTRK1, PPARG, SMAD2 |
| KEGG: 5216 | Thyroid cancer | 0.0193 | MYC, NTRK1, PPARG |
| KEGG: 4390 | Hippo signaling pathway | 0.0286 | ACTB, BMP7, GSK3B, MYC, SMAD2 |
Fig 3Gene network displaying the connections between the markers shared among the three independent populations and mapped in pleiotropic regions.
The nodes represent individual genes. The coloured lines linking the nodes represents the interactions between the genes. The interactions between the genes can be divided in three types: 1) Known interaction: from curated databases (light blue) and experimentally determined (purple); 2) Predicted interactions: gene neighbourhood (green), gene fusions (red) and gene co-occurrence (dark blue); 3) Others: text mining (yellow), co-expression (black) and protein homology (violet). The interactions were based on human data, since the human database is more curate and complete than the bovine database.
Fig 4Proportion of each QTL category (reproduction, production, milk, meat and carcass quality, exterior appearance, and health) mapped in a 2 Mb interval (1Mb upstream and 1 Mb downstream) from each gene present in the main network (38 genes).
Inclusion criteria and/or expression pattern (differentially expressed), in Cánovas et al. (2014) [13] and Nguyen et al. (2017) [20], for the 38 genes mapped in the main network identified by STRING database.
| Gene Symbol | Coordinate | Brangus (Cánovas et al., 2014) [ | Tropical composite and Brahman | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SNP | TS | TF | Hyp | Pit | Ov | Ut | End | Lv | Ad | ldm | Pit | Ov | ||
| GSK3B | 1:65265851–65292071 | X | X | |||||||||||
| CASR | 1:67255165–67344655 | X | X | |||||||||||
| SST | 1:80250205–80251648 | X | X | X | X | X | X | X | ||||||
| TBL1XR1 | 1:90428333–90609529 | X | X | |||||||||||
| ATF2 | 2:21725894–21830562 | X | X | |||||||||||
| SMARCAL1 | 2:105135339–105189135 | X | X | |||||||||||
| SCG2 | 2:112549869–112555375 | X | X | |||||||||||
| NTRK1 | 3:14019229–14037583 | X | X | X | ||||||||||
| ADH4 | 6:26853175–26885436 | X | X | X | X | X | ||||||||
| PDLIM5 | 6:31333660–31568270 | X | X | |||||||||||
| FBP1 | 8:82460863–82491694 | X | X | |||||||||||
| HTR1E | 9:63760625–63852410 | X | X | |||||||||||
| IYD | 9:88612923–88624698 | X | X | X | ||||||||||
| LGALS3 | 10:67843328–67861114 | X | ||||||||||||
| FSHR | 11:31110744–31305197 | X | X | |||||||||||
| ABL1 | 11:101011169–101152856 | X | X | |||||||||||
| ABI1 | 13:18094943–18182433 | X | X | |||||||||||
| BMP7 | 13:59424987–59510393 | X | X | |||||||||||
| FKBP1A | 13:60276502–60303717 | X | X | X | ||||||||||
| TG | 14:9253697–9263933 | X | X | X | ||||||||||
| MYC | 14:13769242–13775688 | X | X | |||||||||||
| TGS1 | 14:24747192–24772996 | X | X | |||||||||||
| CHD7 | 14:28043739–28172246 | X | X | |||||||||||
| ACACB | 17:66101179–66217542 | X | X | |||||||||||
| MYO18B | 17:67768278–68002792 | X | X | |||||||||||
| VAT1L | 18:5045212–5211278 | X | X | |||||||||||
| IGF1R | 21:7967701–8268340 | X | X | |||||||||||
| TDGF1 | 22:53432382–53437166 | X | X | X | X | X | X | X | ||||||
| PPARG | 22:57367072–57432321 | X | X | X | ||||||||||
| BRPF3 | 23:10096561–10132302 | X | X | |||||||||||
| SUPT3H | 23:18223167–18623659 | X | X | |||||||||||
| ADCYAP1 | 24:36114443–36121104 | X | X | X | ||||||||||
| SMAD2 | 24:47963393–48022086 | X | X | |||||||||||
| ACTL6B | 25:36459674–36469794 | X | X | |||||||||||
| ACTB | 25:39343633–39347047 | X | ||||||||||||
| PRKG1 | 26:6901760–8343635 | X | X | |||||||||||
| HHEX | 26:14120258–14126069 | X | X | |||||||||||
| DPF2 | 29:44205003–44217694 | X | X | X | ||||||||||
SNP: Genes expressed in at least one tissue among the two physiological states (pre- and post-puberty) and mapped near the markers associated with fertility traits by GWAS; TF: transcriptional factor; TS: genes identified with high probability to show a binding site for TF differentially expressed; Hyp: hypothalamus; Pit: pituitary; Ov: ovary; Ut: uterus; End: endometrium; Lv: liver; Ad: adipose tissue; ldm: longissimus dorsi muscle.
Fig 5Biological processes (BP) significantly enriched in the main network identified by STRING database.
The number inside the parenthesis indicated the number of genes associated with each BP.
Fig 6Chromosome specific plots displaying pleiotropic effect around the genes shared among all breeds.
The x-axis corresponds to the genomic position in each chromosome and the y-axis to the -log(p-value). The -log(p-value) showed in the y-axis corresponds to the p-values adjusted to multiple-testing (FDR<0.01) obtained by Bolormaa et al. (2014) [10] for the pleiotropic analysis. The grey diamond corresponds to the start coordinate of each gene. The horizontal dashed lines indicate the nominal threshold of -log(p-value)>2. All the genes mapped in an interval of 1 Mb of a marker with significant signal for pleiotropic effect were considered as a genes in pleiotropic regions.
Top 10 genes based on the number of interactions identified in the STRING database and NetworkAnalyst analyses.
In bold are shown the genes present in both top 10 lists.
| Top 10 genes for numbers of interactions with other genes | |||
|---|---|---|---|
| STRING database | NetworkAnalyst | ||
| Gene symbol | Number of interaction | Gene symbol | Number of interaction |
| 14 | 714 | ||
| 9 | 313 | ||
| 8 | 276 | ||
| 8 | 243 | ||
| SST | 7 | ATF2 | 241 |
| ACACB | 7 | 198 | |
| 5 | 124 | ||
| 5 | ABI1 | 64 | |
| ADCYAP1 | 5 | 58 | |
| 5 | LGALS3 | 56 | |
Fig 7Interactome displaying the protein-protein interactions for the genes present in the main gene network identified by STRING database with other proteins across the genome.
Larger nodes (highlighted in red and green) represent the genes with the highest number of connections. Genes in red are the genes associated with positive and negative regulation of cellular metabolic processes.