| Literature DB >> 27806696 |
Hamid Beiki1,2, Ardeshir Nejati-Javaremi3, Abbas Pakdel4, Ali Masoudi-Nejad5, Zhi-Liang Hu2, James M Reecy2.
Abstract
BACKGROUND: Genome sequencing and subsequent gene annotation of genomes has led to the elucidation of many genes, but in vertebrates the actual number of protein coding genes are very consistent across species (~20,000). Seven years after sequencing the cattle genome, there are still genes that have limited annotation and the function of many genes are still not understood, or partly understood at best. Based on the assumption that genes with similar patterns of expression across a vast array of tissues and experimental conditions are likely to encode proteins with related functions or participate within a given pathway, we constructed a genome-wide Cattle Gene Co-expression Network (CGCN) using 72 microarray datasets that contained a total of 1470 Affymetrix Genechip Bovine Genome Arrays that were retrieved from either NCBI GEO or EBI ArrayExpress.Entities:
Mesh:
Year: 2016 PMID: 27806696 PMCID: PMC5094014 DOI: 10.1186/s12864-016-3176-2
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Composition of the 1470 Affymetrix Genechip Bovine Genome Arrays used in this study. Arrays were classified according to the experimental conditions (a) and distribution (b)
Fig. 2Network topology of the CGCN. a Visualizing the CGCN (based on TOM similarity matrix) using heatmap plot. Light color represents low overlap and progressively darker red color represents higher overlap. Blocks of darker colors along the diagonal are the modules. The gene dendrogram and module assignment are also shown along the left side and the top. b Scale free topology evaluation of CGCN using Scale-Free Topology Fitting Index [18]
Modules identified in the network and their top over-represented Biological process GO terms
| Module | # probes | # genes | GO term | Genes in GO terma |
|
|---|---|---|---|---|---|
| Turquoise | 2569 | 2050 | system process | 241/652 | 3.30E-31 |
| Blue | 806 | 690 | nucleic acid metabolic process | 211/2322 | 6.89E-10 |
| Brown | 606 | 526 | organic acid metabolic process | 113/565 | 1.38E-36 |
| Yellow | 566 | 521 | RNA processing | 83/455 | 6.26E-26 |
| Green | 557 | 471 | RNA metabolic process | 151/2064 | 3.36E-09 |
| Red | 519 | 431 | cell cycle process | 117/604 | 4.93E-50 |
| Black | 454 | 384 | negative regulation of peptidase activity | 22/129 | 2.76E-07 |
| Pink | 348 | 304 | RNA metabolic process | 117/2064 | 7.20E-17 |
| Magenta | 292 | 267 | gene expression | 111/2568 | 1.56E-08 |
| Purple | 275 | 361 | ribonucleoprotein complex biogenesis | 44/266 | 3.10E-24 |
| Green yellow | 271 | 214 | chromatin modification | 26/336 | 3.79E-07 |
| Tan | 269 | 225 | anatomical structure morphogenesis | 72/1323 | 9.65E-14 |
| Salmon | 239 | 216 | translation | 85/398 | 1.40E-67 |
| Cyan | 202 | 181 | oxidation-reduction process | 35/619 | 1.15E-08 |
| Midnight blue | 200 | 191 | hydrogen ion transmembrane transport | 14/60 | 1.14E-10 |
| Light cyan | 188 | 151 | leukocyte activation | 34/365 | 7.29E-20 |
| Grey60 | 184 | 158 | leukocyte activation | 30/365 | 3.28E-13 |
| Light green | 169 | 160 | purine ribonucleoside triphosphate metabolic process | 17/133 | 2.08E-09 |
| Light yellow | 165 | 149 | sterol biosynthetic process | 15/31 | 2.21E-18 |
| Royal blue | 164 | 146 | mitochondrial translation | 8/43 | 9.87E-06 |
| Dark red | 158 | 139 | proteasomal protein catabolic process | 12/245 | 6.00E-3 |
| Dark green | 146 | 128 | response to endoplasmic reticulum stress | 15/146 | 2.23E-09 |
| Dark turquoise | 120 | 88 | tricarboxylic acid metabolic process | 3/27 | 5.75E-3 |
| Dark grey | 102 | 80 | inflammatory response | 28/269 | 1.40E-23 |
| Orange | 98 | 78 | antigen processing and presentation of peptide antigen | 12/26 | 1.00E-18 |
| Dark orange | 95 | 85 | regulation of mRNA metabolic process | 3/69 | 1.40E-2 |
| White | 80 | 63 | defense response to virus | 15/138 | 4.68E-15 |
| Sky blue | 67 | 53 | transcription from RNA polymerase I promoter | 3/32 | 4.91E-4 |
| Saddle brown | 62 | 53 | defense response to bacterium | 7/94 | 1.16E-11 |
| Steel blue | 61 | 48 | acute inflammatory response | 6/46 | 7.97E-07 |
| Pale turquoise | 34 | 24 | L-serine metabolic process | 3/7 | 2.55E-06 |
| Violet | 29 | 23 | amino acid activation | 7/36 | 1.84E-12 |
aThe two values listed in this column refer to the number of genes associated with the over-represented GO term in the module and the number of genes associated with the same GO term in Affymetrix Genechip Bovine Genome Array
bThe p-value indicated the probability that a module contains equal or larger number of genes associated with the GO term under a hypergeometric distribution after Bonferroni step-down correction
Fig. 3Manual category of ove-represented BP GO terms in CGCN modules
Fig. 4Heatmap visualization of module one gene interactions based on cattle protein interaction network [21]
Fig. 5Functional analysis of the white module genes. Over-represented GO/pathway terms were grouped based on kappa statistics. The size of each category within a pie chart represents the number of included terms. Only the most significant GO/terms within groups were labeled. GO/pathway terms are represented as nodes, and the node size represents the term enrichment significance, while the edges represent significant similarity between categories. a Representative biological processes interactions among module genes. b Representative molecular function interactions among module genes. c Representative Ractome analysis interactions among module genes
Functional enrichment analysis of close neighbors (TOM > 0.01) of hub genes with no functional annotation in CGCNa
| Gene | # | Top BP GO term |
| Top Reactome pathway |
|
|---|---|---|---|---|---|
| LOC509513 | 1 | T cell activation | 1.58E-20 | Adaptive Immune System | 1.04E-21 |
| 2 | T cell aggregation | 1.58E-20 | Immunoregulatory interactions between a Lymphoid and a non-Lymphoid cell | 3.55E-19 | |
| 3 | lymphocyte aggregation | 1.84E-20 | Immune System | 7.66E-19 | |
| LOC100848208 | 1 | RNA processing | 8.95E-07 | Mitochondrial translation elongation | 2.12E-08 |
| 2 | mRNA metabolic process | 1.87E-04 | Mitochondrial translation termination | 2.12E-08 | |
| 3 | mitochondrial translation | 2.18E-04 | Mitochondrial translation | 2.50E-08 | |
| LOC100850151 | 1 | RNA processing | 3.82E-15 | Cell Cycle | 8.78E-11 |
| 2 | mRNA processing | 6.36E-11 | Processing of Capped Intron-Containing Pre-mRNA | 3.91E-10 | |
| 3 | mRNA metabolic process | 1.56E-10 | Cell Cycle, Mitotic | 2.79E-08 | |
| MGC137211 | 1 | carboxylic acid metabolic process | 1.32E-41 | Metabolism | 4.96E-41 |
| 2 | monocarboxylic acid metabolic process | 1.05E-23 | Biological oxidations | 2.20E-18 | |
| 3 | alpha-amino acid metabolic process | 6.84E-20 | Metabolism of lipids and lipoproteins | 3.72E-18 |
aJust three top over-represented biological process (BP) and Reactome pathways are listed, More information related to these hubs and other un-known genes in the network are provided in Additional file 6: Table S5
b,cThe p-value indicated the probability that a module contains equal or larger number of genes associated with the GO term under a hypergeometric distribution after Bonferroni step-down correction