| Literature DB >> 27878450 |
Yong Zeng1,2, Fei-Yan Deng2,3, Wei Zhu2,4, Lan Zhang2, Hao He2, Chao Xu2, Qing Tian2, Ji-Gang Zhang2, Li-Shu Zhang1, Hong-Gang Hu1, Hong-Wen Deng5,6.
Abstract
Human monocyte is an important cell type which is involved in various complex human diseases. To better understand the biology of human monocytes and facilitate further studies, we developed the first comprehensive proteome knowledge base specifically for human monocytes by integrating both in vivo and in vitro datasets. The top 2000 expressed genes from in vitro datasets and 779 genes from in vivo experiments were integrated into this study. Altogether, a total of 2237 unique monocyte-expressed genes were cataloged. Biological functions of these monocyte-expressed genes were annotated and classified via Gene Ontology (GO) analysis. Furthermore, by extracting the overlapped genes from in vivo and in vitro datasets, a core gene list including 541 unique genes was generated. Based on the core gene list, further gene-disease associations, pathway and network analyses were performed. Data analyses based on multiple bioinformatics tools produced a large body of biologically meaningful information, and revealed a number of genes such as SAMHD1, G6PD, GPD2 and ENO1, which have been reported to be related to immune response, blood biology, bone remodeling, and cancer respectively. As a unique resource, this study can serve as a reference map for future in-depth research on monocytes biology and monocyte-involved human diseases.Entities:
Keywords: gene ontology; gene-disease association; human monocytes; network analysis; proteomics knowledgebase
Mesh:
Substances:
Year: 2016 PMID: 27878450 PMCID: PMC5291777 DOI: 10.1007/s13238-016-0342-x
Source DB: PubMed Journal: Protein Cell ISSN: 1674-800X Impact factor: 14.870
Figure 1Basic information and disease association of human monocyte proteins. (A) Distribution of protein molecular weight. X-axis lists the specific Mw ranges. Y-axis shows the number of proteins distributed in each range. (B) Distribution of Protein Isoelectric Point. X axis lists the specific pI ranges. Y axis shows the number of proteins distributed in each range. (C) Top 10 terms in cellular component category. X axis shows the top 10 terms in the Cellular Component category. Y axis shows the number of genes enriched in each specific GO term. (D) 20 main diseases related to human monocytes. X axis shows the names of 20 diseases. Y axis shows the number of monocyte-expressed genes involved in each specific disease
Top 20 terms in molecular function category
| Terms | Number of genes |
|
|---|---|---|
| GO:0000166~nucleotide binding | 520 | 1.81E-44 |
| GO:0017076~purine nucleotide binding | 435 | 9.56E-34 |
| GO:0032555~purine ribonucleotide binding | 416 | 5.60E-32 |
| GO:0032553~ribonucleotide binding | 416 | 5.60E-32 |
| GO:0001882~nucleoside binding | 319 | 2.50E-14 |
| GO:0001883~purine nucleoside binding | 316 | 4.90E-14 |
| GO:0030554~adenyl nucleotide binding | 311 | 9.50E-14 |
| GO:0032559~adenyl ribonucleotide binding | 293 | 1.60E-12 |
| GO:0005524~ATP binding | 290 | 1.46E-12 |
| GO:0005198~structural molecule activity | 168 | 2.70E-19 |
| GO:0003723~RNA binding | 160 | 3.14E-11 |
| GO:0005509~calcium ion binding | 142 | 0.045 |
| GO:0005525~GTP binding | 136 | 6.68E-30 |
| GO:0019001~guanyl nucleotide binding | 136 | 1.43E-28 |
| GO:0032561~guanyl ribonucleotide binding | 136 | 1.43E-28 |
| GO:0008092~cytoskeletal protein binding | 136 | 3.09E-16 |
| GO:0019899~enzyme binding | 132 | 2.36E-13 |
| GO:0042802~identical protein binding | 131 | 4.65E-07 |
| GO:0008233~peptidase activity | 108 | 2.02E-04 |
| GO:0070011~peptidase activity, acting on L-amino acid peptides | 105 | 1.32E-04 |
Notes: Based on the number of enriched genes, the first column shows the names of the top 20 terms in the molecular function category. The second column shows the number of genes enriched in each specific term. The third column shows the p value of each term
20 main biological processes related to monocytes
| Terms | Number of genes |
|
|---|---|---|
| GO:0008104~protein localization | 214 | 1.59E-20 |
| GO:0045184~establishment of protein localization | 200 | 6.49E-23 |
| GO:0007242~intracellular signaling cascade | 199 | 0.002737 |
| GO:0015031~protein transport | 198 | 1.23E-22 |
| GO:0043933~macromolecular complex subunit organization | 197 | 2.97E-26 |
| GO:0006508~proteolysis | 191 | 1.73E-06 |
| GO:0065003~macromolecular complex assembly | 187 | 1.14E-25 |
| GO:0046907~intracellular transport | 181 | 9.74E-24 |
| GO:0006796~phosphate metabolic process | 166 | 2.48E-04 |
| GO:0006793~phosphorus metabolic process | 166 | 2.48E-04 |
| GO:0010941~regulation of cell death | 162 | 3.28E-08 |
| GO:0043067~regulation of programmed cell death | 161 | 4.34E-08 |
| GO:0042981~regulation of apoptosis | 158 | 1.08E-07 |
| GO:0006952~defense response | 153 | 1.49E-15 |
| GO:0016192~vesicle-mediated transport | 151 | 9.85E-18 |
| GO:0006955~immune response | 151 | 1.10E-10 |
| GO:0007049~cell cycle | 151 | 4.01E-07 |
| GO:0009611~response to wounding | 150 | 6.20E-21 |
| GO:0009057~macromolecule catabolic process | 145 | 1.03E-05 |
| GO:0010033~response to organic substance | 143 | 2.71E-07 |
Notes: 20 terms which represent the main biological processes of monocytes are presented in the first column. The second column shows the number of genes enriched in each specific term. The third column shows the p value of each term
Figure 2Main groups and hierarchical relationship of Reactome pathways. Reactome pathways are classified into different groups based on the similarity of their biological functions. The bottom color gradation provides the P value comparison for significant pathways. The dendritic structure represents the hierarchical relation of pathways
Figure 3Network reconstruction of functional terms. Each dot represents one functional term. The same color dots represent terms which have similar biological functions. Accordingly, all terms were classified into different functional modules including “defense response,” “response to stress,” “hemostasis” and “cellular localization”
Five representative diseases and the involved genes related to monocytes
| Arthritis | Carcinoma | HIV infections | Leukemia | Osteoporosis | |
|---|---|---|---|---|---|
| HLA-A | 1 | 1 | 1 | 1 | |
| HLA-B | 1 | 1 | 1 | 1 | |
| HLA-C | 1 | 1 | 1 | 1 | |
| CAT | 1 | 1 | 1 | ||
| ENO1 | 1 | 1 | 1 | ||
| GSTP1 | 1 | 1 | 1 | ||
| SOD2 | 1 | 1 | 1 | ||
| ACTG1 | 1 | 1 | |||
| ANXA1 | 1 | 1 | |||
| ANXA2 | 1 | 1 | |||
| ANXA4 | 1 | 1 | |||
| ATIC | 1 | 1 | |||
| CD14 | 1 | 1 | |||
| CDC42 | 1 | 1 | |||
| HLA-DRB5 | 1 | 1 | |||
| HSPD1 | 1 | 1 | |||
| IDH2 | 1 | 1 | |||
| ITGB2 | 1 | 1 | |||
| LGALS3 | 1 | 1 | |||
| MPO | 1 | 1 | |||
| MVP | 1 | 1 | |||
| NME1 | 1 | 1 | |||
| PLEK | 1 | 1 | |||
| PRTN3 | 1 | 1 | |||
| PSMA4 | 1 | 1 | |||
| PSMA5 | 1 | 1 | |||
| PTPRC | 1 | 1 | |||
| RAN | 1 | 1 | |||
| S100A8 | 1 | 1 | |||
| SERPINA1 | 1 | 1 | |||
| TAP1 | 1 | 1 | |||
| VIM | 1 | 1 |
Notes: In this table, 5 diseases closely related to human monocytes are selected and shown in the first row. Among these 5 diseases, genes involved in 2 or more diseases are listed in the first column. The number “1” means that the corresponding gene is involved in the specified disease