| Literature DB >> 34899868 |
Minjie Sheng1, Haiying Cai1, Qin Yang1, Jing Li1, Jian Zhang2,3,4,5,6, Lihua Liu1.
Abstract
Lymphoma is a serious type of cancer, especially for adolescents and elder adults, although this malignancy is quite rare compared with other types of cancer. The cause of this malignancy remains ambiguous. Genetic factor is deemed to be highly associated with the initiation and progression of lymphoma, and several genes have been related to this disease. Determining the pathogeny of lymphoma by identifying the related genes is important. In this study, we presented a random walk-based method to infer the novel lymphoma-associated genes. From the reported 1,458 lymphoma-associated genes and protein-protein interaction network, raw candidate genes were mined by using the random walk with restart algorithm. The determined raw genes were further filtered by using three screening tests (i.e., permutation, linkage, and enrichment tests). These tests could control false-positive genes and screen out essential candidate genes with strong linkages to validate the lymphoma-associated genes. A total of 108 inferred genes were obtained. Analytical results indicated that some inferred genes, such as RAC3, TEC, IRAK2/3/4, PRKCE, SMAD3, BLK, TXK, PRKCQ, were associated with the initiation and progression of lymphoma.Entities:
Keywords: enrichment theory; lymphoma; permutation test; protein-protein interaction network; random walk with restart algorithm
Year: 2021 PMID: 34899868 PMCID: PMC8655984 DOI: 10.3389/fgene.2021.792754
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Entire procedure to mine the novel candidate genes related to lymphoma. The validated lymphoma-associated genes were retrieved from DisGeNET. From STRING, a protein–protein interaction network was constructed. These genes and the network were fed into the random walk with restart algorithm to extract the candidate genes with high probabilities. These genes were further filtered by using three screening tests to select the final inferred genes. The enrichment analysis is conducted on all inferred genes and some genes are analyzed individually.
FIGURE 2Box plot of the number of interacting lymphoma-associated genes with high confidence scores of inferred genes. Several genes can interact with over twenty lymphoma-associated genes with high confidence scores (≥900), indicating the strong associations between inferred genes with lymphoma-associated genes.
FIGURE 3Enriched gene ontology (GO) terms on inferred genes. Thirteen GO termed are enriched on 108 inferred genes, including eight biological processes (BP) terms, four molecular function (MF) terms and one cellular component (CC) term.
Some important inferred lymphoma-associated genes.
| Ensembl id | Gene symbol | Description | Probability | Z-score | Maximum linkage score | Maximum enrichment score | Reference |
|---|---|---|---|---|---|---|---|
| ENSP00000304283 | RAC3 | Rac Family Small GTPase 3 | 9.700E−05 | 5.0457 | 998 | 0.9984 |
|
| ENSP00000370912 | TEC | Tec Protein Tyrosine kinase | 2.900E−05 | 3.2291 | 990 | 0.9979 |
|
| ENSP00000256458 | IRAK2 | Interleukin 1 Receptor Associated kinase 2 | 3.530E−05 | 3.3867 | 999 | 0.9976 |
|
| ENSP00000390651 | IRAK4 | Interleukin 1 Receptor Associated kinase 4 | 4.190E−05 | 4.3253 | 999 | 0.9967 |
|
| ENSP00000306124 | PRKCE | Protein kinase C Epsilon | 4.260E−05 | 3.9254 | 984 | 0.9967 |
|
| ENSP00000261233 | IRAK3 | Interleukin 1 Receptor Associated kinase 3 | 3.540E−05 | 3.4101 | 999 | 0.9965 |
|
| ENSP00000332973 | SMAD3 | SMAD Family Member 3 | 8.280E−05 | 7.1221 | 999 | 0.9965 |
|
| ENSP00000259089 | BLK | BLK Proto-Oncogene, Src Family Tyrosine kinase | 4.630E−05 | 6.2082 | 983 | 0.9964 |
|
| ENSP00000264316 | TXK | TXK Tyrosine kinase | 2.890E−05 | 2.6307 | 915 | 0.9963 |
|
| ENSP00000263125 | PRKCQ | Protein kinase C Theta | 4.750E−05 | 4.8540 | 999 | 0.9963 |
|