| Literature DB >> 33787392 |
Jinwu Peng1,2,3, Zhili Duan4, Yamin Guo5, Xiaona Li4, Xiaoqin Luo4, Xiumin Han5, Junming Luo2,4.
Abstract
OBJECTIVES: Liver echinococcosis is a severe zoonotic disease caused by Echinococcus (tapeworm) infection, which is epidemic in the Qinghai region of China. Here, we aimed to explore biomarkers and establish a predictive model for the diagnosis of liver echinococcosis.Entities:
Keywords: CTLA4; FCGR2B; Liver hydatid disease; Qinghai region of China; echinococcosis; microarray profiling
Mesh:
Substances:
Year: 2021 PMID: 33787392 PMCID: PMC8020228 DOI: 10.1177/0300060521993980
Source DB: PubMed Journal: J Int Med Res ISSN: 0300-0605 Impact factor: 1.671
Figure 1.Identification of differentially expressed genes (DEGs) between hydatid disease and healthy human liver tissues. (a) Boxplot showing the intensity of raw data and normalized data in the microarray profiling analysis; NB and PB indicate sample numbers. (b) Volcano plot filtering map displaying differential expression of mRNA in liver tissues from patients with hydatid disease (n = 6) compared with that of adjacent normal tissue (the cut edge) (n = 6). Red and green represent upregulated and downregulated mRNAs, respectively; black represents no difference. The x-axis indicates the Log2 fold change (FC) and the y-axis shows −log10 (P-value). (c) Heatmap indicating DEGs; the x-axis shows samples and the y-axis shows DEGs, in which red and green represent upregulated and downregulated DEGs, respectively.
Figure 2.Gene Ontology (GO) and Kyoto Encyclopedia of Genes and Genomes (KEGG) analyses were performed using data of differentially expressed genes in lesional and nonlesional (healthy control) liver tissue from patients with hydatid disease based on the KOBAS database (http://kobas.cbi.pku.edu.cn/). The top 30 significant cellular processes and signaling pathways were demonstrated by (a) GO, and (b) KEGG enrichment scatterplot. The y-axis shows the name of cellular process or signaling pathway and the x-axis indicates the gene ratio; the size of the dot indicates the number of genes.
Figure 3.Protein–protein interaction (PPI) network construction and random forest model construction. (a) The PPI analysis was performed by using the STRING online database (https://string-db.org/cgi/input.pl). The PPI network with the top 20 differentially expressed genes (DEGs) is shown. The color change from yellow to red indicates the gradually increasing degree of expression. (b) A random forest model of the candidate hub genes in the PPI network was constructed using the package “randomForest” in R, in which the sample type (hydatid disease or not) served as the outcome variable, and the expression levels of selected 20 candidate hub genes were the prediction variables. The variable importance, as shown in the figure, was evaluated by the MeanDecreaseAccuracy and MeanDecreaseGini scores. (c) The accuracy of the model was assessed by the area under the ROC curve (AUC) analysis. The x-axis shows the false positive rate, and the y-axis the true positive rate.
The top 20 most important genes by maximal clique centrality (MCC) analysis in the protein–protein interaction network.
| Gene symbol | Score |
|---|---|
|
| 4.0038 × 1021 |
|
| 4.0036 × 1021 |
|
| 4.0036 × 1021 |
|
| 4.0036 × 1021 |
|
| 4.0036 × 1021 |
|
| 4.0036 × 1021 |
|
| 4.0035 × 1021 |
|
| 4.0031 × 1021 |
|
| 4.0027 × 1021 |
|
| 4.0026 × 1021 |
|
| 4.0022 × 1021 |
|
| 3.9978 × 1021 |
|
| 3.9916 × 1021 |
|
| 3.9910 × 1021 |
|
| 3.9890 × 1021 |
|
| 3.901 × 1021 |
|
| 3.9006 × 1021 |
|
| 3.7932 × 1021 |
|
| 3.6281 × 1021 |
|
| 3.5804 × 1021 |