| Literature DB >> 35419455 |
Qin Yuan1, Zilu Wen2, Ke Yang1, Shulin Zhang3,4, Ning Zhang5, Yanzheng Song6, Fuxue Chen1.
Abstract
Pulmonary tuberculosis (TB) is a chronic infectious disease that is caused by respiratory infections, principally Mycobacterium tuberculosis. Increasingly, studies have shown that circular (circ)RNAs play regulatory roles in different diseases through different mechanisms. However, their roles and potential regulatory mechanisms in pulmonary TB remain unclear. In this study, we analyzed circRNA sequencing data from adjacent normal and diseased tissues from pulmonary TB patients and analyzed the differentially expressed genes. We then constructed machine learning models and used single-factor analysis to identify hub circRNAs. We downloaded the pulmonary TB micro (mi)RNA (GSE29190) and mRNA (GSE83456) gene expression datasets from the Gene Expression Omnibus database and performed differential expression analysis to determine the differentially expressed miRNAs and mRNAs. We also constructed a circRNA-miRNA-mRNA interaction network using Cytoscape. Gene Ontology enrichment analysis and Kyoto Encyclopedia of Genes and Genomes pathway analysis were used to predict the biological functions of the identified RNAs and determine hub genes. Then, the STRING database and cytoHubba were used to construct protein-protein interaction networks. The results showed 125 differentially expressed circRNAs in the adjacent normal and diseased tissues of pulmonary TB patients. Among them, we identified three hub genes associated with the development of pulmonary TB: hsa_circ_0007919 (upregulated), hsa_circ_0002419 (downregulated), and hsa_circ_0005521 (downregulated). Through further screening, we determined 16 mRNAs of potential downstream genes for hsa-miR-409-5p and hsa_circ_0005521 and established an interaction network. This network may have important roles in the occurrence and development of pulmonary TB. We constructed a model with 100% prediction accuracy by machine learning and single-factor analysis. We constructed a protein-protein interaction network among the top 50 hub mRNAs, with FBXW7 scoring the highest and SOCS3 the second highest. These results may provide a new reference for the identification of candidate markers for the early diagnosis and treatment of pulmonary TB.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35419455 PMCID: PMC9001091 DOI: 10.1155/2022/1717784
Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411
Figure 1Sequencing data and differential expression analysis. (a) Scatter plot analysis of circRNA expression; (b) volcano plot of differential circRNA expression; (c) heat map of cluster analysis of the differentially expressed circRNAs.
The top 10 differentially expressed genes between normal and diseased tissues.
| circBase_ID | ID | log2FC (H/L) |
| Regulate |
|---|---|---|---|---|
| _ | chr6:32489682|32549615 | 18.39 | 0.016744549 | Up |
| _ | chr6:31239376|31324219 | 15.76 | 5.20E-16 | Up |
| _ | chr6:29913011|29976954 | 15.27 | 6.44E-15 | Up |
| _ | chr12:103657104|103664086 | 11.4 | 0.007540201 | Up |
| hsa_circ_0007919 | chr17:953290|1003975 | 11.37 | 0.001677147 | Up |
| hsa_circ_0049335 | chr19:10906048|10909248 | 11.25 | 0.010820527 | Up |
| _ | chr5:94224584|94267696 | 3.67 | 0.021390567 | Up |
| hsa_circ_0006272 | chr10:70497602|70502326 | 3.45 | 0.026522304 | Up |
| hsa_circ_0071410 | chr4:169812073|169837178 | 3.41 | 0.012791713 | Up |
| hsa_circ_0006006 | chr2:173435454|173460751 | 3.36 | 0.022707405 | Up |
| _ | chr2:152109269|152112257 | -11.18 | 0.045132825 | Down |
| hsa_circ_0005281 | chr17:80721841|80730383 | -11.21 | 0.047352003 | Down |
| _ | chr11:36415396|36440853 | -11.36 | 0.017056129 | Down |
| _ | chr8:108296910|108315595 | -11.59 | 0.043394725 | Down |
| hsa_circ_0002419 | chr12:78443773|78452895 | -11.69 | 0.001451403 | Down |
| hsa_circ_0001961 | chr10:52907842|52916908 | -11.71 | 0.046807666 | Down |
| hsa_circ_0037054 | chr15:100185766|100215663 | -11.9 | 0.030332661 | Down |
| hsa_circ_0013225 | chr1:94667276|94697199 | -12.02 | 0.038719442 | Down |
| hsa_circ_0057608 | chr2:197777606|197786910 | -12.05 | 0.024486018 | Down |
| hsa_circ_0005521 | chr1:215759838|215768813 | -12.11 | 0.001277288 | Down |
Feature screening information.
| Feature-screening method | Number of circRNAs | Number of 100% correct algorithms |
|---|---|---|
| 125 circRNA | 125 | 3/13 |
| CfsSubsetEval-BestFirst | 14 | 3/13 |
| PrincipalComponents-Ranker-T | 5 | 0/13 |
| CorrelationAttributeEval-Ranker-T | 10 | 4/13 |
| GainRatioAttributeEval-Ranker-T | 12 | 0/13 |
| InfoGainAttributeEval-Ranker-T | 12 | 0/13 |
| OneRAttributeEval-Ranker-T | 16 | 2/13 |
| ReliefFAttributeEval-Ranker-T | 12 | 2/13 |
| SymmetricalUncertAttributeEval-Ranker-T | 12 | 0/13 |
Figure 2Comparison of the average accuracy of machine learning models built by different feature screening methods.
Frequency of occurrence of 29 circRNAs in four classes of feature-screened circRNAs.
| Number | CircRNA | Number of occurrences |
|---|---|---|
| 1 | hsa_circ_0007919 | 4/4 |
| 2 | chr10:15590454|15628663 | 4/4 |
| 3 | hsa_circ_0002419 | 4/4 |
| 4 | chr10:76729418|76748870 | 3/4 |
| 5 | hsa_circ_0005521 | 3/4 |
| 6 | chr12:97886239|97954825 | 3/4 |
| 7 | chr10:97141442|97170534 | 2/4 |
| 8 | hsa_circ_0034293 | 2/4 |
| 9 | hsa_circ_0013048 | 2/4 |
| 10 | hsa_circ_0007769 | 2/4 |
| 11 | chr6:62362160|62442669 | 2/4 |
| 12 | chrX:51070573|51099879 | 2/4 |
| 13 | chr8:68934271|68973014 | 2/4 |
| 14 | hsa_circ_0002286 | 2/4 |
| 15 | hsa_circ_0006272 | 1/4 |
| 16 | chr12:103657104|103664086 | 1/4 |
| 17 | chr11:36415396|36440853 | 1/4 |
| 18 | chr16:65005482|65026937 | 1/4 |
| 19 | chr1:21231376|21268823 | 1/4 |
| 20 | hsa_circ_0008336 | 1/4 |
| 21 | hsa_circ_0008223 | 1/4 |
| 22 | hsa_circ_0057105 | 1/4 |
| 23 | chr7:115750762|115752092 | 1/4 |
| 24 | hsa_circ_0066452 | 1/4 |
| 25 | hsa_circ_0084708 | 1/4 |
| 26 | hsa_circ_0080947 | 1/4 |
| 27 | hsa_circ_0003961 | 1/4 |
| 28 | hsa_circ_0042103 | 1/4 |
| 29 | hsa_circ_0076948 | 1/4 |
Univariate analysis of 14 circRNAs.
| circRNA | SMO(%) | IBK(%) | HoeffdingTree(%) | Logistic(%) |
|---|---|---|---|---|
| hsa_circ_0007919 | — | 83 | 83 | 83 |
| chr10:15590454|15628663 | 83 | 94 | 83 | 83 |
| hsa_circ_0002419 | 94 | 94 | 94 | 94 |
| chr10:76729418|76748870 | 83 | — | 83 | 83 |
| hsa_circ_0005521 | 89 | 89 | 89 | 89 |
| chr12:97886239|97954825 | 89 | 83 | 89 | 89 |
| chr10:97141442|97170534 | — | 94 | 83 | — |
| hsa_circ_0034293 | — | 83 | 89 | — |
| hsa_circ_0013048 | — | 89 | — | — |
| hsa_circ_0007769 | 83 | 89 | 83 | — |
| chr6:62362160|62442669 | — | 83 | 83 | 83 |
| chrX:51070573|51099879 | 83 | — | — | — |
| chr8:68934271|68973014 | 83 | 83 | — | — |
| hsa_circ_0002286 | 83 | — | 83 | 83 |
Figure 3Cross-analysis of the top differentially expressed genes and genes identified by univariate analysis.
Statistical analysis of downstream genes of the identified circRNAs.
| circRNA | miRNA | mRNA |
|---|---|---|
| hsa_circ_0007919 | 13 | 1429 |
| hsa_circ_0002419 | 15 | 2201 |
| hsa_circ_0005521 | 46 | 5401 |
Figure 4Analytical screening of downstream miRNAs and mRNAs. (a) Screening of downstream miRNAs using Jvenn cross-analysis of potential miRNAs and the differentially expressed miRNAs. (b) Screening of downstream mRNAs using Jvenn cross-analysis of potential mRNAs and the differentially expressed mRNAs. (c) The circRNA–miRNA–mRNA interaction network.
Figure 5Biological function enrichment analysis. (a) Biological process (BP) enrichment analysis. (b) Cellular component (CC) enrichment analysis. (c) Molecular function (MF) enrichment analysis. (d) KEGG pathway enrichment analysis.
Figure 6Screening the hub mRNAs. (a) Intersection analysis of potential downstream mRNAs of hsa_circ_0007919, hsa_circ_0002419, and hsa_circ_0005521. (b) PPI network of the top 50 hub mRNAs.
Scores of the top 50 hub mRNAs.
| mRNA | MCC |
|---|---|
| FBXW7 | 403816 |
| SOCS3 | 403476 |
| TCEB1 | 403213 |
| ASB16 | 403206 |
| KLHL21 | 403202 |
| KLHL42 | 403200 |
| SOCS6 | 362929 |
| SOCS5 | 362928 |
| FEM1B | 362886 |
| FEM1C | 362882 |
| TP53 | 139047 |
| MDM2 | 135767 |
| CASP3 | 124579 |
| EGFR | 123489 |
| HSP90AA1 | 121780 |
| CDKN1A | 114332 |
| FOXO1 | 94090 |
| PARP1 | 92418 |
| EZH2 | 59540 |
| CYCS | 57445 |
| SIAH2 | 40341 |
| RNF4 | 40323 |
| RNF19B | 40320 |
| MAP2K1 | 22088 |
| JAK1 | 16144 |
| GSK3B | 12798 |
| BMI1 | 11425 |
| MET | 11221 |
| PIK3R1 | 9141 |
| WEE1 | 5886 |
| SNAI2 | 5763 |
| E2F3 | 3748 |
| YWHAZ | 2970 |
| SUZ12 | 1957 |
| AGO3 | 1848 |
| TNRC6B | 1825 |
| CCNE2 | 1690 |
| KMT2D | 1119 |
| BAX | 864 |
| SOD2 | 777 |
| HNRNPA2B1 | 748 |
| HNRNPU | 736 |
| LSM3 | 730 |
| SF3B3 | 727 |
| PRPF8 | 725 |
| HNRNPF | 724 |
| SUGP1 | 720 |
| NOTCH2 | 499 |
| KLF4 | 442 |
| ILK | 265 |