| Literature DB >> 29767230 |
Shouhua Chen1, Xiaobin Fan2, He Gu1, Lili Zhang1, Wenhua Zhao3.
Abstract
The present study aimed to screen all types of RNAs involved in the development of papillary thyroid carcinoma (PTC). RNA‑sequencing data of PTC and normal samples were used for screening differentially expressed (DE) microRNAs (DE‑miRNAs), long non‑coding RNAs (DE‑lncRNAs) and genes (DEGs). Subsequently, lncRNA‑miRNA, miRNA‑gene (that is, miRNA‑mRNA) and gene‑gene interaction pairs were extracted and used to construct regulatory networks. Feature genes in the miRNA‑mRNA network were identified by topological analysis and recursive feature elimination analysis. A support vector machine (SVM) classifier was built using 15 feature genes, and its classification effect was validated using two microarray data sets that were downloaded from the Gene Expression Omnibus (GEO) database. In addition, Gene Ontology function and Kyoto Encyclopedia Genes and Genomes pathway enrichment analyses were conducted for genes identified in the ceRNA network. A total of 506 samples, including 447 tumor samples and 59 normal samples, were obtained from The Cancer Genome Atlas (TCGA); 16 DE‑lncRNAs, 917 DEGs and 30 DE‑miRNAs were screened. The miRNA‑mRNA regulatory network comprised 353 nodes and 577 interactions. From these data, 15 feature genes with high predictive precision (>95%) were extracted from the network and were used to form an SVM classifier with an accuracy of 96.05% (486/506) for PTC samples downloaded from TCGA, and accuracies of 96.81 and 98.46% for GEO downloaded data sets. The ceRNA regulatory network comprised 596 lines (or interactions) and 365 nodes. Genes in the ceRNA network were significantly enriched in 'neuron development', 'differentiation', 'neuroactive ligand‑receptor interaction', 'metabolism of xenobiotics by cytochrome P450', 'drug metabolism' and 'cytokine‑cytokine receptor interaction' pathways. Hox transcript antisense RNA, miRNA‑206 and kallikrein‑related peptidase 10 were nodes in the ceRNA regulatory network of the selected feature gene, and they may serve import roles in the development of PTC.Entities:
Mesh:
Substances:
Year: 2018 PMID: 29767230 PMCID: PMC6059698 DOI: 10.3892/mmr.2018.9009
Source DB: PubMed Journal: Mol Med Rep ISSN: 1791-2997 Impact factor: 2.952
Differentially expressed lncRNAs between papillary thyroid carcinomas samples and normal tissue samples.
| lncRNA | logFC | P-value | FDR |
|---|---|---|---|
| NHEG1 | −2.63948 | 2.87×10−02 | 9.36×10−02 |
| FAM27B | −2.49586 | 2.48×10−05 | 2.57×10−04 |
| C12orf77 | −2.04917 | 2.05×10−07 | 3.01×10−06 |
| RNU11 | −1.85277 | 1.24×10−06 | 1.45×10−05 |
| TERC | −1.66684 | 9.66×10−22 | 1.70×10−19 |
| PWRN2 | −1.58396 | 9.41×10−04 | 5.52×10−03 |
| DIO3OS | −1.45712 | 7.29×10−08 | 1.28×10−06 |
| TCL6 | −1.19462 | 1.84×10−11 | 8.10×10−10 |
| SMCR5 | −1.08884 | 4.66×10−10 | 1.37×10−08 |
| MYCNOS | −1.0739 | 1.65×10−10 | 5.81×10−09 |
| HYMAI | −1.05663 | 2.20×10−08 | 4.84×10−07 |
| C9orf170 | −1.03293 | 1.43×10−07 | 2.29×10−06 |
| SFTA1P | 1.325098 | 4.13×10−18 | 3.63×10−16 |
| C16orf82 | 1.437648 | 6.26×10−03 | 2.75×10−02 |
| UCA1 | 2.279467 | 3.41×10−14 | 2.00×10−12 |
| HOTAIR | 2.707475 | 2.77×10−03 | 1.43×10−02 |
FC, fold change; FDR, false discovery rate; lncRNA, long non-coding RNA.
Figure 1.Sample clustering results by DEGs, DE-microRNAs and DE-long non-coding RNAs. (A) DEGs, (B) DE-microRNAs and (C) DE-long non-coding RNAs. Blue represents normal samples and red represents papillary thyroid carcinoma samples. DE, differentially expressed; DEG, differentially expressed gene.
Differentially expressed lncRNAs that are regulated by differentially expressed microRNAs.
| lncRNA | microRNA |
|---|---|
| C16orf82 | hsa-miR-18b and hsa-miR-876 |
| DIO3OS | hsa-miR-7, hsa-miR-206 and hsa-miR-18b |
| HOTAIR | hsa-miR-9-3 and hsa-miR-206 |
| MYCNOS | hsa-miR-551b, hsa-miR-18b and hsa-miR-7 |
| PWRN2 | hsa-miR-9-3 and hsa-miR-206 |
| SFTA1P | hsa-miR-206 and hsa-miR-9-3 |
| TCL6 | hsa-miR-9-3, hsa-miR-18b, hsa-miR-490 and |
| hsa-miR-7 | |
| UCA1 | hsa-miR-206 |
lncRNA, long non-coding RNA; miR, microRNA.
Figure 2.Differentially expressed miRNA-lncRNA regulatory network. Diamonds represent miRNAs; triangles represent lncRNAs; green color indicates downregulated nodes; and red color indicates upregulated nodes. lncRNA, long non-coding RNA; miR, microRNA; miRNA, microRNA.
Figure 3.miRNA-mRNA regulatory network in papillary thyroid carcinomas. Circles represent mRNAs/genes; diamonds represent miRNAs; green color indicates downregulated nodes; red color indicates upregulated nodes; white circles represent genes with >5 interactions with miRNA-targeted genes; blue lines are miRNA-mRNA interactions; and red lines are gene-gene interactions. miRNA, microRNA.
Figure 4.Clustering results of the feature genes with top 100 betweenness centrality values of different samples in the TCGA downloaded dataset.
Figure 5.Classification effects of the extracted feature genes on PTC samples. (A) Feature elimination of all 100 feature genes. The vertical red line shows that when there were 15 genes, the accuracy could reach >95%. (B) Sample distribution distinguished by the support vector machine classifier. PTC, papillary thyroid carcinomas.
15 feature genes and related DE-microRNAs.
| Gene | logFC | P-value | FDR | BC | Related DE-microRNA |
|---|---|---|---|---|---|
| 1.1311 | 3.99×10−13 | 1.00×10−11 | 0.0247 | hsa-miR-18b and hsa-miR-206 | |
| −1.0414 | 1.09×10−44 | 2.17×10−42 | 0.0879 | hsa-miR-187 and hsa-miR-506 | |
| 1.0542 | 3.72×10−08 | 5.42×10−07 | 0.1683 | hsa-miR-187, hsa-miR-18b and hsa-miR-873 | |
| −1.1509 | 1.06×10−10 | 2.07×10−09 | 0.3746 | hsa-miR-187, hsa-miR-18b, hsa-miR-506 and hsa-miR-873 | |
| −1.1511 | 1.84×10−09 | 3.10×10−08 | 0.0856 | hsa-miR-206 and hsa-miR-506 | |
| −1.3015 | 8.89×10−31 | 8.19×10−29 | 0.0603 | hsa-miR-206 and hsa-miR-506 | |
| 1.2698 | 2.92×10−14 | 8.30×10−13 | 0.1310 | hsa-miR-18b, hsa-miR-206 and hsa-miR-873 | |
| 1.2011 | 9.98×10−42 | 1.66×10−39 | 0.0247 | hsa-miR-18b and hsa-miR-206 | |
| −1.3458 | 3.08×10−35 | 3.91×10−33 | 0.0247 | hsa-miR-18b and hsa-miR-206 | |
| 1.4452 | 5.51×10−48 | 1.37×10−45 | 0.0614 | hsa-miR-506 and hsa-miR-873 | |
| −1.1062 | 2.25×10−12 | 5.21×10−11 | 0.0357 | hsa-miR-187 and hsa-miR-206 | |
| 2.0281 | 1.86×10−58 | 1.03×10−55 | 0.0942 | hsa-miR-18b, hsa-miR-206 and hsa-miR-506 | |
| −1.3570 | 4.04×10−32 | 4.07×10−30 | 0.0345 | hsa-miR-206 and hsa-miR-506 | |
| 1.9145 | 3.25×10−38 | 4.64×10−36 | 0.0350 | hsa-miR-18b and hsa-miR-506 | |
| 1.7752 | 5.37×10−49 | 1.39×10−46 | 0.0247 | hsa-miR-18b and hsa-miR-206 |
BC, betweenness centrality; DE, differentially expressed; FC, fold change; FDR, false discover rate; miR, microRNA.
Figure 6.Classification confusion matrix and scatter plot of support vector machine classifier on GEO downloaded datasets. (A) The left fig. illustrates the accuracy of the SVM classifier to classify the normal and tumor sample in GSE33630 dataset; The right fig. showed the distribution of the samples in GSE33630 dataset. (B) The left Figure showed the accuracy of the SVM classifier to classify the normal and tumor sample in GSE60542 dataset; The right Figure showed the distribution of the samples in GSE60542 dataset.
Classifying characteristics of the support vector machine classifier on all downloaded data sets.
| Data set | n | Correct rate | Sensitivity | Specificity | PPV | NPV | AUROC |
|---|---|---|---|---|---|---|---|
| TCGA | 506 | 0.961 | 0.9619 | 0.949 | 0.993 | 0.921 | 0.998 |
| GSE33630 | 94 | 0.968 | 0.959 | 0.978 | 0.979 | 0.956 | 0.991 |
| GSE60542 | 65 | 0.985 | 0.969 | 1.000 | 1.000 | 0.969 | 0.973 |
AUROC, area under receiver operating characteristic curve; TCGA, The Cancer Genome Atlas; NPV, negative predictive value; PPV, positive predictive value.
Figure 7.ceRNA regulatory network of all DE-miRNAs, DE-lncRNAs and DEGs. Circles represent mRNAs (genes); diamonds represent miRNAs; triangles represent lncRNAs. Green nodes represent downregulated expression; red nodes represent upregulated expression; white circles represent genes with >5 interactions to miRNA targeted genes. Blue lines indicate miRNA-gene and miRNA-lncRNA interactions; red lines indicate gene-gene interactions. DE, differentially expressed; DEG, differentially expressed gene; lncRNA, long non-coding RNA; miR, microRNA; miRNA, microRNA.
Enriched functions for the feature genes.
| Term | Count | P-value | Genes |
|---|---|---|---|
| GO:0007267~cell-cell signaling | 25 | 4.05×10−5 | |
| GO:0007268~synaptic transmission | 16 | 1.04×10−4 | |
| GO:0019226~transmission of nerve impulse | 17 | 1.87×10−4 | |
| GO:0051969~regulation of transmission of nerve impulse | 10 | 6.47×10−4 | |
| GO:0006811~ion transport | 26 | 6.79×10−4 | |
| GO:0031644~regulation of neurological system process | 10 | 8.62×10−4 | |
| GO:0043062~extracellular structure organization | 10 | 1.35×10−3 | |
| GO:0007610~behavior | 18 | 1.65×10−3 | |
| GO:0050877~neurological system process | 34 | 2.03×10−3 | |
| GO:0048666~neuron development | 14 | 3.56×10−3 | |
| GO:0007423~sensory organ development | 11 | 4.17×10−3 | |
| GO:0007389~pattern specification process | 12 | 4.22×10−3 | |
| GO:0044057~regulation of system process | 13 | 4.55×10−3 | |
| GO:0030182~neuron differentiation | 16 | 5.17×10−3 | |
| GO:0001501~skeletal system development | 13 | 5.76×10−3 | |
| GO:0051240~positive regulation of multicellular organismal process | 11 | 6.39×10−3 | |
| GO:0000904~cell morphogenesis involved in differentiation | 11 | 6.39×10−3 | |
| GO:0048667~cell morphogenesis involved in neuron differentiatio | 10 | 7.03×10−3 | |
| GO:0007601~visual perception | 10 | 8.63×10−3 | |
| GO:0050953~sensory perception of light stimulus | 10 | 8.63×10−3 |
GO, gene ontology.
Figure 8.ceRNA regulatory network of DE-miRNAs, DE-lncRNAs and selected feature genes. The feature genes were those used for constructing the SVM classifier. Circles represent mRNAs (genes); diamonds represent miRNAs; triangles represent lncRNAs. Green nodes indicate downregulated expression; red nodes indicate upregulated expression. Solid lines represent miRNA-mRNA interactions; dotted lines represent lncRNA-miRNA interactions. DE, differentially expressed; lncRNA, long non-coding RNA; miR, microRNA; miRNA, microRNA.