| Literature DB >> 27171078 |
Abstract
MicroRNA(miRNA)-mRNA interactions are important for understanding many biological processes, including development, differentiation and disease progression, but their identification is highly context-dependent. When computationally derived from sequence information alone, the identification should be verified by integrated analyses of mRNA and miRNA expression. The drawback of this strategy is the vast number of identified interactions, which prevents an experimental or detailed investigation of each pair. In this paper, we overcome this difficulty by the recently proposed principal component analysis (PCA)-based unsupervised feature extraction (FE), which reduces the number of identified miRNA-mRNA interactions that properly discriminate between patients and healthy controls without losing biological feasibility. The approach is applied to six cancers: hepatocellular carcinoma, non-small cell lung cancer, esophageal squamous cell carcinoma, prostate cancer, colorectal/colon cancer and breast cancer. In PCA-based unsupervised FE, the significance does not depend on the number of samples (as in the standard case) but on the number of features, which approximates the number of miRNAs/mRNAs. To our knowledge, we have newly identified miRNA-mRNA interactions in multiple cancers based on a single common (universal) criterion. Moreover, the number of identified interactions was sufficiently small to be sequentially curated by literature searches.Entities:
Keywords: breast cancer; colorectal/colon cancer; esophageal squamous cell carcinoma; feature extraction; hepatocellular carcinoma; miRNA–mRNA interaction; non-small cell lung cancer; principal component analysis; prostate cancer
Mesh:
Substances:
Year: 2016 PMID: 27171078 PMCID: PMC4881522 DOI: 10.3390/ijms17050696
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Summary of the investigated mRNA/miRNA expressions. Probes identified and not identified by PCA-based unsupervised FE are denoted as selected and non-selected, respectively. For more details, see Materials and Methods.
| Number of Samples | Number of Probes | ||||
|---|---|---|---|---|---|
| Cancers | GEO ID | Tumors | Controls | Selected | Non-Selected |
| HCC | |||||
| mRNA | GSE45114 | 24 | 25 | 269 | 22,963 |
| miRNA | GSE36915 | 68 | 21 | 58 | 1087 |
| NSCLC | |||||
| mRNA | GSE18842 | 46 | 45 | 1098 | 53,504 |
| miRNA | GSE15008 | 187 | 174 | 268 | 3428 |
| ESCC | |||||
| mRNA | GSE38129 | 30 | 30 | 189 | 22,088 |
| miRNA | GSE19337 | 76 | 76 | 37 | 1217 |
| Prostate cancer | |||||
| mRNA | GSE21032 | 150 | 29 | 399 | 43,020 |
| miRNA | GSE84318 | 27 | 27 | 23 | 700 |
| Colon/colorectal cancer | |||||
| mRNA | GSE41258 | 186 | 54 | 309 | 21,974 |
| miRNA | GSE48267 | 30 | 30 | 12 | 839 |
| Breast cancer | |||||
| mRNA | GSE29174 | 110 | 11 | 980 | 33,600 |
| miRNA | GSE28884 | 173 | 16 | 18 | 2258 |
Comparison between BH-adjusted p-values and FDR q-values. Under both criteria, mRNA/miRNA identifications with p-values below 0.01 were regarded as significant.
| HCC | NSCLC | ESCC | ||||
|---|---|---|---|---|---|---|
| mRNA | miRNA | mRNA | miRNA | mRNA | miRNA | |
| FDR | 262 | 38 | 978 | 0 | 189 | 0 |
| BH | 269 | 58 | 1091 | 268 | 189 | 37 |
| FDR | 399 | 7 | 305 | 12 | 861 | 0 |
| BH | 399 | 23 | 309 | 12 | 908 | 18 |
Results (confusion matrix) of linear discriminant analysis between patients and healthy controls (hc). L: number of PC loadings used in the discrimination; the p-values and odds ratios were computed by Fisher’s exact test. *: . Columns: true classes, rows: predicted classes.
| mRNA | miRNA | |||
|---|---|---|---|---|
| HCC | hc | HCC | hc | |
| HCC | 20 | 0 | 64 | 0 |
| hc | 4 | 25 | 4 | 21 |
| ( | (4, | (10, *, | ||
| NSCLC | hc | NSCLC | hc | |
| NSCLC | 46 | 0 | 171 | 12 |
| hc | 0 | 45 | 16 | 162 |
| ( | (2,*, | (5, *, | ||
| ESCC | hc | ESCC | hc | |
| ESCC | 28 | 2 | 63 | 11 |
| hc | 2 | 28 | 13 | 65 |
| ( | (2, | (6, *, | ||
| Pancreatic cancer | hc | Pancreatic cancer | hc | |
| Pancreatic cancer | 139 | 4 | 22 | 3 |
| hc | 11 | 25 | 5 | 24 |
| ( | (8, *, | (4, | ||
| Colorectal cancer | hc | Colon cancer | hc | |
| Colon/Colorectal cancer | 178 | 5 | 27 | 3 |
| hc | 8 | 49 | 3 | 27 |
| ( | (8, *, | (4, | ||
| Breast cancer | hc | Breast cancer | hc | |
| Breast cancer | 110 | 0 | 169 | 5 |
| hc | 0 | 11 | 4 | 11 |
| ( | (3, | (18, | ||