| Literature DB >> 23874370 |
Y-h Taguchi1, Yoshiki Murakami.
Abstract
The discovery and characterization of blood-based disease biomarkers are clinically important because blood collection is easy and involves relatively little stress for the patient. However, blood generally reflects not only targeted diseases, but also the whole body status of patients. Thus, the selection of biomarkers may be difficult. In this study, we considered miRNAs as biomarker candidates for several reasons. First, since miRNAs were discovered relatively recently, they have not yet been tested extensively. Second, since the number of miRNAs is relatively limited, selection is expected to be easy. Third, since they are known to play critical roles in a wide range of biological processes, their expression may be disease specific. We applied a newly proposed method to select combinations of miRNAs that discriminate between healthy controls and each of 14 diseases that include 5 cancers. A new feature selection method is based on principal component analysis. Namely this method does not require knowledge of whether each sample was derived from a disease patient or a healthy control. Using this method, we found that hsa-miR-425, hsa-miR-15b, hsa-miR-185, hsa-miR-92a, hsa-miR-140-3p, hsa-miR-320a, hsa-miR-486-5p, hsa-miR-16, hsa-miR-191, hsa-miR-106b, hsa-miR-19b, and hsa-miR-30d were potential biomarkers; combinations of 10 of these miRNAs allowed us to discriminate each disease included in this study from healthy controls. These 12 miRNAs are significantly up- or downregulated in most cancers and other diseases, albeit in a cancer- or disease-specific combinatory manner. Therefore, these 12 miRNAs were also previously reported to be cancer- and disease-related miRNAs. Many disease-specific KEGG pathways were also significantly enriched by target genes of up-/downregulated miRNAs within several combinations of 10 miRNAs among these 12 miRNAs. We also selected miRNAs that could discriminate one disease from another or from healthy controls. These miRNAs were found to be largely overlapped with miRNAs that discriminate each disease from healthy controls.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23874370 PMCID: PMC3715582 DOI: 10.1371/journal.pone.0066714
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Performance of several feature extraction methods for scenario III.
| Accuracy | # of miRNAs | ||||||
|
|
|
| PCA | SAM |
| PCA | SAM |
| 2.0 | 0.5 | 0.99 | 0.99 | 0.99 | 9.0 | 8.7 | 8.0 |
| 2.0 | 1.0 | 0.95 | 0.95 | 0.95 | 8.1 | 7.9 | 7.5 |
| 2.0 | 1.5 | 0.88 | 0.88 | 0.88 | 7.7 | 9.2 | 7.5 |
| 2.0 | 2.0 | 0.83 | 0.82 | 0.82 | 6.8 | 9.3 | 6.9 |
| 1.5 | 0.5 | 0.98 | 0.98 | 0.98 | 8.7 | 8.0 | 7.2 |
| 1.5 | 1.0 | 0.90 | 0.88 | 0.90 | 7.9 | 8.0 | 7.1 |
| 1.5 | 1.5 | 0.82 | 0.81 | 0.82 | 7.1 | 8.5 | 6.7 |
| 1.5 | 2.0 | 0.77 | 0.76 | 0.77 | 6.4 | 8.9 | 6.5 |
| 1.0 | 0.5 | 0.95 | 0.89 | 0.94 | 8.2 | 6.2 | 6.5 |
| 1.0 | 1.0 | 0.82 | 0.75 | 0.81 | 6.9 | 6.4 | 6.0 |
| 1.0 | 1.5 | 0.71 | 0.66 | 0.71 | 6.2 | 7.1 | 5.9 |
| 1.0 | 2.0 | 0.66 | 0.63 | 0.66 | 5.4 | 8.1 | 5.4 |
| 0.5 | 0.5 | 0.82 | 0.50 | 0.80 | 6.9 | 0.0 | 4.5 |
| 0.5 | 1.0 | 0.66 | 0.50 | 0.65 | 5.3 | 0.1 | 3.9 |
| 0.5 | 1.5 | 0.59 | 0.50 | 0.57 | 3.9 | 2.1 | 3.5 |
| 0.5 | 2.0 | 0.55 | 0.51 | 0.55 | 3.5 | 4.4 | 3.4 |
Accuracy and the number of correctly selected miRNAs among 10 miRNAs with distinct expression between the 2 classes (averaged over 100 trials) for t-test-, PCA-, and SAM-based feature extractions. Scenario III was employed. Upper rows indicate easier classification problems. and represent the amplitudes of mean and standard deviation of the first 10 miRNAs that exhibit distinct expression between the 2 categories.
miRNAs selected to distinguish patients with cancers or other diseases from healthy controls by PCA-based feature extraction.
| lung cancer | other pancreatic tumors and diseases | pancreatitis | ovarian cancer | COPD (chronic obstructive pulmonary disease) | ductal pancreatic cancer | gastric cancer | sarcoidosis | prostate cancer | acute myocardial infarction | periodontitis | multiple sclerosis | melanoma | Wilm’s tumor | |||
| A | hsa-miR-425 | + | + | + | − | + | + | − | + | + | − | − | − | − | − | |
| A | hsa-miR-191 | + | + | + | − | + | + | + | - | + | − | − |
| + | − | |
| hsa-miR-185 | − | − | − | − | − | − | − | + | − | − | − | − | + | − | ||
| hsa-miR-140-3p | + | + | + | + | + | − | + | + | + | − | + | + | + | + | ||
| B | hsa-miR-15b | − | − | − | + | + | + | − | − | − | + | − | − | − | + | |
| B | hsa-miR-16 | − | + | + | + | + | + | + | − | + | − | + | + | − | − | |
| hsa-miR-320a | + | − | − | − | + | − | − | + | + | + | + | + | + | + | ||
| hsa-miR-486-5p | − | + | + | − | − | + | − | + | − | + | − | − | − | − | ||
| C | hsa-miR-92a | ± | + | − | + | − | ± | + | + | − | + | − | + | ± | + | |
| C | hsa-miR-19b | ± |
|
|
|
| ± |
|
|
|
|
| - | ± |
| |
| hsa-miR-106b |
| + | + | − | + |
| − |
| − | − | − | − |
| − | ||
| hsa-miR-30d |
|
|
|
|
|
|
| + |
|
|
|
|
|
| ||
+ (−) indicates that the miRNA was expressed in patients with cancers or other diseases (healthy controls).
indicates that the miRNA was not selected within the top 10 most significant miRNAs contributing to discrimination. A–C: miRNAs belonging to common clusters, which were defined by an inter-miRNA distance of 1 kbp. Coincidence within clusters A and C are underlined. See Fig. 5 for actual amount of expression/suppression.
Performance of PCA-based LDA for discrimination between patients with cancers or other diseases and healthy controls.
| cancer orother disease | PC | Accuracy | Specificity | Sensitivity | Precision |
| Lung cancer | 5 | 0.784 (+) | 0.800 (+) | 0.750 (+) | 0.632 |
| Other pancreatic tumors and diseases | 7 | 0.814 (+) | 0.771 | 0.875 (+) | 0.724 |
| Pancreatitis | 8 | 0.833 (+) | 0.786 (−) | 0.921 (+) | 0.700 |
| Ovarian cancer | 6 | 0.800 | 0.786 (−) | 0.867 (+) | 0.464 |
| COPD | 2 | 0.713 (−) | 0.671 (−) | 0.833 (+) | 0.465 |
| Ductal pancreatic cancer | 2 | 0.765 (−) | 0.743 (−) | 0.800 (+) | 0.667 |
| Gastric cancer | 9 | 0.855 (+) | 0.857 (+) | 0.846 | 0.524 |
| Sarcoidosis | 10 | 0.835 (−) | 0.800 (+) | 0.889 (−) | 0.741 |
| Prostate cancer | 5 | 0.806 (+) | 0.800 (+) | 0.826 (+) | 0.576 |
| Acute myocardial infarction | 7 | 0.789 (−) | 0.900 | 0.757 (−) | 0.964 |
| Periodontitis | 10 | 0.807 (+) | 0.814 (+) | 0.778 (−) | 0.519 |
| Multiple sclerosis | 10 | 0.892 (+) | 0.871 (+) | 0.957 (+) | 0.710 |
| Melanoma | 10 | 0.867 (−) | 0.857 (+) | 0.886 (−) | 0.756 |
| Wilm’s tumor | 7 | 0.867 | 0.886 | 0.600 | 0.273 |
+ (−) indicates that the performance was better (worse) than that of Keller et al [13]. PC is the number of PCs used for PCA-based LDA. LOOCV was applied. See the Table in the study by Keller et al [13] on page 14 of the Supplementary Materials.
Figure 1Individual contributions of miRNAs to discrimination between patients with cancers or other diseases and normal controls.
The height of the bars indicates the amount of contribution from each miRNA in discriminating patients with cancers or other diseases from healthy controls. A positive (negative) value indicates that the miRNA was expressed in patients with cancer and other diseases (healthy controls). The order of cancers or other diseases is the same as that in Table 2 (top to bottom): lung cancer (black), other pancreatic tumors and diseases (red), pancreatitis (green), ovarian cancer (blue), COPD (cyan), ductal pancreatic cancer (pink), gastric cancer (yellow), sarcoidosis (grey), prostate cancer (black), acute myocardial infarction (red), periodontitis (green), multiple sclerosis (blue), melanoma (cyan), and Wilm’s tumor (pink).
miRNAs in Table 2 whose up- and/or downregulation in any cancer was reported in the study by Bandyopadhyay et al. [41].
| miRNA | Cancer type | Expression | Mean fold change |
| hsa-miR-425 | Central nervous system | Downregulated | 13.6-fold reduction |
| hsa-miR-15b | Colon | Upregulated | 1.5-fold increase |
| hsa-miR-185 | Bladder (urothelial) | Upregulated | 1.30-fold increase |
| hsa-miR-185 | Kidney | Upregulated | 1.42-fold increase |
| hsa-miR-92-2 | Pancreas | Upregulated | |
| hsa-miR-92-2 | Prostate | Upregulated | |
| hsa-miR-140 | Central nervous system | Downregulated | 2.7-fold reduction |
| hsa-miR-140 | Colon | Downregulated | 11.4-fold reduction |
| hsa-miR-140 | Hematologic | Downregulated | 3.5-fold reduction |
| hsa-miR-140 | Lung | Downregulated | |
| hsa-miR-140 | Ovary | Downregulated | 3.51-fold reduction |
| hsa-miR-16-1 | Uterus/endometrial _cancer | Upregulated | At least 2-fold increase |
| hsa-miR-16-2 | B cell CLL | Downregulated/Deleted | |
| hsa-miR-16a | B cell CLL | Downregulated | |
| hsa-miR-191 | Breast | Upregulated | |
| hsa-miR-191 | Central nervous system | Downregulated | 4.4-fold reduction |
| hsa-miR-191 | Colon | Upregulated | 1.4-fold increase |
| hsa-miR-191 | Lung | Upregulated | |
| hsa-miR-106b | Lung | Upregulated | 12-fold increase in small lung cancer cell line SKLC-2. |
| hsa-miR-30d | Central nervous system | Downregulated | 3.2-fold reduction |
Any miRNAs listed in the Additional File in the study by Bandyopadhyay et al. [41].
Cancer-specific KEGG pathways enriched in miRNA target genes.
| DIANA-mirpath | Starbase | ||||
| –log10
| adjusted | ||||
| up | down | up | Down | ||
| KEGG ID | description | Lung cancer | |||
| hsa05223 | Non-small cell lung cancer | 3.55 | 7.31 | 4.26e-02 | – |
| hsa05222 | Small cell lung cancer | 2.9 | 3.64 | – | – |
| hsa05200 | Pathways in cancer | – | – | – | 5.69e-02 |
| KEGG ID | description | Ductal pancreatic cancer | |||
| hsa05212 | Pancreatic cancer | 9.71 | 4.66 | 1.12e-02 | – |
| hsa05200 | Pathways in cancer | – | – | 1.10e-02 | – |
| KEGG ID | description | Pancreatitis | |||
| hsa05212 | Pancreatic cancer | 13.28 | 12.36 | 2.14e-05 | 1.10e-02 |
| hsa05200 | Pathways in cancer | – | – | 5.11e-05 | 1.27e-03 |
| KEGG ID | description | Other pancreatic tumors and diseases | |||
| hsa05212 | Pancreatic cancer | 11.93 | 15.51 | 2.20 e-04 | 1.01e-03 |
| hsa05200 | Pathways in cancer | – | – | 3.67 e-05 | 2.90e-03 |
| KEGG ID | description | Prostate cancer | |||
| hsa05200 | Pathways in cancer | – | 1.53e-04 | 1.44e-02 | |
| hsa05215 | Prostate cancer | 8.12 | 8.98 | – | 4.04e-04 |
| KEGG ID | description | Melanoma | |||
| hsa05218 | Melanoma | 6.32 | 9.21 | – | – |
A list of cancer-specific KEGG pathways enriched in up- and/or downregulated miRNA target genes between normal controls and corresponding cancer patients. DIANA-mirpath gave -values while Starbase gave adjusted -values.
miRNAs used for discrimination between diseases.
| 1 | 2 | 3 | 4 | 5 | 6 | 7 | 8 | 9 | 10 | 11 | 12 | 13 | 14 | |
| Lung cancer | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p | 92a | |||
| Multiple selerosis | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p | 92a | ||
| Other pancreatic tumors and diseases | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 320a | 425 | 486-5p | 92a | |||
| Pancreatitis | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p | 92a | ||
| Ductal pancreatic cancer | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 320a | 425 | 486-5p | 92a | |||
| Gastric cancer | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b |
| 30d | 320a | 425 | 486-5p | 92a | |
| Sarcoidosis | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p |
| 92a | ||
| Melanoma | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p | 92a | |||
| Wilm’s tumor |
| 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p |
| 92a |
| Prostate cancer | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p | 92a | ||
| Acute myocardial infarction | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p | 92a | ||
| Periodontitis | 15b | 185 | 140-3p | 320a | 486-5p | 16 | 92a | 425 | 106b | 191 | 19b |
| 30d | |
| Ovarian cancer | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b |
| 30d | 320a | 425 | 486-5p | 92a | |
| COPD | 106b | 140-3p | 15b | 16 | 185 | 191 | 19b | 30d | 320a | 425 | 486-5p | 92a |
Each row lists the miRNAs used for discrimination between the diseases. A set of 10 miRNAs among the miRNAs listed in each row was used for discrimination between the disease shown in the left most column and any of other 13 diseases or normal control. Since 10 miRNAs were selected for each of 14 discrimination analyses, a total of 140 miRNAs were selected as biomarkers. However, there are at most 14 miRNAs listed in each row. In addition to this, miRNAs shown in each row overlapped significantly with each other. This means that miRNAs to be used as biomarkers to discriminate between diseases are highly disease-independent. More detailed information about which 10 miRNAs discriminated between each pair of diseases or control/disease can be found in Table S2. All miRNAs excluding the mRNAs underlined are also in Table 2.