| Literature DB >> 24498150 |
Li Li1, Chang Liu1, Fang Wang1, Wei Miao1, Jie Zhang1, Zhiqian Kang1, Yihan Chen2, Luying Peng1.
Abstract
It has become increasingly clear that the current taxonomy of clinical phenotypes is mixed with molecular heterogeneity, which potentially affects the treatment effect for involved patients. Defining the hidden molecular-distinct diseases using modern large-scale genomic approaches is therefore useful for refining clinical practice and improving intervention strategies. Given that microRNA expression profiling has provided a powerful way to dissect hidden genetic heterogeneity for complex diseases, the aim of the study was to develop a bioinformatics approach that identifies microRNA features leading to the hidden subtyping of complex clinical phenotypes. The basic strategy of the proposed method was to identify optimal miRNA clusters by iteratively partitioning the sample and feature space using the two-ways super-paramagnetic clustering technique. We evaluated the obtained optimal miRNA cluster by determining the consistency of co-expression and the chromosome location among the within-cluster microRNAs, and concluded that the optimal miRNA cluster could lead to a natural partition of disease samples. We applied the proposed method to a publicly available microarray dataset of breast cancer patients that have notoriously heterogeneous phenotypes. We obtained a feature subset of 13 microRNAs that could classify the 71 breast cancer patients into five subtypes with significantly different five-year overall survival rates (45%, 82.4%, 70.6%, 100% and 60% respectively; p = 0.008). By building a multivariate Cox proportional-hazards prediction model for the feature subset, we identified has-miR-146b as one of the most significant predictor (p = 0.045; hazard ratios = 0.39). The proposed algorithm is a promising computational strategy for dissecting hidden genetic heterogeneity for complex diseases, and will be of value for improving cancer diagnosis and treatment.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24498150 PMCID: PMC3907466 DOI: 10.1371/journal.pone.0087601
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1The graphic algorithm flow for the proposed SPC-based two-way clustering.
miRNA clusters using CTWC
| Objects | Clusters | ||||||||||||||||||
| mG1(s1) | mG2 | mG3 | mG4 | mG5 | mG6 | mG7 | mG8 | mG9 | mG10 | mG11 | |||||||||
| mG1(S2) | mG12 | mG13 | mG14 | mG15 | mG16 | mG17 | mG18 | mG19 | mG20 | mG21 | mG22 | mG23 | mG24 | mG25 | mG26 | mG27 | mG28 | mG29 | |
| mG1(S3) | mG30 | mG31 | mG32 | mG33 | mG34 | mG35 | mG36 | mG37 | mG38 | mG39 | mG40 | ||||||||
| mG1(S4) | mG41 | mG42 | mG43 | mG44 | mG45 | mG46 | mG47 | mG48 | |||||||||||
| mG1(S5) | mG49 | mG50 | mG51 | mG52 | mG53 | mG54 | mG55 | ||||||||||||
| mG1(S6) | mG56 | mG57 | mG58 | mG59 | mG60 | mG61 | |||||||||||||
| mG1(S7) | mG62 | mG63 | mG64 | mG65 | mG66 | mG67 | mG68 | mG69 | mG70 | mG71 | mG72 | mG73 | mG74 | mG75 | mG76 | ||||
Figure 2The five partitions of breast cancer were identified using as the disease feature set in the breast cancer dataset.
In the figure, each microRNA corresponds to a row, and each breast cancer sample corresponds to column. The 71 breast cancer samples were divided into five subtypes (Subtype 1, Subtype 2, Subtype 3, Subtype 4 and Subtype 5). Red areas indicate increased expression, and green areas decreased expression. Each column represents a single breast cancer sample, and each row represents a single microRNA. The dendrogram at the top shows the degree to which each breast cancer subtype is related to the others with respect to microRNA expression.
Figure 3Survival curves for five subtypes of the breast cancer patients in the breast cancer dataset.
Multivariate Cox proportional-hazards analysis based on the signature microRNAs relevant to survival time.
| Variable | Estimated coefficient | Wald | p value | Hazard ratio(95%CI) |
| hsa-miR-146b | -3.232 | 4.021 | 0.045 | 0.039(.002–.930) |