| Literature DB >> 24892101 |
Milan Vukićević1, Sandro Radovanović1, Miloš Milovanović1, Miroslav Minović1.
Abstract
Rapid growth and storage of biomedical data enabled many opportunities for predictive modeling and improvement of healthcare processes. On the other side analysis of such large amounts of data is a difficult and computationally intensive task for most existing data mining algorithms. This problem is addressed by proposing a cloud based system that integrates metalearning framework for ranking and selection of best predictive algorithms for data at hand and open source big data technologies for analysis of biomedical data.Entities:
Mesh:
Year: 2014 PMID: 24892101 PMCID: PMC4032768 DOI: 10.1155/2014/859279
Source DB: PubMed Journal: ScientificWorldJournal ISSN: 1537-744X
Figure 1Projected growth of DNA sequence data in the 21st century [7].
Sub-problems and RCs for generic clustering algorithm design.
| Sub-problem | Reusable components |
|---|---|
| Initialize representatives | DIANA, RANDOM, XMEANS, GMEANS, PCA, KMEANS++, SPSS |
|
| |
| Measure distance | EUCLIDEAN, CITY, CORREL, COSINE |
|
| |
| Update representatives | MEAN, MEDIAN, ONLINE |
|
| |
| Evaluate clusters | AIC, BIC, SILHOU, COMPACT, XB, CONN |
Figure 2Extended metalearning system for clustering biomedical data.
Meta-algorithm performance.
| Algorithm/Error | RMSE | MAE |
|---|---|---|
| RBFN | 0.143 | 0.109 (±0.092) |
| LR | 0.111 | 0.086 (±0.070) |
| LMSR | 0.265 | 0.094 (±0.248) |
| NN | 0.101 | 0.064 (±0.078) |
| SVM |
|
|
Figure 3Main process for finding the best model for prediction of AMI.
Figure 4Stream for application of metalearning system on new cases.
Figure 5Cloud based system [16] integrated with KaaS extension [17] for analysis of biomedical data.
Figure 6Cloud based system for predictive modeling of biomedical data.