| Literature DB >> 31178711 |
Hany Alashwal1, Mohamed El Halaby2, Jacob J Crouse3, Areeg Abdalla2, Ahmed A Moustafa4.
Abstract
Clustering is a powerful machine learning tool for detecting structures in datasets. In the medical field, clustering has been proven to be a powerful tool for discovering patterns and structure in labeled and unlabeled datasets. Unlike supervised methods, clustering is an unsupervised method that works on datasets in which there is no outcome (target) variable nor is anything known about the relationship between the observations, that is, unlabeled data. In this paper, we focus on studying and reviewing clustering methods that have been applied to datasets of neurological diseases, especially Alzheimer's disease (AD). The aim is to provide insights into which clustering technique is more suitable for partitioning patients of AD based on their similarity. This is important as clustering algorithms can find patterns across patients that are difficult for medical practitioners to find. We further discuss the implications of the use of clustering algorithms in the treatment of AD. We found that clustering analysis can point to several features that underlie the conversion from early-stage AD to advanced AD. Furthermore, future work can apply semi-clustering algorithms on AD datasets, which will enhance clusters by including additional information.Entities:
Keywords: Alzheimer’s disease; clustering; machine learning techniques; neurological diseases; unsupervised learning
Year: 2019 PMID: 31178711 PMCID: PMC6543980 DOI: 10.3389/fncom.2019.00031
Source DB: PubMed Journal: Front Comput Neurosci ISSN: 1662-5188 Impact factor: 2.380
Types of machine learning methods.
| Type of data | Data points have labels. | Data points do not have corresponding labels. | A subset of the data points is labeled. |
| Learning process | Analyzing the training data to learn a function that can be used for predicting the labels of new examples. | Modeling the structure or the distribution of the data in order to find patterns and gain new insights from the data. | Utilizing unlabeled data with labeled data to learn better models. |
| Applications | Fraud detection, detecting spam emails, predicting real estate prices. | Clustering customers' data and market segmentation, learning rule associations, image segmentation, gene clustering. | When it is expensive to annotate every data point (e.g., using humans), this type of learning is suitable. Examples: web content classification, medical predictions. |
Firstly, the nature of the data is stated, then the objective of the type of learning is discussed, and finally some real-world examples are mentioned.
Figure 1A summary of the number of articles and their corresponding year of publication.
Figure 2The frequency of usage of clustering algorithms on Alzheimer’s disease data.