Literature DB >> 30016722

An unsupervised machine learning method for discovering patient clusters based on genetic signatures.

Christian Lopez1, Scott Tucker2, Tarik Salameh3, Conrad Tucker4.   

Abstract

INTRODUCTION: Many chronic disorders have genomic etiology, disease progression, clinical presentation, and response to treatment that vary on a patient-to-patient basis. Such variability creates a need to identify characteristics within patient populations that have clinically relevant predictive value in order to advance personalized medicine. Unsupervised machine learning methods are suitable to address this type of problem, in which no a priori class label information is available to guide this search. However, it is challenging for existing methods to identify cluster memberships that are not just a result of natural sampling variation. Moreover, most of the current methods require researchers to provide specific input parameters a priori.
METHOD: This work presents an unsupervised machine learning method to cluster patients based on their genomic makeup without providing input parameters a priori. The method implements internal validity metrics to algorithmically identify the number of clusters, as well as statistical analyses to test for the significance of the results. Furthermore, the method takes advantage of the high degree of linkage disequilibrium between single nucleotide polymorphisms. Finally, a gene pathway analysis is performed to identify potential relationships between the clusters in the context of known biological knowledge. DATASETS AND
RESULTS: The method is tested with a cluster validation and a genomic dataset previously used in the literature. Benchmark results indicate that the proposed method provides the greatest performance out of the methods tested. Furthermore, the method is implemented on a sample genome-wide study dataset of 191 multiple sclerosis patients. The results indicate that the method was able to identify genetically distinct patient clusters without the need to select parameters a priori. Additionally, variants identified as significantly different between clusters are shown to be enriched for protein-protein interactions, especially in immune processes and cell adhesion pathways, via Gene Ontology term analysis.
CONCLUSION: Once links are drawn between clusters and clinically relevant outcomes, Immunochip data can be used to classify high-risk and newly diagnosed chronic disease patients into known clusters for predictive value. Further investigation can extend beyond pathway analysis to evaluate these clusters for clinical significance of genetically related characteristics such as age of onset, disease course, heritability, and response to treatment.
Copyright © 2018 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Clustering analysis; Genomic similarity; Multiple sclerosis; Unsupervised machine learning

Mesh:

Year:  2018        PMID: 30016722      PMCID: PMC6621561          DOI: 10.1016/j.jbi.2018.07.004

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  49 in total

1.  IntAct: an open source molecular interaction database.

Authors:  Henning Hermjakob; Luisa Montecchi-Palazzi; Chris Lewington; Sugath Mudali; Samuel Kerrien; Sandra Orchard; Martin Vingron; Bernd Roechert; Peter Roepstorff; Alfonso Valencia; Hanah Margalit; John Armstrong; Amos Bairoch; Gianni Cesareni; David Sherman; Rolf Apweiler
Journal:  Nucleic Acids Res       Date:  2004-01-01       Impact factor: 16.971

Review 2.  Clustering algorithms in biomedical research: a review.

Authors:  Rui Xu; Donald C Wunsch
Journal:  IEEE Rev Biomed Eng       Date:  2010

3.  Some new indexes of cluster validity.

Authors:  J C Bezdek; N R Pal
Journal:  IEEE Trans Syst Man Cybern B Cybern       Date:  1998

4.  Optimization by simulated annealing.

Authors:  S Kirkpatrick; C D Gelatt; M P Vecchi
Journal:  Science       Date:  1983-05-13       Impact factor: 47.728

5.  An ImmunoChip study of multiple sclerosis risk in African Americans.

Authors:  Noriko Isobe; Lohith Madireddy; Pouya Khankhanian; Takuya Matsushita; Stacy J Caillier; Jayaji M Moré; Pierre-Antoine Gourraud; Jacob L McCauley; Ashley H Beecham; Laura Piccio; Joseph Herbert; Omar Khan; Jeffrey Cohen; Lael Stone; Adam Santaniello; Bruce A C Cree; Suna Onengut-Gumuscu; Stephen S Rich; Stephen L Hauser; Stephen Sawcer; Jorge R Oksenberg
Journal:  Brain       Date:  2015-03-28       Impact factor: 13.501

Review 6.  Biclustering on expression data: A review.

Authors:  Beatriz Pontes; Raúl Giráldez; Jesús S Aguilar-Ruiz
Journal:  J Biomed Inform       Date:  2015-07-06       Impact factor: 6.317

7.  Comparative pharmacogenetics of multiple sclerosis: IFN-β versus glatiramer acetate.

Authors:  Olga G Kulakova; Ekaterina Yu Tsareva; Dmitrijs Lvovs; Alexander V Favorov; Alexey N Boyko; Olga O Favorova
Journal:  Pharmacogenomics       Date:  2014-04       Impact factor: 2.533

8.  Integrative clustering of multiple genomic data types using a joint latent variable model with application to breast and lung cancer subtype analysis.

Authors:  Ronglai Shen; Adam B Olshen; Marc Ladanyi
Journal:  Bioinformatics       Date:  2009-09-16       Impact factor: 6.937

9.  Comparison of clustering methods for investigation of genome-wide methylation array data.

Authors:  Harry Clifford; Frank Wessely; Satish Pendurthi; Richard D Emes
Journal:  Front Genet       Date:  2011-12-07       Impact factor: 4.599

10.  Exploration of machine learning techniques in predicting multiple sclerosis disease course.

Authors:  Yijun Zhao; Brian C Healy; Dalia Rotstein; Charles R G Guttmann; Rohit Bakshi; Howard L Weiner; Carla E Brodley; Tanuja Chitnis
Journal:  PLoS One       Date:  2017-04-05       Impact factor: 3.240

View more
  17 in total

Review 1.  Artificial intelligence in radiotherapy.

Authors:  Sarkar Siddique; James C L Chow
Journal:  Rep Pract Oncol Radiother       Date:  2020-05-06

Review 2.  Precision medicine to prevent glaucoma-related blindness.

Authors:  Sayoko E Moroi; David M Reed; David S Sanders; Ahmed Almazroa; Lawrence Kagemann; Neil Shah; Nakul Shekhawat; Julia E Richards
Journal:  Curr Opin Ophthalmol       Date:  2019-05       Impact factor: 3.761

Review 3.  Gut microbiome, big data and machine learning to promote precision medicine for cancer.

Authors:  Giovanni Cammarota; Gianluca Ianiro; Anna Ahern; Carmine Carbone; Andriy Temko; Marcus J Claesson; Antonio Gasbarrini; Giampaolo Tortora
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2020-07-09       Impact factor: 46.802

4.  Machine Learning for Detection of Correct Peripherally Inserted Central Catheter Tip Position from Radiology Reports in Infants.

Authors:  Manan Shah; Derek Shu; V B Surya Prasath; Yizhao Ni; Andrew H Schapiro; Kevin R Dufendach
Journal:  Appl Clin Inform       Date:  2021-09-08       Impact factor: 2.762

Review 5.  Predicting Major Adverse Cardiovascular Events in Acute Coronary Syndrome: A Scoping Review of Machine Learning Approaches.

Authors:  Sara Chopannejad; Farahnaz Sadoughi; Rafat Bagherzadeh; Sakineh Shekarchi
Journal:  Appl Clin Inform       Date:  2022-05-26       Impact factor: 2.762

Review 6.  Applications of machine learning to diagnosis and treatment of neurodegenerative diseases.

Authors:  Monika A Myszczynska; Poojitha N Ojamies; Alix M B Lacoste; Daniel Neil; Amir Saffari; Richard Mead; Guillaume M Hautbergue; Joanna D Holbrook; Laura Ferraiuolo
Journal:  Nat Rev Neurol       Date:  2020-07-15       Impact factor: 42.937

7.  Machine Learning Driven Profiling of Cerebrospinal Fluid Core Biomarkers in Alzheimer's Disease and Other Neurological Disorders.

Authors:  Giovanni Bellomo; Antonio Indaco; Davide Chiasserini; Emanuela Maderna; Federico Paolini Paoletti; Lorenzo Gaetani; Silvia Paciotti; Maya Petricciuolo; Fabrizio Tagliavini; Giorgio Giaccone; Lucilla Parnetti; Giuseppe Di Fede
Journal:  Front Neurosci       Date:  2021-03-31       Impact factor: 4.677

Review 8.  A systematic review of the applications of artificial intelligence and machine learning in autoimmune diseases.

Authors:  I S Stafford; M Kellermann; E Mossotto; R M Beattie; B D MacArthur; S Ennis
Journal:  NPJ Digit Med       Date:  2020-03-09

9.  Characterisation, identification, clustering, and classification of disease.

Authors:  A J Webster; K Gaitskell; I Turnbull; B J Cairns; R Clarke
Journal:  Sci Rep       Date:  2021-03-08       Impact factor: 4.379

10.  Machine Learning Demonstrates High Accuracy for Disease Diagnosis and Prognosis in Plastic Surgery.

Authors:  Angelos Mantelakis; Yannis Assael; Parviz Sorooshian; Ankur Khajuria
Journal:  Plast Reconstr Surg Glob Open       Date:  2021-06-24
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.