Literature DB >> 24879897

Multi-label classification of chronically ill patients with bag of words and supervised dimensionality reduction algorithms.

Stefano Bromuri1, Damien Zufferey2, Jean Hennebert3, Michael Schumacher2.   

Abstract

OBJECTIVE: This research is motivated by the issue of classifying illnesses of chronically ill patients for decision support in clinical settings. Our main objective is to propose multi-label classification of multivariate time series contained in medical records of chronically ill patients, by means of quantization methods, such as bag of words (BoW), and multi-label classification algorithms. Our second objective is to compare supervised dimensionality reduction techniques to state-of-the-art multi-label classification algorithms. The hypothesis is that kernel methods and locality preserving projections make such algorithms good candidates to study multi-label medical time series.
METHODS: We combine BoW and supervised dimensionality reduction algorithms to perform multi-label classification on health records of chronically ill patients. The considered algorithms are compared with state-of-the-art multi-label classifiers in two real world datasets. Portavita dataset contains 525 diabetes type 2 (DT2) patients, with co-morbidities of DT2 such as hypertension, dyslipidemia, and microvascular or macrovascular issues. MIMIC II dataset contains 2635 patients affected by thyroid disease, diabetes mellitus, lipoid metabolism disease, fluid electrolyte disease, hypertensive disease, thrombosis, hypotension, chronic obstructive pulmonary disease (COPD), liver disease and kidney disease. The algorithms are evaluated using multi-label evaluation metrics such as hamming loss, one error, coverage, ranking loss, and average precision.
RESULTS: Non-linear dimensionality reduction approaches behave well on medical time series quantized using the BoW algorithm, with results comparable to state-of-the-art multi-label classification algorithms. Chaining the projected features has a positive impact on the performance of the algorithm with respect to pure binary relevance approaches.
CONCLUSIONS: The evaluation highlights the feasibility of representing medical health records using the BoW for multi-label classification tasks. The study also highlights that dimensionality reduction algorithms based on kernel methods, locality preserving projections or both are good candidates to deal with multi-label classification tasks in medical time series with many missing values and high label density.
Copyright © 2014 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Clinical data; Complex patient; Diabetes type 2; Dimensionality reduction; Kernel methods; Multi-label classification

Mesh:

Year:  2014        PMID: 24879897     DOI: 10.1016/j.jbi.2014.05.010

Source DB:  PubMed          Journal:  J Biomed Inform        ISSN: 1532-0464            Impact factor:   6.317


  3 in total

Review 1.  Translational Radiomics: Defining the Strategy Pipeline and Considerations for Application-Part 2: From Clinical Implementation to Enterprise.

Authors:  Faiq Shaikh; Benjamin Franc; Erastus Allen; Evis Sala; Omer Awan; Kenneth Hendrata; Safwan Halabi; Sohaib Mohiuddin; Sana Malik; Dexter Hadley; Rasu Shrestha
Journal:  J Am Coll Radiol       Date:  2018-02-01       Impact factor: 5.532

2.  Type 2 Diabetes Patients Benefit from the COMODITY12 mHealth System: Results of a Randomised Trial.

Authors:  Przemysław Kardas; Krzysztof Lewandowski; Stefano Bromuri
Journal:  J Med Syst       Date:  2016-10-08       Impact factor: 4.460

3.  Identification of social determinants of health using multi-label classification of electronic health record clinical notes.

Authors:  Rachel Stemerman; Jaime Arguello; Jane Brice; Ashok Krishnamurthy; Mary Houston; Rebecca Kitzmiller
Journal:  JAMIA Open       Date:  2021-02-09
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.