Literature DB >> 29654417

Healthcare Text Classification System and its Performance Evaluation: A Source of Better Intelligence by Characterizing Healthcare Text.

Saurabh Kumar Srivastava1, Sandeep Kumar Singh1, Jasjit S Suri2.   

Abstract

A machine learning (ML)-based text classification system has several classifiers. The performance evaluation (PE) of the ML system is typically driven by the training data size and the partition protocols used. Such systems lead to low accuracy because the text classification systems lack the ability to model the input text data in terms of noise characteristics. This research study proposes a concept of misrepresentation ratio (MRR) on input healthcare text data and models the PE criteria for validating the hypothesis. Further, such a novel system provides a platform to amalgamate several attributes of the ML system such as: data size, classifier type, partitioning protocol and percentage MRR. Our comprehensive data analysis consisted of five types of text data sets (TwitterA, WebKB4, Disease, Reuters (R8), and SMS); five kinds of classifiers (support vector machine with linear kernel (SVM-L), MLP-based neural network, AdaBoost, stochastic gradient descent and decision tree); and five types of training protocols (K2, K4, K5, K10 and JK). Using the decreasing order of MRR, our ML system demonstrates the mean classification accuracies as: 70.13 ± 0.15%, 87.34 ± 0.06%, 93.73 ± 0.03%, 94.45 ± 0.03% and 97.83 ± 0.01%, respectively, using all the classifiers and protocols. The corresponding AUC is 0.98 for SMS data using Multi-Layer Perceptron (MLP) based neural network. All the classifiers, the best accuracy of 91.84 ± 0.04% is shown to be of MLP-based neural network and this is 6% better over previously published. Further we observed that as MRR decreases, the system robustness increases and validated by standard deviations. The overall text system accuracy using all data types, classifiers, protocols is 89%, thereby showing the entire ML system to be novel, robust and unique. The system is also tested for stability and reliability.

Entities:  

Keywords:  Classifiers; Healthcare text classification; Machine learning; Misrepresentation ratio; Reliability; Stability

Mesh:

Year:  2018        PMID: 29654417     DOI: 10.1007/s10916-018-0941-6

Source DB:  PubMed          Journal:  J Med Syst        ISSN: 0148-5598            Impact factor:   4.460


  10 in total

1.  An introduction to kernel-based learning algorithms.

Authors:  K R Müller; S Mika; G Rätsch; K Tsuda; B Schölkopf
Journal:  IEEE Trans Neural Netw       Date:  2001

2.  Symptomatic vs. asymptomatic plaque classification in carotid ultrasound.

Authors:  Rajendra U Acharya; Oliver Faust; A P C Alvin; S Vinitha Sree; Filippo Molinari; Luca Saba; Andrew Nicolaides; Jasjit S Suri
Journal:  J Med Syst       Date:  2011-01-18       Impact factor: 4.460

3.  An Approach for Learning Expressive Ontologies in Medical Domain.

Authors:  Ana B Rios-Alvarado; Ivan Lopez-Arevalo; Edgar Tello-Leal; Victor J Sosa-Sosa
Journal:  J Med Syst       Date:  2015-06-16       Impact factor: 4.460

4.  Atherosclerotic plaque tissue characterization in 2D ultrasound longitudinal carotid scans for automated classification: a paradigm for stroke risk assessment.

Authors:  U Rajendra Acharya; Muthu Rama Krishnan Mookiah; S Vinitha Sree; David Afonso; Joao Sanches; Shoaib Shafique; Andrew Nicolaides; L M Pedro; J Fernandes E Fernandes; Jasjit S Suri
Journal:  Med Biol Eng Comput       Date:  2013-01-06       Impact factor: 2.602

5.  An ensemble heterogeneous classification methodology for discovering health-related knowledge in social media messages.

Authors:  Suppawong Tuarob; Conrad S Tucker; Marcel Salathe; Nilam Ram
Journal:  J Biomed Inform       Date:  2014-03-16       Impact factor: 6.317

6.  Twitter mining for fine-grained syndromic surveillance.

Authors:  Paola Velardi; Giovanni Stilo; Alberto E Tozzi; Francesco Gesualdo
Journal:  Artif Intell Med       Date:  2014-01-31       Impact factor: 5.326

7.  Ovarian tumor characterization and classification using ultrasound-a new online paradigm.

Authors:  U Rajendra Acharya; S Vinitha Sree; Luca Saba; Filippo Molinari; Stefano Guerriero; Jasjit S Suri
Journal:  J Digit Imaging       Date:  2013-06       Impact factor: 4.056

8.  Text Messaging (SMS) Helping Cancer Care in Patients Undergoing Chemotherapy Treatment: a Pilot Study.

Authors:  Timóteo Matthies Rico; Karina Dos Santos Machado; Vanessa Pellegrini Fernandes; Samanta Winck Madruga; Patrícia Tuerlinckx Noguez; Camila Rose Guadalupe Barcelos; Mateus Madail Santin; Cristiane Rios Petrarca; Samuel Carvalho Dumith
Journal:  J Med Syst       Date:  2017-10-09       Impact factor: 4.460

9.  Computer-aided diagnosis of psoriasis skin images with HOS, texture and color features: A first comparative study of its kind.

Authors:  Vimal K Shrivastava; Narendra D Londhe; Rajendra S Sonawane; Jasjit S Suri
Journal:  Comput Methods Programs Biomed       Date:  2016-01-20       Impact factor: 5.428

10.  Patient involvement in health care decision making: a review.

Authors:  Shaghayegh Vahdat; Leila Hamzehgardeshi; Somayeh Hessam; Zeinab Hamzehgardeshi
Journal:  Iran Red Crescent Med J       Date:  2014-01-05       Impact factor: 0.611

  10 in total
  4 in total

1.  Identification of Potential Type II Diabetes in a Large-Scale Chinese Population Using a Systematic Machine Learning Framework.

Authors:  Mingyue Xue; Yinxia Su; Chen Li; Shuxia Wang; Hua Yao
Journal:  J Diabetes Res       Date:  2020-09-24       Impact factor: 4.011

2.  RSMOTE: improving classification performance over imbalanced medical datasets.

Authors:  Mehdi Naseriparsa; Ahmed Al-Shammari; Ming Sheng; Yong Zhang; Rui Zhou
Journal:  Health Inf Sci Syst       Date:  2020-06-12

3.  A Machine Learning Based Framework to Identify and Classify Non-alcoholic Fatty Liver Disease in a Large-Scale Population.

Authors:  Weidong Ji; Mingyue Xue; Yushan Zhang; Hua Yao; Yushan Wang
Journal:  Front Public Health       Date:  2022-04-04

4.  Classification and prediction of diabetes disease using machine learning paradigm.

Authors:  Md Maniruzzaman; Md Jahanur Rahman; Benojir Ahammed; Md Menhazul Abedin
Journal:  Health Inf Sci Syst       Date:  2020-01-03
  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.