Niloofar Mohammadzadeh1, Reza Safdari2, Alireza Baraani3, Farshid Mohammadzadeh4. 1. Health Information Management, Tehran University of Medical Sciences. Tehran, Iran. 2. Health Information Management Department, Health Information Management Research Center, Tehran University of Medical Sciences, Tehran, Iran. 3. Department of Computer Engineering, Sharif University of Technology, Tehran, Iran. 4. Shahabe Danesh University, Iran.
Abstract
OBJECTIVE: Intelligent data analysis has ability to prepare and present complex relations between symptoms and diseases, medical and treatment consequences and definitely has significant role in improving follow-up management of chronic heart failure (CHF) patients, increasing speed and accuracy in diagnosis and treatments; reducing costs, designing and implementation of clinical guidelines. THE AIM: The aim of this article is to describe intelligent data analysis methods in order to improve patient monitoring in follow and treatment of chronic heart failure patients as the best approach for CHF follow up management. METHODS: Minimum data set (MDS) requirements for monitoring and follow up of CHF patient designed in checklist with six main parts. All CHF patients that discharged in 2013 from Tehran heart center have been selected. The MDS for monitoring CHF patient status were collected during 5 months in three different times of follow up. Gathered data was imported in RAPIDMINER 5 software. RESULTS: Modeling was based on decision trees methods such as C4.5, CHAID, ID3 and k-Nearest Neighbors algorithm (K-NN) with k=1. Final analysis was based on voting method. Decision trees and K-NN evaluate according to Cross-Validation. CONCLUSION: Creating and using standard terminologies and databases consistent with these terminologies help to meet the challenges related to data collection from various places and data application in intelligent data analysis. It should be noted that intelligent analysis of health data and intelligent system can never replace cardiologists. It can only act as a helpful tool for the cardiologist's decisions making.
OBJECTIVE: Intelligent data analysis has ability to prepare and present complex relations between symptoms and diseases, medical and treatment consequences and definitely has significant role in improving follow-up management of chronic heart failure (CHF) patients, increasing speed and accuracy in diagnosis and treatments; reducing costs, designing and implementation of clinical guidelines. THE AIM: The aim of this article is to describe intelligent data analysis methods in order to improve patient monitoring in follow and treatment of chronic heart failurepatients as the best approach for CHF follow up management. METHODS: Minimum data set (MDS) requirements for monitoring and follow up of CHFpatient designed in checklist with six main parts. All CHFpatients that discharged in 2013 from Tehran heart center have been selected. The MDS for monitoring CHFpatient status were collected during 5 months in three different times of follow up. Gathered data was imported in RAPIDMINER 5 software. RESULTS: Modeling was based on decision trees methods such as C4.5, CHAID, ID3 and k-Nearest Neighbors algorithm (K-NN) with k=1. Final analysis was based on voting method. Decision trees and K-NN evaluate according to Cross-Validation. CONCLUSION: Creating and using standard terminologies and databases consistent with these terminologies help to meet the challenges related to data collection from various places and data application in intelligent data analysis. It should be noted that intelligent analysis of health data and intelligent system can never replace cardiologists. It can only act as a helpful tool for the cardiologist's decisions making.
Entities:
Keywords:
Data Analysis; Follow Up Study; Heart Failure
Chronic heart failure (CHF) is a major public health problem in most countries (1). In Europe, CHF is the leading cause of elderly hospitalization and 1-2% of all hospitalizations due to this disease (2). Total heart failure costs are estimated at 44.6 billion in 2015 only in the United States (3). Symptoms and morbidity associated with CHF effect patients’ daily activities and quality of life. High rate of heart failure readmission in early months after discharge are common. So finding the opportunities to reduce readmission and improve the quality of patients life is very essential (4).CHF disease management programs especially follow up plans with patient center approach can improve survival, quality of life and decrease re hospitalization. Also note treatment burden of patients in follow up plan is very important. Patients’ needs to adapt to new life style changes, learn about self-management strategies, engage with others, monitor the treatments, and integrate treatments into social circumstances and overcoming barriers such as accessibility to health care and poor continuity of care (5, 6).Information technology tools like telemedicine and tele care systems use effectively to provide require services for chronic diseasepatients. These systems would be helpful to improve the monitoring of patients through continuous assessment of symptoms and signs of disease and checking compliance with self-management programs, thus they can improve treatment, and resource usage (7, 8).Use of intelligent systems lead to facilitate, accelerate and improve health care services, decision support in the areas of diagnosis and treatment particularly in home care and telemedicine. Data analysis with artificial intelligence methods are important tools that can provide access to hidden knowledge in large databases of clinical and administrative data in hospitals. Also improve decision support, prevention, treatment and diagnosis planning and provides useful knowledge for health care personnel (9).Intelligent data analysis has ability to prepare and present complex relations between symptoms, diseases, and medical and treatment consequences (10). It is obvious that intelligent data analysis definitely have significant role in improving health care, prevention, treatment, and follow-up management of CHFpatients, increase quality of health care, high speed and accuracy in diagnosis and treatment, reduce costs, clinical outcomes assessment, design and implementation of clinical guidelines (11). The aim of this article is to describe intelligent data analysis method in order to improve patient monitoring in follow and treatment of chronic heart failurepatients as the best approach for CHF follow up management.
2. METHODS
In this research according to standard guideline about diagnosis and management of chronic heart failure in America (12), Europe (13), Australia (14) and consult with a cardiologist, minimum data set requirements for monitoring and follow up of CHFpatient were designed in checklist with six main parts include demographic, clinical, examination, exercise test, drug, procedure and imaging. These minimum data sets were distributed among 16 cardiologists and their comments for each item were collected after discussion group. All CHFpatients that discharged in 2013 from Tehran heart center have been selected. Then MDS for monitoring CHFpatient status were collected during 5 months in three different times of follow up. Gathered data was imported in RAPIDMINER 5 software for intelligent data analysis.
3. RESULTS
Important factors that studied in this research for follow up CHFpatients include:Demographic: age, sex, hypertension, LDL, diabetes, cigarette, family history;Clinical: chest pain, PND, dyspnea, edema;Examination: NA, K, HB, BUN, CR;Exercise Test: functional capacity, ST Low, chest pain during exercise test;Drugs: diuretics, digoxin, statins, aspirin, CA blockers, ace inhibitors, ARB, beta blockers, nitrates 6-Imaging / Procedure include: a) Angiography :coronary artery involvement; b) Echo :EF, left ventricular size, mitral valve regurgitation, pulmonary artery pressure, right ventricular function; c) Scan SPECT :ischemia, viability and (d) ECG :ECG normality, ST-T abnormality, LVH, LA abnormal.From total amount of patients in the sample 44 percent of chronic heart failurepatient (36 people) were female and 56 percent (45 people) were male.12percent (10 people) were 17-49 year old, 46percent (37 people) were 50-65 years and 42 percent (34 people) were aged over 66 years. 54 percent (44 people) had blood pressure, 40 percent (32 people) had diabetes, 53 percent (43 people) had decrease hemoglobin, and 75 percent (61 people) had hyper urea (BUN).In order to manage missing value in this project, records with most blank values or missing values were excluded from the intelligent data analysis. Details of some of the follow up data were not recorded in information system in Tehran heart center, so these variables had not positive effect on the data analysis. Because of this, data analysis performed based on variables which information was to be recorded in the hospital information system and other attributes were deleted. Data discretization was performed after data cleansing and preprocessing process in order to provide suitable basis for more efficient performance of learning model algorithms. Attributes that selected for discretization were show in Table1.
Table 1
Variables discretization in chronic heart failure follow up
Variables discretization in chronic heart failure follow upModeling was based on decision trees methods such as C4.5, CHAID, Id3 with k=10 and k-Nearest Neighbors algorithm (K-NN) with k=1. Final analysis was based on voting method. Decision trees and K-NN evaluate according to Cross-Validation. Cross-Validation is a model validation technique for assessing how the results of a statistical analysis will generalize to an independent data set. It is mainly used in settings where the goal is prediction, and one wants to estimate how accurately a predictive model will perform in practice. In this method data dived into two parts (train and test) and data in both parts are repeated k times and assessed regularly (15).All relevant data of first follow up was attached to set role function in RAPIDMINER 5 and patient status in second follow up includes follow up or non-follow up (death, hospitalization) with this function as the label (feature target) was considered. Then necessary changes with select attribute and rename functions have been done on data for reach structural integrity data. At the end, data was attached to x-validation function. Inside this function was used C4.5 decision tree for modeling. Figure1 show the Inside the x-validation function in the first follows up.
Figure 1
Decision tree in first follow up (C4.5)
Decision tree in first follow up (C4.5)The same procedure was done on data of second follow up but this time the patient condition in third status was considered as a label. Also CHAID decision tree was used for modeling. In third phase of follow up of CHFpatient, the patient condition is not specified, so gathered data created before and prediction models were evaluated. If there was an agreement between models, predicted value were considered as label and if the response was different, non-follow up label was considered because of the importance of the patient’s condition, that was admitted in hospital in one of the before phases. Also ID3 decision tree was used for modeling in this section. All these described phases of data analysis with classification methods and C4.5, CHAID, ID3 decision tree algorithms were show in Figure 2.
Figure 2
Data analysis with classification methods and C4.5, CHAID, ID3 decision tree algorithms
Data analysis with classification methods and C4.5, CHAID, ID3 decision tree algorithmsAccuracy of model based on different algorithmsFor the K-NN model used similar structure due to the properties of the tree structure shown. K-NN model was done in three parts of follow up like decision trees models; the difference was that inside the x-validation function K-NN beside the decision algorithms were used. In these models with see the output can understand the similarity of the patient symptoms to each other. With retrieval of k- neighbor on each attribute in many cases results lead to more care and visit the physician. This result according to general approach in this project was acceptable but it should be noted that the conclusions generated by K-NN model should be modified by decision trees.Decision Tree Based on C4.5 Algorithm in First Follow up of CHFPatient
4. DISCUSSION
Rezaei Hachesu and et al (2013) use of data mining techniques to determine and predict length of stay of cardiac patients. They studied factors include clinical and demographic such as sex, age, CR, diabetes, Cholesterol, Hemoglobin, Triglycerides, lipoprotein, hypertension, family history, marital status, smoking, cardiovascular disease, valvular heart disease, systolic and diastolic blood pressure, exercise tolerance test, drugs such as: statins, nitrates, diuretic, beta blockers, CA blockers and so on (16). Some of the items examined in Rezaei Hachesu study, with a minimum of data obtained in this study are similar but only clinical and demographic factors and drug usage in order to investigate the effect of them on length of hospitalization for heart disease in general not specifically chronic heart failure had been studied. However common factors in two studies shows that studied factors, in addition to the impact of deteriorating chronic heart failurepatients after discharge can affect length of stay of these patients. So, consideration of these factors in intelligent data analysis is very important.Karaolis and et al (2010) were assessment of the risk factors of coronary heart events based on data mining with decision trees. They said age, sex, family history, smoking, hypertension and diabetes are most risk factors that should be considered (17). All of the data obtained in Karaolis et al study are matching with the minimum data sets for follow up and monitoring of chronic heart failurepatient status in demographic sections of this research.Rajeswari and et al (2011) with artificial neural network techniques were studied following items as risk factors for ischemic heart disease. Age, sex, cholesterol, LDL, smoking, life style, diet habits, family history, heart disease, systolic blood pressure, sleep disorders and genetic factors (18). Some of these factors are similar to the results of our research. This indicates that it is important to examine these factors in follow up of chronic heart failurepatients.Dessai (2013) suggest intelligent heart disease prediction system with using probabilistic neural network. In this system 13 attributes based on k-mean clustering algorithms and artificial neural networks have been studied. This research stated that the application of intelligent system compared to the existing methods has significant impact on improving the predicting and classifying of heart diseases (19). Chen and et al (2011) suggest heart disease prediction system (HDPS). In this system also data of 13 attributes of patients were collected and use artificial neural network algorithm. Accuracy prediction in this model was 80 percent (20). In both of systems 13 attributes were studied but in this research 44 attributes in six sections studied in three different times after discharge of chronic heart failurepatients. The performances of the proposed model in addition to prediction are including data processing, evaluation, reasoning, negotiation for make a decision. Also this model has learning ability and will significantly help to making appropriate decisions about the chronic heart failurepatients after discharge. Follow up plan management is very essential and has significant role in reduce mortality and morbidity, decrease costs and improve use of health care resources.Data analysis with artificial intelligent techniques is main tools that can provide access to hidden knowledge in large database of clinical and administrative data in hospitals. Also provide useful knowledge for health care personnel to improve decision support, increase quality of preventive; treatment and diagnosis planning. Nowadays data analysis without tools based on intelligent analysis is very difficult and sometimes impossible due to advances in the field of health and increasing complexity of health data. Intelligent data analysis increase efficiency of knowledge discovery process and avoids reporting trivial results to the user.Our emphasis in this study were on non-follow up patient status include death or/and hospitalization. In data analysis system learns from its mistakes and errors lead patient to visit the physician. In other words error generated in data analysis was such that patient who had not been seen in survey their status by cardiologist lead to visit the doctor. Of course gradually and over time the number of errors will lower. All algorithms are able to predict patient statues with various degrees of accuracy. According to various aspects of data, different decision trees models used in data analysis and finally system can be voting from these decision trees. Combining decision trees based on voting algorithms can learn multiple models and predictions can reduce error of decision trees on real environment problems. Also in order to increase prediction accuracy, greater volume of data should be collected.
5. CONCLUSION
Physicians in these days rely on evidence-based medicine more than facts that generated by software. However, intelligent data analysis because of diversity of health data and specific circumstances of health domain confront enormous challenges. Limits the number of disease-related data and lack of electronic guidelines and expert system that help to physician in decision making are some challenges that this research is also facing them.Creating and using standard terminologies and databases consistent with these terminologies help to meet the challenges related to data collection from various places and application data in intelligent data analysis.It should be noted that intelligent analysis of health data and intelligent system can never replace with cardiologists. It can act as a helpful tool for the cardiologist’s decisions making. Lack of physician trust in intelligent analysis is the main problems of using intelligent system in the world. One of the positive points of this project is involvement of cardiologist besides the software and health information management experts in order to decrease this challenge. Explain benefit of intelligent system to cardiologist, emphasizing the helpful role of these types of systems, report generated results and feedbacks to them can decrease physician resistance and improve their attitudes and trust.
Authors: Mariell Jessup; William T Abraham; Donald E Casey; Arthur M Feldman; Gary S Francis; Theodore G Ganiats; Marvin A Konstam; Donna M Mancini; Peter S Rahko; Marc A Silver; Lynne Warner Stevenson; Clyde W Yancy Journal: Circulation Date: 2009-03-26 Impact factor: 29.690
Authors: John J V McMurray; Stamatis Adamopoulos; Stefan D Anker; Angelo Auricchio; Michael Böhm; Kenneth Dickstein; Volkmar Falk; Gerasimos Filippatos; Cândida Fonseca; Miguel Angel Gomez-Sanchez; Tiny Jaarsma; Lars Køber; Gregory Y H Lip; Aldo Pietro Maggioni; Alexander Parkhomenko; Burkert M Pieske; Bogdan A Popescu; Per K Rønnevik; Frans H Rutten; Juerg Schwitter; Petar Seferovic; Janina Stepinska; Pedro T Trindade; Adriaan A Voors; Faiez Zannad; Andreas Zeiher Journal: Eur Heart J Date: 2012-05-19 Impact factor: 29.983
Authors: Joseph S Ross; Jersey Chen; Zhenqiu Lin; Héctor Bueno; Jeptha P Curtis; Patricia S Keenan; Sharon-Lise T Normand; Geoffrey Schreiner; John A Spertus; Maria T Vidán; Yongfei Wang; Yun Wang; Harlan M Krumholz Journal: Circ Heart Fail Date: 2009-11-10 Impact factor: 8.790
Authors: Larry A Allen; Lynne W Stevenson; Kathleen L Grady; Nathan E Goldstein; Daniel D Matlock; Robert M Arnold; Nancy R Cook; G Michael Felker; Gary S Francis; Paul J Hauptman; Edward P Havranek; Harlan M Krumholz; Donna Mancini; Barbara Riegel; John A Spertus Journal: Circulation Date: 2012-03-05 Impact factor: 29.690
Authors: Sven Van Poucke; Zhongheng Zhang; Martin Schmitz; Milan Vukicevic; Margot Vander Laenen; Leo Anthony Celi; Cathy De Deyne Journal: PLoS One Date: 2016-01-05 Impact factor: 3.240