Literature DB >> 31821152

Detecting Lifestyle Risk Factors for Chronic Kidney Disease With Comorbidities: Association Rule Mining Analysis of Web-Based Survey Data.

Suyuan Peng1,2, Feichen Shen2, Andrew Wen2, Liwei Wang2, Yadan Fan3, Xusheng Liu4, Hongfang Liu2.   

Abstract

BACKGROUND: The rise in the number of patients with chronic kidney disease (CKD) and consequent end-stage renal disease necessitating renal replacement therapy has placed a significant strain on health care. The rate of progression of CKD is influenced by both modifiable and unmodifiable risk factors. Identification of modifiable risk factors, such as lifestyle choices, is vital in informing strategies toward renoprotection. Modification of unhealthy lifestyle choices lessens the risk of CKD progression and associated comorbidities, although the lifestyle risk factors and modification strategies may vary with different comorbidities (eg, diabetes, hypertension). However, there are limited studies on suitable lifestyle interventions for CKD patients with comorbidities.
OBJECTIVE: The objectives of our study are to (1) identify the lifestyle risk factors for CKD with common comorbid chronic conditions using a US nationwide survey in combination with literature mining, and (2) demonstrate the potential effectiveness of association rule mining (ARM) analysis for the aforementioned task, which can be generalized for similar tasks associated with noncommunicable diseases (NCDs).
METHODS: We applied ARM to identify lifestyle risk factors for CKD progression with comorbidities (cardiovascular disease, chronic pulmonary disease, rheumatoid arthritis, diabetes, and cancer) using questionnaire data for 450,000 participants collected from the Behavioral Risk Factor Surveillance System (BRFSS) 2017. The BRFSS is a Web-based resource, which includes demographic information, chronic health conditions, fruit and vegetable consumption, and sugar- or salt-related behavior. To enrich the BRFSS questionnaire, the Semantic MEDLINE Database was also mined to identify lifestyle risk factors.
RESULTS: The results suggest that lifestyle modification for CKD varies among different comorbidities. For example, the lifestyle modification of CKD with cardiovascular disease needs to focus on increasing aerobic capacity by improving muscle strength or functional ability. For CKD patients with chronic pulmonary disease or rheumatoid arthritis, lifestyle modification should be high dietary fiber intake and participation in moderate-intensity exercise. Meanwhile, the management of CKD patients with diabetes focuses on exercise and weight loss predominantly.
CONCLUSIONS: We have demonstrated the use of ARM to identify lifestyle risk factors for CKD with common comorbid chronic conditions using data from BRFSS 2017. Our methods can be generalized to advance chronic disease management with more focused and optimized lifestyle modification of NCDs. ©Suyuan Peng, Feichen Shen, Andrew Wen, Liwei Wang, Yadan Fan, Xusheng Liu, Hongfang Liu. Originally published in the Journal of Medical Internet Research (http://www.jmir.org), 10.12.2019.

Entities:  

Keywords:  Behavioral Risk Factor Surveillance System; association rule mining; chronic kidney disease; noncommunicable diseases

Year:  2019        PMID: 31821152      PMCID: PMC6930505          DOI: 10.2196/14204

Source DB:  PubMed          Journal:  J Med Internet Res        ISSN: 1438-8871            Impact factor:   5.428


Introduction

Chronic kidney disease (CKD) is a progressive disease associated with high rates of mortality, morbidity, and disability [1,2]. Renal replacement therapies have been performed on approximately 8 million adults in the United States, with significant economic burdens [3]. The rate of progression of CKD from one major stage to another varies based on both unmodifiable (eg, age, race/ethnicity, family history) and modifiable (eg, hypertension, dyslipidemia, cigarette smoking, overweight/obesity, physical inactivity, dietary patterns) risk factors. Modifiable lifestyle risk factors account for 24% of the excess risk of CKD [4]. Observational and nonrandomized prospective studies have suggested that patients who modify their unhealthy lifestyles have fewer hospitalizations, are more likely to adhere to established CKD treatment goals (anemia or mineral and bone disease), and may have improved rates of survival [5-7]. Therefore, recognition of those lifestyle risk factors is vital in informing strategies to achieve renoprotection. Lifestyle modification for CKD patients involves long-term habit changes, requires considerable effort from patients, and may take years to be effective. Evidence does exist that supports the value of lifestyle intervention for treating hypertension or diabetes and preventing cardiovascular events, but studies on suitable lifestyle interventions for patients with CKD are sparse. In addition, lifestyle risk factors for CKD with different comorbidities may vary. For example, lifestyle interventions for CKD with mineral and bone disorder include adequate calcium and vitamin D consumption, exercise, and fall prevention. The lifestyle risk factors for CKD with diabetes include unhealthy diet, sedentary lifestyle, and obesity. The lifestyle risk factors and modification strategies for CKD suggested by different guidelines may also vary [8-10], which poses a major challenge for clinical practice and research. With the advance of digital health care strategies, a large amount of data can be leveraged for identifying lifestyle risk factors. Popular approaches for identifying lifestyle risk factors include epidemiological or statistical approaches with an implicit assumption that risk factors are linearly associated with a disease. However, it oversimplifies complex relationships between risk factors and diseases. In this paper, we explore the use of a popular data mining technique, association rule mining (ARM), to determine more nuanced relationships between lifestyle risk factors and CKD with comorbidities. ARM is commonly used for performing unsupervised exploratory data analysis over a wide range of research and commercial domains, including biology and bioinformatics (eg, biological sequence analysis, analysis of gene expression data) [11-13]. Rules produced by ARM are able to summarize the impact of several factors in combination in a nonhierarchical fashion.

Methods

Materials

Behavioral Risk Factor Surveillance System

We conducted an ARM analysis using the 2017 Behavioral Risk Factor Surveillance System (BRFSS), which was published in July 2018 [14]. The BRFSS is an annual health-related telephone survey conducted by the Centers for Disease Control and Prevention that is designed to measure the health-related risk behaviors, chronic health conditions, and use of preventive services of adult residents (≥18 years) of the United States (including all 50 states, the District of Columbia, Guam, and Puerto Rico). More than 400,000 adults are interviewed each year, making it the largest telephone-based survey in the world and enabling it to be a powerful tool for health promotion activities. The BRFSS system consists of 29 modules and 358 variables that collect information about health status, healthy days or health-related quality of life, health care access, exercise, inadequate sleep, chronic health conditions, oral health, tobacco and e-cigarette use, alcohol consumption, immunization status, falls, seat belt use, drinking and driving, breast and cervical cancer screening, prostate cancer screening, colorectal cancer screening, and HIV/AIDS [15]. The validity of BRFSS variables for indexing chronic disease conditions has been previously demonstrated [15,16]. The BRFSS 2017 contains a total of 450,016 responses and 17,547 CKD cases.

Semantic MEDLINE Database

The Semantic MEDLINE Database (SemMedDB) [17] is a repository of semantic predications (subject-predicate-object triples) extracted from the titles and abstracts of all PubMed citations, which is widely used to conduct literature-based knowledge discovery in the biomedical domain [18-21]. The predications are extracted by SemRep [22], which is a semantic interpreter developed by the National Library of Medicine. Specifically, the semantic predications consist of UMLS (Unified Medical Language System) metathesaurus concepts as arguments (eg, subject and object) and a semantic relationship (eg, “treat”) from an extended version of the UMLS Semantic Network as a predicate. There are currently more than 83 million semantic predications in this database in the June 30, 2017, version of this database. Although SemMedDB provides structured predications, further inference work is needed to filter out noisy data and discover new knowledge. In this study, we treated the SemMedDB as a knowledge resource and extracted a subgraph that contains all triples related to CKD for enriching the survey data.

Charlson Comorbidity Index

We evaluated the noncommunicable diseases (NCDs) of each participant by using the classification of Charlson Comorbidity Index (CCI) [23], consisting of 17 comorbidities, developed and validated as a measure of 1-year mortality risk and burden of disease. In addition to CKD, we investigated five NCDs: cardiovascular disease, chronic pulmonary disease, rheumatoid arthritis, diabetes, and non-skin cancer. Institutional review board approval was not necessary for this study due to the nature of the study (secondary analysis of an anonymized dataset).

Methodology Overview

We applied ARM for the CKD population using the 2017 BRFSS data to generate rules for detecting lifestyle risk factors for CKD progression, including demographic information, lifestyle behaviors, clinical symptoms, and chronic disease conditions. Correlation analysis was performed to assess differences in lifestyle risk factors in the status of comorbidity-related CKD. To enrich the BRFSS data, SemMedDB was mined to identify lifestyle risk factors for CKD presented in publications. The workflow is shown in Figure 1. The arules package (version v1.6-4) for R (version 3.5.2) was used for ARM analysis.
Figure 1

Workflow of this study. ARM: association rule mining; BRFSS: Behavioral Risk Factor Surveillance System; CKD: chronic kidney disease; NCD: noncommunicable disease; SemMedDB: Semantic MEDLINE Database.

Workflow of this study. ARM: association rule mining; BRFSS: Behavioral Risk Factor Surveillance System; CKD: chronic kidney disease; NCD: noncommunicable disease; SemMedDB: Semantic MEDLINE Database.

BRFSS Input Data Preparation

We first selected 58 variables (involving 18 modules) related to behaviors from the BRFSS 2017 data by utilizing domain expert knowledge (from two nephrologists: S Peng and X Liu), with a focus on the presence of a condition or behavior rather than the questions about obvious feelings (as shown in Multimedia Appendix 1). If the given condition of interest was present in the patient, it was marked as 1, otherwise 0 or NA. For example, completion of the flu vaccine series was defined by a participant answering “yes” to the question: “During the past 12 months, have you had either a flu shot or a flu vaccine that was sprayed in your nose?” (possible answers were “yes,” “no,” “don’t know/not sure,” and “refused”). Only those who answered “yes” were annotated as 1 and included in the analysis. Records with responses of “no,” “unknown,” or “refused” were annotated as 0; those with missing data were completely excluded from the analysis to minimize underestimation. For each patient, we extracted all variables that were marked as 1 and prepared the input.

Association Rule Mining of the Chronic Kidney Disease Cohort in BRFSS

We then applied the Apriori algorithm [24] on the input data for 58 variables among 17,547 CKD patients. Apriori is a popular algorithm for mining association rules that is divided into two steps: (1) finding frequent itemsets and (2) constructing rules from frequent itemsets. An association rule is an implication between disjoint itemsets: m ⇒ n. The left-hand side of the rule is the antecedent and the right-hand side the consequent. An itemset containing k items is called a k-itemset. If T is a transaction, m is an itemset, and m ⊆ T, then T contains m. The support of the rule m ⇒ n is the fraction of transactions that contain both m and n (equation 1 in Figure 2). A frequent itemset is one whose support is at least some threshold, always denoted as minSup.
Figure 2

Equations.

The rule m ⇒ n with confidence (equation 2 in Figure 2) means that the fraction of transactions in T containing m that also contain n is confidence. It measures how often items in m appear in transactions that contain n. Confidence can also be referred to as the strength of the rule. The threshold of confidence is always denoted as minConf. Lift (equation 3 in Figure 2) is an index that indicates the relative magnitude of the probability of observing m under the condition of n, compared with the overall probability of observing m. When lift = 1, the two occurrences, m ⇒ n, are independent of each other. When the lift value is greater than 1, the two occurrences are dependent on one another; the higher the value, the greater the relevance of the interaction. Equations. We used the following heuristic for generating the final association rules to be analyzed. We first selected itemsets with support value larger than the average of support values of all itemsets and lift value greater than 1. We then kept itemsets with lift value larger than the average lift values of those selected itemsets as our final association rules. We focused our analysis of the association rules of five NCDs as determined by the CCI: cardiovascular disease, chronic pulmonary disease, rheumatoid arthritis, diabetes, and non-skin cancer.

Correlation Analysis of Comorbidities and Risk Factors

To assess differences of lifestyle risk factors in the status of comorbidity-related CKD, correlation analysis was performed. We retrieved five subcohorts of NCDs (cardiovascular disease, chronic pulmonary disease, rheumatoid arthritis, diabetes, non-skin cancer) from the CKD cohort and evaluated the contribution and correlation of lifestyle risk factors.

Literature Enrichment Analysis

We mined SemMedDB for lifestyle risk factors that were not present in the BRFSS system by also using ARM. We first retrieved all relevant triplets (subject, predicate, and object) related to CKD. For example, CKD>, <CKD, Coexists_With, Diabetes>, and <CKD, Causes, Hypertensive Disease> are some example triplets retrieved. We then selected terms from the triplets that were relevant to lifestyle behavior, symptoms, and diseases based on a list of relevant semantic types (see Multimedia Appendix 2). We filtered out 47 terms with generic meaning (eg, patients, agent, woman, child, author, disease). We then applied the Apriori algorithm on the extracted pairwise terms to mine frequent itemsets and generate rules. Based on our previous work [25], our item matrix was very sparse, with a density of 0.00026. To mine sufficiently interesting rules, we set the minimum support and minimum confidence by making sure every rule was presented at least two times and selected itemsets with lift value greater than 1. We then kept itemsets with lift value larger than the average lift values of those selected itemsets as our final association rules. Specifically, we focused our analysis of the association rules on six specific semantic types (daily or recreational activity, food, hazardous or poisonous substance, individual behavior, mental or behavioral dysfunction, finding) to detect lifestyle risk factors present in publications that the BRFSS questionnaire does not mention.

Results

Characteristics of Patients With Chronic Kidney Disease Cohort

Overall, a total of 17,547 participants were reported have CKD in the BFRSS 2017 data; 80.09% (14,053/17,547) were white and 60.13% (10,551/17,547) were men. The mean age was 64.42 (SD 13.81) years. The characteristics of the CKD cohort are presented in Table 1.
Table 1

Characteristics of participants in the BFRSS (Behavioral Risk Factor Surveillance System) 2017 with chronic kidney disease (N=17,547).

CharacteristicsParticipants
Age (years), mean (SD)64.42 (13.81)
Male, n (%)10,551 (60.13)
Completed interview, n (%)15,348 (87.47)
Ever served on active duty in the United States Armed Forces, n (%)2940 (16.76)
Income categoriesa, n (%)
Less than $15,0002608 (14.86)
$15,000 to less than $25,0003455 (19.69)
$25,000 to less than $35,0001792 (10.21)
$35,000 to less than $50,0001955 (11.14)
$50,000 or more4734 (26.98)
Education level, n (%)
Did not graduate middle school1933 (11.02)
Did not graduate high school5213 (29.71)
Attended college or technical school5151 (29.36)
Graduated from college or technical school5181 (29.53)
Marital status, n (%)
Married7904 (45.04)
Divorced3047 (17.36)
Widowed3821 (21.78)
Separated494 (2.82)
Never married1801 (10.26)
A member of an unmarried couple374 (2.13)
Race, n (%)
White14,053 (80.09)
Black or African American1763 (10.05)
American Indian or Alaskan Native535 (3.05)
Asian261 (1.49)
Native Hawaiian or other Pacific Islander159 (0.91)
Other race351 (2.00)
No preferred race50 (0.28)
Comorbidity, n (%)
CHDb or myocardial infarction4828 (28.12)
Stroke15,204 (86.65)
COPDc, emphysema, or chronic bronchitis3763 (21.45)
Asthma2888 (16.46)
Rheumatoid arthritis10,798 (61.98)
Diabetes6642 (37.85)
Cancer3974 (22.65)

aThe rest of the people refused to answer this question.

bCHD: coronary heart disease.

cCOPD: chronic obstructive pulmonary disease.

Characteristics of participants in the BFRSS (Behavioral Risk Factor Surveillance System) 2017 with chronic kidney disease (N=17,547). aThe rest of the people refused to answer this question. bCHD: coronary heart disease. cCOPD: chronic obstructive pulmonary disease. For heuristics, we set a lower bound of 0.1 for support and computed the average for all selected support values. As a result, we set the average support (0.150) as a threshold and selected 12,141 frequent itemsets. Among the 12,141 frequent itemsets, we then picked the average lift 1.094 as the threshold to finalize 7677 association rules. Figure 3 shows the curve between ranked associations and interestingness metrics (support and lift). The threshold was also marked on the curve.
Figure 3

Support and lift value selection.

Among the 7677 association rules, we retrieved subsets that related to five adverse conditions included in NCDs from CCI, including cardiovascular disease, chronic pulmonary disease, rheumatoid arthritis, diabetes, and non-skin cancer. For each of the input conditions, we then selected the top 10 association rules with the highest lift score regardless of whether the disease appeared on the left or right side. From the top rules of each comorbidity, we determined that (1) CKD patients with comorbidity of cardiovascular disease have symptoms of high blood pressure, high cholesterol, asthma, function limitation, and lower aerobic and strengthening level; (2) CKD patients with a comorbidity of chronic pulmonary disease tend to have clinical manifestations of being overweight, hypertension, unhealthy diet (french fries or fried potatoes, less consumption of fruit and vegetables), and lower aerobic and strengthening level; (3) CKD patients with a comorbidity of rheumatoid arthritis are associated with hypertension, overweight, asthma, difficulty walking/doing errands alone, and less leisure-time physical activities; (4) CKD patients with a comorbidity of diabetes have a variety of clinical manifestations, including hypertension, high cholesterol, overweight, less leisure-time physical activities, and lower aerobic and strengthening level; and (5) CKD patients with non-skin cancer are associated with age (older than 65 years), asthma, less muscle strengthening, and lower aerobic level. Examples of the top rules with the highest lift scores are shown in Table 2 (see Multimedia Appendix 3 for details on the results of the top 10 rules for subsets).
Table 2

Examples of the top rule for subsets.

ComorbiditiesKeywordsaTop ruleLiftCount
Cardiovascular disease‘x.michd’{diffwalk,x.casthm1,x.rfhype5,x.rfsmok3} => {x.michd}1.432783
Chronic pulmonary diseasechccopd1,’ ‘x.casthm1’{diffalon,x.casthm1,x.drdxar1,x.rfsmok3} => {diffwalk}1.932643
Rheumatoid arthritis‘x.drdxar1’{diffalon,x.casthm1,x.drdxar1,x.rfsmok3} => {diffwalk}1.932676
Diabetes‘diabete3’{diffwalk,x.casthm1,x.rfchol1,x.rfhype5,x.rfsmok3} => {diabete3}1.532726
Cancer‘chcocncr’{chcocncr,x.casthm1} => {x.age65yr}1.22697

aWe used the variable code to represent each variable. The meaning of the code is shown in Multimedia Appendix 1.

Support and lift value selection. Examples of the top rule for subsets. aWe used the variable code to represent each variable. The meaning of the code is shown in Multimedia Appendix 1. We conducted a correlation analysis using variables present in the top 10 rules for the five NCDs and CKD with a total of 25 variables. Figure 4 shows the heatmap of the correlation coefficient values of those variables with the five NCDs, CKD, and CKD with the comorbidities. Spearman rank correlation test was used for the analysis.
Figure 4

Heatmap for correlation analysis of comorbidities and lifestyle risk factors.

Heatmap for correlation analysis of comorbidities and lifestyle risk factors. The correction analysis showed that people with NCDs including CKD have less physical activity in their leisure time and consume fewer fruits and vegetables. Hypertension, high cholesterol, age older than 65 years, male sex, difficulty walking, and difficulty concentrating had positive correlations with cardiovascular disease and rheumatoid arthritis but negative correlations with CKD comorbid conditions. We set the support threshold to 0.00047 and the confidence threshold to 0.0001 to ensure every rule was presented at least two times, and then expressed the results with lift value greater than 1 by descending order to finalize important association rules. Among all 1323 association rules, 140 keywords from six specific semantic types were selected as a lifestyle word list (daily or recreational activity, food, hazardous or poisonous substance, individual behavior, mental or behavioral dysfunction, finding) to detect novel rules related to lifestyle risk factors present in publications. Multimedia Appendix 4 shows the top 20 rules. Associations found using this method indicated that iron deficiency, depressed mood, sedentary lifestyle, and malnutrition were associated with anemia, hyperparathyroidism, obesity, and atherosclerosis, respectively, which the BRFSS questionnaire does not mention (Table 3).
Table 3

Lifestyle-related top rules of SemMedDB.

Top rulesLiftCount
{Iron deficiency} => {Anemia}20.923
{Depressed mood} => {Hyperparathyroidism; Secondary}16.242
{Obesity} => {Sedentary}15.182
{Obesity} => {Hypercholesterolemia}15.182
{Malnutrition} => {Atherosclerosis}9.333
Lifestyle-related top rules of SemMedDB.

Discussion

Comparison With Other Studies and Reviews

The association rules indicated that CKD patients comorbid with cardiovascular disease are more likely to have symptoms of high blood pressure, high cholesterol, asthma, function limitation, and lower aerobic and strengthening levels. The proper assessment of overall progressive risk in patients with CKD requires an adequate assessment of the presence and severity of other major risk factors. CKD is an independent risk factor for the development of cardiovascular disease; CKD is considered a cardiovascular disease risk equivalent [26,27]. Damaged kidneys may release too much renin, which helps to control blood pressure but increases the risk for heart attack, congestive heart failure (CHF), and stroke. CHF is responsible for up to 50% of deaths in patients with renal failure [3,28]. The signs and symptoms of heart failure include shortness of breath (dyspnea), fatigue, and weakness, consistent with our findings. The Physicians’ Health Study and other observational studies suggest that increased physical activity, higher cardiorespiratory fitness, and lower sedentary time are associated with reduced incidence of CHF [29]. Evidence shows that exercise training results in improved physical performance and functioning in patients with CKD [30]. Hence, the highlight of lifestyle modification of CKD with cardiovascular disease is to increase aerobic capacity by improving muscle strength or functional ability. Findings also point to similar risk factors for CKD with chronic pulmonary disease or rheumatoid arthritis. The relationship between rheumatoid arthritis and chronic pulmonary disease (especially for chronic obstructive pulmonary disease) was found recently, in which people with rheumatoid arthritis were at 47 percent greater risk of hospitalization for chronic obstructive pulmonary disease than those in the control group [31]. Our finding supported evidence in the CKD cohort, the mechanisms that link CKD comorbid with the two diseases are speculative at present, which might be inflammation, autoimmunity, or genetic predispositions shared between them. The lifestyle risk factors of these two comorbidities include hypertension, overweight, unhealthy diet (french fries or fried potatoes, less consumption of fruit and vegetables), and physical inactivity. Because the items did not determine exactly when symptoms of CKD or other NCDs originated, there are two possible interpretations of the result. One possible interpretation is that participants began reducing their physical activity and intake of fruits and vegetables when they developed CKD or chronic conditions. Symptoms of chronic conditions, such as hypertension, bone pain, peripheral neuropathy, side effects from medicines and fluid retention, itch, or sleep disturbance, can all negatively affect daily physical activity level, especially for CKD patients. Fruits and vegetables are a rich source of carbohydrates, vitamins, potassium, magnesium, and dietary fiber, whereas legumes and dried beans are important vegetable proteins. However, the limitation of potassium, fructose [32,33], or dietary protein intake has been common practice to control uremia. Despite the known benefits of fruit and vegetable consumption, intake remains poor in both the general and CKD populations [34]. An alternative interpretation is that lower vegetable and fruit consumption contributes to the development or maintenance of CKD or other NCDs. This interpretation has greater plausibility because it is consistent with other epidemiological studies and existing biological knowledge. However, fruits and vegetables should not be omitted from the everyday diet; this practice may lead to nutrient deficiency and low fiber-related constipation, which contribute to further accumulation of uremic toxins. The national “2 fruits and 5 vegetables” campaign guides Australians toward healthy fruit and vegetable consumption, which is applicable to CKD [35]. Also, regular participation in moderate-intensity exercise may enhance certain aspects of immune function and exert anti-inflammatory effects. Therefore, the lifestyle modification of CKD with chronic pulmonary disease or rheumatoid arthritis should be high dietary fiber intake and participation in moderate-intensity exercise to decrease inflammation and oxidative stress. CKD is associated with insulin resistance and, in advanced CKD, decreased insulin degradation. In the association rules for CKD with diabetes, the results pointed to hypertension, high cholesterol, overweight, less leisure-time physical activities, and lower aerobic and strengthening as lifestyle risk factors rather than an unhealthy diet. Hence, the lifestyle modification of CKD with diabetes is consistent with the prevention of type 2 diabetes (predominantly exercise and weight loss), which can successfully decrease the development of CKD with diabetes. CKD is recognized as a disease that may complicate cancer and its therapy (eg, immunotherapy). Cancer can cause CKD either directly or indirectly through the adverse effects of therapies; conversely, CKD may be a risk factor for cancer [36,37]. We found that age older than 65 years and physical inactivity were associated with CKD with non-skin cancer. The BRFSS questionnaire does not incorporate the therapeutics of cancer; therefore, the lifestyle risk factors of CKD with cancer cannot be evaluated in our research.

Enrichment of the BRFSS Questionnaire

The BRFSS does not specifically target CKD or NCDs; therefore, many clinical manifestations were not considered, including potentially relevant items such as anorexia, nausea, vomiting, fatigue, anemia, and bone disease. To enrich the questionnaire, we used the SemMedDB to find lifestyles that related to the clinical manifestations specifically with CKD from publications. The results indicated that iron deficiency, depressed mood, sedentary lifestyle, and malnutrition are associated with anemia, hyperparathyroidism, obesity, and atherosclerosis, respectively, which the BRFSS questionnaire did not mention. CKD can affect a patient’s health-related quality of life in many ways. The diagnosis alone might cause fear or anxiety. Anemia, frailty, coexisting comorbidities, and depression are also major contributory factors to quality of life in CKD. Meat and meat alternatives are the main source of protein in the CKD diet. Healthy choices include lean cuts of meat, skinless poultry, eggs, fish, seafood, and plant-based protein foods such as legumes, dried beans, nuts, and seeds. The questionnaire of the BRFSS does not contain the variables of meat or protein consumption, nor does it contain information on micronutrient deficiency.

Effectiveness of Association Rule Mining in the Noncommunicable Disease Domain

The results of the correlation analysis found that hypertension, high cholesterol, age older than 65 years, male sex, difficulty walking, and attention deficit disorder were positively correlated with cardiovascular disease and rheumatoid arthritis, but negatively correlated with corresponding CKD comorbidities (CKD with cardiovascular disease/rheumatoid arthritis). The ARM results suggest that patients with CKD older than 65 years are more likely to have signs or symptoms of hypertension, asthma, and difficulty walking, which is inconsistent with the aforementioned findings. It was caused by the differences between the two algorithms: a correlation is the relationship that exists between two or more variables in which a change in one variable causes a change in the other variable when the two variables are said to be correlated. Association rules are of the form {X1, ..., Xn} → Y, meaning that if you find all signs or symptoms of X1, ..., Xn in a disease it is possible to find another sign or symptom (Y). Epidemiological studies and existing domain knowledge are inconsistent with the result of correlation analysis but consistent with the results of ARM. A wide range of disorders may develop as a consequence of the loss of renal function with CKD. These include disorders of fluid and electrolyte balance, as well as abnormalities related to hormonal or systemic dysfunction. Treatment strategies should be modified based on the needs of the individual patient. Variations and inconsistencies are inevitable in clinical practice; therefore, recognizing modifiable risk factors in medical interventions are important for providing effective chronic disease management. ARM has several applications in the medical domain, and it has been used for detecting risk factors for diabetes and cardiovascular disease [38,39]. This study illustrates how ARM approaches could be used in risk factor detection of CKD and provides the potential effectiveness of the method of ARM analysis for NCDs. ARM methods, such as Apriori, have also been used on electronic health record data to identify associations among clinical concepts. The strength of the ARM approach compared with a more conventional correlation analysis is that it has identified sizeable groups that can easily be defined and identified for intervention at a practice level in real time to allow more focused and immediate correction of bias in chronic disease management.

Limitations

This research used a large representative sample, was based on items that asked about diagnosed disease, and included a number of relevant covariates; however, there are some aspects of this study that should be noted as limitations. First, from 17,547 CKD patients, only 15,348 completed the interview. The dataset was skewed toward the white race and male gender, which may affect the generalizability of lifestyle interventions to other races and females. Other research found similar results [40] in which lower response rates (<40%) were associated with the underrepresentation of racial/ethnic minorities (eg, Hispanics), women, and younger individuals in the BRFSS survey. Second, CKD and cancer can influence each other either directly or indirectly through the adverse effects of therapies. Since the BRFSS questionnaire was based on self-reporting, we cannot connect enough information. The lifestyle risk factors of CKD with cancer could be confirmed in further research using direct physical examination or biochemical indexes. Third, the semantic predications consist of UMLS metathesaurus concepts as arguments, so we cannot tell whether “sedentary lifestyle” and “depressed mood” can be treated as “leisure-time physical activity calculate variables” or “ever been told you have depressive disorder.” As such, whether these differences are involved in observed associations for CKD needs to be considered in further epidemiological research. More robust observational or quasi-experimental studies would be needed to fully support the long-term impact of interventions for modifiable risk factors. Finally, for the semantic predication triples extracted from the SemMedDB, we ignored the semantic meaning of the predicates and only kept subjects and objects as pairwise associations. However, we also found some predications with negative meanings. For example, the triples CKD>, CKD>, renal failure, Neg_Coexists_With, CKD>, and CKD> contain predicates with negative meaning, like not treats, have no association with, does not coexist with, or does not manifest. The reason we did not completely remove those triples is that we found inconsistency because both positive and negative relationships for the same factor may be reported. For example, according to a 2015 study conducted by Wong [41], a positive relationship between abnormal blood pressure and CKD was found in SemMedDB; however, in a 1992 study conducted by Taniguchi et al [42], a negative relationship between the same two items was detected. The SemMedDB only maintains information contained in the title and abstract; therefore, it is difficult to address inconsistencies without reading through the full text. In the future, we will count positive and negative associations for each pairwise term and assign weights for different predications for a better semantic representation.

Conclusion

This study related both lifestyle risk factors and CKD with five other comorbid chronic conditions using the largest national US survey available and provided a suggestion for BRFSS questionnaire enrichment. Various lifestyle risk factors result in the presence of different comorbid conditions for CKD patients, and different signs and symptoms may be observed. The findings illustrate how ARM approaches could be used in risk factor detection of chronic diseases to allow more focused and optimized chronic disease management.
  32 in total

1.  Excess risk of chronic kidney disease among African-American versus white subjects in the United States: a population-based study of potential explanatory factors.

Authors:  Michelle E Tarver-Carr; Neil R Powe; Mark S Eberhardt; Thomas A LaVeist; Raynard S Kington; Josef Coresh; Frederick L Brancati
Journal:  J Am Soc Nephrol       Date:  2002-09       Impact factor: 10.121

2.  The interaction of domain knowledge and linguistic structure in natural language processing: interpreting hypernymic propositions in biomedical text.

Authors:  Thomas C Rindflesch; Marcelo Fiszman
Journal:  J Biomed Inform       Date:  2003-12       Impact factor: 6.317

3.  2013 AHA/ACC guideline on lifestyle management to reduce cardiovascular risk: a report of the American College of Cardiology/American Heart Association Task Force on Practice Guidelines.

Authors:  Robert H Eckel; John M Jakicic; Jamy D Ard; Janet M de Jesus; Nancy Houston Miller; Van S Hubbard; I-Min Lee; Alice H Lichtenstein; Catherine M Loria; Barbara E Millen; Cathy A Nonas; Frank M Sacks; Sidney C Smith; Laura P Svetkey; Thomas A Wadden; Susan Z Yanovski
Journal:  J Am Coll Cardiol       Date:  2013-11-12       Impact factor: 24.094

4.  2017 ACC/AHA/AAPA/ABC/ACPM/AGS/APhA/ASH/ASPC/NMA/PCNA Guideline for the Prevention, Detection, Evaluation, and Management of High Blood Pressure in Adults: Executive Summary: A Report of the American College of Cardiology/American Heart Association Task Force on Clinical Practice Guidelines.

Authors:  Paul K Whelton; Robert M Carey; Wilbert S Aronow; Donald E Casey; Karen J Collins; Cheryl Dennison Himmelfarb; Sondra M DePalma; Samuel Gidding; Kenneth A Jamerson; Daniel W Jones; Eric J MacLaughlin; Paul Muntner; Bruce Ovbiagele; Sidney C Smith; Crystal C Spencer; Randall S Stafford; Sandra J Taler; Randal J Thomas; Kim A Williams; Jeff D Williamson; Jackson T Wright
Journal:  Circulation       Date:  2018-10-23       Impact factor: 29.690

5.  A new method of classifying prognostic comorbidity in longitudinal studies: development and validation.

Authors:  M E Charlson; P Pompei; K L Ales; C R MacKenzie
Journal:  J Chronic Dis       Date:  1987

6.  Chronic kidney disease care program improves quality of pre-end-stage renal disease care and reduces medical costs.

Authors:  Shu-Yi Wei; Yong-Yuan Chang; Lih-Wen Mau; Ming-Yen Lin; Herng-Chia Chiu; Jer-Chia Tsai; Chih-Jen Huang; Hung-Chun Chen; Shang-Jyh Hwang
Journal:  Nephrology (Carlton)       Date:  2010-02       Impact factor: 2.506

7.  Risk of Incident Chronic Obstructive Pulmonary Disease in Rheumatoid Arthritis: A Population-Based Cohort Study.

Authors:  Katherine Mcguire; J Antonio Aviña-Zubieta; John M Esdaile; Mohsen Sadatsafavi; Eric C Sayre; Michal Abrahamowicz; Diane Lacaille
Journal:  Arthritis Care Res (Hoboken)       Date:  2018-04-02       Impact factor: 4.794

8.  Epidemiology of chronic kidney disease in cancer patients: lessons from the IRMA study group.

Authors:  Vincent Launay-Vacher
Journal:  Semin Nephrol       Date:  2010-11       Impact factor: 5.299

9.  Prevalence of chronic kidney disease and decreased kidney function in the adult US population: Third National Health and Nutrition Examination Survey.

Authors:  Josef Coresh; Brad C Astor; Tom Greene; Garabed Eknoyan; Andrew S Levey
Journal:  Am J Kidney Dis       Date:  2003-01       Impact factor: 8.860

10.  Predicate Oriented Pattern Analysis for Biomedical Knowledge Discovery.

Authors:  Feichen Shen; Hongfang Liu; Sunghwan Sohn; David W Larson; Yugyung Lee
Journal:  Intell Inf Manag       Date:  2016-05
View more
  5 in total

1.  Experiences and disease self-management in individuals living with chronic kidney disease: qualitative analysis of the National Kidney Foundation's online community.

Authors:  Yan Du; Brittany Dennis; Valerie Ramirez; Chengdong Li; Jing Wang; Christiane L Meireles
Journal:  BMC Nephrol       Date:  2022-03-04       Impact factor: 2.388

2.  Diagnosis and Treatment Rules of Chronic Kidney Disease and Nursing Intervention Models of Related Mental Diseases Using Electronic Medical Records and Data Mining.

Authors:  Yanli Wang; Yueyao Sun; Na Lu; Xuan Feng; Minglong Gao; Lihong Zhang; Yaping Dou; Fulei Meng; Kaidi Zhang
Journal:  J Healthc Eng       Date:  2021-12-10       Impact factor: 2.682

3.  Health Education Through a Campaign and mHealth to Enhance Knowledge and Quality of Life Among Patients With Chronic Kidney Disease in Bangladesh: Protocol for a Randomized Controlled Trial.

Authors:  Mohammad Habibur Rahman Sarker; Michiko Moriyama; Harun Ur Rashid; Md Moshiur Rahman; Mohammod Jobayer Chisti; Sumon Kumar Das; Yasmin Jahan; Samir Kumar Saha; Shams El Arifeen; Tahmeed Ahmed; A S G Faruque
Journal:  JMIR Res Protoc       Date:  2021-11-19

4.  Comparison of trend in chronic kidney disease burden between China, Japan, the United Kingdom, and the United States.

Authors:  Haoyu Wen; Donghui Yang; Cong Xie; Fang Shi; Yan Liu; Jiaming Zhang; Chuanhua Yu
Journal:  Front Public Health       Date:  2022-09-06

5.  Another Look at Obesity Paradox in Acute Ischemic Stroke: Association Rule Mining.

Authors:  Pum-Jun Kim; Chulho Kim; Sang-Hwa Lee; Jong-Hee Shon; Youngsuk Kwon; Jong-Ho Kim; Dong-Kyu Kim; Hyunjae Yu; Hyo-Jeong Ahn; Jin-Pyeong Jeon; Youngmi Kim; Jae-Jun Lee
Journal:  J Pers Med       Date:  2021-12-29
  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.