| Literature DB >> 26835413 |
Sunmoo Yoon1, Jose Gutierrez2.
Abstract
PURPOSE: Disability is a potential risk for stroke survivors. This study aims to identify disability risk factors associated with stroke and their relative importance and relationships from a national behavioral risk factor dataset.Entities:
Keywords: Stroke; data mining; patient outcome; visualization
Year: 2015 PMID: 26835413 PMCID: PMC4729578 DOI: 10.9734/BJMMR/2016/21601
Source DB: PubMed Journal: Br J Med Med Res ISSN: 2231-0614
Fig. 1WHO’s international classification of functioning, disability, and health (ICH) (Left), operationalized concepts of ICF (Right)
Fig. 2Steps of data mining for building a disability association model for post-stroke patients
Characteristics of stroke survivors in 2011 a national survey (N=19,458)
| Characteristics | Sample | Characteristics | Sample |
|---|---|---|---|
|
| |||
| Personal factors | Personal factors | ||
| Age, years, mean | 66.5 (SD=15.2) | ||
| Sex (female) | 12,137 (62%) | Current everyday | 2,704 (14%) |
| Current someday | 1,041 (5%) | ||
| White | 14,825 (76%) | Former smoke | 7,480 (39%) |
| Black | 2,103 (11%) | Never smoke | 8,100 (42%) |
| Other | 675 (3%) | ||
| Multiracial | 532 (3%) | Yes | 5,455 (28%) |
| Hispanic | 985 (5%) | No | 12,484 (64%) |
| Never attend | 42 (0%) | Excellent | 756 (4%) |
| Grades 1–8 years | 1,246 (6%) | Very good | 2,744 (14%) |
| Grades 9–11 years | 2,226 (12%) | Good | 5,499 (28%) |
| Grade 12 | 6,923 (36%) | Fair | 5,699 (29%) |
| College 1–3 years | 5,071 (26%) | Poor | 4,637(24%) |
| College >= 4 years | 3,919 (20%) | ||
| Physical, days, mean | 11 (SD=12.5) | ||
| Employed for wages | 2,208 (11%) | Mental, days, mean | 6 (SD=10.2) |
| Self-employed | 705 (4%) | Both, days, mean | 10 (SD=11.2) |
| Out of work > 1 year | 543 (3%) | ||
| Out of work < 1 year | 306 (2%) | Yes | 10,827 (55%) |
| A homemaker | 1,238 (6%) | No | 8,631 (44%) |
| A student | 58 (0%) | Not sure | 125 (1%) |
| Retired | 9,909 (51%) | Refused | 19 (0%) |
| Unable to work | 4,382 (23%) | ||
| Married | 8,049 (41%) | Yes | 7,537 (39%) |
| Divorced | 3,580 (18%) | No | 11,894 (61%) |
| Widowed | 5,790 (30%) | ||
| Separated | 536 (3%) | ||
| Never Married | 1,249 (7%) | ||
| Have coverage | 18,085 (93%) | ||
| Myocardial infarction | 5,731 (30%) | No coverage | 1,318 (7%) |
| Coronary heart disease | 4,890 (26%) | ||
| Asthma | 3,826 (20%) | Cost barrier | 2,775 (14%) |
| Injury by fall | 2,371 (44%) | No cost barrier | 16,605 (86%) |
Fig. 3Infographics of correlates of disability among post-stroke patients (number and size representing β calculated by linear regression and M5’s methods representing relative importance using weka software, model fit: correlation coefficeint 0.47, root mean squared error 10.74,)
Fig. 4Infographics of correlation models for disability among post-stroke patients generated using C4.5 (J48) algorithm using Weka software
Variables associated with disability among stroke survivors
| Category | Variable | Rank | Model | Literature | |
|---|---|---|---|---|---|
|
| |||||
| Personal factors | |||||
| Use of assistive device | 1 | ☒ | accuracy | ||
| Insulin use | 6 | 69% | |||
| Asthma | 10 | ☒ | AUC 73% | ||
| Chronic illness | 11 | ||||
| Pain | 16 | [ | |||
| Daibetes | 22 | ☒ | [ | ||
| Myocardial infarction | 23 | ☒ | [ | ||
| Coronary heart disease | 26 | ☒ | |||
| Cancer | |||||
| Snoring | [ | ||||
| Depression | [ | ||||
| Gender | 3 | ☒ | accuracy | [ | |
| Employment | 7 | ☒ | 61% | [ | |
| Marital status | 18 | AUC 66% | [ | ||
| Veteran experience | 19 | ||||
| Income level | 21 | ☒ | [ | ||
| Age | 27 | [ | |||
| Race | 28 | ☒ | [ | ||
| Education | ☒ | [ | |||
| Excercise | 2 | ☒ | accuracy | [ | |
| Smoking | 5 | ☒ | 65% | [ | |
| Quit smoking | 5 | AUC 66% | [ | ||
| Last smoking | 12 | ||||
| Drinking | 14 | [ | |||
| HIV risk behavior | 15 | ||||
| 100 cigarettes in life | 17 | ☒ | |||
| Sleep duration | 20 | [ | |||
| Quality of resting | 24 | ☒ | [ | ||
| # of fall in 3 months | 25 | ||||
| Preventative screening | |||||
| Immunization behaviors | |||||
| Health care access | Medical cost barriers | 8 | ☒ | accuracy | |
| Insurance coverage | ☒ | 56% | [ | ||
| # of health care providers | ☒ | AUC 56% | |||
| Social/family support | Satisfy of life | 4 | accuracy 56% AUC 51% | [ | |
| Frequency of emotional support | [ | ||||
| # of adults in a family | ☒ | [ | |||
Rank of relative importance calculated by data mining linear regression with M5’s methods range from 1 to 28. Blank means the variable were not seleced by M5 algorithm.
Variables included in the association model for stroke disability detected by C4.5 algorithm. The association models are presented in Fig. 3