| Literature DB >> 28815117 |
Yanshan Wang1, Liwei Wang1, Majid Rastegar-Mojarad1, Sijia Liu1, Feichen Shen1, Hongfang Liu1.
Abstract
Family history is an important component in modern clinical care especially in the era of precision medicine. Family history information in the Electronic Health Record (EHR) system is usually stored in structured format as well as in free-text format. In this study, we systematically analyzed a family history text corpus from 3 million clinical notes for the patients receiving their primary care at Mayo Clinic. Family members, medical problems, and their associations were analyzed and reported. Our findings showed a great agreement between positive/negated medical problems mentioned in the diagnosis report and the family history, as measured by observed agreement and random agreement. We also found that the family history of some medical problems existed up to 10~15 years prior to the diagnosis date of such problems. Finally two patient cases were studied to show the medical problems in the diagnosis and family history associated with the timeline.Entities:
Year: 2017 PMID: 28815117 PMCID: PMC5543380
Source DB: PubMed Journal: AMIA Jt Summits Transl Sci Proc
A list of family members.
| Aunt | Brother | Child | Cousin | Daughter |
| Father | Grandfather | Grandmother | Grandparent | Mother |
| Parent | Sibling | Sister | Son | Uncle |
Examples of family history from the FH Corpus.
| Prostate cancer - three Brothers |
| Liver Disease--- Brother |
| Pancreatic Cancer--2 Brothers |
| HTN-- Father |
| Elevated Chol/Trigs.---Brother, Sister |
| MI, Stroke-- Father age 54 |
| FATHER |
| Father alive, 74, High blood pressure |
| MOTHER |
| Mother died at 74 from COPD (smoker), CHF, overweight |
| SISTERS |
| 5 sisters alive |
| SONS |
| 2 sons alive |
| GRANDPARENTS |
| Maternal Grandmother, deceased, *** years - died of “old age” |
| Maternal Grandfather, deceased, unknown |
| Paternal Grandmother, deceased, 70’s, COPD (smoker) |
| Paternal Grandfather, deceased, 70’s, colon cancer |
| This is *** third pregnancy. Her first two pregnancies, through her previous husband, resulted in full-term females who are currently healthy at the ages of *** and ***. *** has a healthy ***-year-old brother whose partner is currently pregnant. She has a healthy ***-year-old brother who has no children. She has a healthy ***-year-old sister who has a healthy ***-month-old daughter. ***s father is healthy at the age of ***. Her mother suffers from type II diabetes at the age of ***. ***s mother has six siblings of whom one, a sister, has at least seven healthy children and has had three miscarriages, for which no reason was given. ***s partner, ***, is reportedly healthy at the age of ***. He has a healthy ***-year-old brother who has a healthy *** son. His mother is healthy at the age of ***. His father, age ***, reportedly has an adult-onset arrhythmia. There were no reports of mental retardation, learning disabilities, or birth defects. No family members had babies that were still born or died early. There were no reports of cancer before the age of ***. The remainder of the family history was non-contributory to today’s discussion. There is no consanguinity reported between these families. |
indicates de-identified information.
Figure 1:Distribution of age in the FH Cohort.
Figure 2:Distribution of family members in the FH Corpus.
Top 10 medical problems in the family history section.
| Mecial Problem | Frequency |
|---|---|
| Hypertension | 49,460 (5.5%) |
| Depression | 44,900 (5.0%) |
| Cancer | 44,306 (4.9%) |
| Diabetes | 35,281 (3.9%) |
| High Blood Pressure | 33,437 (3.7%) |
| Alcohol Abuse | 28,580 (3.2%) |
| Heart Disease | 27,758 (3.1%) |
| High Cholesterol | 25,592 (2.8%) |
| CAD | 24,837 (2.7%) |
| Breast Cancer | 23,266 (2.6%) |
Figure 3:Heat map of frequencies of co-occurrence of top 10 medical problems and family members.
Figure 4:Top 5 medical problems for each family member in the FH Corpus.
Comparison of frequencies of medical problems mentioned in diagnosis and family history.
| Medical Problems Mentioned in Diagnosis | ||||
| Positive | Negated | Not Found | ||
| Medical Problems Mentioned in Family History | Positive | 58,463 | 2,887 | 427,306 |
| Negated | 11,141 | 1,348 | 111,649 | |
| Not Found | 1,617,480 | 153,044 | - | |
Comparison of the most frequent medical problems in diagnosis and family history.
| (A) Positive medical problems in diagnosis while negated in family history | (B) Negated medical problems in diagnosis while positive in family history | ||
|---|---|---|---|
| Medical Problem | Frequency | Medical Problem | Frequency |
| Hypertension | 780 | Breast Cancer | 181 |
| Depression | 751 | Hypertension | 167 |
| Diabetes | 539 | Depression | 119 |
| Cancer | 447 | Cancer | 116 |
| Asthma | 391 | Asthma | 99 |
| Coronary artery disease | 378 | Stoke | 96 |
| Colon cancer | 280 | Coronary artery disease | 96 |
| Hyperlipidemia | 178 | Diabetes | 88 |
| Breast cancer | 173 | Obstructive sleep apnea | 81 |
| Attention deficit disorder | 160 | Sleep apnea | 68 |
| Prostate cancer | 160 | Attention deficit disorder | 58 |
| Anxiety | 158 | Myocardial infarction | 54 |
| Headaches | 157 | Allergies | 53 |
| Seizure | 145 | Myocardial ischemia | 47 |
| Skin cancer | 131 | Anxiety | 34 |
| Osteoporosis | 129 | Restless legs | 34 |
| Migraine headaches | 124 | Colon cancer | 31 |
| Sleep apnea | 123 | Heart Disease | 31 |
| Melanoma | 119 | Headaches | 26 |
| Pain | 115 | Diarrhea | 25 |
Figure 5:Number of years between the diagnosis date of a medical problem and the first date that medical problem mentioned in patient’s family history.
Figure 6:Timeline charts of two patient cases. X-axis represents the timeline of clinical notes while y-axis the medical problems. A turquoise dot represents a medical problem in diagnosis and a blue dot indicates the first date such medical problem is diagnosed. A yellow diamond represents a medical problem in family history and a red diamond indicates the first date such medical problem occurred in family history.