| Literature DB >> 31615985 |
Hyojung Paik1,2,3,4, Matthew J Kan1,2, Nadav Rappoport1,2, Dexter Hadley1,2, Marina Sirota1,2, Bin Chen1,2, Udi Manber1,5, Seong Beom Cho6, Atul J Butte7,8.
Abstract
The identification of novel disease associations using big-data for patient care has had limited success. In this study, we created a longitudinal disease network of traced readmissions (disease trajectories), merging data from over 10.4 million inpatients through the Healthcare Cost and Utilization Project, which allowed the representation of disease progression mapping over 300 diseases. From these disease trajectories, we discovered an interesting association between schizophrenia and rhabdomyolysis, a rare muscle disease (incidence < 1E-04) (relative risk, 2.21 [1.80-2.71, confidence interval = 0.95], P-value 9.54E-15). We validated this association by using independent electronic medical records from over 830,000 patients at the University of California, San Francisco (UCSF) medical center. A case review of 29 rhabdomyolysis incidents in schizophrenia patients at UCSF demonstrated that 62% are idiopathic, without the use of any drug known to lead to this adverse event, suggesting a warning to physicians to watch for this unexpected risk of schizophrenia. Large-scale analysis of disease trajectories can help physicians understand potential sequential events in their patients.Entities:
Mesh:
Year: 2019 PMID: 31615985 PMCID: PMC6794302 DOI: 10.1038/s41597-019-0220-5
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Overview of the main study data and analytics. (a) Overview of the data set build and analytics. We prepared the study data set by combining five editions of the California State Inpatient Database (CA SID) sets, which were released annually (2006–2010). In summary, we removed admissions for which primary diagnoses were not related to disease (pregnancy, injury, external causes, administration). The merged CA SID covers longitudinal records (1980–2010) for >1.4 million patients without data redundancy. (b) Construction of DAGs. The principal inpatient diagnoses for patients were selected and 682 significant disease association pairs (RA > 1, FDR < 0.1) were identified. The average temporal correlation was determined within pairs and then concatenated by using a greedy algorithm to create DAGs (i.e., trajectories) with 117 starting nodes.
Fig. 2Selected disease trajectories (DAGs). Selected DAGs (117 in total) showing trajectories for one-year intervals between primary diagnosis codes for California inpatient admissions. Areas of shapes are directly proportional to the number of patients, where circles represent primary diagnoses and squares represent deaths. Nodes are colored by mean age of patients. The thickness of edges is determined by the number of patients traveling along the edge (see Legend). (a) DAG of “chronic liver disease and cirrhosis.” Prominent nodes include “liver abscess and sequelae of chronic liver disease,” “septicemia,” and “death.” (b) DAG of “acute myocardial infarction.” The majority of readmissions were for “heart failure” and “other forms of chronic ischemic heart disease,” and “acute myocardial infarction” and “heart failure” were the nodes most associated with death. Note that some of these second diagnoses occur in a larger population of patients beyond those coming in with acute myocardial infarction, and this can lower their mean age of incidence. (c) DAG of “pneumonia, organism NOS.” Common readmissions were for “septicemia,” “heart failure,” “other disease of lung,” “pneumonitis due to solids and liquids,” and “other bacterial pneumonia.” These diagnoses were all strongly associated with death. (d) DAG of “episodic mood disorders,” which includes diagnosis codes for uncharacterized mental disorders. The majority of hospitalizations were readmitted for diagnoses of “schizophrenia” and a significant proportion of these individuals were later hospitalized for “disorders of muscle ligament and fascia;” 93% of the admissions for “disorders of muscle ligament and fascia” were more specifically coded for rhabdomyolysis (ICD-9-CM code = 728.88).
Fig. 3The identified novel association between schizophrenia and rhabdomyolysis. Our scaled analysis of digitalized medical records from millions of patients reveals a novel association between schizophrenia and rhabdomyolysis. (a) DAG of schizophrenia and disorders of muscle, ligament, and fascia enriched with rhabdomyolysis based on the data of CA SID. (b) DAG of schizophrenia and disorders of muscle, ligament, and fascia enriched with rhabdomyolysis based on the EHRs of the UCSF Medical Center. (c) The identical DAG of (b) using the subset of the EHRs of the UCSF Medical Center consisting of inpatient records. (d) The proportion of the diagnoses of rhabdomyolysis in schizophrenia across different data sets. Black bars indicate the fraction of rhabdomyolysis among the muscle disease (ICD-9-CM code = 728) in schizophrenia. White bars represent the rate of rhabdomyolysis in muscle disease-diagnosed patients. (e) Lab test evidence for the diagnoses of rhabdomyolysis in (B). The levels of creatine kinase in patients who were diagnosed with rhabdomyolysis after schizophrenia are extremely high. (f) The level of creatine kinase in patients who were diagnosed with schizophrenia, but not diagnosed with rhabdomyolysis in (b). Because those patients already suffer with illness, the creatine kinase levels are mildly increased but these values are far from those of (f). The blue and red lines are the reference range for normal levels of creatine kinase (38–174 units/L [reference for male, blue line]; 96–140 units/L [reference for female, red line]). The colors of dots indicate normality or abnormality marks assigned by healthcare providers (black for normal and orange for abnormal tags).