| Literature DB >> 28783162 |
Kanix Wang1,2, Hallie Gaitsch2, Hoifung Poon3, Nancy J Cox4, Andrey Rzhetsky2,5.
Abstract
In this study, we used insurance claims for over one-third of the entire US population to create a subset of 128,989 families (481,657 unique individuals). We then used these data to (i) estimate the heritability and familial environmental patterns of 149 diseases and (ii) infer the genetic and environmental correlations for disease pairs from a set of 29 complex diseases. The majority (52 of 65) of our study's heritability estimates matched earlier reports, and 84 of our estimates appear to have been obtained for the first time. We used correlation matrices to compute environmental and genetic disease classifications and corresponding reliability measures. Among unexpected observations, we found that migraine, typically classified as a disease of the central nervous system, appeared to be most genetically similar to irritable bowel syndrome and most environmentally similar to cystitis and urethritis, all of which are inflammatory diseases.Entities:
Mesh:
Year: 2017 PMID: 28783162 PMCID: PMC5577363 DOI: 10.1038/ng.3931
Source DB: PubMed Journal: Nat Genet ISSN: 1061-4036 Impact factor: 38.330
Figure 1Information on study population, results of model selection, and analysis of heritability of 149 diseases. (A) Distribution of study population across population density septile; septile 1 corresponds to the most-rural counties and septile 7 most-urban. (B) Number of children in a family as a function of population density septile; septile notations are the same as in the (A). (C) Parent/child age distribution in studied families. (D) Model selection results, using univariate models GF, GS, GCF, GCSF, GC, and GCS, where G stands for additive genetics, F for common family environment, S for common sibling environment, and C for environment common for parental couple; plot shows frequency of corresponding model becoming the “best” (rank 1) as compared by DIC, second best (rank 2), and so on; clearly, the GCS model wins in the majority of cases. (E) Disease heritability estimates with one standard deviation; diseases, heritability for which appears to be measured for the first time are marked with asterisk; heritability values are sorted in decreasing order; color of the bar indicates biological system associated with the disease, see key in the upper right corner; keys to disease acronyms are given in the Supplementary Table 1 and 2. (F) Estimates of disease heritability values against estimates of disease prevalence; the linear correlation is significantly negative, Pearson’s r = −0.212 (95% CI [−0.36 −0.05]), and p = 0.00915. (G) Comparison of our estimates of heritability with the previously published estimates; see Supplement Table 3 for detailed numbers.
Figure 2Genetic and environmental correlations between diseases. (A) Matrix of pairwise genetic correlations (upper half) and corresponding environmental interactions (lower half) colored by sign and magnitude (see legend) The disease color labels indicate biological systems associated; the size of the squares indicates statistical significance, see key on the right. Cells with asterisks indicate pairwise interactions that remained significant at a false discovery rate of 1%.[15] The color boxes within the matrix indicate opposite-sign correlation values for the same pair of diseases. Posterior probabilities of two correlation values (genetic and environmental) for the same pair of diseases having the same sign were 1.869 × 10−14 (ADHD and benign skin neoplasm), 3.376 × 10−14 (ADHD and non-melanoma skin cancer), 4.523 × 10−9 (adjustment disorder and general hypertension), 8.715 × 10−4 (migraine and type 1 diabetes mellitus), 9.251 × 10−5 (benign skin neoplasm and type 1 diabetes mellitus), 6.401 × 10−33 (benign skin neoplasm and general hypertension), 3.712 × 10−17 (non-melanoma skin cancer and general hypertension), 3.933 × 10−4 (allergic rhinitis and type 1 diabetes mellitus). (B) Distribution of (Genetic correlation − Environmental correlation) values for the same pair of diseases. (C) Individual distributions of genetic and environmental correlations superimposed on the same plot. (D) Comparison of our family-based estimates of genetic correlations between diseases compared to previously published GWAS-based values, the complete data on values and references is provided in the Supplement Table 5. Linear fit with a slope of 1.08 (SE=0.167) is indicated by the dotted line.
Disease prevalence and heritability estimates for 30 most prevalent diseases in our study.
| Disease | Prevalence | h2 | h2 SD |
|---|---|---|---|
| Cardiac Dysrhythmia | 0.045 | 0.240 | 0.011 |
| General Hypertension | 0.173 | 0.462 | 0.009 |
| Esophageal Disease | 0.077 | 0.292 | 0.008 |
| Functional Digestive Disorder | 0.051 | 0.203 | 0.009 |
| Type II Diabetes Mellitus | 0.066 | 0.561 | 0.010 |
| Allergic Rhinitis | 0.108 | 0.445 | 0.006 |
| Asthma | 0.063 | 0.457 | 0.008 |
| Atopic Contact Dermatitis | 0.095 | 0.202 | 0.006 |
| Chronic Sinusitis | 0.047 | 0.523 | 0.008 |
| Eye Inflammation | 0.045 | 0.292 | 0.009 |
| Osteoarthritis | 0.068 | 0.256 | 0.012 |
| Cellulitis | 0.061 | 0.226 | 0.007 |
| Ear Infection | 0.106 | 0.244 | 0.007 |
| Eye Infection | 0.053 | 0.200 | 0.009 |
| Fungal Infection | 0.063 | 0.211 | 0.007 |
| UTI | 0.083 | 0.227 | 0.007 |
| Viral Warts HPV | 0.038 | 0.289 | 0.009 |
| Acne | 0.036 | 0.501 | 0.010 |
| Keratosis | 0.058 | 0.344 | 0.015 |
| General Spondylosis Spine Disorder | 0.081 | 0.325 | 0.008 |
| Muscle Ligament Disorder | 0.121 | 0.268 | 0.006 |
| Synovium Tendon Bursa Disorder | 0.039 | 0.180 | 0.009 |
| Benign Colon Neoplasm | 0.039 | 0.173 | 0.019 |
| Benign Skin Neoplasm | 0.067 | 0.547 | 0.007 |
| Non-Melanoma Skin Cancer | 0.054 | 0.520 | 0.008 |
| Anxiety Phobic Disorder | 0.063 | 0.432 | 0.007 |
| Depression | 0.038 | 0.579 | 0.006 |
| Substance Abuse | 0.045 | 0.422 | 0.010 |
| Breast Disorder | 0.044 | 0.166 | 0.010 |
| Disease of the Female Reproductive Organs | 0.105 | 0.235 | 0.009 |