| Literature DB >> 28388634 |
Tian Ge1,2,3, Chia-Yen Chen2,3,4, Benjamin M Neale2,3,4, Mert R Sabuncu1,5, Jordan W Smoller2,3.
Abstract
Heritability estimation provides important information about the relative contribution of genetic and environmental factors to phenotypic variation, and provides an upper bound for the utility of genetic risk prediction models. Recent technological and statistical advances have enabled the estimation of additive heritability attributable to common genetic variants (SNP heritability) across a broad phenotypic spectrum. Here, we present a computationally and memory efficient heritability estimation method that can handle large sample sizes, and report the SNP heritability for 551 complex traits derived from the interim data release (152,736 subjects) of the large-scale, population-based UK Biobank, comprising both quantitative phenotypes and disease codes. We demonstrate that common genetic variation contributes to a broad array of quantitative traits and human diseases in the UK population, and identify phenotypes whose heritability is moderated by age (e.g., a majority of physical measures including height and body mass index), sex (e.g., blood pressure related traits) and socioeconomic status (education). Our study represents the first comprehensive phenome-wide heritability analysis in the UK Biobank, and underscores the importance of considering population characteristics in interpreting heritability.Entities:
Mesh:
Year: 2017 PMID: 28388634 PMCID: PMC5400281 DOI: 10.1371/journal.pgen.1006711
Source DB: PubMed Journal: PLoS Genet ISSN: 1553-7390 Impact factor: 5.917
Fig 1The number of traits in each of the 11 phenotypic domains that make up the 551 traits analyzed in the UK Biobank: cognitive function (5 traits), early life factors (7 traits), health and medical history (60 traits), hospital in-patient main diagnosis ICD-10 codes (194 traits), life style and environment (88 traits), physical measures (50 traits), psychosocial factors (40 traits), self-reported cancer codes (9 traits), self-reported non-cancer illness codes (79 traits), sex-specific factors (14 traits), and sociodemographics (5 traits).
The SNP heritability estimates (h2), standard error (SE) estimates, sample sizes (N), and prevalence (prev) in the sample (for binary traits) for the top heritable traits in each phenotypic domain.
All heritability estimates adjusted for genotyping array, UK Biobank assessment center, age at recruitment, sex (except male-specific traits indicated by *) and top 10 principal components of the genotype data as covariates. ♯: additionally adjusted for weight; §: additionally adjusted for negative experience. See S1 Table and S2 Table for the way we binarized categorical variables and other relevant information.
| Field ID | Field Name | Type | N | Prev | h2 | SE |
|---|---|---|---|---|---|---|
| 20016 | Fluid intelligence score | continuous | 34491 | NA | 0.233 | 0.011 |
| 1697 | Comparative height at age 10 | categorical | 106497 | 25.36% | 0.439 | 0.007 |
| 2976 | Age diabetes diagnosed | continuous | 5369 | NA | 1.000 | 0.074 |
| 6148 | Eye problems/disorders | categorical | 33750 | 2.04% | 0.722 | 0.097 |
| 2443 | Diabetes diagnosed by doctor | binary | 107935 | 5.25% | 0.414 | 0.016 |
| 6152 | Blood clot, DVT, bronchitis, emphysema, asthma, rhinitis, eczema, allergy diagnosed by doctor | categorical | 108022 | 12.37% | 0.344 | 0.010 |
| 41202 | C61 Malignant neoplasm of prostate | binary | 32780 | 2.24% | 0.426* | 0.093 |
| 41202 | C60-C63 Malignant neoplasms of male genital organs | binary | 32780 | 2.46% | 0.322* | 0.087 |
| 1747 | Hair colour (natural, before greying) | categorical | 107947 | 4.62% | 1.000 | 0.018 |
| 1717 | Skin colour | categorical | 106773 | 80.09% | 0.556 | 0.008 |
| 1727 | Ease of skin tanning | categorical | 105966 | 61.13% | 0.454 | 0.006 |
| 6144 | Never eat eggs, dairy, wheat, sugar | categorical | 107832 | 2.68% | 0.691 | 0.026 |
| 50 | Standing height | continuous | 107976 | NA | 0.685 | 0.004 |
| 23105 | Basal metabolic rate | continuous | 106311 | NA | 0.397 | 0.004 |
| 23102 | Whole body water mass | continuous | 106252 | NA | 0.331♯ | 0.004 |
| 23130 | Trunk predicted mass | continuous | 106157 | NA | 0.325♯ | 0.004 |
| 23106 | Impedance of whole body | continuous | 106304 | NA | 0.307 | 0.004 |
| 20151 | Forced vital capacity (FVC), best measure | continuous | 84301 | NA | 0.339 | 0.005 |
| 20150 | Forced expiratory volume in 1-second (FEV1), best measure | continuous | 84305 | NA | 0.324 | 0.005 |
| 3148 | Heel bone mineral density (BMD) | continuous | 62546 | NA | 0.327 | 0.006 |
| 20126 | Bipolar and major depression status | categorical | 25571 | 19.91% | 0.195§ | 0.032 |
| 20001 | Prostate cancer | binary | 50997 | 1.59% | 0.310* | 0.077 |
| 20002 | Psoriasis | binary | 108148 | 1.15% | 1.000 | 0.047 |
| 20002 | Thyroid problem (not cancer) | binary | 108148 | 5.77% | 0.876 | 0.015 |
| 20002 | Rheumatoid arthritis | binary | 108148 | 1.18% | 0.821 | 0.046 |
| 20002 | Hypothyroidism/Myxoedema | binary | 108148 | 4.83% | 0.814 | 0.017 |
| 20002 | Diabetes | binary | 108148 | 5.14% | 0.396 | 0.016 |
| 20002 | Dermatology | binary | 108148 | 4.27% | 0.368 | 0.019 |
| 20002 | Asthma | binary | 108148 | 12.44% | 0.340 | 0.010 |
| 2395 | Hair/Balding pattern | categorical | 50660 | 18.51% | 0.479* | 0.017 |
| 2375 | Relative age of first facial hair | categorical | 49384 | 6.64% | 0.303* | 0.030 |
| 6138 | Qualifications | categorical | 107158 | 30.16% | 0.294 | 0.007 |
A head-to-head comparison of SNP heritability estimates (h2) for the self-reported illness codes and ICD-10 codes that represent the same or closely matched diseases.
| Self-reported disease | h2 (self-reported) | ICD-10 code | h2 (ICD-10) |
|---|---|---|---|
| Breast cancer | 0.133 | C50 Malignant neoplasm of breast | 0.207 |
| Skin cancer | 0.061 | C43-C44 Melanoma and other malignant neoplasms of skin | 0.160 |
| Male genital tract cancer | 0.226 | C60-C63 Malignant neoplasms of male genital organs | 0.322 |
| Prostate cancer | 0.310 | C61 Malignant neoplasm of prostate | 0.426 |
| Cerebrovascular disease | 0.066 | I60-I69 Cerebrovascular diseases | 0.197 |
| Angina | 0.176 | I20 Angina pectoris | 0.136 |
| Liver/Biliary/Pancreas problem | 0.074 | K80-K87 Disorders of gallbladder, biliary tract and pancreas | 0.099 |
| Heart attack/Myocardial infarction | 0.184 | I21 Acute myocardial infarction | 0.193 |
| Gastro-oesophageal reflux (gord)/Gastric reflux | 0.099 | K21 Gastro-oesophageal reflux disease | 0.105 |
| Cholelithiasis/Gall stones | 0.123 | K80 Cholelithiasis | 0.143 |
| Uterine fibroids | 0.087 | D25 Leiomyoma of uterus | 0.054 |
| Enlarged prostate | 0.157 | N40 Hyperplasia of prostate | 0.127 |
| Pneumonia | 0.075 | J18 Pneumonia—organism unspecified | 0.160 |
| Diverticular disease/Diverticulitis | 0.195 | K57 Diverticular disease of intestine | 0.179 |
Fig 2SNP heritability estimates and their standard errors of the traits in the UK Biobank that show significantly different SNP heritability in females and males after multiple testing correction.
The heritability estimates of rheumatoid arthritis, endocrine/diabetes and wheat products intake reported here are based on genome-wide SNPs and will be markedly reduced when the major histocompatibility complex (MHC) region (chr6:25-35Mb) is excluded from analysis, and thus need to be interpreted with caution.
Fig 3(A) The age-varying SNP heritability estimates and their standard errors (shaded region) for the 12 traits whose heritability significantly decreases with age after accounting for multiple testing correction; (B) The stratified heritability estimates and standard errors (shaded region) of education (has college or university degree or not), on which the socioeconomic status (SES) measured by the Townsend deprivation index has a significant moderating effect after multiple testing correction.