| Literature DB >> 34140970 |
Seung-Soo Kim1, Adam D Hudgins1,2, Brenda Gonzalez2, Sofiya Milman3, Nir Barzilai3, Jan Vijg2, Zhidong Tu4, Yousin Suh1,5.
Abstract
The rich data from the genome-wide association studies (GWAS) and phenome-wide association studies (PheWAS) offer an unprecedented opportunity to identify the biological underpinnings of age-related disease (ARD) risk and multimorbidity. Surprisingly, however, a comprehensive list of ARDs remains unavailable due to the lack of a clear definition and selection criteria. We developed a method to identify ARDs and to provide a compendium of ARDs for genetic association studies. Querying 1,358 electronic medical record-derived traits, we first defined ARDs and age-related traits (ARTs) based on their prevalence profiles, requiring a unimodal distribution that shows an increasing prevalence after the age of 40 years, and which reaches a maximum peak at 60 years of age or later. As a result, we identified a list of 463 ARDs and ARTs in the GWAS and PheWAS catalogs. We next translated the ARDs and ARTs to their respective 276 Medical Subject Headings diseases and 45 anatomy terms. The most abundant disease categories are neoplasms (48 terms), cardiovascular diseases (44 terms), and nervous system diseases (27 terms). Employing data from a human symptoms-disease network, we found 6 symptom-shared disease groups, representing cancers, heart diseases, brain diseases, joint diseases, eye diseases, and mixed diseases. Lastly, by overlaying our ARD and ART list with genetic correlation data from the UK Biobank, we found 54 phenotypes in 2 clusters with high genetic correlations. Our compendium of ARD and ART is a highly useful resource, with broad applicability for studies of the genetics of aging, ARD, and multimorbidity.Entities:
Keywords: GWAS; age-related disease; age-related trait; aging; biomarker
Year: 2021 PMID: 34140970 PMCID: PMC8204079 DOI: 10.3389/fgene.2021.680560
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
FIGURE 1Identified age-related traits and diseases annotations for genetic resources. A schematic of the analysis flow of this study. From the PheWAS and GWAS catalogs, ARTs were identified, converted to Medical Subject Headings (MeSH) terms, and overlaid onto a human symptom-disease network (HSDN), top 100 leading causes of death, and UKBB genetic correlation data. *SN means subnetwork.
FIGURE 2Summary of 276 MeSH disease and anatomy terms. (A) Distribution of the ARD MeSH disease categories and their corresponding tissues. The colors indicate the corresponding tissue. (B) Distribution of the ARD MeSH anatomy categories and their corresponding MeSH disease categories of ARDs. The colors indicate the corresponding MeSH disease category.
FIGURE 3Disease network of 144 ARDs with 6 subnetworks. The ARD subnetworks of disease pairs with similarity scores > 0.55 are shown. There are 6 subnetworks; subnetwork 1 with 41 ARDs is composed of cancers; subnetwork 2 with 24 ARDs includes heart diseases; subnetwork 3 with 10 ARDs is composed of brain diseases; subnetwork 4 with 8 ARDs includes joint diseases; subnetwork 5 with 7 ARDs is composed of eye diseases; and subnetwork 6 with 32 ARDs contains mixed diseases. Node colors indicate different disease types. Node shapes represent different tissue/organs. Edge thickness indicates disease similarity score, with the thickest edge representing a disease similarity score = 1.
FIGURE 4A heatmap for genetic correlations (r) of 54 UKBB phenotypes. From the publicly available UKBB genetic correlation data (https://ukbb-rg.hail.is/rg_browser/), the genetic correlations (r) of 54 UKBB phenotypes are displayed. By hierarchical clustering with the dynamic tree cut algorithm, 2 clusters of phenotypes were identified; Cluster 1 is composed of 17 phenotypes with positive correlations; cluster 2 includes 26 phenotypes with both positive and negative correlations. Unique ID of the UKBB phenotypes is noted in parentheses. Colors represent the values of genetic correlations (r); dark red is r = 1, white is r = 0, and dark blue is r = −1. Significant genetic correlation pairs (FDR < 0.05) are indicated with black edged squares.