| Literature DB >> 31921313 |
Megan E Breitbach1,2, Susan Greenspan3, Neil M Resnick3,4, Subashan Perera3,5, Aditi U Gurkar3,4, Devin Absher1, Arthur S Levine6,7.
Abstract
Background: Recent studies investigating longevity have revealed very few convincing genetic associations with increased lifespan. This is, in part, due to the complexity of biological aging, as well as the limited power of genome-wide association studies, which assay common single nucleotide polymorphisms (SNPs) and require several thousand subjects to achieve statistical significance. To overcome such barriers, we performed comprehensive DNA sequencing of a panel of 20 genes previously associated with phenotypic aging in a cohort of 200 individuals, half of whom were clinically defined by an "early aging" phenotype, and half of whom were clinically defined by a "late aging" phenotype based on age (65-75 years) and the ability to walk up a flight of stairs or walk for 15 min without resting. A validation cohort of 511 late agers was used to verify our results.Entities:
Keywords: aging; bioinformatics; genetics; machine learning; sequencing
Year: 2019 PMID: 31921313 PMCID: PMC6931058 DOI: 10.3389/fgene.2019.01277
Source DB: PubMed Journal: Front Genet ISSN: 1664-8021 Impact factor: 4.599
Comparisons of variables between aging cohorts: mean ± standard deviation.
| Demographics | Early aged 65–75 (n = 100) | Late aged >75 (n = 100) | Early |
|---|---|---|---|
| Age (years) | 70.4 ± 3.0 | 83.2 ± 5.4 | <0.0001 |
| Sex (% female) | 63 (63.0) | 56 (56.0) | 0.3133 |
| Comorbidity index (of 14 conditions)∆ | 4.4 ± 1.8 | 2.5 ± 1.6 | <0.0001 |
| Arthritis | 68% | 45% | <0.001 |
| Gait Speed (m/s) # | 0.92 ± 0.24 | 1.08 ± 0.26 | <0.0001 |
| % used cane | 14% | 5% | <0.001 |
| % used other device | 5% | 1% | |
| BMI (mean; kg/m2) | 33.5 ± 8.3 | 27.2 ± 4.6 | <0.0001 |
| BMI ≥ 40 | 24% | 1% | <0.001 |
| Lean body mass (kg) | 53. ± 11.8 | 47.4 ± 9.9 | 0.0002 |
| Total mass (kg) | 91.6 ± 23.5 | 73.6 ± 15.3 | <0.0001 |
| % Fat body mass | 37.9 ± 8.6 | 32.2 ± 7.8 | <0.0001 |
| MOCA (1–30) # | 25.3 ± 2.8 | 24.3 ± 3.5 | 0.03 |
| DSST Score # | 42.2 ± 9.5 | 39.7 ± 10.7 | 0.0808 |
| Grip strength (kg) (dominant) | 26.7 ± 10.8 | 26.7 ± 10.6 | 0.9791 |
| Chair rise time (s)∆ | 14.7 ± 13.8 | 12.4 ± 11.8 | 0.0001 |
| SPPB total score # | 9.1± 2.5 | 10.2 ± 1.8 | 0.0005 |
| Balance score # | 3.4 ± 1.0 | 3.6 ± 0.7 | 0.1873 |
| Calories from all activity per week | 2320 ± 2186 | 3585 ± 3059 | 0.001 |
| Calories from moderate activity per week | 929 ± 1495 | 2018 ± 2322 | 0.0001 |
| Freq of all activity per week | 13.9 ± 9.8 | 19.7 ± 10.6 | <0.0001 |
| Freq of moderate activity per week | 4.3 ± 5.0 | 7.2 ± 6.4 | 0.0003 |
| Frail scale ∆ | 2.6 ± 1.3 | 0.6 ± 0.9 | <0.0001 |
| Physical function index # | 37.3 ± 19.1 | 77.2 ± 17.2 | <0.0001 |
| General health perception # | 52.8 ± 22.2 | 78.2 ± 14.3 | <0.0001 |
| Bodily pain # | 44.5 ± 22.4 | 76.3 ± 19.7 | <0.0001 |
| Social function # | 69.0 ± 24.5 | 92.1 ± 15.7 | <0.0001 |
| Mental health index # | 65.2 ± 14.3 | 75.8 ± 9.4 | < 0.0001 |
| Vitality # | 47.7 ± 14.1 | 66.5 ± 11.8 | <0.0001 |
*Computed using independent samples t-, Wilcoxon rank sum, or chi-square tests, as appropriate. ∆ Lower score better, # Higher score better.
Names, biological function, and literature references for aging association of the 20 genes sequenced.
| Gene | Function | Biological association with aging/age-related pathology | Literature reference for study inclusion |
|---|---|---|---|
| Combines with lipids to form lipoproteins, which package cholesterol and other fats for transfer through the bloodstream. | Polymorphisms in APOE are associated with human longevity. | ( | |
| Involved in DNA break repair and base excision repair. | Defects in aprataxin cause the autosomal recessive neurodegenerative disorder ataxia oculomotor apraxia 1 (AOA1). | ( | |
| ATP-dependent DNA helicase. Unwinds DNA in the 3'-5' direction. Involved in double-strand break repair. | Defects associated with segmental aging of the immune system together with an elevated risk of otitis media and pneumonia, an elevated risk of diabetes mellitus, reduced fertility, and higher cancer incidence. | ( | |
| Induces cell cycle arrest and acts as a tumor suppressor. | Mutations near CDKN2A were particularly associated with diseases of aging (e.g., cancer, atherosclerosis, type 2 diabetes, glaucoma). CDKN2A expression increases with age. Removal of p16+ cells in mouse models increases health span and lifespan. | ( | |
| Mediates cell-cell interactions and maintenance of immune cells in the resting state. | Mutations in CD33 are associated with AD risk. | ( | |
| Stabilization and maintenance of telomerase. | Mutations in DKC1 causes premature aging, bone marrow failure, and cancer. | ( | |
| Catalytic component of a DNA repair endonuclease responsible for 5' incision during DNA repair. | Loss of ERCC4 causes systemic accelerated aging (XPE) and neurodegeneration. | ( | |
| Endonuclease involved in single-strand DNA nucleotide excision repair at the 3' end. | Mutations in ERCC5 lead to Cockayne Syndrome (CS), which is characterized by premature aging. | ( | |
| DNA-binding protein involved in transcription-coupled nucleotide excision repair. | Defects in ERCC6 cause CS and age-related macular degeneration. | ( | |
| DNA repair protein involved in Interstrand Crosslink (ICL) repair. | Defects cause Fanconi anemia, a progeroid syndrome with symptoms common in premature aging (sarcopenia, hypersensitivity to infectious agents, endocrine abnormalities, etc.). | ( | |
| Component of the nuclear lamina. | LMNA mutations cause Hutchinson-Gilford syndrome (HGPS). | ( | |
| Mediates poly-ADP-ribosylation of proteins and plays a role in DNA repair, chromatin remodeling, telomere maintenance, and mediator of inflammation. | PARP1 activation increases with age in | ( | |
| DNA polymerase involved in base excision and repair. | ( | ||
| Involved in mitochondrial DNA replication. | Increased mitochondrial mutation load in mice is associated with premature aging. | ( | |
| NAD-dependent protein deacetylase. Involved in cell cycle regulation, response to DNA damage, metabolism, apoptosis, and autophagy. | SIRT1 overexpression extends lifespan in mice. Mutations are associated with age-related pathologies such as myocardial infarction (MI). | ( | |
| NAD-dependent protein deacetylase. Deacetylase activity toward histones H3K9Ac and H3K56Ac. Required for genomic stability. Deacetylates telomeric DNA. | SIRT6 overexpression extends lifespan. Long-lived animals have highly efficient SIRT6 function. | ( | |
| Destroys superoxide anion radicals produced in cells. | SOD2 mutations are associated with heart disease and increased risk of malignancies. | ( | |
| Ribonucleoprotein polymerase that maintains telomere ends by the addition of the telomere repeat TTAGGG. | Telomere attrition is highly associated with aging due to increased cellular senescence. | ( | |
| Component of the telosome that is involved in telomere length regulation and protection. | Mutations in TINF2 are linked to Revesz syndrome, a telomeropathy with symptoms characteristic of accelerated aging. | ( | |
| DNA helicase that is involved in maintenance of genomic stability, DNA repair, replication, transcription, and telomere maintenance. | Mutations in WRN lead to Werner syndrome with systemic aging phenotypes. | ( |
Figure 1Logistic regression and variant burden reveal lack of association with early aging. (A) Quantile-quantile plot of logistic regression p-values. (B) Box plot of total number of variants in the discovery early aged group (red), discovery late ager group (blue), and the validation late ager group (purple). (C) Diagram of predictive modeling analysis study design.
Figure 2Different subsets of variants defined as top predictive models using random forest and support vector machine (SVM) learning methods. (A) Boxplots of the random forest model area under the curve (AUCs) for the all variant, high Combined Annotation-Dependent Depletion (CADD) exon and control subsets of the variant data. P-values between groups determined by performing a Kruskal-Wallis test. **** = p < 0.0001, *** = p < 0.001. (B) Boxplots of the SVM model AUCs for the all variant, transcription factor binding site (TFBS), and control subsets of the variant data. P-values between groups determined by performing a Kruskal-Wallis test. (C) Receiver-operating characteristic (ROC) curve of the mean high CADD exon random forest model with confidence intervals. The red line represents the null AUC (0.5). (D) ROC curve of the mean TFBS SVM model with confidence intervals. The red line represents the null AUC (0.5).
Figure 3The random forest high Combined Annotation-Dependent Depletion (CADD) exon model is predictive of late aging status in the validation cohort and outperforms smoking as a predictor of aging. (A) Boxplots of the fraction of misclassified patient samples based on the random forest high CADD exon model (magenta) and the control random forest model (shuffled dataset) (teal). (B) Receiver-operating characteristic curve of the mean area under the curve (AUC) resulting from the random forest high CADD exon model (black) with confidence intervals in the discovery cohort and the AUC resulting from smoking status as a sole predictor of early versus late aging (green).
Figure 4Random forest high Combined Annotation-Dependent Depletion exon predictive variants are within 9 of the 20 genes and mostly non-synonymous. (A) Scatter plot of the Gini score for each of the predictive variants based on corresponding gene. (B) Bar plot of the variant consequence type within the predictors with corresponding empirical p-values.