| Literature DB >> 33102519 |
Matthew Greenig1, Andrew Melville2, Derek Huntley1, Mark Isalan1,3, Michal Mielcarek1,3.
Abstract
Cardiovascular disease accounts for millions of deaths each year and is currently the leading cause of mortality worldwide. The aging process is clearly linked to cardiovascular disease, however, the exact relationship between aging and heart function is not fully understood. Furthermore, a holistic view of cardiac aging, linking features of early life development to changes observed in old age, has not been synthesized. Here, we re-purpose RNA-sequencing data previously-collected by our group, investigating gene expression differences between wild-type mice of different age groups that represent key developmental milestones in the murine lifespan. DESeq2's generalized linear model was applied with two hypothesis testing approaches to identify differentially-expressed (DE) genes, both between pairs of age groups and across mice of all ages. Pairwise comparisons identified genes associated with specific age transitions, while comparisons across all age groups identified a large set of genes associated with the aging process more broadly. An unsupervised machine learning approach was then applied to extract common expression patterns from this set of age-associated genes. Sets of genes with both linear and non-linear expression trajectories were identified, suggesting that aging not only involves the activation of gene expression programs unique to different age groups, but also the re-activation of gene expression programs from earlier ages. Overall, we present a comprehensive transcriptomic analysis of cardiac gene expression patterns across the entirety of the murine lifespan.Entities:
Keywords: RNAseq; aging; cardiomyocyte; gene expression; genetics; heart; transcriptomics
Year: 2020 PMID: 33102519 PMCID: PMC7545256 DOI: 10.3389/fmolb.2020.565530
Source DB: PubMed Journal: Front Mol Biosci ISSN: 2296-889X
Figure 1Overview of entire RNA-Seq data analysis workflow and volcano plots for dynamically expressed genes. Plots were constructed using the EnhancedVolcano (Blighe, 2019) package in R. Horizontal lines are drawn at p = 0.001. Vertical lines are added at log2 fold change values of 1 and −1. DESeq2's shrunken log2 fold change metric was used. Genes with significant differential expression (p < 0.001 and LFC >1 or LFC < −1) are colored red. (A) Flowchart of steps used to analyse RNA sequencing data obtained from mice of different age groups. Data preparation steps are colored in light blue, differential expression analysis steps are colored in pink, and downstream analysis steps are colored in purple. (B) Results of the likelihood ratio test using a reduced model with sex as the sole explanatory variable; genes identified as significant (red) vary in expression with age. (C) Results of the Wald test comparing gene expression in the 4 week group to expression in the 15 week group. (D) Results of the Wald test comparing gene expression in the 15 week group to expression in the 8 month group. (E) Results of the Wald test comparing gene expression in the 8 month group to expression in the 22 month group. This comparison identified the largest number of transcripts with p < 0.05 of the three pairwise comparisons.
List of genes that exhibit dynamic expression throughout the aging process.
| 3.79 | −2.17 | 5.68 | 8.27·10−7 | |
| −8.66 | 1.78 | −8.12 | 1.25·10−6 | |
| −7.57 | −0.74 | 4.39 | 2.14·10−6 | |
| 3.61 | −2.51 | −0.96 | 2.52·10−6 | |
| 6.31 | −0.88 | 10.10 | 2.59·10−6 | |
| 10.33 | −6.27 | −2.66 | 8.77·10−6 | |
| −3.64 | 0.51 | 7.87 | 8.77·10−6 | |
| 1.66 | −6.94 | 7.54 | 8.77·10−6 | |
| −4.15 | 9.19 | −1.08 | 9.09·10−6 | |
| 3.97 | −7.95 | −1.45 | 9.09·10−6 | |
| −3.39 | −4.71 | 1.10 | 1.13·10−5 | |
| −5.46 | 2.83 | 0.26 | 1.27·10−5 | |
| −4.59 | 4.00 | 7.07 | 1.78·10−5 | |
| −1.78 | 0.25 | 14.34 | 1.78·10−5 | |
| 1.10 | 2.22 | 3.95 | 3.01·10−5 | |
| 0.71 | 2.60 | 2.02 | 3.49·10−5 | |
| 4.79 | 1.39 | 6.88 | 3.49·10−5 | |
| −8.19 | 1.39 | 2.56 | 3.87·10−5 | |
| −4.30 | 1.42 | −6.31 | 4.43·10−5 | |
| 5.23 | 0.62 | −6.41 | 4.43·10−5 |
Each table includes the top 20 genes with the lowest p-values calculated by the likelihood ratio test (LRT) for age, sorted by adjusted p-value. These p-values provide an indication of the age variable's explanatory power in the expression model for each gene. DESeq2's shrunken log2 fold change (LFC) for each pair of consecutive time points is also listed to provide information on each gene's specific expression path.
Fzd6, Frizzled class receptor 6; Hs6st1, Heparan Sulfate 6-O-Sulfotransferase 1; Slc35e3, Solute Carrier Family 35 Member E3; Ap5m1, Adaptor related protein complex 5 subunit mu 1; Selenok, Selenoprotein K; Mgat4a, Alpha-1,3-Manosyl-Glycoprotein 4-beta-N-acetylglucosaminyltransferase A; Tnni1, Troponin I1, slow skeletal type; Ube2k, Ubiquitin-conjugating enzyme E2 K; Atmin, ATM ineractor; Sox6, SRY-box transcription factor 6; Tsnax, Translin-associated factor X; Xpo7, Exportin-7; Hscb, HscB mitochondrial iron-sulfur cluster co-chaperone; Naa10, N-alpha-acetyltransferase 10 NatA catalytic subunit; Ascc1, Activating signal cointegrator 1 complex subunit 1; Rpl26-ps5, Ribosomal protein L26 pseudogene 5; Rpl3l, Ribosomal protein L3-like; Ddo, D-aspartate oxidase; Dusp19, dual-specificity phosphatase 19; Mturn, Maturin, neural progenitor differentiation regulator homolog.
Figure 2Scatter plots displaying sex-specific expression paths over time. The top 12 genes with canonical names (ranked by adjusted p-value) identified in the likelihood ratio test for age/sex interaction are displayed. For each gene, transcript count is displayed on the y-axis (with logarithmic scale) and age group on the x-axis. Samples from different sexes are colored differently. Expression of each of these genes is highly associated with an interaction between age and sex. Lines are added to display trajectories between mean expression levels for each sex at each time point. Lines for males (blue) and females (red) are colored differently.
Figure 3Clusters of differentially-expressed genes in the aging murine heart transcriptome. Clustering was performed using DEGreport's implementation of divisive hierarchical clustering (Pantano, 2019). Genes with adjusted p-value < 0.05 (calculated by the likelihood ratio test for age) were clustered according to log2 normalized read counts. Each cluster's number is provided in the title of each scaled expression graph, along with the number of genes in that cluster. Genes are plotted on the y-axis according to a scaled expression value (z-score), calculated as each gene's distance from the mean log transcript count of all genes in that cluster divided by the standard deviation of log transcript counts in that cluster. A box plot is constructed for each time point, with median expression level denoted by a horizontal line. Lines also connect the mean expression levels of consecutive time points, to display an expression path typical of that cluster. Genes that did not match the expression profiles of any of the 10 clusters were omitted. Pathway and ontology analysis was implemented on all 10 clusters, and gene sets with statistically-significant enrichment were identified in cluster 2 (red) and cluster 6 (blue). Cluster 2's genes exhibit high expression levels in both juvenile and elderly mice but are down-regulated in adolescent and adult mice, while the genes in cluster 6 exhibit a monotonic increase in expression as mice age past 15 weeks.
Figure 4Pathway graph displaying the “Regulation of actin cytoskeleton” pathway from the KEGG (Kanehisa and Goto, 2000) database (mmu04810). This pathway was significantly activated (adjusted p < 0.01) between the 4 and 15 week age groups. Between the 15 week and 8 month age groups, the pathway is non-significantly inhibited (adjusted p = 0.09), while between the 8 month and 22 month age groups, the pathway is significantly activated (adjusted p < 0.05). Colors of each node correspond to the sum of DESeq2's shrunken log2 fold changes (LFCs) for genes associated with that node. Shrunken LFCs are indicated by the color scale in the top right corner. Overall, the log2 fold changes of 126 constituent genes are annotated on the pathway graph. White nodes correspond to transcripts for which read count data was not available. Some nodes represent multiple proteins with similar or redundant functions.
Figure 5Re-evaluation of time-dependent transcriptional changes of selected transcripts. Specific genes in which conserved patterns were identified between our analysis and others are also annotated. (A) The experimental design used in our analysis. C57BL/6 + CBA hybrid mice of four different age groups—4 weeks, 15 weeks, 8 months, and 22 months—were analyzed to identify gene expression changes associated with aging. We identified that Gata4 and Serca2 were downregulated between the 15 week and 8 month groups (with LFC = −1.5 and LFC = −1.8, respectively). Furthermore, Gapdh was downregulated in the 22 month group compared to the 8 month group (LFC = −3.9), while C3 was identified as upregulated in the 22 month group compared to the 8 month group (LFC = 5.1). (B) Overview of the experimental design used in a 2002 study by Bodyak et al. (2002). Their group compared C57BL/6 mice of 4 months age with those of 20 months age. It was identified that Gata4, Serca2, and Gapdh were all downregulated in the 20 month group compared to the 4 month group. (C) Overview of the experimental design used in a 2019 study by Bartling et al. (2019). They compared C57BL/6 mice of 6 months age with those of 24 months age. They identified that C3's expression was significantly upregulated in the 24 month group compared to the 6 month group.