BACKGROUND: Neuropathological changes of Alzheimer's disease (AD) and Parkinson's disease (PD) can coexist in the same sample, suggesting possible common degenerative mechanisms. OBJECTIVE: The objective of this study was to use RNA-sequencing to compare gene expression in AD and PD vulnerable brain regions and search for co-expressed genes. METHODS: Total RNA was isolated from AD/CTL frontal cortex and PD/CTL ventral midbrain. Sequencing libraries were prepared, multiplex paired-end RNA sequencing was carried out, and bioinformatics analyses of gene expression used both publicly available (tophat2/bowtie2/Cufflinks) and commercial (Qlucore Omics Explorer) algorithms. RESULTS: Both AD (frontal cortex, n = 10) and PD (ventral midbrain, n = 14) samples showed extensive heterogeneity of gene expression. Hierarchical clustering of heatmaps revealed two gene populations (AD, 376 genes; PD, 351 genes) that separated AD or PD from control samples at false-discovery rates (q) of <5% and fold changes of at least 1.3 (AD) or 1.5 (PD). 10,124 genes were co-expressed in our AD and PD samples. A very small group of these genes (n = 23) showed both low variances (<150; variance = standard deviation squared) and reduced expressions (>1.5-fold under-expression) in both AD and PD. Ingenuity Pathways Analyses (IPA, Qiagen) revealed loss of NAD biosynthesis and salvage as the major canonical pathway significantly altered in both AD and PD. CONCLUSIONS: AD and PD in vulnerable brain regions appear to arise from and result in independent molecular genetic abnormalities, but we identified several under-expressed genes with potential to treat both diseases. NAD supplementation shows particular promise.
BACKGROUND: Neuropathological changes of Alzheimer's disease (AD) and Parkinson's disease (PD) can coexist in the same sample, suggesting possible common degenerative mechanisms. OBJECTIVE: The objective of this study was to use RNA-sequencing to compare gene expression in AD and PD vulnerable brain regions and search for co-expressed genes. METHODS: Total RNA was isolated from AD/CTL frontal cortex and PD/CTL ventral midbrain. Sequencing libraries were prepared, multiplex paired-end RNA sequencing was carried out, and bioinformatics analyses of gene expression used both publicly available (tophat2/bowtie2/Cufflinks) and commercial (Qlucore Omics Explorer) algorithms. RESULTS: Both AD (frontal cortex, n = 10) and PD (ventral midbrain, n = 14) samples showed extensive heterogeneity of gene expression. Hierarchical clustering of heatmaps revealed two gene populations (AD, 376 genes; PD, 351 genes) that separated AD or PD from control samples at false-discovery rates (q) of <5% and fold changes of at least 1.3 (AD) or 1.5 (PD). 10,124 genes were co-expressed in our AD and PD samples. A very small group of these genes (n = 23) showed both low variances (<150; variance = standard deviation squared) and reduced expressions (>1.5-fold under-expression) in both AD and PD. Ingenuity Pathways Analyses (IPA, Qiagen) revealed loss of NAD biosynthesis and salvage as the major canonical pathway significantly altered in both AD and PD. CONCLUSIONS: AD and PD in vulnerable brain regions appear to arise from and result in independent molecular genetic abnormalities, but we identified several under-expressed genes with potential to treat both diseases. NAD supplementation shows particular promise.
Alzheimer’s disease (AD) and Parkinson’s disease (PD) pathologies may co-exist in the same brain [1, 2]. This situation begs the questions (among many) of whether the two disease processes are in any way related pathogenically, which process (if either) “came first”, and whether both pathological processes and clinical symptoms could share a common disease-altering treatment.Answers to the above questions are of necessity speculative at the current level of knowledge. Insight might be gained by examination of “pure” cases of AD and PD to search for common molecular abnormalities, since clinical symptoms alone may not predict underlying pathologies accurately.We applied RNA sequencing (RNA-seq) of total (i.e., non mRNA-enriched) RNA that was rRNA-depleted to this question. RNA sequencing of total RNA allows the estimation of gene expression, which is the net sum of transcription minus degradation of pre- and messenger RNAs for each gene. In this study we tested the hypothesis that gene expression, assessed by the increasingly popular RNA-seq technology, could be used to suggest new treatments for disease, as opposed to “insights” about disease pathogenesis.When this approach is applied to brain tissue, one has an overall picture of how the cells in that tissue responded to stresses present at the time of death of the subject. In the case of neurodegenerative diseases, one will not observe pathogenic or adaptive gene expression changes that were present in neurons that have died and been removed and may instead observe changes in “survivors”. Pathogenic gene expression changes may or may not be present in surviving vulnerable neurons and supporting cells (such as astroglia that comprise ∼90% of tissue mass) that likely contribute to the majority of expression changes observed, excepting those specifically expressed by neurons.Bioinformatic approaches to analysis of gene sequencing data are likewise best viewed as estimates that reflect underlying assumptions based on allowable fidelities of alignment, sequencing accuracies and assembly of fragments into whole transcriptomes, most of which are subject in vivo to alternative splicing. Further sorting of individual genes into networks, a procedure still in development, offers the possibility of identifying “hub” genes that control expression of other genes.Humans are not genetically/epigenetically identical, further confounding the interpretation of findings [3, 4]. This fundamental heterogeneity likely contributes to variable responses to treatments, and approaches to embrace this heterogeneity may improve selection of specific therapies for individuals, so-called “personalized (precision) medicine”[5].In spite of these limitations, RNA-seq studies of postmortem brain tissue may offer insights into similarities and differences among conditions that have similar pathological or clinical presentations. For that reason, we undertook a study of gene expression in “pure” AD and PD samples available to us that we have collected.
METHODS
RNA-seq followed procedures described in our prior publications [6, 7]. Briefly, extraction of total RNA from frozen sections of frontal cortex or ventral midbrain was performed using Qiagen miRNeasy kits according to manufacturer’s instructions. Data regarding the ages and RNA quality of our samples are provided in Table 1. Many of our samples were obtained prior to staging systems developed by the Braak group and instead used CERAD criteria (Consortium to Establish a Registry for Alzheimer’s Disease) [7]. The two samples that did have Braak staging carried out indicated advanced AD. All postmortem samples were examined by a neuropathologist.
Table 1
Demographic and analytic details of AD and PD brain tissue samples
Braak
CERAD
Case
Neuropath Dx
Age
RIN
Case
Neuropath Dx
Age
RIN
Staging
Criteria
3
PD
73
6.5
138
AD
78
4.0
6
PD
N/A
4.8
156
AD
77
4.0
7
PD
91
5.3
178
AD
77
6.9
8
PD
60
5.5
190
AD
78
4.0
10
PD
74
5.0
196
AD
66
5.0
AD
18
PD
N/A
6.3
208
AD
92
5.1
AD
19
PD
82
5.0
212
AD
90
4.3
160
PD
70
5.3
215
AD
73
4.4
V/VI
AD
166
PD
52
5.2
223
AD
69
4.4
AD
171
PD
83
4.5
248
AD
81
4.8
V/VI
AD
174
PD
82
4.5
187
PD
73
6.1
63834
PD
61
7.1
66205
PD
68
5.7
mean
72.4
5.5
78.1
4.7
std dev
11.2
0.8
8.2
0.9
PD/AD versus CTL p (t-test)
0.33
0.62
0.19
0.97
11
CTL
82
5.5
137
CTL
63
5
N/A
N/A
143
CTL
48
4.4
144
CTL
83
4.9
N/A
N/A
145
CTL
59
4.8
147
CTL
53
4.9
N/A
N/A
151
CTL
53
6.0
151
CTL
88
4.7
N/A
N/A
161
CTL
83
5.1
161
CTL
87
4.6
N/A
N/A
164
CTL
53
6.9
164
CTL
70
4
N/A
N/A
179
CTL
72
5.0
191
CTL
71
4
N/A
N/A
216
CTL
87
5.1
150
CTL
53
5.3
N/A
N/A
228
CTL
64
5.1
216
CTL
71
N/A
N/A
mean
66.8
5.3
71
4.7
std dev
14.7
0.7
12.5
0.5
N/A, not available.
Demographic and analytic details of AD and PD brain tissue samplesN/A, not available.On-column DNAase treatments and extra buffer washes were routinely carried out. Multiplex Illumina® sequencing libraries, quantitation and sequencing were performed by CoFactor Genomics, using paired-end approach and ∼60 million reads/sample.Compressed (gz) sequencing files in fastq format were downloaded, examined with FastQC and Illumina® sequencing adapters removed with Trimmomatic®. Reads were aligned against the current (hg38) version of the human genome using Tophat2/Bowtie2. The resulting binary sequencing files (*.bam) were processed by either Cufflinks (using FPKM normalization) or Qlucore Omics Explorer® (QOE, www.qlucore.com) containing the NGS plug-in (FPKM or TMM normalization).
RESULTS
We chose to use gene expression variance (= standard deviation squared) as a quantitative measure of gene heterogeneity within each disease population. Figure 1 shows plots of variances for averaged (from Cufflinks) gene expression of the AD or PD samples on the y-axes against variances of the respective CTL samples on the x-axes. There is extensive scatter within both data sets and no clear relationships among variances of AD or PD samples and their respective CTLs’ variances.
Fig.1
Variances of gene expression, on a gene-by-gene basis, for AD samples (frontal cortex, top row) and PD samples (ventral midbrain, bottom row) are plotted against variances for CTL samples for AD (frontal cortex, top row) and PD (ventral midbrain, bottom row). Variances for gene expression were calculated from FPKM estimates of expressions derived from Cufflinks analyses, using Excel.
Variances of gene expression, on a gene-by-gene basis, for AD samples (frontal cortex, top row) and PD samples (ventral midbrain, bottom row) are plotted against variances for CTL samples for AD (frontal cortex, top row) and PD (ventral midbrain, bottom row). Variances for gene expression were calculated from FPKM estimates of expressions derived from Cufflinks analyses, using Excel.Figure 2 shows the relationships among variances in expression of the AD or PD samples on the y-axes compared to AD/CTL or PD/CTL expression ratios on the x-axes. The datasets suggest a Gaussian-type distribution, implying both that the data are normally distributed and that the genes with mean expression ratios closest to unity also can have the highest variances.
Fig.2
Variances of gene expression, on a gene-by-gene basis, for AD samples (frontal cortex, top row) and PD samples (ventral midbrain, bottom row) plotted against AD/CTL expression ratios (top row) or PD/CTL expression ratios (bottom row). Gene expressions were calculated from FPKM estimates of expressions derived from Cufflinks analyses.
Variances of gene expression, on a gene-by-gene basis, for AD samples (frontal cortex, top row) and PD samples (ventral midbrain, bottom row) plotted against AD/CTL expression ratios (top row) or PD/CTL expression ratios (bottom row). Gene expressions were calculated from FPKM estimates of expressions derived from Cufflinks analyses.Figure 3 shows a plot of averaged gene expression of PD/CTL samples on the y-axis compared to averaged gene expression of the AD/CTL samples on the x-axis on a gene-by-gene basis, for genes (n = 10,124) that are co-expressed in both PD and AD samples. There is no obvious single relationship among these 10,124 genes expressed in both PD and AD samples, but the majority of these co-expressed genes appear to have similar expressions in CTLs and AD or PD samples.
Fig.3
Plot of expression ratios (PD/CTL versus AD/CTL) in postmortem samples of 10,124 co-expressed genes. Note that most genes are near ratio values of 1.0. The blue rectangle denotes genes that are relatively over-expressed in PD, and the orange rectangle denotes genes relatively over-expressed in AD.
Plot of expression ratios (PD/CTL versus AD/CTL) in postmortem samples of 10,124 co-expressed genes. Note that most genes are near ratio values of 1.0. The blue rectangle denotes genes that are relatively over-expressed in PD, and the orange rectangle denotes genes relatively over-expressed in AD.Table 2 shows that among these 10,124 co-expressed genes in both PD and AD samples, there is a small number that have both small variances (<150) in PD and AD populations and are under-expressed in both PD and AD populations (Fold Change (FC) <0.67). For these genes, a metric is calculated, the “gene expression index” (GEI), that is the product of variance X expression ratio (AD/CTL, or PD/CTL). Genes with the smallest GEI would be predicted to influence the largest number of subjects with either condition (or both conditions). This would occur under at least two conditions: 1) The expression of the gene could be increased; and 2) The reduced expression is causal to the disease process and not solely secondary to the disease process.
Table 2
Genes expressed in AD samples (left group) or PD samples (right group) from among those co-expressed (n = 10,124) that had both low variances (<150) in their respective sample populations and under-expression (disease/CTL < 0.67). These genes are potential target for therapy development (see text for details)
Gene Expression
Gene Expression
Gene ID
AD variance
AD/CTL
GEI AD
Gene ID
PD variance
PD/CTL
GEI PD
NMNAT2
0.876
0.275
0.241
MACROD2
0.250
0.660
0.165
GAP43
0.491
0.545
0.267
MIR22HG
0.546
0.667
0.365
MIR22HG
0.493
0.574
0.283
PCSK1
1.093
0.363
0.396
PPFIA2
0.927
0.431
0.400
RAB27B
2.074
0.542
1.124
NR4A2
0.872
0.659
0.575
C2orf80
2.326
0.522
1.213
SLC8A1
1.198
0.499
0.598
LOC399715
1.999
0.658
1.315
LINC00467
0.999
0.658
0.657
NR4A2
3.394
0.526
1.787
LOC399715
1.173
0.627
0.736
SV2B
4.498
0.475
2.138
MACROD2
1.193
0.657
0.784
NRN1
3.552
0.651
2.314
RALYL
1.482
0.550
0.816
LINC00467
5.283
0.620
3.273
PBX1
1.575
0.665
1.047
NMNAT2
8.332
0.548
4.570
INSIG1
3.924
0.539
2.117
TRIM36
11.897
0.618
7.348
DYNC1I1
4.986
0.536
2.673
INSIG1
15.784
0.470
7.411
NAPB
7.199
0.407
2.932
RALYL
12.118
0.660
7.992
PCSK1
7.733
0.426
3.297
PPFIA2
14.267
0.645
9.197
SV2B
5.547
0.613
3.401
PPP1R2
14.085
0.658
9.274
SNCA
10.612
0.523
5.546
PBX1
14.381
0.668
9.605
TRIM36
13.149
0.654
8.596
SNCA
28.386
0.557
15.814
RAB27B
15.706
0.669
10.503
LOC101928307
38.921
0.669
26.056
PPP1R2
18.417
0.664
12.232
DYNC1I1
44.207
0.625
27.608
C2orf80
26.866
0.610
16.396
SLC8A1
62.610
0.597
37.392
NRN1
48.699
0.635
30.910
GAP43
131.473
0.536
70.426
LOC101928307
99.900
0.542
54.147
Genes expressed in AD samples (left group) or PD samples (right group) from among those co-expressed (n = 10,124) that had both low variances (<150) in their respective sample populations and under-expression (disease/CTL < 0.67). These genes are potential target for therapy development (see text for details)Figure 4 shows a more traditional heatmap of gene expression, hierarchically clustered, where false discovery rates (FDR, q) are <5% and fold-changes in expression are 1.3 for the AD and 1.5 for the PD samples. In both cases it is possible to define gene groups that allow separation of the samples into disease compared to CTL. There is no overlap among these gene groups (data not shown).
Fig.4
Heatmaps of genes hierarchically clustered from the AD population (A) (q < 5%, Fold Change > 1.3); and the PD population (B) (q < 5%, Fold Change > 1.5). Note that these analyses separated the AD and PD populations from their respective CTL samples. Both heatmaps were created in Qlucore using FPKM normalization.
Heatmaps of genes hierarchically clustered from the AD population (A) (q < 5%, Fold Change > 1.3); and the PD population (B) (q < 5%, Fold Change > 1.5). Note that these analyses separated the AD and PD populations from their respective CTL samples. Both heatmaps were created in Qlucore using FPKM normalization.
DISCUSSION
In this paper we have presented both a comparison of RNA-seq datasets between PD and AD samples and a novel approach to using this data to predict treatment paradigms to be tested in these populations. We found extensive heterogeneity within both PD and AD groups and a small group of co-expressed genes that demonstrated both low variation among subjects (manifested as low variance) and reduced expression in both conditions (manifested as averaged disease/CTL ratios of 0.67 or less).We propose that traditional analytic approaches to RNA-seq data reduction are of limited help in providing directions for therapeutic trials, likely as a result of the extensive heterogeneity in the samples. We found that developing gene lists based on false discovery rate estimation and hierarchical clustering, while separating the two populations (PD or AD versus CTL), did not provide insights into therapeutic development likely to help most subjects.We propose a new approach, based on the GEI, that is a mathematical product of gene expression variance (square of standard deviation) and gene expression ratio. Genes with the lowest GEI would be worthy of increased expression, either directly (e.g., by vector-mediated expression) or indirectly (e.g., by small molecule inducers or alteration of microRNA expression modifiers). Low GEI value genes would, by virtue of their low variance, be predicted to have effects on the greatest numbers of afflicted persons.This argument assumes that genes with low GEI values are causally related to the disease process and not simply adaptive changes to the disease process. This may require empirical testing of many different genes. Low GEI value genes could potentially also serve as biomarkers to be followed both as response metrics to therapies that alter these genes and as selection criteria to be used for genes to be altered in a given individual. Such a “personalized therapy” approach does not follow the usual single agent model of drug development that has been so successful for agents of certain classes, but which has failed so far to provide disease alteration in more complex conditions such as neurodegeneration.Based on our results and the apparent involvement of these genes in PD and/or AD, we propose increased expression in CNS of the following genes in PD and AD subjects, who might share both disease processes. Other genes from Table 1 could also be proposed:NR4A2: (GEIs PD = 1.79; AD = 0.58). NR4A2 (nuclear receptor subfamily 4 group A member 2), aka Nurr1, is a member of the steroid-thyroid hormone-retinoid receptor superfamily. It may also act as a transcription factor, and it has relevance to both PD and AD. In PD, NR4A2 gene (and the entire NR4A family) is expressed in lower levels in PD subjects’ peripheral blood mononuclear cells (PBMC) [8], and NR4A1 expression is reduced in AD PBMC [8]. NR4A2 exhibits polymorphisms, one of which (rs35479735) tracks with sporadic PD [9]. In AD, a deficit of hippocampal expression of miR-184, assayed by RT-qPCR, has been described [10]. A luciferase assay experiment confirmed that miR-184 can interact with the 3’-UTR of NR4A2 [10], which accumulates in brain regions that also accumulate Aβ plaques in the 5X FAD mouse model of AD [11]. Nurr1 protein has been reported to be induced in brain by an herbal combination product [12].PBX1: (GEIs PD = 9.61; AD = 1.05). PBX1 (PBX homeobox 1, pre B-cell leukemia transcription factor 1) is a transcription factor and one of four members of the PBX homeobox family. Its expression is impaired in dopaminergic neurons of PD subjects [13], and it is expressed in Nurr1 (+) neurons in substantia nigra as they mature into tyrosine hydroxylase (+) midbrain dopaminergic neurons [13]. PBX 1 is induced by retinoic acid [14, 15].GAP43: (GEIs PD = 70.43; AD = 0.27). GAP43 (growth associated protein 43) gene encodes a protein (Neuromodulin) that is expressed in high levels in neural growth cones and is reduced in brains from individuals with PD dementia compared to brains from individuals with AD [16]. GAP43 is reported to be induced by: a) the combination of Y27632 (a ROCKII inhibitor) and 4-benzyl-2-methyl-1, 2,4-thiadiazolidine-3,5-dione (TDZD-8; a GSK-3β inhibitor) in rats with spinal cord injury [17]; b) FK506, an immune modulator [18], and c) an essential oil [19].NRN1: (GEIs PD = 2.31; AD = 30.91). NRN1 (Neuritin 1) is expressed in postmitotic neurons of the developing nervous system and is believed to be important for neural plasticity. Increased expression of NRN1 promotes maturation of synapses in hippocampal neurons prepared from Tg2576 mice, and Neuritin infusion into Tg2576 mice normalizes synaptic plasticity in hippocampal slices [20]. NRN1 expression is increased by glial cell-derived neurotrophic factor (GDNF, [20]), and GDNF improves survival of nigral dopamine neurons in a chronic model of Parkinson’s disease (MitoPark mice, [21]).NMNAT2: (GEIs PD = 4.57; AD = 0.24). NMNAT2 (nicotinamide nucleotide adenylyltransferase 2) is a member of the NMNAT family that catalyzes an essential step in NAD/NADP biosynthesis. NMNAT2 is predominantly expressed in brain and its expression levels correlate positively with cognitive function and negatively with AD pathology [22]. Its mRNA protein levels are reduced in AD brain [22].We are particularly intrigued by the last gene discussed (NMNAT2) for two reasons. First, IPA analyses of our common gene lists of 23 genes with both low variances and under-expression in AD and PD samples found that the major canonical pathways significantly altered (p = (3.7–4.8)E-03) in both groups were NAD biosynthesis and salvage. Second, NAD supplementation has significant support from the literature and appears to be safe [23-29]. Recent successes in NAD supplementation support a trial of nicotinamides in AD and PD to increase NAD+ levels.
Authors: Erika Bereczki; Rui M Branca; Paul T Francis; Joana B Pereira; Jean-Ha Baek; Tibor Hortobágyi; Bengt Winblad; Clive Ballard; Janne Lehtiö; Dag Aarsland Journal: Brain Date: 2018-02-01 Impact factor: 13.501
Authors: Ryan W Dellinger; Santiago Roel Santos; Mark Morris; Mal Evans; Dan Alminana; Leonard Guarente; Eric Marcotulli Journal: NPJ Aging Mech Dis Date: 2017-11-24