Literature DB >> 21283692

Whole transcriptome sequencing reveals gene expression and splicing differences in brain regions affected by Alzheimer's disease.

Natalie A Twine1, Karolina Janitz, Marc R Wilkins, Michal Janitz.   

Abstract

Recent studies strongly indicate that aberrations in the control of gene expression might contribute to the initiation and progression of Alzheimer's disease (AD). In particular, alternative splicing has been suggested to play a role in spontaneous cases of AD. Previous transcriptome profiling of AD models and patient samples using microarrays delivered conflicting results. This study provides, for the first time, transcriptomic analysis for distinct regions of the AD brain using RNA-Seq next-generation sequencing technology. Illumina RNA-Seq analysis was used to survey transcriptome profiles from total brain, frontal and temporal lobe of healthy and AD post-mortem tissue. We quantified gene expression levels, splicing isoforms and alternative transcript start sites. Gene Ontology term enrichment analysis revealed an overrepresentation of genes associated with a neuron's cytological structure and synapse function in AD brain samples. Analysis of the temporal lobe with the Cufflinks tool revealed that transcriptional isoforms of the apolipoprotein E gene, APOE-001, -002 and -005, are under the control of different promoters in normal and AD brain tissue. We also observed differing expression levels of APOE-001 and -002 splice variants in the AD temporal lobe. Our results indicate that alternative splicing and promoter usage of the APOE gene in AD brain tissue might reflect the progression of neurodegeneration.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21283692      PMCID: PMC3025006          DOI: 10.1371/journal.pone.0016266

Source DB:  PubMed          Journal:  PLoS One        ISSN: 1932-6203            Impact factor:   3.240


Introduction

Alzheimer's disease (AD) is the most common cause of dementia in the human population; it mainly affects individuals over the age of 60, and one's risk of developing it increases steadily with age [1]. AD is characterized by a complex progression of neurodegeneration that results in memory impairment and loss of other cognitive processes as well as the presence of non-cognitive symptoms including delusions, agitation and changes in mood and personality. The pathogenesis of AD is complex and remains challenging to research efforts worldwide. The majority of AD cases show no familial or geographical clustering and are described as sporadic or idiopathic. The apolipoprotein E (APOE) genotype influences age at onset of AD. Compared to APOE e3 (Cys-112, Arg-158), which is considered neutral, the e4 allele (Arg-112, Arg-158) is associated with increased risk and earlier onset of AD in a dose-dependent manner. Conversely, the e2 allele (Cys-112, Cys-158) is protective against AD [2]. In the absence of greater understanding of AD pathogenesis, treatment strategies do not provide a cure but only treat symptoms or reduce the rate of onset [3], [4]. The transcriptome reflects cellular activity within a tissue at a given point in time. Genome-wide expression studies, which are not influenced by deductive assumptions, provide an unbiased approach for investigating the pathogenesis of complex diseases like AD. Transcriptome analyses have been performed using transgenic animals models of AD and patient-derived cell lines [5], [6]. In contrast to these approaches, post-mortem brain tissue is difficult to obtain, and some RNA quality concerns exist that might potentially influence transcriptome studies [7], [8]. Nevertheless, post-mortem brain tissue, being identical to the tissue affected by the disease, remains the gold standard against which all other model systems are evaluated. Transcriptome studies of AD utilizing brain tissue have however generated mostly discordant results. The recent development of next-generation sequencing provides a more comprehensive and accurate tool for transcriptome analysis of this invaluable resource [9], [10]. RNA-Seq analyzes complementary DNA (cDNA) by means of highly efficient, next-generation DNA sequencing methods and subsequent mapping of short sequence fragments (reads) onto the reference genome. That this new technology makes it possible to identify exons and introns, mapping their boundaries and the 5′ and 3′ ends of genes, in turn makes it possible to understand the complexity of eukaryotic transcriptomes comprehensively. Moreover, RNA-Seq enables identification of transcription initiation sites (TSSs) and new splicing variants, and it permits of a precise quantitative determination of exon and splicing isoform expression [11]. Some recent reports, which systematically compare microarrays and next-generation sequencing, have clearly proven the superiority of the latter, both with respect to low frequency of false positive signals and high reproducibility of the method [12], [13]. A recent report by van Bakel et al. concerning transcript analysis of intragenic regions unambiguously showed that hybridization signals from microarrays can lead to massively false positive signals from transcripts of low abundance [14]. In the present study, we performed a comparative gene expression analysis of normal human brain tissue and tissue affected by Alzheimer's disease, using the RNA-Seq technique. Along with samples from whole normal and AD brains, mRNA samples from two different brain regions, namely the frontal and temporal lobes, were analyzed. We found significant differences in gene isoform expression levels, alternated use of promoters and transcription start sites between normal and AD brain tissue.

Materials and Methods

Human brain RNA

Total RNA from post-mortem human brains was obtained from Ambion (Austin, USA) and Capital Biosciences (Rockville, USA). Table 1 provides detailed information regarding each sample used in this study. The quality of the total RNA was evaluated using the Agilent 2100 Bioanalyser RNA Nano Chip.
Table 1

Source of total RNA from brain tissue samples.

ConditionSampleGenderAge (years)Source
Normal Total brain13 male;10 female23–86 (x̃≈68.3)Ambion
Frontal lobe5 male22–29 (x̃≈26.4)Capital Biosciences
Temporal lobe5 male23–29 (x̃≈26.0)Capital Biosciences
Alzheimer's disease Total brain1 male87Capital Biosciences
Frontal lobe1 male87Capital Biosciences
Temporal lobe1 male80Capital Biosciences

Library preparation and sequencing

For the mRNA-Seq sample preparation, the Illumina standard kit was used according to the manufacturer's protocol. Briefly, 10 µg of each total RNA sample was used for polyA mRNA selection using streptavidin-coated magnetic beads, followed by thermal mRNA fragmentation. The fragmented mRNA was subjected to cDNA synthesis using reverse transcriptase (SuperScript II) and random primers. The cDNA was further converted into double stranded cDNA and, after an end repair process (Klenow fragment, T4 polynucleotide kinase and T4 polymerase), was finally ligated to Illumina paired end (PE) adaptors. Size selection was performed using a 2% agarose gel, generating cDNA libraries ranging in size from 200–250 bp. Finally, the libraries were enriched using 15 cycles of PCR and purified by the QIAquick PCR purification kit (Qiagen). The enriched libraries were diluted with Elution Buffer to a final concentration of 10 nM. Each library was run at a concentration of 7 pM on one Genome Analyzer (GAII) lane using 36 bp sequencing. Six samples were analyzed in this manner, taken from frontal, temporal and total brain tissue of both AD and healthy brains.

Primary processing of Illumina RNA-Seq reads

RNA-Seq reads were obtained using Bustard (Illumina Pipeline version 1.3). Reads were quality-filtered using the standard Illumina process, and a 0 (no) or 1 (yes) was used to define whether a read passed filtering or not. Six sequence files were generated in FASTQ format (sequence read plus quality information in Phred format); each file corresponded to the brain tissue from which the RNA originated. The median number of reads per sequence file (corresponding to one lane on the flow cell) was 14,974,824. The sequence data have been submitted to the NCBI Short Read Archive with accession number SRA027308.2.

Mapping of RNA-Seq reads using TopHat

Reads were then processed and aligned to the UCSC H. sapiens reference genome (build hg19) using TopHat v1.0.12 [15]. TopHat incorporates the Bowtie v0.11.3 algorithm to perform the alignment [16]. TopHat initially removes a portion of reads based on quality information accompanying each read, then maps reads to the reference genome. The pre-built H. sapiens UCSC hg19 index was downloaded from the TopHat homepage and used as the reference genome. TopHat allows multiple alignments per read (up to 40 by default) and a maximum of 2 mismatches when mapping reads to the reference. The mapping results were then used to identify “islands” of expression, which can be interpreted as potential exons. TopHat builds a database of potential splice junctions and confirms these by comparing the previously unmapped reads against the database of putative junctions. Default parameters for TopHat were used.

Transcript assembly and abundance estimation using Cufflinks

The aligned read files were processed by Cufflinks v0.8.0 [17]. Reads were assembled into transcripts, their abundance estimated and tests for differential expression and regulation between the tissue samples were performed. Cufflinks does not make use of existing gene annotations during assembly of transcripts, but rather constructs a minimum set of transcripts that bests describe the reads in the dataset. This approach allows Cufflinks to identify alternative transcription and splicing that are not described by pre-existing gene models [17]. Cufflinks uses the normalized RNA-Seq fragment counts to measure the relative abundances of transcripts. The unit of measurement is Fragments Per Kilobase of exon per Million fragments mapped (FPKM). Confidence intervals for FPKM estimates were calculated using a Bayesian inference method [18].

Comparison to reference annotation and differential expression testing using Cuffcompare and Cuffdiff

Once all short read sequences were assembled with Cufflinks, the output.GTF files were sent to Cuffcompare along with a reference.GTF annotation file downloaded from the Ensembl database (Homo_sapiens.GRCh37.55.gtf; [19]). This classified each transcript as known or novel. The classification also describes the nature of the match to the reference gene annotation by way of a code letter. These are useful for selecting novel isoforms from the analysis. Cuffcompare produces a combined.GTF file which is passed to Cuffdiff along with the original alignment (.SAM) files produced by TopHat. Cuffdiff then re-estimates the abundance of transcripts listed in the.GTF file using alignments from the.SAM file, and concurrently tests for differential expression. The expression testing is done at the level of transcripts, primary transcripts and genes. By tracking changes in the relative abundance of transcripts with a common transcription start site, Cuffdiff can identify changes in splicing. Relative promoter use within a single gene is also monitored by following the abundance changes of primary transcripts from that gene. We used Cuffdiff to perform three pairwise comparisons of expression, splicing and promoter use between normal and diseased samples from temporal, frontal and total brain regions.

Identification of APOE allele in AD samples

To identify which allele of APOE was present in the frontal, temporal lobe and total brain AD samples, the genotype of SNPs rs429358 and rs7412 were determined using the Integrated Genome Viewer.

Visualization of mapped reads

Mapping results were visualized using both the University of California, Santa Cruz (UCSC) genome browser [20] and a local copy of the Integrative Genomics Viewer software available at http://www.broadinstitute.org/igv/. Views of individual genes were generated by uploading coverage.wig files to the UCSC Genome browser as a custom track. Data files were restricted to the chromosome in question due to upload limits imposed by the genome browser. The same method was used to generate coverage plots for chromosome 1, except here the coverage values were logged (base 2) prior to uploading to the genome browser. This was done to visualize better the full dynamic range of the read coverage.

Functional analysis of gene lists using DAVID

The Database for Annotation, Visualization and Integrated Discovery (DAVID) v6.7 is a set of web-based functional annotation tools [21]. The functional clustering tool was used to look for functional enrichment for genes over- and under-expressed more than two-fold in Alzheimer's disease. A unique list of gene symbols was uploaded via the web interface, and the background was selected as Homo sapiens. Gene Ontology Biological Process was selected as the functional annotation category for this analysis.

Hardware specifications

TopHat and Bowtie were installed and run on a SGI Altix 4700 64-bit shared memory machine with 1 TB RAM, 128 Dual-Core CPUs of 1.6 GHz. Cufflinks was run on a desktop computer with 4 GB RAM.

Results

Analysis of RNA-Seq data

During the amplification step of sequence generation, the Illumina GAII produces clusters of identical sequence fragments. The number of these clusters is reported, as is the percentage that pass quality filtering by the Illumina image analysis software. Across all 6 samples, between 192,093 and 211,779 raw clusters were generated. Between 67.6% and 74.1% of these clusters passed filtering; these values are within the acceptable range recommended by Illumina. The total number of reads produced for each brain sample ranged from 13,442,077 to 15,772,947, with a median of 14,974,824 (Table 2). There was no significant difference in the number of reads from normal and Alzheimer's brain (Student's t-test, p = 0.9). To assess the quality of mapping reads to the reference genome, some key metrics were extracted from the TopHat output and log files, as shown in Table 2. Between 90% and 92% of reads aligned to the reference genome in a unique manner. A small percentage of reads (0.02% to 0.05%) were removed from the analysis prior to mapping to the reference, due to low quality.
Table 2

RNA-Seq sequence reads mapping to UCSC Human genome build 19 by TopHat v1.0.12.

Total brain Na Total brain ADb Temp lobe NTemp lobe ADFront lobe NFront lobe AD
Total reads 13,442,07714,720,81615,256,75214,227,70215,772,94715,228,832
Reads removed 0.05%0.04%0.02%0.04%0.03%0.04%
Unique hits to reference genome 91.85%92.42%92.40%90.41%91.46%90.96%

TopHat allows up to two mismatches when mapping reads to a reference genome. The number of reads removed due to poor quality and the number of reads mapping uniquely to the reference genome are both expressed as percentages of the total number of reads.

Normal brain samples.

Alzheimer's disease brain samples.

TopHat allows up to two mismatches when mapping reads to a reference genome. The number of reads removed due to poor quality and the number of reads mapping uniquely to the reference genome are both expressed as percentages of the total number of reads. Normal brain samples. Alzheimer's disease brain samples.

Sequence coverage distribution

To investigate the level and uniformity of the read coverage against the human genome, we plotted mapped reads of the normal temporal lobe sample along the human chromosome 1 (Fig. 1). We exemplified RNA-Seq coverage on chromosome 1 because this is the largest chromosome in the human karyotype, encoding over 13.6% of all human genes. The coverage values, measured along discrete intervals or bins of the genome, were log-transformed (base 2) to visualize better the full dynamic range of the data. Figure 1 shows the breadth of read coverage across chromosome 1. The read depth in the different bins ranged from 0 to 12,949 and revealed extensive transcriptional activity in the genome. As expected, no reads mapped to the centromere. The total numbers of reads that mapped to chromosome 1 in normal total brain as well as normal temporal and frontal lobes were 1,700,799, 2,062,880 and 2,048,959 respectively.
Figure 1

A transcription profile of normal temporal lobe of the brain for chromosome 1.

The RNASeq read density along the length of the chromosome is shown. The coverage values are measured along intervals of the genome. These intervals vary in size from 1 bp to 10 Mbp depending on how variable the read density is for a particular genomic location. Each bar represents log2 of the frequency reads plotted against chromosome coordinates.

A transcription profile of normal temporal lobe of the brain for chromosome 1.

The RNASeq read density along the length of the chromosome is shown. The coverage values are measured along intervals of the genome. These intervals vary in size from 1 bp to 10 Mbp depending on how variable the read density is for a particular genomic location. Each bar represents log2 of the frequency reads plotted against chromosome coordinates.

Differentially expressed genes

After mapping the RNA-Seq reads to the reference genome with TopHat, transcripts were assembled and their relative abundances calculated using Cufflinks. The summation of FPKM values for every transcript associated with a particular gene gives the expression (abundance) measurement for that gene, in FPKM. Cufflinks uses the Cuffdiff algorithm to calculate differential expression at both the gene and transcript levels. Differential gene expression (DGE) for total brain, frontal and temporal lobes was calculated using the ratio of AD versus normal FPKM values for every gene. The DGE ratios were tested for statistical significance as described recently [22]. The significance scores were corrected for multiple testing using the Benjamini-Hochberg correction. The range of DGE ratios observed was −26.20 to 26.24 for frontal lobe, −183 to 13.27 for temporal lobe and −350 to 36.63 for total brain. These three ranges for DGE ratios were all statistically significant. The expression ratios in AD versus normal were skewed towards down-regulation. This is potentially due to the lower overall levels of transcriptional activity present in AD vs. normal brain following significant loss of neuronal tissue in the former. The top 10 up- and down-regulated genes in total, frontal and temporal AD brain regions are listed in Tables 3, 4 and 5, respectively.
Table 4

Top ten up- and down-regulated genes in frontal lobe of AD brain.

GeneDescriptionChromosomeFPKM NFPKM ADFold changep-valueEnsembl Gene ID
PCK1 phosphoenolpyruvate carboxykinase 1 (soluble)chr200.1214413.18661926.240058965.87E-006ENSG00000124253
CD163 CD163 moleculechr120.2641394.43586916.793691960.000108665ENSG00000177575
AC012317.1 Bac clonechr160.2955063.91334713.242868170.0128902not annotated
NUPR1 nuclear protein, transcriptional regulator, 1chr167.9345894.48870911.908470140ENSG00000176046
GDPD3 glycerophosphodiester phosphodiesterase domain containing 3chr160.2629153.10451711.808063442.23E-006ENSG00000102886
STAB1 stabilin 1chr30.4075944.27812710.496049990.00152394ENSG00000010327
MOV10 Mov10, Moloney leukemia virus 10, homolog (mouse)chr10.5845175.5216059.4464403940.00022429ENSG00000155363
MLKL mixed lineage kinase domain-likechr160.2681342.532919.44643350.00258816ENSG00000168404
LY6G5C lymphocyte antigen 6 complex, locus G5Cchr60.2759592.4620048.9216296620.00341639ENSG00000111971
ITPR3 inositol 1,4,5-triphosphate receptor, type 3chr60.271732.2816688.3968203730.00455198ENSG00000096433
SLIT1 slit homolog 1 (Drosophila)chr109.631310.367602−26.200374325.71E-006ENSG00000187122
PTPRO protein tyrosine phosphatase, receptor type, Ochr128.777410.368512−23.81851881.10E-005ENSG00000151490
LPIN2 lipin 2chr187.507450.335313−22.389379481.67E-005ENSG00000101577
ATRN attractinchr207.29840.333062−21.913037211.92E-005ENSG00000088812
NAG (NBAS)neuroblastoma amplified sequencechr26.546590.327206−20.007548763.48E-005ENSG00000151779
GPR107 G protein-coupled receptor 107chr97.311490.383708−19.054828154.75E-005ENSG00000148358
ACOX1 acyl-CoA oxidase 1, palmitoylchr178.285870.442214−18.737240347.37E-007ENSG00000161533
EDEM3 ER degradation enhancer, mannosidase alpha-like 3chr15.644770.303834−18.578467195.57E-005ENSG00000116406
ATP8A1 ATPase, aminophospholipid transporter (APLT), class I, type 8A, member 1chr47.661870.412407−18.578418895.57E-005ENSG00000124406
VWF von Willebrand factorchr126.01290.32365−18.57840265.57E-005ENSG00000110799

Differential gene expression for frontal lobe was calculated using the ratio of AD versus normal (N) FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on their fold changes and the ten with the highest or lowest fold changes are shown here.

Table 5

Top ten up- and down-regulated genes in temporal lobe of AD brain.

GeneDescriptionChromosomeFPKM NFPKM ADFold changep-valueEnsembl ID
AC074289.1 Bac clone – not annotatedchr20.285933.79369813.267925720.0129943not annotated
MT1G metallothionein 1Gchr1615.1637148.1156499.7677775870ENSG00000125144
S100A4 S100 calcium binding protein A4chr13.019123.5521757.8010582624.44E-016ENSG00000196154
DES desminchr24.2377431.3444417.3964993130ENSG00000175084
C19orf42 UPF0608 protein C19orf42 Precursorchr190.6260874.1534456.6339741920.0132315ENSG00000214046
MTPAP mitochondrial poly(A) polymerasechr101.8718112.0035986.4128292940.000124237ENSG00000107951
NME3 non-metastatic cells 3, protein expressed inchr169.4028745.7765864.868363170ENSG00000103024
KIF1C kinesin family member 1Cchr1739.0482180.4834894.6220693660ENSG00000129250
MAP4K4 mitogen-activated protein kinase kinase kinase kinase 4chr25.6518424.0587354.2567969027.85E-009ENSG00000071054
TGFB3 transforming growth factor, beta 3chr144.6264217.9516683.8802503880ENSG00000119699
MICAL2 microtubule associatedmonoxygenase, calponin and LIM domain containing 2chr1143.99610.240419−182.99760ENSG00000133816
DYNC1I1 dynein, cytoplasmic 1, intermediate chain 1chr751.49850.292−176.3647262.96E-013ENSG00000158560
RPH3A rabphilin 3A homolog (mouse)chr1242.81480.271284−157.82279829.50E-013ENSG00000089169
RASGRF1 Ras protein-specific guanine nucleotide-releasing factor 1chr1529.11940.19051−152.84971921.32E-012ENSG00000058335
ATP2B1 ATPase, Ca++ transporting, plasma membrane 1chr1227.81050.195853−141.99680372.82E-012ENSG00000070961
ELMOD1 ELMO/CED-12 domain containing 1chr1125.71480.185023−138.98164010ENSG00000110675
NELL2 NEL-like 2 (chicken)chr1248.2560.356889−135.21290934.64E-012ENSG00000184613
PDE2A phosphodiesterase 2A, cGMP-stimulatedchr1133.24910.250937−132.49979085.69E-012ENSG00000186642
CAMKK2 calcium/calmodulin-dependent protein kinase kinase 2, betachr1244.6930.352967−126.62090228.97E-012ENSG00000110931
ICAM5 intercellular adhesion molecule 5, telencephalinchr1920.78340.170851−121.64634681.34E-011ENSG00000105376

Differential gene expression for temporal lobe was calculated using the ratio of AD versus normal (N) FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on their fold changes and the ten with the highest or lowest fold changes are shown here.

Table 6

Splice junctions in normal and Alzheimer's brains predicted by TopHat.

Total brain Na Total brain ADb Temp lobe NTemp lobe ADFront lobe NFront lobe AD
Total reads 13,442,07714,720,81615,256,75214,227,70215,772,94715,228,832
Total splice junctions 54,45829,01252,43817,26554,80838,647
Reads mapping to splice junctions (%) 2.14%0.94%2.10%0.47%2.20%1.28%

RNA-Seq data were mapped to the UCSC Human genome build 19. The number of splice junctions predicted by TopHat is shown, as well as the percentage of the total number of reads.

Normal brain samples.

Alzheimer's disease brain samples.

Differential gene expression for total brain was calculated using the ratio of AD versus normal (N) FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on their fold changes and the ten with the highest or lowest fold changes are shown here. Differential gene expression for frontal lobe was calculated using the ratio of AD versus normal (N) FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on their fold changes and the ten with the highest or lowest fold changes are shown here. Differential gene expression for temporal lobe was calculated using the ratio of AD versus normal (N) FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on their fold changes and the ten with the highest or lowest fold changes are shown here. When comparing the top 30 most over- and under-expressed genes in AD across the 3 brain samples (Tables S1, S2, S3), DHX58 (DEXH box polypeptide 58) and STAB1 (Stabilin 1) are up-regulated in both total brain (2.13 fold change (FC), p = 0.01 and 4.9 FC, p = 0.01, respectively) and frontal lobe (3.96 FC, p = 0.03 and 10.5, p<1×10−16, respectively), while TFR1 (transferrin receptor) is down-regulated in both regions (−50.92 FC, p = 3.8×10−8 and -17.15 FC, p = 9.2×10−5, respectively). SLIT1 (slit homolog 1) is down-regulated in both frontal and temporal lobes (−26.2 FC, p = 5.7×10−6 and −116.67, p = 2×10−11). TFR1, responsible for cellular uptake of iron, has been implicated in neurologic development in mice, and accumulation of iron in brain-specific regions has been implicated in AD [23], [24]. SLIT1 is widely reported to be involved in brain development and axon guidance [25]. In the top 30 over- and under-expressed genes in AD between the 3 brain samples, there are a number of genes without annotation, described either as putative or novel transcripts in the Ensembl database. RP11-552E20.3 and AC018730.1 are up-regulated in AD total brain (7.61 FC, p = 8.76×10−9 and 3.49 FC, p = 8.88×10−16, respectively), AC074289.4 is up-regulated in AD temporal lobe (13.27 FC, p = 0.01) and RP4-697K14.12 is up-regulated in AD frontal lobe (5.77 FC, p = 0.02). None of these putative or novel transcripts is described as protein coding by Ensembl. There is some concordance between gene expression differences found with RNA-Seq and those reported in previous microarray studies on Alzheimer's disease [9]. Genes in the AD temporal lobe detected as down-regulated by both approaches include dopamine receptor 2 (DRD2), AMPA1 receptor (GRIA1), glutamate receptor, ionotropic, N-methyl D-aspartate 1 (GRIN1), glutamate transporter EAAT3 (SLC1A1), a-synuclein (SCNA), high affinity BDNF/NT-3 receptor (TrkB), high affinity NT-3 receptor (TrkC), glutamic acid decarboxylase 1 (GAD1) and glutamic acid decarboxylase 2 (GAD2). There is also concordance in genes expressed in the frontal lobe, where DNM1 and SYN2 are down-regulated, in both our data and previous microarray studies. A comparison also highlights some contradicting results, however, between RNA-Seq and microarray techniques. PPP3CB is up-regulated in the temporal lobe in the microarray study [26] but down-regulated in our dataset. GRIA4 and GRIK1 are shown to be expressed in senile plaques (in temporal lobe) in microarray data [27] but are not identified as expressed in the AD temporal lobe in the present RNA-Seq dataset.

Gene Ontology term enrichment analysis of differentially expressed genes

The NCBI web-based functional annotation tool DAVID v 6.7 (Database for Annotation, Visualization and Integrated Discovery) was used to investigate functional associations of gene expression changes seen in AD brain [21]. Genes that were more than two-fold over- or under-expressed were analyzed by functional clustering. Gene Ontology Biological Process was selected as the annotation category for clustering. Once the tool has identified enriched ontologies for a particular gene list, it clusters those that have a statistically significant overlap in terms of their constituent genes. The gene lists used in this analysis contained 1416, 1071 and 944 genes for temporal, whole and frontal brain samples, respectively. There is a high degree of overlap between the top ten most enriched clusters (Tables S4, S5, S6). Protein localization is the most enriched cluster across all three regions, while vesicle mediated transport and phosphate metabolic processes are within the top five clusters and proteolysis and regulation of GTPase activity are within the top seven for all three tissue samples. The only brain-specific cluster present in the top ten across all three samples is neuronal development. This level of functional overlap between the samples is to be expected given that they all originate from the same tissue. Interestingly, the frontal lobe is different from the other samples in that it shows greater changes in genes associated with brain-specific biological processes. These are regulation of synaptic transmission (rank 9), neurotransmitter transport (11), response to metal ion (13), metal ion transport (15), regulation of synaptic plasticity (18), negative regulation of neuron apoptosis (19) and axon transport (20). By contrast, the brain-specific categories apparent in the temporal lobe are axon transport (rank 14) and neurotransmitter transport (18), and cerebellum development (12) is implicated for the total brain. Genes known to be involved in programmed cell death were enriched in the frontal lobe of AD brain (rank 10) and an induction of apoptosis is present in both frontal and temporal lobes (rank 16 and 12, respectively). An over-representation of apoptosis-related genes clearly indicates the ongoing process of neurodegeneration and associated cell loss. The top 20 DAVID functional clusters for total, frontal and temporal brain regions can be seen in Tables S4, S5 and S6, respectively.

Alternative splicing and transcript identification using RNA-Seq

A key feature of RNA Seq is its ability to identify alternative splicing of transcripts. It also has an advantage over microarray-based methods of detection in its ability to identify novel transcripts. Accordingly, we next investigated the splicing status of all genes and whether genes show differential splicing patterns between normal and diseased tissues. TopHat builds a database of potential splice junctions by identifying the splice donor and acceptor sites (GT-AG) for each region of a gene with high coverage of short mRNA reads. TopHat then compares the previously unmapped reads against this database of putative junctions. Regions of genes with a high coverage are also screened for internal junction sites. One of the advantages of identifying potential exons without using predefined annotation information is the capability to highlight splicing in unannotated regions of the genome. A range of 52,438 to 54,808 splice junctions was predicted for normal brain (Table 6). This corresponds to 2.1–2.2% of all reads. By contrast, AD brain samples showed a lower number of splice variants, ranging from 17,265 to 29,012 predicted junctions. This corresponds to 0.47–1.28% of all reads. This difference is statistically significant (Student's t-test, p = 0.043). RNA-Seq data were mapped to the UCSC Human genome build 19. The number of splice junctions predicted by TopHat is shown, as well as the percentage of the total number of reads. Normal brain samples. Alzheimer's disease brain samples. Using the Cuffdiff algorithm to calculate differential expression at the transcript level allowed discovery of which transcripts are common, differentially expressed or present/absent between normal and AD brain tissue. Frontal, temporal and total brain specimens showed a large proportion of transcripts at similar expression levels between normal and AD tissue (Fig. 2). Specifically, there were 56%, 48% and 59% of transcripts showing less than two-fold expression difference in the total brain, temporal and frontal lobes, respectively. The number of transcripts up-regulated in AD tissue as compared to normal brain ranged from 422 to 927, representing 0.2–0.5% of total transcripts. The number of transcripts up-regulated in normal tissues compared to AD brain was larger in each case, ranging from 3858 (1.98%) to 6385 (3.52%).
Figure 2

Venn diagram showing distributions of differentially expressed transcripts between healthy and AD brain.

Venn diagram showing the number of differentially expressed transcripts between AD and normal tissue samples across total brain, temporal and frontal lobe. The number of transcripts unique to AD and normal tissues is shown in universe area outside the circles. The numbers of transcripts up-regulated by more than two-fold in AD tissue are indicated in the dark grey circle, while the numbers up-regulated by more than two-fold in normal tissue are highlighted in the light grey circle. The intersection of the two circles refers to number of transcripts which are expressed in both AD and normal tissues but which are less than two-fold different in expression level.

Venn diagram showing distributions of differentially expressed transcripts between healthy and AD brain.

Venn diagram showing the number of differentially expressed transcripts between AD and normal tissue samples across total brain, temporal and frontal lobe. The number of transcripts unique to AD and normal tissues is shown in universe area outside the circles. The numbers of transcripts up-regulated by more than two-fold in AD tissue are indicated in the dark grey circle, while the numbers up-regulated by more than two-fold in normal tissue are highlighted in the light grey circle. The intersection of the two circles refers to number of transcripts which are expressed in both AD and normal tissues but which are less than two-fold different in expression level. Further analysis revealed a considerable portion of transcripts that were unique to either AD or normal brains. AD brain tissue showed between 19,578 and 28,407 (10.7–14.5%) unique transcripts compared to the corresponding normal tissue. Larger numbers of transcripts were seen to be unique to normal tissue, for which between 46,672 to 68,025 transcripts were observed (23.9% to 37.5%).

Transcriptional and post-transcriptional regulation between normal and AD brain tissue

To detect transcriptional regulation, RNA-Seq data can be analyzed with Cufflinks. This identifies how many transcription start sites (TSS) are used in each gene and groups transcripts from that gene by their TSS. Each TSS is thus associated with a primary transcript. Cufflinks compares ratios of grouped transcripts between normal and AD tissue to detect alternative promoter usage. Cufflinks also identifies post-transcriptional regulation by looking for changes in relative abundances of mRNAs spliced from the same primary transcript between normal and AD tissue, which it detects as alternative splicing. In this way, Cufflinks discriminates between transcriptional and post-transcriptional processing [17]. Cufflinks analysis of the transcriptome from total brain, temporal and frontal lobe samples revealed that numerous genes are controlled by different promoters in normal and AD tissue (Table 7). Comparative analysis of the total brain samples resulted in the identification of five genes (CANX, DNAJC5, MGEA5, TMEM66, WDR92) with statistically significant usage of alternative promoters in AD samples (p<0.05 and passing false discovery rate threshold). Using the same selection criteria, frontal and temporal lobe samples from the AD brain showed alternative promoter usage in eleven genes (ACAP3, ARGLU1, CHD3, KIF5A, LENG8, MAPK3, NR1D1, PDE1B, PIP5K2B, RPH3A, WDR47) and three genes (APOE, KIF5A, PP2R4), respectively.
Table 7

Genes showing alternative promoter usage.

GeneDescriptionp-value
Total brain
CANX calnexin0
DNAJC5 DnaJ (Hsp40) homolog, subfamily C, member 55.64E-006
MGEA5 meningioma expressed antigen 5 (hyaluronidase)0
TMEM66 transmembrane protein 661.16E-009
WDR92 WD repeat domain 920
Frontal lobe
ACAP3 ArfGAP with coiled-coil, ankyrin repeat and PH domains 32.24E-005
ARGLU1 arginine and glutamate rich 16.43E-007
CHD3 chromodomain helicase DNA binding protein 30
KIF5A kinesin family member 5A2.35E-013
LENG8 leukocyte receptor cluster (LRC) member 80
MAPK3 mitogen-activated protein kinase 30
NR1D1 nuclear receptor subfamily 1, group D, member 10
PDE1B phosphodiesterase 1B, calmodulin-dependent0
PIP5K2B phosphatidylinositol-5-phosphate 4-kinase, type II, beta2.22E-016
RPH3A rabphilin 3A homolog (mouse)0
WDR47 WD repeat domain 470
Temporal lobe
APOE apolipoprotein E1.92E-006
KIF5A kinesin family member 5A0
PPP2R4 protein phosphatase 2A activator, regulatory subunit 47.18E-007

Genes identified by Cufflinks as exhibiting statistically significant alternative promoter usage between normal and AD tissue. Results are shown for total brain, frontal and temporal lobe tissue.

Genes identified by Cufflinks as exhibiting statistically significant alternative promoter usage between normal and AD tissue. Results are shown for total brain, frontal and temporal lobe tissue. We also investigated whether splicing patterns for transcripts sharing the same transcription start site (TSS) differ between normal and AD brain tissue (Table 8). Statistically significant alternative splicing between normal and AD total brain was detected for the following four genes: CALM3, CANX, DNAJC5 and MGEA5. Moreover, alternative splicing was detected at a statistically significant level in frontal and temporal brain samples for fifteen and four genes, respectively. For the frontal lobe these include ACAP3, AP2B1, ATN1, B2M, CHD3, CTBP1, EFHD2, LENG8, MAPK3, NR1D1, NUDCD3, PDE1B, RHBDD2, SEPT5 and WDR47, and the genes APOE, KIF5A, PDZD4 and SPTBN1 in the temporal lobe.
Table 8

Genes showing alternative splicing.

GeneDescriptionp-value
Total brain
CALM3 calmodulin 3 (phosphorylase kinase, delta)1.11E-016
CANX calnexin0
DNAJC5 DnaJ (Hsp40) homolog, subfamily C, member 51.38E-008
MGEA5 meningioma expressed antigen 5 (hyaluronidase)0
Frontal lobe
ACAP3 ArfGAP with coiled-coil, ankyrin repeat and PH domains 30
AP2B1 adaptor-related protein complex 2, beta 1 subunit1.07E-010
ATN1 atrophin 12.34E-008
B2M beta-2-microglobulin6.68E-004
CHD3 chromodomain helicase DNA binding protein 33.16E-005
CTBP1 C-terminal binding protein 11.06E-009
EFHD2 EF-hand domain family, member D22.66E-007
LENG8 leukocyte receptor cluster (LRC) member 80
MAPK3 mitogen-activated protein kinase 30
NR1D1 nuclear receptor subfamily 1, group D, member 13.73E-007
NUDCD3 NudC domain containing 32.11E-004
PDE1B phosphodiesterase 1B, calmodulin-dependent3.42E-004
RHBDD2 rhomboid domain containing 20
SEPT5 septin 54.44E-016
WDR47 WD repeat domain 476.65E-009
Temporal lobe
APOE apolipoprotein E1.56E-010
KIF5A kinesin family member 5A2.22E-016
PDZD4 PDZ domain containing 49.39E-005
SPTBN1 spectrin, beta, non-erythrocytic 18.47E-007

Gene names for transcripts identified by Cufflinks as exhibiting statistically significant alternative splicing between normal and AD tissue. Results are shown for total brain, frontal and temporal lobe tissue. Alternative splicing is detected between transcripts, which share the same transcription start site (TSS).

Gene names for transcripts identified by Cufflinks as exhibiting statistically significant alternative splicing between normal and AD tissue. Results are shown for total brain, frontal and temporal lobe tissue. Alternative splicing is detected between transcripts, which share the same transcription start site (TSS).

Identification of alternative splicing and promoter usage for apolipoprotein E (APOE)

Apolipoprotein E gene (APOE) is of particular interest due to its relevance to AD molecular pathology [28]. The mapping of reads for all six samples to the reference genome shows differences in expression levels for individual APOE exons (Fig. 3). Cufflinks quantification of differential gene expression showed a 2.13-fold down-regulation of APOE in the AD temporal lobe (p = 4.19×10−7). It also highlighted the possibility of differential gene splicing. Detailed analysis of transcripts revealed three different APOE transcriptional isoforms, namely APOE-001 (ENST00000252486), APOE-002 (ENST00000446996) and APOE-005 (ENST00000425718), in both temporal lobe samples. The APOE-001 and -002 isoforms contain exon 1 whereas the -005 isoform is generated by an alternative promoter upstream of the second APOE exon. Two transcription start sites (TSS) were identified for the APOE gene in both temporal lobe samples, which will be referred to as TSS A and TSS B. Isoforms APOE-001 and -002 are transcribed from TSS A, while APOE-005 is transcribed from TSS B (Fig. 4a). Comparative analysis of TSS A and TSS B revealed a 26.5-fold up-regulation of the latter in AD temporal lobe (p<1×10−16) and 3.09-fold down-regulation of the former in AD temporal lobe (p = 5.11×10−15; Fig. 4b,c).
Figure 3

RNA-Seq read mapping to the reference for APOE.

RNA-Seq read mapping to the UCSC reference genome (hg19) of the gene APOE for all 6 samples in this study. The AD tracks are shown in green and normal samples in red. It is clear that the reads map to the 4 exons of the APOE gene as annotated in the UCSC database (APOE exons shown in blue). The absolute read counts for each sample are indicated on the y axis. A schematic representation of the 5 Ensembl transcripts for APOE is shown in brown at the bottom of the figure. N – normal brain samples; AD – Alzheimer's disease brain samples.

Figure 4

Alternative splicing and promoter usage for the APOE gene in temporal lobe tissue.

(a) Transcriptional isoforms APOE-001, APOE-002 and APOE-005 are detected in both normal and AD temporal lobes; APOE-001 and -002 have transcription start site (TSS) A and 005 is initiated at TSS B. Isoform 005 comprises exons 2, 3 and 4 while isoforms 001 and 002 contain all 4 exons. (b) Isoforms 001 and 002 show decreased expression in AD relative to normal temporal lobe, while isoform 005 shows a relative increase in the AD temporal lobe. (c) Relative changes in TSS abundance between normal and AD temporal lobes are indicated by the green/red pie charts, while changes in the two TSS A group isoforms (001 and 002) between normal and AD temporal lobes are shown by the blue/yellow pie charts.

RNA-Seq read mapping to the reference for APOE.

RNA-Seq read mapping to the UCSC reference genome (hg19) of the gene APOE for all 6 samples in this study. The AD tracks are shown in green and normal samples in red. It is clear that the reads map to the 4 exons of the APOE gene as annotated in the UCSC database (APOE exons shown in blue). The absolute read counts for each sample are indicated on the y axis. A schematic representation of the 5 Ensembl transcripts for APOE is shown in brown at the bottom of the figure. N – normal brain samples; ADAlzheimer's disease brain samples.

Alternative splicing and promoter usage for the APOE gene in temporal lobe tissue.

(a) Transcriptional isoforms APOE-001, APOE-002 and APOE-005 are detected in both normal and AD temporal lobes; APOE-001 and -002 have transcription start site (TSS) A and 005 is initiated at TSS B. Isoform 005 comprises exons 2, 3 and 4 while isoforms 001 and 002 contain all 4 exons. (b) Isoforms 001 and 002 show decreased expression in AD relative to normal temporal lobe, while isoform 005 shows a relative increase in the AD temporal lobe. (c) Relative changes in TSS abundance between normal and AD temporal lobes are indicated by the green/red pie charts, while changes in the two TSS A group isoforms (001 and 002) between normal and AD temporal lobes are shown by the blue/yellow pie charts. In addition to a switch in promoter usage in the normal and AD temporal lobe, significant alternative splicing between the two isoforms is seen under the control of TSS A (p = 1.46×10−10). The abundance of isoform APOE-002 is reduced in AD temporal lobe to an almost negligible level of 0.02 FPKM, compared with 45.83 in the normal counterpart. APOE-001 also shows a reduction in abundance of 2.81-fold in AD relative to normal temporal lobe, however it still remains the dominant isoform expressed in the AD temporal lobe at 159.43 FPKM. The APOE-005 isoform has a FPKM of 73.08 in the AD temporal lobe (Fig. 4c). A comparison of APOE splicing and promoter use in the frontal lobe and total brain did not reveal expression pattern differences as seen in the temporal lobe. Cufflinks does not detect the APOE-002 isoform in either frontal or total brain samples, and no alternative splicing or promoter usage was detected between the normal and AD samples. Focusing on APOE expression in temporal lobe clearly illustrates that, used together, RNA-Seq and Cufflinks can identify not only transcriptional regulation of a gene but also post-transcriptional regulation of primary transcripts via alternative splicing.

Identification of APOE alleles in the AD samples

To identify which allele of APOE was present in the temporal, frontal lobe and total brain AD samples, the genotype of SNPs rs429358 and rs7412 were determined. These two SNPs are associated with the amino acid changes at positions 112 and 158 in the ApoE isoforms. SNP rs429358 showed a T/T genotype for temporal, frontal lobe and total brain samples. This genotype translates to a Cys at position 112 of the protein. SNP rs7412 showed a C/C genotype in temporal lobe and total brain samples and a C/T genotype for the frontal lobe sample. The C allele translates to an Arg at position 158 of the protein, while a T allele translates to a Cys. The Cys112/Arg158 combination in the ApoE protein reflects the presence of the e3 allele, while the Cys112/Cys158 combination indicates presence of the APOE e2 allele. Thus, both temporal lobe and total brain AD samples exhibit the APOE e3 allele, while frontal lobe AD sample has an equal mix of the e2 and e3 allele.

Discussion

Our study provides the first comprehensive insight into the transcriptome of brain tissue affected by Alzheimer's disease. Using a whole transcriptome sequencing technique (RNA-Seq), we were able to identify the levels of differentially expressed genes and establish genes with alternative promoter usage and splicing patterns that changed in association with neurodegeneration. Moreover, comparative analysis of samples derived from different brain regions produced an increased molecular resolution for our analysis. This revealed that the frontal and temporal lobes of AD brains not only differed in the quantitative composition of the genes expressed but also showed lobe-specific alternations in transcript assembly. For whole transcriptome sequencing, we used an Illumina Genome Analyser II with 36 bp sequence reads length. We obtained ∼14×106 sequence reads per sample, which has been previously reported to deliver sufficient sequence coverage for transcriptome profiling [13]. Our rate of 90-92% of reads that map to the reference genome met quality standards of the RNA-Seq technique [29]. An estimation of the number of reads covering chromosome 1 (1,937,546 reads on average) was approximately 12.9% of all reads generated per transcriptome (14,974,824 reads on average). Human chromosome 1 comprises 8% of the human genome and contains 3,141 genes, or 13.6% of all annotated genes [30]. Hence, we conclude that our mRNA-Seq data provide good representation of expressed genes in the human genome. Cufflinks analysis of gene isoform expression levels, alternative splicing and alternative promoter usage revealed significant differences in transcriptome profiles between frontal and temporal lobe of the AD brain. These variations might reflect temporal and spatial differences in the progression of AD neuropathology across the aging brain. Widespread neuronal loss and a presence of the intraneuronal neurofibrillary tangles (NFTs) and the extracellular neuritic or senile plaques (NPs) are key features of the AD neuropathology. The main components of NPs are peptides of varying length collectively described as beta-amyloid whereas NFTs are mainly composed of paired helical filaments of a hyperphosphorylated form of the microtubule-associated protein tau (MAPT) [31], [32]. NFTs first arise in the entorhinal cortex of the medial temporal lobe and then spread toward the hippocampal CA1 region. NFTs formation then progresses to the temporal and frontal neocortices, and finally affects primary cortices [33]. Thus the temporal and frontal lobe samples used in this study might approximately represent brain regions at distinct stages of the neurodegeneration process, with the temporal lobe affected first, followed by the frontal lobe of the brain. The tissue-specific enrichment for gene ontology processes suggest region-specific, sequential progression of brain tissue neurodegeneration, with the temporal lobe being affected earlier than the frontal part of the cortex [33]. Consequently, neuronal activity in the frontal lobe may be more vigorous at the time of sample donation. This might count for over-representation of GO terms such as regulation of synaptic plasticity and negative regulation of neuronal apoptosis. In contrast, neurons of the temporal lobe might exist in a more advanced phase of functional deterioration. This in turn is reflected by the more non-neuronally specific transcriptome patterns seen in samples derived from the total brain in this study. We do observe an over-representation of genes related to apoptosis that is consistent with previous reports, however there was no evidence in our analysis for AD-associated changes in the immune response [34]. Many of the changes we observed in gene expression between normal and AD brains were similar to those reported previously. However, some differences were noted. This lack of concordance among our RNA-Seq transcriptome data set and previously reported gene expression profiles is likely to stem from inherent limitations in microarray systems. For example, background levels of hybridization (i.e. hybridization to a probe that occurs irrespective of the corresponding transcript's expression level) limit the accuracy of microarray expression measurements, particularly for transcripts present at low abundance. Furthermore, probes differ considerably in their hybridization properties [35]. Thus, although comparing hybridization results across arrays can identify gene expression differences among samples [36], hybridization results from a single sample may not provide a reliable measure of relative expression for different transcripts. By contrast, the Illumina sequencing data have been described as replicable with relatively little technical variation, thus for many purposes it may suffice to sequence each mRNA sample only once. The information gained from a single lane of Illumina flow cell, as done in the present study, provides a comprehensive analysis of transcripts and enables identification with confidence of differentially expressed genes [11], [37]. Moreover, validation techniques such as quantitative PCR (qPCR) [38], [39] and spike-in RNA [29] have demonstrated that RNA-Seq is extremely accurate. Accordingly, a false positive rate <2% has been demonstrated for this technique [40]. As recently reported by Marioni et al., qPCR results agreed more closely with Illumina sequencing results than with microarrays [11]. Regarding quantification of gene expression, Cufflinks analysis of RNA-Seq data allowed us to dissect expression of individual genes into quantification of particular mRNA isoforms contributing to the final cumulative value of gene expression. To our knowledge, this is the first report where quantitative information about particular splice variants at a genome-wide scale has been generated for different anatomical segments of normal and AD brains. Thus, our study creates a useful data set supplementing previous microarray-generated information, which lacked isoform-specific resolution of gene expression [9], [41]. Despite the magnitude of the APOE e4 risk effect and a possible mechanistic link with amyloid beta (Aβ) pathology [34], [42], [43], it is still far from clear how APOE e4 is involved in AD pathogenesis [44]. Interestingly, the APOE genotype in the case of AD samples used in this study was e3, which is considered to have no effect on AD onset. This suggests that the observed alternative promoter and TSS usage during APOE expression in the AD temporal lobe might be independent of the CysArg substitution at position 112. Following this line of reasoning, differential APOE expression patterns - as indicated in this report - might be independent of the amyloid beta aggregation pathway in the course of Alzheimer's disease. Indeed, previous observations of alternative splicing in AD brains for glutamate transporter [45], PIN1 [46], estrogen receptor alpha [47] and the APOE receptor [48] genes strongly suggest that alteration of transcriptional control for genes involved in neuronal physiology is a landmark of ongoing neurodegeneration. In light of our observations of alternative APOE expression, the previously reported AD-specific splicing pattern of the APOE receptor further suggests the functional relevance of lipid metabolism in the context of AD pathology [49]. Moreover, it has previously been proposed that synthesis of ApoE might play a role in regional vulnerability of neurons in AD [50]. How this might relate to the presence of different transcriptional variants of APOE remains a subject for future studies. Top 30 up and top 30 down regulated genes in AD total brain. Differential gene expression for total brain was calculated using the ratio of AD versus normal FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on this ratio (fold change), and those with the 30 highest and 30 lowest fold change values are shown here. (XLSX) Click here for additional data file. Top 30 up and top 30 down regulated genes in AD frontal lobe. Differential gene expression for frontal lobe was calculated using the ratio of AD versus normal FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on this ratio (fold change), and those with the 30 highest and 30 lowest fold change values are shown here. (XLSX) Click here for additional data file. Top 30 up and top 30 down regulated genes in AD temporal lobe. Differential gene expression for temporal lobe was calculated using the ratio of AD versus normal FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on this ratio (fold change), and those with the 30 highest and 30 lowest fold change values are shown here. (XLSX) Click here for additional data file. Top 20 Clusters from functional enrichment analysis using the DAVID tool for total brain. The NCBI tool, DAVID, was used to investigate functional associations of gene expression changes seen in AD total brain. There were 1071 genes that were more than two-fold over- or under-expressed in AD relative to normal total brain and these were analysed by the functional clustering tool. Gene Ontology Biological Process was selected as the annotation category for clustering. Once the tool has identified enriched ontologies for a particular gene list, it creates annotation clusters with those that have a statistically significant overlap in terms of their constituent genes. The top 20 annotation clusters are shown in this table. (XLSX) Click here for additional data file. Top 20 Clusters from functional enrichment analysis using the DAVID tool for frontal lobe. The NCBI tool, DAVID, was used to investigate functional associations of gene expression changes seen in AD frontal lobe. There were 944 genes that were more than two-fold over- or under-expressed in AD relative to normal frontal lobe and these were analysed by the functional clustering tool. Gene Ontology Biological Process was selected as the annotation category for clustering. Once the tool has identified enriched ontologies for a particular gene list, it creates annotation clusters with those that have a statistically significant overlap in terms of their constituent genes. The top 20 annotation clusters are shown in this table. (XLSX) Click here for additional data file. Top 20 Clusters from functional enrichment analysis using the DAVID tool for temporal lobe. The NCBI tool, DAVID, was used to investigate functional associations of gene expression changes seen in AD temporal lobe. There were 1416 genes that were more than two-fold over- or under-expressed in AD relative to normal temporal lobe and these were analysed by the functional clustering tool. Gene Ontology Biological Process was selected as the annotation category for clustering. Once the tool has identified enriched ontologies for a particular gene list, it creates annotation clusters with those that have a statistically significant overlap in terms of their constituent genes. The top 20 annotation clusters are shown in this table. (XLSX) Click here for additional data file.
Table 3

Top ten up- and down-regulated genes in AD total brain.

GeneDescriptionChromosomeFPKM NFPKM ADFold changep-valueEnsembl Gene ID
IGHA1 immunoglobulin heavy constant alpha 1chr140.2340925.27536422.535430510.00018499ENSG00000211895
RP11-552E20.3 not annotatedchr61.8753914.2721937.6102533348.76E-009not annotated
PCYT1A phosphate cytidylyltransferase 1, choline, alphachr30.4136373.0219567.3058164530.00801203ENSG00000161217
SLC7A9 solute carrier family 7 (cationic amino acid transporter, y+ system), member 9chr190.7058224.8343266.8492141080.0105864ENSG00000021488
RAD54L RAD54-like (S. cerevisiae)chr10.4364952.3917195.4793731890.0259394ENSG00000085999
OAS1 2′,5′-oligoadenylate synthetase 1, 40/46kDachr123.8277320.9735365.4793666224.89E-008ENSG00000089127
MTIF2 mitochondrial translational initiation factor 2chr23.7575316.1769994.3052215157.00E-007ENSG00000085760
STAB1 stabilin 1chr30.7296262.8873643.95732060.0317452ENSG00000010327
CD22 CD22 moleculechr199.8381836.8837423.7490411840ENSG00000012124
AC018730.1 not annotatedchr29.416132.9078953.4948540278.88E-016not annotated
RELN reelinchr719.44430.055404−350.95480472.22E-016ENSG00000189056
ANK1 ankyrin 1, erythrocyticchr813.72020.086115−159.32415968.88E-013ENSG00000029534
GRM4 glutamate receptor, metabotropic 4chr629.22030.392424−74.461042140ENSG00000124493
GRM1 glutamate receptor, metabotropic 1chr67.965430.142632−55.846023331.76E-008ENSG00000152822
TFRC transferrin receptor (p90, CD71)chr39.171080.180114−50.918196253.81E-008ENSG00000072274
DAO D-amino-acid oxidasechr1210.04590.20387−49.276009224.99E-008ENSG00000110887
ABLIM1 actin binding LIM protein 1chr1019.20580.39862−48.18072353.21E-011ENSG00000099204
KIAA0802 KIAA0802chr1814.42330.387405−37.230546844.61E-007ENSG00000168502
MED13L mediator complex subunit 13-likechr127.777480.210969−36.865511057.40E-010ENSG00000123066
ITGB8 integrin, beta 8chr77.389080.20143−36.683115725.17E-007ENSG00000105855

Differential gene expression for total brain was calculated using the ratio of AD versus normal (N) FPKM values for every gene identified as expressed by Cufflinks. The genes were ranked on their fold changes and the ten with the highest or lowest fold changes are shown here.

  49 in total

1.  Statistical inferences for isoform expression in RNA-Seq.

Authors:  Hui Jiang; Wing Hung Wong
Journal:  Bioinformatics       Date:  2009-02-25       Impact factor: 6.937

2.  Parkinson's disease patient-derived induced pluripotent stem cells free of viral reprogramming factors.

Authors:  Frank Soldner; Dirk Hockemeyer; Caroline Beard; Qing Gao; George W Bell; Elizabeth G Cook; Gunnar Hargus; Alexandra Blak; Oliver Cooper; Maisam Mitalipova; Ole Isacson; Rudolf Jaenisch
Journal:  Cell       Date:  2009-03-06       Impact factor: 41.582

3.  A global view of gene activity and alternative splicing by deep sequencing of the human transcriptome.

Authors:  Marc Sultan; Marcel H Schulz; Hugues Richard; Alon Magen; Andreas Klingenhoff; Matthias Scherf; Martin Seifert; Tatjana Borodina; Aleksey Soldatov; Dmitri Parkhomchuk; Dominic Schmidt; Sean O'Keeffe; Stefan Haas; Martin Vingron; Hans Lehrach; Marie-Laure Yaspo
Journal:  Science       Date:  2008-07-03       Impact factor: 47.728

Review 4.  UCSC genome browser tutorial.

Authors:  Ann S Zweig; Donna Karolchik; Robert M Kuhn; David Haussler; W James Kent
Journal:  Genomics       Date:  2008-06-02       Impact factor: 5.736

5.  Hippocampal estrogen receptor-alpha splice variant TADDI in the human brain in aging and Alzheimer's disease.

Authors:  T A Ishunina; D F Swaab
Journal:  Neuroendocrinology       Date:  2008-09-25       Impact factor: 4.914

6.  Ultrafast and memory-efficient alignment of short DNA sequences to the human genome.

Authors:  Ben Langmead; Cole Trapnell; Mihai Pop; Steven L Salzberg
Journal:  Genome Biol       Date:  2009-03-04       Impact factor: 13.583

7.  pH measurement as quality control on human post mortem brain tissue: a study of the BrainNet Europe consortium.

Authors:  C M Monoranu; M Apfelbacher; E Grünblatt; B Puppe; I Alafuzoff; I Ferrer; S Al-Saraj; K Keyvani; A Schmitt; P Falkai; J Schittenhelm; G Halliday; J Kril; C Harper; C McLean; P Riederer; W Roggendorf
Journal:  Neuropathol Appl Neurobiol       Date:  2009-06       Impact factor: 8.090

8.  Apolipoprotein E-dependent accumulation of Alzheimer disease-related lesions begins in middle age.

Authors:  Eloise Kok; Satu Haikonen; Teemu Luoto; Heini Huhtala; Sirkka Goebeler; Hannu Haapasalo; Pekka J Karhunen
Journal:  Ann Neurol       Date:  2009-06       Impact factor: 10.422

9.  Alternative isoform regulation in human tissue transcriptomes.

Authors:  Eric T Wang; Rickard Sandberg; Shujun Luo; Irina Khrebtukova; Lu Zhang; Christine Mayr; Stephen F Kingsmore; Gary P Schroth; Christopher B Burge
Journal:  Nature       Date:  2008-11-27       Impact factor: 49.962

10.  TopHat: discovering splice junctions with RNA-Seq.

Authors:  Cole Trapnell; Lior Pachter; Steven L Salzberg
Journal:  Bioinformatics       Date:  2009-03-16       Impact factor: 6.937

View more
  137 in total

1.  Naturally occurring genetic variability in expression of Gsta4 is associated with differential survival of axotomized rat motoneurons.

Authors:  Mikael Ström; Faiez Al Nimer; Rickard Lindblom; Jens Randel Nyengaard; Fredrik Piehl
Journal:  Neuromolecular Med       Date:  2011-12-08       Impact factor: 3.843

Review 2.  RNA-Seq and human complex diseases: recent accomplishments and future perspectives.

Authors:  Valerio Costa; Marianna Aprile; Roberta Esposito; Alfredo Ciccodicola
Journal:  Eur J Hum Genet       Date:  2012-06-27       Impact factor: 4.246

3.  Characterization of the role of metallothionein-3 in an animal model of Alzheimer's disease.

Authors:  Yasmina Manso; Javier Carrasco; Gemma Comes; Gabriele Meloni; Paul A Adlard; Ashley I Bush; Milan Vašák; Juan Hidalgo
Journal:  Cell Mol Life Sci       Date:  2012-06-22       Impact factor: 9.261

4.  Neuroepigenomics: Resources, Obstacles, and Opportunities.

Authors:  John S Satterlee; Andrea Beckel-Mitchener; Roger Little; Dena Procaccini; Joni L Rutter; Amy C Lossie
Journal:  Neuroepigenetics       Date:  2015-01-01

Review 5.  Single-cell and regional gene expression analysis in Alzheimer's disease.

Authors:  Ruby Kwong; Michelle K Lupton; Michal Janitz
Journal:  Cell Mol Neurobiol       Date:  2012-01-22       Impact factor: 5.046

6.  Nuclear receptor binding factor 2 (NRBF2) is required for learning and memory.

Authors:  Xiaosen Ouyang; Israr Ahmad; Michelle S Johnson; Matthew Redmann; Jason Craver; Willayat Y Wani; Gloria A Benavides; Balu Chacko; Peng Li; Martin Young; Anil G Jegga; Victor Darley-Usmar; Jianhua Zhang
Journal:  Lab Invest       Date:  2020-04-29       Impact factor: 5.662

7.  Increased O-GlcNAc levels correlate with decreased O-GlcNAcase levels in Alzheimer disease brain.

Authors:  Sarah Förster; Andrew S Welleford; Judy C Triplett; Rukhsana Sultana; Brigitte Schmitz; D Allan Butterfield
Journal:  Biochim Biophys Acta       Date:  2014-05-23

8.  A Novel Relationship for Schizophrenia, Bipolar, and Major Depressive Disorder. Part 8: a Hint from Chromosome 8 High Density Association Screen.

Authors:  Xing Chen; Feng Long; Bin Cai; Xiaohong Chen; Lizeng Qin; Gang Chen
Journal:  Mol Neurobiol       Date:  2016-09-22       Impact factor: 5.590

9.  HDAC inhibitor-dependent transcriptome and memory reinstatement in cognitive decline models.

Authors:  Eva Benito; Hendrik Urbanke; Binu Ramachandran; Jonas Barth; Rashi Halder; Ankit Awasthi; Gaurav Jain; Vincenzo Capece; Susanne Burkhardt; Magdalena Navarro-Sala; Sankari Nagarajan; Anna-Lena Schütz; Steven A Johnsen; Stefan Bonn; Reinhardt Lührmann; Camin Dean; André Fischer
Journal:  J Clin Invest       Date:  2015-08-17       Impact factor: 14.808

10.  Transcriptomic analysis of postmortem brain identifies dysregulated splicing events in novel candidate genes for schizophrenia.

Authors:  Ori S Cohen; Sarah Y Mccoy; Frank A Middleton; Sean Bialosuknia; Yanli Zhang-James; Lu Liu; Ming T Tsuang; Stephen V Faraone; Stephen J Glatt
Journal:  Schizophr Res       Date:  2012-10-09       Impact factor: 4.939

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.