| Literature DB >> 25501035 |
Joel E Kleinman1, Daniel R Weinberger1,2,3,4,5, Andrew E Jaffe1,6,7, Jooheon Shin1, Leonardo Collado-Torres1,6, Jeffrey T Leek6,2, Ran Tao1, Chao Li1, Yuan Gao1, Yankai Jia1, Brady J Maher1,3,4, Thomas M Hyde1,3,4,5,8.
Abstract
Transcriptome analysis of human brain provides fundamental insight into development and disease, but it largely relies on existing annotation. We sequenced transcriptomes of 72 prefrontal cortex samples across six life stages and identified 50,650 differentially expression regions (DERs) associated with developmental and aging, agnostic of annotation. While many DERs annotated to non-exonic sequence (41.1%), most were similarly regulated in cytosolic mRNA extracted from independent samples. The DERs were developmentally conserved across 16 brain regions and in the developing mouse cortex, and were expressed in diverse cell and tissue types. The DERs were further enriched for active chromatin marks and clinical risk for neurodevelopmental disorders such as schizophrenia. Lastly, we demonstrate quantitatively that these DERs associate with a changing neuronal phenotype related to differentiation and maturation. These data show conserved molecular signatures of transcriptional dynamics across brain development, have potential clinical relevance and highlight the incomplete annotation of the human brain transcriptome.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25501035 PMCID: PMC4281298 DOI: 10.1038/nn.3898
Source DB: PubMed Journal: Nat Neurosci ISSN: 1097-6256 Impact factor: 24.884
Figure 1Schematic design of the project. We performed RNA sequencing (RNA-seq) on 36 DLPFC samples from across the lifespan, and implemented the derfinder method to identify “differentially expressed regions” (DERs). These DERs were replicated in an independent DLPFC sample, and explored across other brain regions, in the developing mouse cortex, in diverse cell and tissue types, and in the context of disease-associated gene sets. An example of a DER is shown in the top right corner (see legend of Figure S1 for a detailed description). We additionally quantified the cell composition of these DLPFC samples and defined regions of expression across the genome by age group.
Figure 2Age-associated differentially expressed region (DER) expression patterns across multiple brain regions. Principal component analysis (PCA) was performed on normalized coverage estimates across all DERs using all BrainSpan samples. Each point is a sample colored by age (purple: prenatal and green: postnatal), where white corresponds to birth.
Correlation of fetal versus adult fold changes across brain regions within differentially expressed regions (DERs]. Spearman correlation coefficients were calculated between log2 fold changes comparing fetal versus postnatal expression levels within the DLPFC discovery dataset and each brain region in the BrainSpan database across the DERs [All], and within the DERs annotated to specific Ensembl features.
| BrainSpan Region | All (N=50,560) | Intragenic (N=4,221) | Intronic (N=16,616) | Exonic (N=29,813) |
|---|---|---|---|---|
|
| ||||
| DFC | 0.863 | 0.702 | 0.49 | 0.895 |
| VFC | 0.851 | 0.684 | 0.429 | 0.888 |
| MFC | 0.858 | 0.705 | 0.485 | 0.891 |
| OFC | 0.845 | 0.674 | 0.36 | 0.891 |
| M1C | 0.841 | 0.675 | 0.388 | 0.882 |
| S1C | 0.83 | 0.657 | 0.326 | 0.878 |
| IPC | 0.849 | 0.681 | 0.464 | 0.882 |
| A1C | 0.86 | 0.698 | 0.517 | 0.888 |
| STC | 0.871 | 0.72 | 0.576 | 0.894 |
| ITC | 0.852 | 0.694 | 0.473 | 0.881 |
| V1C | 0.867 | 0.701 | 0.534 | 0.894 |
|
| ||||
| HIP | 0.828 | 0.66 | 0.397 | 0.862 |
| AMY | 0.845 | 0.677 | 0.444 | 0.872 |
| STR | 0.788 | 0.607 | 0.428 | 0.816 |
| MD | 0.699 | 0.528 | 0.266 | 0.731 |
| CBC | 0.627 | 0.434 | 0.23 | 0.673 |
DFC: Dorsolateral prefrontal cortex; VFC: Ventrolateral prefrontal cortex; MFC: Anterior (rostal] cingulate (medial frontal cortex]; OFC: Orbital frontal cortex; MIC: Primary motor cortex (area M1, area 4]; S1C: Primary somatosensort cortex (area S1, areas 3,1,2]; IPC: Posteroinferior (ventral] parietal cortex; A1C: Primary auditory cortex (core]; STC: Posterior (caudal] superior temportal cortex (area Tac]; ITC: Inferolateral temportal cortex (area Tev, area 20]; V1C: Primary visual cortex (striate cortex, area V1/17]; HIP: Hippocampus (hippocampal formation]; AMY: Amygaloid complex; STR: Striatum; MD: Mediodorsal nucleus of thalamus; CBC: Cerebellar cortex.
Figure 3Cross-species comparison of differentially expressed regions (DERs). Significant DERs were lifted over to the mouse genome mm10 and RNA-seq coverage was extracted from the reprocessed Dillman et al 2013 study comparing E17 to adult C57BL/6 mice. Log2 fold changes comparing depth-adjusted mean differences between fetal and adult human samples are highly correlated with E17 versus adult mouse samples within each DER, stratified by human-annotated (A) exonic, (B) intronic, and (C) intergenic sequence, such that any DER with both exonic and intronic sequence was classified as exonic. Each point represents a single DER, where the size indicates the proportion of the DERs width that was successfully lifted over. ρ = Spearman correlation, κ = directionality concordance (e.g. higher or lower expression in fetal relative to adult in both species).
Figure 4Clustering analysis of differentially expressed regions (DERs). Principal component analysis (PCA) of (A) all significant DERs, (B) non-exonic sequence within the DERs and (C) gene counts from Ensembl annotation. PCA was performed on log2 adjusted coverage estimates across multiple datasets including our human brain samples along with publicly available differentiating stem cell and somatic tissue data. Colors and shapes for each point represent dataset and condition (see legend).
Enrichment of DERs among GWAS-positive regions. Shown are p-values assessing significant overlap between DERs and locations of GWAS-positive loci for schizophrenia, Alzheimer's disease, Parkinson's disease, and type 2 diabetes.
| Trait | All | Exon | Intron | Intergenic |
|---|---|---|---|---|
| Schizophrenia | 0.0013 | 0.0001 | 0.0003 | 0.0530 |
| Alzheimer's Disease | 0.0385 | 0.2778 | 0.0117 | 0.6016 |
| Parkinson's Disease | 0.0039 | 0.0100 | 0.0035 | 0.0882 |
| Type 2 Diabetes | 0.2500 | 0.1029 | 0.4307 | 0.1200 |
Expressed sequences/regions by age group defined by 5 or more adjusted reads across consecutive bases (adjusted for library size]. MB: megabases; exonic/intronic/intergenic: the percentages of the expressed regions overlapping annotated features; exons/introns: the converse, being the proportion of all Ensembl features (313,836 unique exons and 266,102 unique introns] covered by expressed sequences in– each age group. “108 PGC2 for SZ” – the number of PGC2 loci overlapping at least 1 expressed sequence in DLPFC. Lastly, we show the percent of expressed regions when defined using 10 or more adjusted reads, as a sensitivity analysis.
| Age Group | ||||||
|---|---|---|---|---|---|---|
| Fetal | Infant | Child | Teen | Adult | 50+ | |
|
| 459,426 | 481,029 | 413,202 | 365,903 | 437,935 | 420,294 |
|
| 46,813 | 37,618 | 33,958 | 31,818 | 32,849 | 31,563 |
|
| 121.8 | 107.5 | 97.1 | 90.5 | 92.9 | 91.4 |
|
| 4.1% | 3.6% | 3.2% | 3.0% | 3.1% | 3.0% |
|
| 44.0% | 46.8% | 54.0% | 58.8% | 53.1% | 54.1% |
|
| 77.1% | 72.8% | 71.1% | 70.2% | 69.9% | 68.9% |
|
| 11.9% | 13.3% | 12.9% | 12.5% | 12.9% | 13.4% |
|
| 55.2% | 56.8% | 56.9% | 55.3% | 56.5% | 55.8% |
|
| 57.6% | 58.1% | 57.7% | 55.4% | 57.2% | 56.0% |
|
| 83 | 84 | 83 | 82 | 83 | 88 |
|
| 73.2% | 65.6% | 64.6% | 64.4% | 63.7% | 62.4% |