Literature DB >> 32561870

Transcriptome data of temporal and cingulate cortex in the Rett syndrome brain.

Kimberly A Aldinger1,2, Andrew E Timms3, James W MacDonald4, Hanna K McNamara5, Jennifer S Herstein6, Theo K Bammler4, Oleg V Evgrafov7,6,8, James A Knowles7,6,8, Pat Levitt9.   

Abstract

Rett syndrome is an X-linked neurodevelopmental disorder caused by mutation in the methyl-CpG-binding protein 2 gene (MECP2) in the majority of cases. We describe an RNA sequencing dataset of postmortem brain tissue samples from four females clinically diagnosed with Rett syndrome and four age-matched female donors. The dataset contains 16 transcriptomes, including two brain regions, temporal and cingulate cortex, for each individual. We compared our dataset with published transcriptomic analyses of postmortem brain tissue from Rett syndrome and found consistent gene expression alterations among regions of the cerebral cortex. Our data provide a valuable resource to explore the biology of the human brain in Rett syndrome.

Entities:  

Mesh:

Year:  2020        PMID: 32561870      PMCID: PMC7305197          DOI: 10.1038/s41597-020-0527-2

Source DB:  PubMed          Journal:  Sci Data        ISSN: 2052-4463            Impact factor:   6.444


Background & Summary

Rett syndrome (RTT) is an X-linked neurodevelopmental disorder mostly caused by heterozygous de novo mutation in the methyl-CpG-binding protein 2 gene (MECP2) and predominantly affecting females[1]. MECP2 duplications have been identified in males with developmental encephalopathy, seizures, autistic features, and recurrent infection[2]. These clinical disorders illustrate the critical requirement for proper MECP2 expression in human brain development, though how MeCP2 dysfunction leads to the RTT phenotype is unclear. MeCP2 acts as a global transcriptional regulator by recruiting chromatin-remodeling complexes or regulating higher-order chromatin structures[3-8]. Thus, MeCP2 may be required for fine-tuning the gene expression for a network of protein-coding genes through both direct and indirect mechanisms. Consistent with this hypothesis, small magnitude changes in gene expression have been detected in brain tissue from either human postmortem RTT samples or mouse Mecp2-mutants[9-12]. However, most transcriptional studies of postmortem RTT brain have used microarray platforms with small numbers and a lack of age-matched control samples, which impact the sensitivity for detecting transcriptional changes. One study used both microarrays and RNA sequencing (RNA-seq) to examine frontal and temporal cortex from individuals with RTT compared to controls and identified over 200 differentially expressed genes after normalizing data for neuron versus glia composition of samples[13]. Another larger study used RNA-seq to examine motor cortex and cerebellum and identified over 2,000 differentially expressed genes with a global increase in expression[14]. We generated RNA-seq data using brain samples for two distinct brain regions, temporal cortex and cingulate cortex, from four female RTT and four age-matched female donors. Reduced volume and dendritic branching of neurons in the temporal cortex and reduced connectivity of the cingulate cortex have been reported in RTT, indicating the importance of these brain regions in the disorder[15-18]. We also compared our data with the transcriptomic profiles of RTT brain samples from published RNA-seq datasets[13,14]. The composite analysis will be useful to facilitate interpretation and further understanding of MECP2-mediated changes in human brain.

Methods

Brain samples

Postmortem brain tissue samples were obtained from the Harvard Brain Bank (http://hbtrc.mclean.harvard.edu/) and the National Institutes of Health (NIH) NeuroBioBank (https://neurobiobank.nih.gov), with approval from the coordinating foundation (https://www.rettsyndrome.org). Consent was obtained from next of kin and tissue was collected with approval from the Partners Human Research Committee for the Harvard Brain Bank and from The University of Maryland Institutional Review Board (IRB) and The Maryland Department of Health and Mental Hygiene IRB for the NeuroBioBank. Work was approved by the University of Southern California and is compliant with all ethical regulations. Frozen temporal (BA36/38) and cingulate cortex samples were obtained from four RTT and four control (CTL) brain donors that were matched in age (Fig. 1). The Harvard Brain Bank sequenced MECP2 coding exons and reported intragenic mutations in two of the four brains. Brain donor characteristics are described in Table 1.
Fig. 1

Overview of the experimental workflow.

Table 1

Brain Donor Characteristics.

BrainGroupGenderAgePMISourceMECP2
NM_004992.3
cDNAProtein
1038CTLF247NBBNANA
1614CTLF2718NBBNANA
4724CTLF1615NBBNANA
4725CTLF3217NBBNANA
6355RTTF1628HBTRCNRNR
7773RTTF2425HBTRCc.473 C > Tp.Thr158Met
7783RTTF2639HBTRCExon Del
B7992RTTF3130HBTRCNRNR

Abbreviations: CTL, control; F, female; HBTRC, Harvard Brain Tissue Resource Center; NA, not applicable; NBB, NIH NeuroBioBank; NR, no mutation in MECP2 reported; PMI, postmortem interval in hours; RTT, Rett syndrome.

Overview of the experimental workflow. Brain Donor Characteristics. Abbreviations: CTL, control; F, female; HBTRC, Harvard Brain Tissue Resource Center; NA, not applicable; NBB, NIH NeuroBioBank; NR, no mutation in MECP2 reported; PMI, postmortem interval in hours; RTT, Rett syndrome.

MECP2 variant confirmation

Genomic DNA was isolated from brain samples for 7773 and 7783 using the PureLink Genomic DNA Kit (LifeTechnologies) according to the manufacturer’s protocol. We performed Sanger sequencing of MECP2 to verify the reported variants (Table 1). Chromatograms were aligned to MECP2 (ENSG00000169057) using MAFFT v7[19]. No additional genes were screened.

RNA sample and library preparation

Total RNA was previously isolated using the Qiagen RNeasy Kit according to the manufacturer’s instructions[20]. Double stranded cDNA fragments were synthesized from mRNA, ligated with adapters, and size-selected for library construction according to the TruSeq Sample Preparation v2 protocol using 0.5–1.5 μg of total RNA (Table 2). ERCC RNA spike-in controls were not included in this experiment. Library quality was measured using an Agilent 2100 Bioanalyzer and concentration was assessed by PicoGreen incorporation. Barcoded libraries were pooled and sequenced in two lanes using an Illumina HiSeq 2000 sequencer.
Table 2

RNA Sample Characteristics.

BrainGroupRegionRINRNA (μg)GEOSRA
1038CTLCCTX9.81.50GSM3673208SRX5527579
1614CTLCCTX9.20.70GSM3673209SRX5527580
4724CTLCCTX9.10.60GSM3673210SRX5527581
4725CTLCCTX9.51.50GSM3673211SRX5527582
6355RTTCCTX8.60.60GSM3673212SRX5527583
7773RTTCCTX9.81.50GSM3673213SRX5527584
7783RTTCCTX8.60.70GSM3673214SRX5527585
B7992RTTCCTX101.50GSM3673215SRX5527586
1038CTLTCTX (BA38)101.50GSM3673216SRX5527587
1614CTLTCTX (BA36)8.30.51GSM3673217SRX5527588
4724CTLTCTX (BA38)9.71.28GSM3673218SRX5527589
4725CTLTCTX (BA38)9.31.00GSM3673219SRX5527590
6355RTTTCTX8.61.28GSM3673220SRX5527591
7773RTTTCTX9.71.50GSM3673221SRX5527592
7783RTTTCTX8.70.51GSM3673222SRX5527593
B7992RTTTCTX9.61.00GSM3673223SRX5527594

Abbreviations: BA, Brodmann area; CCTX, cingulate cortex; CTL, control; RIN, RNA integrity number; RNA, quantity of total RNA used as input for library preparation; RTT, Rett syndrome; TCTX, temporal cortex.

RNA Sample Characteristics. Abbreviations: BA, Brodmann area; CCTX, cingulate cortex; CTL, control; RIN, RNA integrity number; RNA, quantity of total RNA used as input for library preparation; RTT, Rett syndrome; TCTX, temporal cortex.

RNA-Seq data analysis

Single-end reads (100 bp) were aligned to the Human reference genome (NCBI build 37/hg19) using STAR v2.5.3a[21] (see Code Availability 1). Aligned reads mapping to the exons of a gene were summarized into gene counts using featureCounts v1.6[22] (see Code Availability 2). Picard CollecteRnaSeqMetrics was used to measure the 3′ bias of genes in the RNA-seq data (see Code Availability 3). Gene-level differential expression was analyzed using DESeq2[23] specifying ~ region + group + bias as the experimental design (see Code Availability 4). Aligned reads mapping to MECP2 isoforms were also summarized using featureCounts v1.6[22] (see Code availability 2) by substituting isoforms for gene name.

Data Records

Count matrix and normalized count matrix were submitted to the NCBI Gene Expression Omnibus (GEO) under accession number GSE128380[24]. The raw FASTQ files can be downloaded from the Sequence Read Archive (SRA) under accession number SRP188555[25].

Technical Validation

We verified the presence of the MECP2 c.473 C > T (p.Thr158Met) intragenic variant using DNA isolated from brain 7773 (Supplemental Fig. 1). No MECP2 variants were detected in exons 2–4 of brain 7783. Since we were unable to amplify exon 1 in 7783, we infer exon 1 is likely to be the deleted exon. We also examined RNA-seq data for presence of MECP2 variants (Supplemental Fig. 2). The MECP2 c.473 C > T (p.Thr158Met) intragenic variant was also detected in RNA-seq data from CCTX and TCTX for brain 7773. MECP2 variants were not detected in RNA-seq data for other RTT brain samples, possibly due to low sequencing read depth of MECP2 (Supplemental Fig. 3), or because causal variants are present in another gene[26,27].

RNA and data quality

RNA quality was determined using the Agilent 2100 Bioanalyzer and the RNA 6000 Pico Kit and high-quality RNA was obtained from all samples (RNA integrity number [RIN] > 8.0; median RIN = 9.4 [Table 2]). At the time the experiment was performed, the TruSeq RNA Sample Prep v2 protocol (Part # 15026495 Rev.C, May 2012) was optimized for 0.1–4 μg of total RNA. Although the quantity of RNA input varied among the samples in our experiment, it was equivalent within each age- and tissue- matched case-control sample pair, and all samples were within the optimized range. On average, RNA-seq generated 21.9 million high-quality reads per sample, 70.3% of which mapped uniquely to the Human reference genome (NCBI build 37/hg19) (Table 3). RIN and RNA quantity were each correlated with the number of uniquely mapped reads (Fig. 2). Cook’s distance was calculated to test for outliers, with none detected (Fig. 3a). The first principal component explained over 50% of the variance (Fig. 3b). A correlation matrix based on the gene expression data indicated that samples mostly cluster by individual and diagnostic group, but also by 3′ bias (Fig. 3c).
Table 3

RNA-seq Data Mapping Statistics.

BrainGroupRegionTotal Reads% Bases ≥ Q30Mean Base QualityUniquely Mapped ReadsMapping RateMedian 5′ to 3′ Bias
1038CTLCCTX22,666,18577.0931.7118,508,34681.662.16
1614CTLCCTX20,406,02876.1831.4213,966,75168.440.76
4724CTLCCTX17,775,77876.2131.4512,903,74872.590.51
4725CTLCCTX17,152,40076.5831.5713,787,31380.381.36
6355RTTCCTX16,799,77476.3831.4911,754,02769.960.71
7773RTTCCTX22,119,41677.0131.7017,627,39179.691.36
7783RTTCCTX32,207,23977.1631.7222,511,24169.890.43
B7992RTTCCTX21,882,35976.5831.5617,048,61777.911.09
1038CTLTCTX (BA38)23,689,67877.7231.8816,671,04270.371.34
1614CTLTCTX (BA36)17,569,73878.0131.9711,367,08564.700.44
4724CTLTCTX (BA38)22,429,52777.8431.9315,519,59669.190.98
4725CTLTCTX (BA38)24,262,54578.0331.9716,322,63567.281.04
6355RTTTCTX25,330,54476.9531.6414,294,79956.430.36
7773RTTTCTX22,557,49177.8531.9215,330,59167.960.97
7783RTTTCTX17,060,38177.8231.9211,301,79066.250.82
B7992RTTTCTX26,211,05777.1131.7016,434,10362.701.01
Fig. 2

RNA quality or RNA quantity versus number of uniquely mapped reads.

Fig. 3

RNA-seq data quality assessment. (a) Boxplots showing Cook’s distance calculated for each sample. (b) Principal component analysis with samples colored by diagnostic group (CTL, RTT), brain region (CCTX, TCTX), or brain donor. (c) Heatmap of the sample distance matrix. Presence (black) or absence (grey) of 3′ bias in RNA-seq data is indicated for each sample.

RNA-seq Data Mapping Statistics. RNA quality or RNA quantity versus number of uniquely mapped reads. RNA-seq data quality assessment. (a) Boxplots showing Cook’s distance calculated for each sample. (b) Principal component analysis with samples colored by diagnostic group (CTL, RTT), brain region (CCTX, TCTX), or brain donor. (c) Heatmap of the sample distance matrix. Presence (black) or absence (grey) of 3′ bias in RNA-seq data is indicated for each sample.

MECP2 and MET differential expression

We previously used quantitative reverse transcription PCR to compare expression of MECP2_e1 (NM_004992.3), MECP2_e2 (NM_001110792.1), and MET (NM_000245.3) in the temporal cortex between RTT and CTL brains[28]. Consistent with our previous results, the RNA-seq data showed no significant difference in MECP2 expression between RTT and CTL brains (FDR adjusted p-value = 0.16 and 0.59, respectively), while MET expression was significantly reduced in RTT brains (FDR = 1.07 × 10−05; Fig. 4).
Fig. 4

Boxplots showing the expression of MECP2_e1, MECP2_e2, and MET in RTT and CTL brain. Expression values are shown as normalized counts.

Boxplots showing the expression of MECP2_e1, MECP2_e2, and MET in RTT and CTL brain. Expression values are shown as normalized counts.

Compatibility with published transcriptional profiles

Two RNA-seq datasets of postmortem brain from females with RTT compared to controls have been published[13,14] (Table 4). The first dataset examined pooled frontal and temporal cortex (FTTX) for each of three individuals with RTT compared to three CTL and is available from the Sequence Read Archive under accession number PRJNA302685[29]. The second larger dataset examined motor cortex (Motor) and cerebellum (Cblm) for nine females and six females with RTT, respectively, compared to eight CTL of each tissue, but the primary data were not accessible[14]. We downloaded the FASTQ files for the available dataset, aligned reads using salmon[30] (see Code Availability 6), summarized the aligned reads into gene counts using tximport v1.12.1[31] (see Code Availability 7), and retained genes with ≥10 counts in ≥3 samples. Count data were converted to logCPM to adjust for the total counts per sample using limma v3.40.2[32] (see Code Availability 8), then observation-level and sample-level weights were estimated using voom[32,33]. We also reanalyzed our data using this workflow. Analysis of variance models were fit separately for each of three brain regions (CCTX, FCTX, FTTX), then combined in a random effects meta-analysis using GeneMeta v1.56.0[34] (see Code Availability 9). Our meta-analysis identified 1,455 genes that were significantly differentially expressed (FDR < 0.05) between brain samples from control individuals and those with RTT.
Table 4

Published RTT Brain RNA-seq datasets.

DatasetBrain Region# Cases# ControlsSRAPMID
LinFTTX23PRJNA30268527267200
TCTX1
GogliottiMotor9829523700
CBLM68

Abbreviations: CBLM, cerebellum; FTTX, pooled frontal cortex and temporal cortex from the same individual; Motor, motor cortex; TCTX, temporal cortex; SRA, sequence read archive; PMID, PubMed unique identifier for published reference article.

Published RTT Brain RNA-seq datasets. Abbreviations: CBLM, cerebellum; FTTX, pooled frontal cortex and temporal cortex from the same individual; Motor, motor cortex; TCTX, temporal cortex; SRA, sequence read archive; PMID, PubMed unique identifier for published reference article. To verify these results, we compared the results from our meta-analysis with differential gene expression results from previous RTT RNA-seq analyses[13,14] (Fig. 5). We compared the Z-score for each of the significantly differentially expressed genes from our meta-analysis with the log2 fold change from our previous analysis (GEO DESeq2) and from each of the three published RNA-seq datasets (Lin et al., Gogliotti et al. Motor, and Gogliotti et al. Cblm; Fig. 5a). We found strong concordance among RTT transcriptional profiles from regions within the cerebral cortex, while RTT transcriptional profiles from the cerebellum were least correlated with the regions from the cerebral cortex (Fig. 5b). We aggregated the gene-wise correlation coefficients among datasets and found an overall positive correlation for 63% of the comparisons among datasets, indicating an overall agreement among the differential gene expression per dataset (Fig. 5c). Not only do our data represent an independent technical and biological replication of molecular alterations in RTT brain, but our meta-analysis demonstrates the power of combining datasets to maximize detectable results among several smaller studies.
Fig. 5

Replication of differential gene expression between RTT and CTL brain. (a) Meta-analysis Z-Score compared to log2 fold change (FC) between RTT and CTL from our initial analysis (GEO), Lin et al. combined frontal and temporal cortex (from Table S5)[13], Gogliotti et al. motor cortex (from Table S2)[14], and Gogliotti et al. cerebellum (from Table S3)[14]. Genes with significant differential expression (False Discovery Rate [FDR] < 0.05) in the dataset represented on the X-axis are in red. (b) Spearman’s correlation between meta-analysis Z-score and logFC for each of the other datasets. Color intensity and circle size are proportional to the correlation coefficients with values displayed below the diagonal. (c) Density of gene-wise correlation coefficients among datasets in (b).

Replication of differential gene expression between RTT and CTL brain. (a) Meta-analysis Z-Score compared to log2 fold change (FC) between RTT and CTL from our initial analysis (GEO), Lin et al. combined frontal and temporal cortex (from Table S5)[13], Gogliotti et al. motor cortex (from Table S2)[14], and Gogliotti et al. cerebellum (from Table S3)[14]. Genes with significant differential expression (False Discovery Rate [FDR] < 0.05) in the dataset represented on the X-axis are in red. (b) Spearman’s correlation between meta-analysis Z-score and logFC for each of the other datasets. Color intensity and circle size are proportional to the correlation coefficients with values displayed below the diagonal. (c) Density of gene-wise correlation coefficients among datasets in (b). Supplementary information
Measurement(s)transcriptome • RNA
Technology Type(s)RNA sequencing
Factor Type(s)Rett syndrome brain versus control brain • brain regions (temporal cortex and cingulate cortex)
Sample Characteristic - OrganismHomo sapiens
  31 in total

1.  Combining multiple microarray studies and modeling interstudy variation.

Authors:  Jung Kyoon Choi; Ungsik Yu; Sangsoo Kim; Ook Joon Yoo
Journal:  Bioinformatics       Date:  2003       Impact factor: 6.937

2.  Transcriptional repression by the methyl-CpG-binding protein MeCP2 involves a histone deacetylase complex.

Authors:  X Nan; H H Ng; C A Johnson; C D Laherty; B M Turner; R N Eisenman; A Bird
Journal:  Nature       Date:  1998-05-28       Impact factor: 49.962

3.  STAR: ultrafast universal RNA-seq aligner.

Authors:  Alexander Dobin; Carrie A Davis; Felix Schlesinger; Jorg Drenkow; Chris Zaleski; Sonali Jha; Philippe Batut; Mark Chaisson; Thomas R Gingeras
Journal:  Bioinformatics       Date:  2012-10-25       Impact factor: 6.937

4.  Total RNA Sequencing of Rett Syndrome Autopsy Samples Identifies the M4 Muscarinic Receptor as a Novel Therapeutic Target.

Authors:  Rocco G Gogliotti; Nicole M Fisher; Branden J Stansley; Carrie K Jones; Craig W Lindsley; P Jeffrey Conn; Colleen M Niswender
Journal:  J Pharmacol Exp Ther       Date:  2018-03-09       Impact factor: 4.030

5.  Magnetic resonance spectroscopy and analysis of MECP2 in Rett syndrome.

Authors:  Pek Lan Khong; Ching Wan Lam; Clara G C Ooi; Chun Hung Ko; Virginia C N Wong
Journal:  Pediatr Neurol       Date:  2002-03       Impact factor: 3.372

6.  FXYD1 is an MeCP2 target gene overexpressed in the brains of Rett syndrome patients and Mecp2-null mice.

Authors:  Vivianne Deng; Valerie Matagne; Fatima Banine; Matthew Frerking; Patricia Ohliger; Sarojini Budden; Jonathan Pevsner; Gregory A Dissen; Larry S Sherman; Sergio R Ojeda
Journal:  Hum Mol Genet       Date:  2007-02-19       Impact factor: 6.150

7.  Decreased dendritic branching in frontal, motor and limbic cortex in Rett syndrome compared with trisomy 21.

Authors:  D D Armstrong; K Dunn; B Antalffy
Journal:  J Neuropathol Exp Neurol       Date:  1998-11       Impact factor: 3.685

8.  voom: Precision weights unlock linear model analysis tools for RNA-seq read counts.

Authors:  Charity W Law; Yunshun Chen; Wei Shi; Gordon K Smyth
Journal:  Genome Biol       Date:  2014-02-03       Impact factor: 13.583

9.  Rett and Rett-like syndrome: Expanding the genetic spectrum to KIF1A and GRIN1 gene.

Authors:  Jiaping Wang; Qingping Zhang; Yan Chen; Shujie Yu; Xiru Wu; Xinhua Bao
Journal:  Mol Genet Genomic Med       Date:  2019-09-11       Impact factor: 2.183

10.  Comparative DNA methylation among females with neurodevelopmental disorders and seizures identifies TAC1 as a MeCP2 target gene.

Authors:  Kimberly A Aldinger; Jasmine T Plummer; Pat Levitt
Journal:  J Neurodev Disord       Date:  2013-06-11       Impact factor: 4.025

View more
  1 in total

1.  Disrupted Timing of MET Signaling Derails the Developmental Maturation of Cortical Circuits and Leads to Altered Behavior in Mice.

Authors:  Xiaokuang Ma; Jing Wei; Yuehua Cui; Baomei Xia; Le Zhang; Antoine Nehme; Yi Zuo; Deveroux Ferguson; Pat Levitt; Shenfeng Qiu
Journal:  Cereb Cortex       Date:  2022-04-05       Impact factor: 4.861

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.