| Literature DB >> 29090078 |
Natasha L Pacheco1, Michael R Heaven2, Leanne M Holt1,3, David K Crossman4, Kristin J Boggio5, Scott A Shaffer5, Daniel L Flint6, Michelle L Olsen1,3.
Abstract
BACKGROUND: Rett syndrome (RTT) is an X-linked neurodevelopmental disorder caused by mutations in the transcriptional regulator MeCP2. Much of our understanding of MeCP2 function is derived from transcriptomic studies with the general assumption that alterations in the transcriptome correlate with proteomic changes. Advances in mass spectrometry-based proteomics have facilitated recent interest in the examination of global protein expression to better understand the biology between transcriptional and translational regulation.Entities:
Keywords: Multi-cellular deficits; Proteome; Rett syndrome; Transcriptome
Mesh:
Substances:
Year: 2017 PMID: 29090078 PMCID: PMC5655833 DOI: 10.1186/s13229-017-0174-4
Source DB: PubMed Journal: Mol Autism Impact factor: 7.509
Fig. 1Transcriptome-wide expression in Mecp2 cortex. a. Heat map of 391 significant, differentially expressed (DE) genes. Each genotype has 4 biological replicates, where each column represents 1 biological replicate and each row represents the Log10-transformed FPKM of a significant DE gene. Biological replicates are listed in the order of how they cluster, which is indicated by the cluster dendrogram above the heat map. Genes with a false discovery rate (q-value or FDR) of < 0.05 were considered to be significantly, differentially expressed. b. Volcano plot of all the detected genes’ expression (Log2 fold change) in the Mecp2 whole cortex transcriptome. Significant DE genes previously identified as RTT hits are highlighted in red crosses and arrows. Due to space constraints, additional genes identified in supplemental material from Chahrour et al. [4] and Veeraragavan et al. [34] were not highlighted in this volcano plot; for information on these genes, see Table 1. Dotted line indicates a q-value of 0.05, where anything above the line indicates a significant DE gene. c. Venn diagram comparing our transcriptome data to previously published microarray studies (Urdinguio et al. [24] and Tudor et al. [20]) on Mecp2 mouse cortex. Note that in the Urdinguio study, the fold change expression was not differentiated between cortex, midbrain, and cerebellum due to their finding that there were no differences in gene expression between the 3 brain regions [24]; rather, fold change values represent combined tissue expression. Six genes were shared between the Mecp2 transcriptome data and the Urdinguio et al. study, while 5 genes from the Tudor et al. study were shared in common with the transcriptome data. One of the targets (Fabp7) from the Tudor et al. study was also overlapped with the Urdinguio et al. study
Significant DE genes overlapping with previously identified RTT hits
| Gene targets: this study | UniProt: function keywords | Fold change | RTT gene hits: direction of expression (with references) |
|---|---|---|---|
|
| • Molecular function: ligase | − 0.79 | Decrease4 |
|
| • Molecular function: oxidoreductase | − 0.67 | Decrease4 |
|
| • |
|
|
|
| N/A | 1.40 | Increase7 |
|
| • Biological process: complement pathways; immunity; innate immunity | − 0.76 | Decrease7 |
|
| • |
|
|
|
| • |
|
|
|
| • Molecular function: chaperone | − 0.80 | Decrease8 |
|
| • Molecular function: kinase; transferase | 1.42 | Increase4, 8 |
|
| • Molecular function: hydrolase | − 0.71 | Decrease4 |
|
| • |
|
|
|
| • |
|
|
|
| • Molecular function: chloride channel; ion channel; ligand-gated ion channel; receptor | − 0.81 | Decrease6 |
|
| • Molecular function: receptor | 1.26 | Increase4, 8 |
|
| • Molecular function: G-protein coupled receptor; receptor; transducer | − 0.63 | Decrease4 |
|
| • Ligand: calcium; metal-binding | − 0.69 | Decrease4, 8 |
|
| • Biological process: stress response | − 0.80 | Decrease3 |
|
|
|
|
|
|
| • Molecular function: DNA-binding; repressor | − 0.66 | Decrease1, 4 |
|
| • |
|
|
|
| • Molecular function: actin-binding; calmodulin-binding; motor protein; myosin | 1.32 | Increase4, 8 |
|
| • Molecular function: hydrolase; protease; serine protease | − 0.74 | Decrease6 |
|
| • Molecular function: isomerase | − 0.71 | Decrease4, 8 |
|
| • |
|
|
|
| • Molecular function: kinase; serine/threonine-protein kinase; transferase | − 0.82 | Decrease8 |
|
| • |
|
|
|
| • Molecular function: glycosyltransferase; transferase | 1.29 | Increase6 |
|
| • Molecular function: oxidoreductase | − 0.74 | Decrease4 |
|
| • Molecular function: aminopeptidase; hydrolase; metalloprotease; protease | − 0.78 | Decrease4 |
|
| • |
|
|
|
| • Biological process: antiport; calcium transport; ion transport; olfaction; potassium transport; sensory transduction; sodium transport; symport; transport | 1.58 | Increase4 |
|
| • Biological process: Wnt signaling pathway | − 0.74 | Decrease8 |
|
| • Biological process: meiosis | 1.42 | Increase8 |
|
| • Molecular function: nucleotidyltransferase; transferase | − 0.73 | Decrease4 |
|
| • Molecular function: DNA-binding | 1.5 | Increase8 |
First column represents the significant DE gene identified in this study, along with the UniProt function keyword(s) for each gene (second column) and the Log2 fold change expression of the gene in our data set (third column). DE genes with an asterisk (*) represent genes identified in supplemental material from Chahrour et al. [4] and Veeraragavan et al. [34]. All function keywords were found from the UniProtKB/Swiss-Prot database [131] per respective DE gene. DE genes without a published function keyword in the UniProtKB/Swiss-Prot database are represented as “N/A.” The last column represents the previously identified RTT hits’ fold change expression direction, with each superscript representing that gene’s respective references. Superscript references are as follows: (1) Amir et al. [2], (2) Tudor et al. [20], (3) Nuber et al. [21], (4) Chahrour et al. [4], (5) Urdinguio et al. [24], (6) Ben-Shachar et al. [25], (7) Lin et al. [33], and (8) Veeraragavan et al. [34]. Rows in bold font represent previously identified RTT hits that were found to be specifically DE in the cortex
Fig. 2Cell type-specific gene expression correlation. a. Mecp2 cortex significant DE genes compared to top 500 CNS cell type-specific genes based on the Zhang et al. study [69], which is also provided as a public database. Each pie slice lists the number of DE genes associated with each CNS cell type. b. Fold change expression and gene size correlation in the Mecp2 transcriptome. Significant DE genes were plotted by Log2 fold change (FC) expression (y-axis) and gene size (x-axis; in units of kilobase (kb)) according to their respective CNS cell type distribution (based on part A). Expression and gene size correlations were also examined as a whole (bottom scatter plot in black). This relationship is also represented to the right of each scatter plot based on the number of short (defined as being less than 100 kb; gray) and long (defined as being greater than 100 kb; turquoise) genes that are either repressed (i.e., decreased expression) or activated (i.e., increased expression) in the Mecp2 cortex
Fig. 3Proteome-wide expression in Mecp2 cortex. a. Heat map of 460 significant, abundantly expressed proteins. Each column represents pooled biological replicates per genotype (n = 4), and each row represents the relative abundance fold change of an individual protein (with or without a PTM). Proteins with a p-value of < 0.1 were considered differentially abundant. b. Volcano plot of all the detected proteins’ expression (Log2 fold change) in the Mecp2 whole cortex proteome. Dotted line indicates a p-value of 0.1, where anything above the line indicates a significant protein. Previously identified RTT hits are highlighted in red, blue, or purple filled circles. Red circles denote that the significant protein was identified from a transcriptome-based gene expression study, blue circles denote identification from a proteomics-based study, and purple circles denote identification from a non-omics-based study. Due to space constraints, only the selected RTT protein hits identified from proteomics and non-omics-based studies were highlighted in this volcano plot. For a comprehensive list of all the identified RTT protein hits, refer to Additional file 8. c. Pie chart of significant proteins compared to top 500 cell type-specific genes based on the Zhang et al. study [69]. All significant proteins were included in the analysis. Each pie slice lists the number of significant proteins associated with each CNS cell type
Differentially abundant proteins associated with RNA metabolism and proteostasis
| Gene name | Protein description | FC |
| General putative function |
|---|---|---|---|---|
|
| ||||
|
|
|
|
|
|
|
| Cell division cycle 5-like protein | 2.2 | 0.013 | mRNA splicing |
|
| CD2 antigen cytoplasmic tail-binding protein 2 | 1.8 | 0.072 | mRNA splicing |
|
| Heterogeneous nuclear ribonucleoprotein (hnRNP) A0 | 2.0 | 0.076 | RNA-binding protein |
|
| Heterogeneous nuclear ribonucleoprotein (hnRNP) D0 | 1.4 | 0.029 | RNA-binding protein |
|
| Heterogeneous nuclear ribonucleoprotein (hnRNP) D-like | 2.1 | 0.024 | RNA-binding protein |
|
| Heterogeneous nuclear ribonucleoprotein (hnRNP) H2 | 1.5 | 0.034 | RNA-binding protein |
|
| Heterogeneous nuclear ribonucleoprotein (hnRNP) L | 1.8 | 0.077 | RNA-binding protein |
|
|
|
|
|
|
|
| Nucleolar RNA helicase 2 | 2.9 | 0.009 | rRNA transcription, processing, and modification |
|
| Heterochromatin protein 1-binding protein 3 | 2.1 | 0.057 | Transcription regulation |
|
| ||||
|
| Protein IMPACT | 1.8 | 0.002 | Translational activation |
|
| Ubiquitin carboxyl-terminal hydrolase isozyme L5 | 1.6 | 0.037 | Deubiquitylation |
|
| Proteasomal ubiquitin receptor ADRM1 | 3.0 | 0.007 | Enhances Uchl5 |
|
| Ubiquitin-like domain-containing CTD phosphatase 1 | 3.5 | 0.068 | Nuclear proteasome activity |
|
| Alpha-crystallin B chain | −2.1 | 0.003 | Chaperone-like activity |
|
| Heat shock protein beta-1 (HspB1) | −1.7 | 0.020 | Chaperone for protein folding maintenance |
|
|
|
|
|
|
|
| Heat shock protein HSP 90-alpha (HSP90α) | −1.5 | 0.058 | ATP-dependent chaperone |
|
| Heat shock protein HSP 90-beta (HSP90β) | −1.4 | 0.056 | ATP-dependent chaperone |
|
| Heat shock 70 kDa protein 4 | −1.5 | 0.067 | Molecular chaperone |
|
| Activator of 90 kDa heat shock protein ATPase homolog 1 | −1.3 | 0.059 | Chaperone binding |
Significant proteins (p < 0.1) that are associated with either RNA metabolism (top half of table) or proteostasis (bottom half of table) are listed by gene name (first column), followed by the full protein name (second column). For each protein, the fold change (“FC”) and p-value is provided (third and fourth columns, respectively), along with a brief description regarding the putative function of that protein (“General Putative Function”). Information regarding the putative function of each protein was obtained from the UniProtKB/Swiss-Prot database [131]. Gene names with an asterisk (“*”) indicates that the corresponding gene was also identified as significantly, differentially expressed (DE) in the transcriptome by a q < 0.05. Bold proteins indicate that the respective protein has been previously identified as a RTT hit and/or MeCP2 interacting protein (see text for references)
Differentially abundant proteins associated with metabolism and S-adenosylmethionine-dependent methylation
| Gene name | Protein description | FC |
| General putative function |
|---|---|---|---|---|
|
| ||||
|
| Lanosterol synthase | 8.1 | 0.016 | Cholesterol synthesis |
|
|
|
|
|
|
|
| Very long-chain specific acyl-CoA dehydrogenase, mitochondrial | 1.6 | 0.045 | Fatty acid oxidation |
|
|
|
|
|
|
|
| Cytosolic acyl coenzyme A thioester hydrolase | − 1.3 | 0.087 | Fatty Acyl-CoA biosynthesis |
|
| ||||
|
|
|
|
|
|
|
| Sepiapterin reductase | − 1.5 | 0.083 | Tetrahydrobiopterin synthesis |
|
| Amine oxidase [flavin-containing] A | 1.4 | 0.096 | Degradation of monoamines |
|
| ||||
|
|
|
|
|
|
|
| Spermidine synthase (SPDSY) | − 1.5 | 0.024 | Polyamine synthesis |
|
| Monofunctional C1-tetrahydrofolate synthase, mitochondrial | 1.7 | 0.054 | Folate synthesis |
Significant proteins (p < 0.1) are listed by gene name (first column), followed by the full protein name (second column). For each protein, the fold change (“FC”) and p-value is provided (third and fourth columns, respectively), along with a brief description regarding the putative function of that protein (“General Putative Function”). Information regarding the putative function of each protein was obtained from the UniProtKB/Swiss-Prot database [131]. Gene names with an asterisk (“*”) indicates that the corresponding gene was also identified as significantly, differentially expressed (DE) in the transcriptome by a q < 0.05, while genes with a pound sign (“#”) indicate that gene was identified as DE in the transcriptome by a q < 0.01. Bold proteins indicate that the respective protein has been previously identified as a RTT hit and/or MeCP2 interacting protein
Fig. 4Transcriptome-proteome expression correlation in Mecp2 cortex. a. Overall gene-protein expression correlation. Detected genes (7026) from the RNA-Seq data set were matched against detected proteins (4789) from the proteomics data set, resulting in a total of 3780 gene-protein matches. Each individual gene-protein match is plotted by gene fold change expression (x-axis, Mecp2 /WT) and its corresponding protein fold change expression (y-axis, Mecp2 /WT). Pearson’s R reports a correlation of 0.12. b. Significant gene and significant protein expression correlation. Out of the 3780 detected gene-protein matches, only 35 have both a significant gene (q < 0.05) and corresponding significant protein (p < 0.1) match. Each match is plotted by gene fold change expression (x-axis, Mecp2 /WT) and its corresponding protein fold change expression (y-axis, Mecp2 /WT). Pearson’s R reports a correlation of 0.74
List of significant genes with a significant protein
| Gene/protein | UniProt accession | Gene fold change | Protein fold change | RTT hit? | Putative function |
|---|---|---|---|---|---|
|
| Q9Z2D6 | − 0.66 | − 10.6 | Yes1, 2 | DNA binding protein; transcriptional regulator |
|
| Q8VCT3 | − 0.78 | − 3.47 | Yes2 | Peptide catabolic process |
|
| P08003 | − 0.71 | − 2.22 | Yes2,3 | ER protein processing |
|
| Q61699 | − 0.80 | − 2.13 | Yes4 | Protein chaperone |
|
| Q8VCT3 | − 0.78 | − 2.02 | Yes2 | Peptide catabolic process |
|
| P24549 | − 0.67 | − 1.95 | Yes2 | Retinal dehydrogenase |
|
| P70441 | − 0.74 | − 1.79 | Yes3 | Scaffold protein |
|
| Q64676 | − 0.79 | − 1.76 | --- | Sphingolipid metabolism |
|
| Q8BVI4 | − 0.74 | − 1.68 | Yes2 | Monoamine metabolism (tetrahydrobiopterin biosynthesis) |
|
| Q91ZJ5 | − 0.73 | − 1.67 | Yes2 | Glycogen synthesis |
|
| P00920 | − 0.72 | − 1.61 | --- | Reversible hydration of carbon dioxide |
|
| Q9QYS2 | − 0.63 | − 1.6 | Yes2 | Protein coupled glutamate receptor |
|
| P34914 | − 0.71 | − 1.59 | Yes2 | Cholesterol synthesis |
|
| Q8BGZ1 | − 0.69 | − 1.51 | Yes2,3 | Calcium binding |
|
| Q64105 | − 0.77 | − 1.47 | --- | Monoamine metabolism (tetrahydrobiopterin metabolic process) |
|
| P59808 | − 0.82 | − 1.44 | --- | p38 MAPK and NIK/NF-kappaB signaling |
|
| P14211 | − 0.80 | − 1.43 | Yes3 | ER calcium-binding protein |
|
| Q9D2R0 | − 0.79 | − 1.37 | Yes2 | Fatty acid and lipid metabolism |
|
| O08530 | − 0.78 | − 1.28 | --- | Cell-cell adhesion |
|
| P70441 | − 0.74 | − 1.24 | Yes3 | Scaffold protein |
|
| Q9JKN6 | 1.26 | 1.28 | --- | RNA/mRNA-binding protein |
|
| Q8CGQ8 | 1.58 | 1.3 | Yes3 | K+-dependent Na+/Ca2+ exchanger |
|
| P97382 | 1.27 | 1.43 | --- | Voltage gated K+ channel |
|
| Q8BMF3 | 1.28 | 1.46 | --- | Mitochondrial NADP(+)-dependent malic enzyme |
|
| Q62351 | 1.32 | 1.48 | --- | Cell surface receptor required for iron uptake |
|
| Q91WG7 | 1.42 | 1.51 | Yes2,3 | Lipid metabolism |
|
| P63318 | − 0.82 | 1.66 | Yes3 | Protein kinase; LTP |
|
| P27671 | 1.32 | 1.68 | --- | Stimulates the dissociation of GDP from RAS protein |
|
| Q8BZ94 | 1.50 | 1.88 | Yes3 | DNA/RNA and p53 binding |
|
| P46735 | 1.32 | 2.02 | Yes2,3 | Actin binding; calmodulin binding |
|
| Q64378 | 1.81 | 2.02 | Yes2–6 | Immunoregulation; protein trafficking |
|
| P0C7L0 | 1.23 | 2.26 | --- | Cytoskeleton organization |
|
| Q61704 | 1.52 | 2.37 | --- | Anchor protein between hyaluronan and matrix proteins |
|
| P97785 | 1.26 | 2.64 | Yes2,3 | RET and RAF/MAP kinase signaling cascade |
|
| Q8BJS4 | 1.42 | 2.86 | Yes3 | Nuclear envelope protein |
Significant genes were defined as having a q < 0.05, and significant proteins having a p < 0.1. Gene and protein names are provided along with the UniProt accession number, gene and protein fold change expression values (respectively), whether the gene-protein match has been previously identified as a RTT hit, and the respective putative function according to the UniProtKB/Swiss-Prot database [131]. DE gene-protein matches with an asterisk (*) represent genes/proteins identified in supplemental material from Chahrour et al. [4] and Veeraragavan et al. [34]. Gene-protein matches with “(Ac)” denote that the corresponding protein had a significant acetylation PTM, and a “(P)” denotes a significant phosphorylation PTM. For the RTT hit column, a “---” indicates the match has not been identified as a RTT hit prior to this study. Superscripts in the RTT hit column denote references for matches identified as a RTT hit and are as follows: (1) Amir et al. [2], (2) Chahrour et al. [4], (3) Veeraragavan et al. [34], (4) Nuber et al. [21], (5) Urdinguio et al. [24], and (6) Lin et al. [33]
Fig. 5Transcriptome and proteome general cellular and molecular pathways in Mecp2 cortex. Selected biological pathways that are both shared and unique to the transcriptome (R) and proteome (P) data sets are grouped by broad categories associated with general cellular/molecular function: (a) cell cycle, (b) cellular components/structure/general function, and (c) lipids and metabolism. Respective pathways are plotted by −Log10 p-value, where a value of 1.3 or greater represents a p-value of at least p < 0.05; non-significant (NS) pathways are denoted in white. The direction of gene (gray bars) and/or protein (black bars) expression changes associated with each respective pathway are represented as a percent expression to the right of the heat map, where a value greater than 0 indicates increased expression and a value less than 0 indicates decreased expression. The percent expression was calculated by taking the number of genes/proteins with significant increased or decreased expression divided by the total sum of significant genes/proteins assigned to the respective pathway. Note in part B, the abbreviation “AC” in the “G-protein signaling, AC inhibiting pathway” stands for “adenylate cyclase.” For a comprehensive list of all pathways and associated significant genes/proteins identified in both the transcriptome and proteome data sets, see Additional file 10
Fig. 6Transcriptome and proteome cell type-specific pathways in Mecp2 cortex. Selected biological pathways that are both shared and unique to the transcriptome (R) and proteome (P) data sets are grouped by broad categories associated with CNS and general cell type specific function: (a) neuronal functions, (b) glial functions, (c) immunological/inflammation functions, and (d) blood/blood vessel/vasculature. Respective pathways are plotted by −Log10 p-value, where a value of 1.3 or greater represents a p-value of at least p < 0.05; non-significant (NS) pathways are denoted in white. The direction of gene (gray bars) and/or protein (black bars) expression changes associated with each respective pathway are represented as a percent expression to the right of the heat map, where a value greater than 0 indicates increased expression and a value less than 0 indicates decreased expression. The percent expression was calculated as described in Fig. 5. Note in part B, the abbreviation “EAE” stands for “experimental autoimmune encephalomyelitis”; in part C, the abbreviation “APCs” in the pathways “Immune response of APCs” and “Cell movement of APCs” stands for “antigen presenting cells.” For a comprehensive list of all pathways and associated significant genes/proteins identified in both the transcriptome and proteome data sets, see Additional file 10