| Literature DB >> 33282601 |
Heba Yasin1, Robert Stowe2, Chi Kin Wong3, Puthen Veettil Jithesh1, Farah R Zahir3.
Abstract
In 2019, we confirmed that the haploinsufficiency of CHD8 does indeed cause the novel syndromic neurodevelopmental disease we first discovered a dozen years before. Here, we report the first whole transcriptome RNAseq gene expression profiling for a patient with this new syndrome, as a preliminary exploration of potential pathophysiological mechanisms. We compared our patient transcriptome profile with that of all publicly available RNAseq datasets from human cellular models including neuronal progenitor cells, neurons and organoids. We compared differential gene expression profiles overall and conducted phenotype-informed data filtration based on the characteristic syndrome presentation. We found that concordance among differential gene expression profiles was poor across all datasets. Nevertheless, remarkably, we show that the patient blood differential gene expression profile most resembled that of the neuronal cell model, a finding that encourages further transcriptome profiling using patient blood samples. In addition, our custom phenotype-informed analyses yielded important, differentially expressed syndrome pathophysiology target genes. Finally, we note that genes dysregulated due to CHD8 heterozygous deletion are linked to known neurological as well as oncological pathways.Entities:
Keywords: autism spectrum disorder; chd8; intellectual disability; neurodevelopmental disorders; rnaseq
Year: 2020 PMID: 33282601 PMCID: PMC7710346 DOI: 10.7759/cureus.11571
Source DB: PubMed Journal: Cureus ISSN: 2168-8184
Figure 1Whole transcriptome RNAseq heatmap for differential expression between patient and control samples
Figure 2A – log2 fold change scatter plot showing differential gene expression between patient and control across all 5,388 detected transcripts. B – 23 genes that showed a log2 fold change ± 3 between patient and control
DEGs in patients that are expressed with a log2 fold change of either <= -3 or <=3
DEGs: differentially expressed genes
| GENE | LOG2 (FOLD_CHANGE) | P_VALUE | Q_VALUE | REGULATION IN PATIENT | OTHER DATASETS GENE IS A DEG IN |
| USP32P2 | -6.312 | 5.00E-05 | 0.000316967 | down | None |
| ADAMTS2 | -4.49852 | 0.00025 | 0.00129193 | down | Wang_Neurons |
| HIST1H1E | -4.31541 | 5.00E-05 | 0.000316967 | down | None |
| THBS1 | -4.04863 | 5.00E-05 | 0.000316967 | down | Wilkinson, Wang_NPC and Wang_Neurons |
| SULT1A2 | -3.87789 | 0.00655 | 0.019417 | down | None |
| HIST1H1D | -3.83365 | 0.00015 | 0.000841735 | down | Wilkinson |
| RMRP | -3.57236 | 5.00E-05 | 0.000316967 | down | None |
| RPPH1 | -3.5139 | 5.00E-05 | 0.000316967 | down | None |
| RAVER2 | -3.42626 | 0.0004 | 0.00193051 | down | None |
| ANKRD34B | -3.41265 | 0.00035 | 0.00172049 | down | None |
| WBSCR27 | -3.16821 | 0.0077 | 0.0223024 | down | None |
| SCARNA2 | -3.09253 | 5.00E-05 | 0.000316967 | down | None |
| PRR29 | 3.06369 | 0.00145 | 0.00555477 | up | None |
| TFCP2L1 | 3.26474 | 5.00E-05 | 0.000316967 | up | None |
| VMO1 | 3.3094 | 5.00E-05 | 0.000316967 | up | None |
| VSIG2 | 3.55333 | 0.0196 | 0.0477048 | up | None |
| HERC2P3 | 3.63294 | 5.00E-05 | 0.000316967 | up | None |
| SH3RF2 | 3.70726 | 0.00385 | 0.0124584 | up | None |
| GRIK4 | 3.95099 | 0.0002 | 0.00107189 | up | None |
| TM4SF19 | 4.23945 | 0.00115 | 0.00461064 | up | None |
| TACSTD2 | 4.99907 | 0.0026 | 0.00902676 | up | None |
| CYP4F29P | 5.07494 | 0.0001 | 0.000592077 | up | None |
| USP32P1 | 5.78815 | 5.00E-05 | 0.000316967 | up | None |
Top 10 differentially expressed pathways in the patient versus normal control
| Ingenuity Canonical Pathways | -log (p-value) |
| Protein Ubiquitination Pathway | 9.31 |
| mTOR Signaling | 8.58 |
| NF-κB Signaling | 7.94 |
| TNFR1 Signaling | 7.63 |
| Role of PKR in Interferon Induction and Antiviral Response | 7.4 |
| TNFR2 Signaling | 7.19 |
| TWEAK Signaling | 7.03 |
| Germ Cell-Sertoli Cell Junction Signaling | 6.99 |
| IL-8 Signaling | 6.48 |
| EIF2 Signaling | 6.39 |
Patient DEG overlap with PSG details
DEG: differentially expressed gene; PSG: phenotype seed gene; DD: developmental delay; ID: intellectual disability; ASD: autism spectrum disorder; GI: gastrointestinal
| Phenotype | Patient DEGs that are PSGs | Total PSG# | Percentage PSGs that are patient DEGs | Percentage of total patient DEGs (5388) that are PSGs |
| Facial dysmorphisms | 212 | 832 | 25.50% | 3.90% |
| DD | 240 | 833 | 28.80% | 4.50% |
| ID | 327 | 1236 | 26.50% | 6.10% |
| ASD | 113 | 482 | 23.40% | 2.10% |
| Speech delay | 124 | 494 | 25.10% | 2.30% |
| Macrocephaly | 103 | 378 | 27.20% | 1.90% |
| Pes planus | 62 | 181 | 34.30% | 1.20% |
| Hypotonia | 258 | 937 | 27.50% | 4.80% |
| GI | 206 | 740 | 27.80% | 3.80% |
| Sleep | 90 | 352 | 25.60% | 1.70% |
| Anxiety | 101 | 372 | 27.20% | 1.90% |
| Dental | 235 | 824 | 28.50% | 4.40% |
Figure 3Bar plot showing the percentage of DEGs that appear as PSGs in each study, by phenotype
DEGs: differentially expressed genes; PSGs: phenotype seed genes
Patient DEGs that are frequent PSGs
DEGs: differentially expressed genes; PSGs: phenotype seed genes
| Patient DEGs | Total times found out of the 12 phenotypes | Log2 (fold change) | Regulation |
| CDKN1C | 11 | 1.10845 | up |
| MECP2 | 11 | 0.37637 | up |
| NFIX | 10 | 0.413196 | up |
| LIMK1 | 9 | 0.563392 | up |
| FLNA | 9 | 0.535682 | up |
| BCL7B | 9 | 0.419455 | up |
| WBSCR16 | 9 | 0.30959 | up |
| FLII | 9 | 0.23419 | up |
| COG6 | 9 | -0.334932 | down |
| NRAS | 9 | -0.34015 | down |
| TCF4 | 9 | -0.351494 | down |
| ATRX | 9 | -0.377614 | down |
| KRAS | 9 | -0.427387 | down |
| PTEN | 9 | -0.524188 | down |
| DHFR | 9 | -0.565876 | down |
| WBSCR27 | 9 | -3.16821 | down |
Total DEG counts
DEG: differentially expressed gene
| Model | Number of DEGs |
| Patient blood | 5388 |
| Neurons_ Wang 2015 | 3289 |
| NPC_ Wilkinson | 1812 |
| NPC_ Wang 2015 | 1248 |
| Organoids_ Wang 2017 | 838 |
| NPC_ Sugathan | 369 |
| DEGs Average | 2157 |
| DEGs Median | 1530 |
DEGs that are PSGs and found in at least five of the six samples, arranged according to phenotype
DEGs: differentially expressed genes; PSGs: phenotype seed genes; DD: developmental delay; ID: intellectual disability; ASD: autism spectrum disorder; GI: gastrointestinal
| Macro | Face | DD | ID | ASD | Speech | Anxiety | GI | Dental | PP | Sleep | Hypotonia | Non-phenotype | |
| Patient | TCF4 | TCF4 | TCF4 | TCF4 RGMB | TCF4 | NFIA | TCF4 NFIA | TCF4 | TCF4 | TCF4 NFIA | CELF2, EML6, FBXL19, PODXL, RIMS3 | ||
| Wang Organoid | TCF4 | TCF4 | TCF4 | TCF4 RGMB | TCF4 | NFIA | TCF4 NFIA | TCF4 | TCF4 | TCF4 NFIA | CELF2, EML6, FBXL19, PODXL | ||
| Wang Neurons | TCF4 | TCF4 | TCF4 | TCF4 RGMB | TCF4 | NFIA | TCF4 NFIA | TCF4 | TCF4 | TCF4 NFIA | CELF2, EML6, FBXL19, PODXL, RIMS3 | ||
| Wang NPC | TCF4 | TCF4 | TCF4 | TCF4 RGMB | TCF4 | NFIA | TCF4 NFIA | TCF4 | TCF4 | TCF4 NFIA | CELF2, EML6, FBXL19, PODXL, RIMS3 | ||
| Wilkinson | TCF4 | TCF4 | TCF4 | TCF4 RGMB | TCF4 | NFIA | TCF4 NFIA | TCF4 | TCF4 | TCF4 NFIA | CELF2, EML6, FBXL19, PODXL, RIMS3 | ||
| Sugathan | EML6, FBXL19, RIMS3 |
Figure 4Six-way Venn diagram showing an overlap between all DEGs across the six studies
DEGs: differentially expressed genes
An overview of laboratory methods and bioinformatics pipelines used by each dataset presented in this paper
| Dataset | Patient | Sugathan NPCs | Wilkinson NPCs | Wang NPCs and Neurons | Wang Organoids |
| GEO accession no. | GSE61492 | GSE71594 | GSE85417 | ||
| Cell type | PBMCs | Induced pluripotent stem cells (iPSCs) derived neural progenitor cells (NPCs) from human control (GM8330) | Commercial (SK-N-SH) human NPCs from neuroblastoma cells | iPSCs and neurons developed from human control fibroblasts obtained from a skin biopsy. | Differentiated cerebral organoids derived from developed iPSC-derived NPCs from Wang et al., 2015. |
| Samples | 3 samples 3 controls | 14 samples 8 controls | 4 samples 4 controls | NPCs: 2 samples, 2 controls. Neurons: 2 samples, 2 controls. | 4 samples 2 controls |
| Gene knockdown method | Native | Transduction by HIV-based lentiviral vectors carrying shRNAs targeting the CHD8 gene. | Transfection by siRNA silencer selective negative control and siRNA targeting the CHD8 gene | Transduction with CRISPR/Cas9 vectors containing two separate CRISPR sgRNA sequences to target the N-terminal of CHD8 protein to generate truncated mutations | |
| NGS | Total RNAseq | Strand-specific dUTP RNAseq. 40.6 million reads per line at a final depth ~45 million for total reads. | RNAseq focusing on long noncoding RNAs. Average of 45 million per knockdown replicate and an average of 40 million reads per control replicate. | Total RNAseq | Total RNAseq |
| Number of DEGs | 5388 | 369 | 1812 | NPCs: 1248, Neurons: 3289 | 838 |
| Mapping/ alignment: | Gsnap (GrCH37) ensemble (71) | transcriptome :UCSC Hg19,Ensemble GrCH37 (75),Tophat2 | Tophat (hg19) Gencode database, Cufflink used to get FPKM | Kallisto using GENCODE. TPM > 1 in WT samples | |
| DE analysis | DESeq (2 factor model) 1.12.1 | Cuffdiff + GTF file of the transcriptome using default settings | DESeq2 | DESeq2 | |
| Functional analysis | DAVID, ToppGene | DAVID, Disease association protein-protein link evaluator | DAVID, IPA, ToppGene | DAVID, IPA | |