| Literature DB >> 26981392 |
Ruijie Liu1, Kelan Chen2, Natasha Jansz2, Marnie E Blewitt2, Matthew E Ritchie3.
Abstract
Smchd1 is an epigenetic repressor with important functions in healthy cellular processes and disease. To elucidate its role in transcriptional regulation, we performed two independent genome-wide RNA-sequencing studies comparing wild-type and Smchd1 null samples in neural stem cells and lymphoma cell lines. Using an R-based analysis pipeline that accommodates observational and sample-specific weights in the linear modeling, we identify key genes dysregulated by Smchd1 deletion such as clustered protocadherins in the neural stem cells and imprinted genes in both experiments. Here we provide a detailed description of this analysis, from quality control to read mapping and differential expression analysis. These data sets are publicly available from the Gene Expression Omnibus database (accession numbers GSE64099 and GSE65747).Entities:
Keywords: Epigenetics; RNA-sequencing; Sample variability; voom
Year: 2015 PMID: 26981392 PMCID: PMC4778621 DOI: 10.1016/j.gdata.2015.12.027
Source DB: PubMed Journal: Genom Data ISSN: 2213-5960
Fig. 1Quality assessment at the read level. Boxplots of base-calling Phred scores at different base positions across all the reads in representative libraries from NSC RNA-seq (A) and Lymphoma cell line RNA-seq (B) experiments generated by FastQC. The box represents 25% and 75% quantiles of the scores with median score marked by the red line. Whiskers mark the 10% and 90% quantiles and blue lines show the mean quality score.
Fig. 2Quality assessment at the sample level. Multi-dimensional scaling (MDS) plots of the NSC (A) and Lymphoma (B) data sets, with samples numbered and color coded by genotype. Distances correspond to the mean log2fold-change for the top 500 genes that best discriminate each pair of samples. In both experiments, one or more samples cluster poorly with replicates of the same genotype, motivating the use sample weights (D) in the regression modeling to detect differential expression. Panel C shows a scatterplot of the mean–variance relationship in abundance estimated from biological replicates from the NSC data set using the voom method. Panel D shows the sample weights estimated for the NSC data set that are combined with voom's abundance-related weights in the voomWithQualityWeights function and used in the linear model analysis to detect differentially expressed genes.
Fig. 3Summary of the RNA-seq results. Volcano plot representation of differential expression analysis of genes in the Smchd1 wild-type versus Smchd1 null comparison for the NSC (A) and Lymphoma RNA-seq (B) data sets. Red and blue points mark the genes with significantly increased or decreased expression respectively in Smchd1 wild-type compared to Smchd1 null samples (FDR < 0.01). The x-axis shows log2fold-changes in expression and the y-axis the log odds of a gene being differentially expressed. In both data sets, Smchd1 is the top ranked gene.
| Specifications | |
|---|---|
| Organism/cell line/tissue | |
| Sex | Male. |
| Sequencer or array type | NSC data: Libraries prepared with the Illumina TruSeq Total Stranded RNA kit and sequenced on an Illumina HiSeq 2000 with Illumina TruSeq SBS Kit v3-HS reagents as 100 bp paired-end reads. |
| Data format | Raw (fastq) and summarized counts. |
| Experimental factors | RNA was obtained from Smchd1 null and wild-type samples. |
| Experimental features | Neural stem cells were derived from E14.5 male embryos. |
| Consent | All animal experiments were carried out in accordance with the Walter and Eliza Hall Institute of Medical Research Animal Ethics Committee guidelines (AEC 2011.027). |
| Sample source location | Melbourne, Australia. |