| Literature DB >> 26564201 |
Florian Mertes1,2, Björn Lichtner3, Heiner Kuhl4, Mirjam Blattner5, Jörg Otte6, Wasco Wruck7, Bernd Timmermann8, Hans Lehrach9, James Adjaye10,11.
Abstract
BACKGROUND: Next Generation Sequencing has proven to be an exceptionally powerful tool in the field of genomics and transcriptomics. With recent development it is nowadays possible to analyze ultra-low input sample material down to single cells. Nevertheless, investigating such sample material often limits the analysis to either the genome or transcriptome. We describe here a combined analysis of both types of nucleic acids from the same sample material.Entities:
Mesh:
Substances:
Year: 2015 PMID: 26564201 PMCID: PMC4643517 DOI: 10.1186/s12864-015-2025-z
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Method overview for combined sequencing of mRNA and whole genome DNA. a Schematics for sequencing of ultra-low input DNA and mRNA from a single cell-colony sample. b OCT3/4, NANOG and DAPI staining of cultured human embryonic stem cells. Rectangular cuts indicating size of sample applied (150–200 cells each). Scale bar =200 μm
Fig. 2Comparison of RNAseq data in terms of transcript identification. a Read coverage across transcripts with coverage for transcripts of sizes 0 kb-1 kb, 1 kb-2 kb, 2 kb-3 kb, 3 kb-4 kb, 4 kb-5 kb and 5 kb-15 kb. Transcripts were divided into 40 equally sized bins and coverage was averaged over all transcripts of the dedicated size interval. Mean values of mRNA sample1 and sample2 are displayed, error bars denoting standard deviation of samples. b Boxplot of total length of sequenced Refseq genes for RNAseq sample 1 and sample 2. c Correlation plot for RNAseq sample 1 and sample 2 in terms of FPKM values for expressed Refseq genes
Fig. 3Overlap between microarray and sequencing measurements. a.1 Venn diagram displaying overlap of significant genes found in the Illumina microarray and next-generation sequencing (NGS) experiments. Genes expressed with Illumina detection p-value < 0.05 in the microarray experiment or FPKM > 0.5 in the NGS experiments were considered significant. Genes were compared via HGNC gene symbol annotation. a.2 Venn diagram displaying overlap of significant CPDB categories found in the Illumina microarray and deep sequencing experiments. Significant genes from microarray and NGS experiments were analysed with the CPDB functional annotation tool and categories with a q-value <0.05 were considered significant. b Pluripotency associated genes from microarray experiment and NGS experiments of sample1 and sample2 were compared. Logarithmic values (base 2) of Illumina average signals and FPKM values from expressed genes (detection p-value < 0.05, FPKM > 0.5) were quantile normalized and subjected to cluster analysis via R heatmap2 function using Euclidean distance as distance measure
Fig. 4Read coverage of genome sequencing and RNA-seq data. a Read coverage of genomic DNA is visualized in a Manhattan plot showing coverage of 10 kb regions over chromosomes. b Example of mapped reads for selected stem cell marker genes (NANOG, POU5F1, SOX2,) and housekeeping gene (ACTB) for both RNA-seq samples (light grey: RNA-seq sample1 and medium grey: RNA-seq sample2) and WGA DNA sample (dark grey). The aligned RNA-seq reads resemble the exon structure depicted below
Detected isoforms of selected human embryonic stem cell marker genes
| Gene | Nearest ref ID | Locus | Length | Sample1 FPKM | Sample2 FPKM | Number of isoforms |
|---|---|---|---|---|---|---|
| DNMT3B | 212.59 | 132.05 | 4/7 (13) | |||
| NM_006892 | chr20:31350190-31397167 | 789 | 9.71 | 4.87 | ||
| NM_006892 | chr20:31350190-31397167 | 721 | 0.00 | 3.00 | ||
| NM_006892 | chr20:31350190-31397167 | 3463 | 18.81 | 0.00 | ||
| NM_006892 | chr20:31350190-31397167 | 4131 | 0.00 | 21.60 | ||
| NM_006892 | chr20:31350190-31397167 | 4203 | 175.74 | 0.54 | ||
| NM_006892 | chr20:31350190-31397167 | 4336 | 8.33 | 65.00 | ||
| NM_175848 | chr20:31350190-31397167 | 4276 | 0.00 | 29.39 | ||
| NM_175849 | chr20:31350190-31397167 | 4087 | 0.00 | 7.65 | ||
| NANOG | 3.91 | 6.62 | 1/1 (1) | |||
| NM_024865 | chr12:7941994-7948655 | 2089 | 3.91 | 6.62 | ||
| POU5F1 | 507.35 | 484.58 | 3/2 (21) | |||
| NM_001173531 | chr6:31132113-31138451 | 1247 | 156.01 | 0.00 | ||
| NM_002701 | chr6:31132113-31138451 | 1401 | 348.96 | 483.12 | ||
| NM_203289.6 | chr6:31132113-31138451 | 1733 | 2.38 | 1.46 |
Gene expression is shown in FPKM values including detected isoforms. Number of detected isoforms for RNA-seq sample1 and sample2 are separated by dash; total number of known isoforms in brackets