| Literature DB >> 23705625 |
James L Chitwood1, Gonzalo Rincon, German G Kaiser, Juan F Medrano, Pablo J Ross.
Abstract
BACKGROUND: Use of RNA-Seq presents unique benefits in terms of gene expression analysis because of its wide dynamic range and ability to identify functional sequence variants. This technology provides the opportunity to assay the developing embryo, but the paucity of biological material available from individual embryos has made this a challenging prospect.Entities:
Mesh:
Year: 2013 PMID: 23705625 PMCID: PMC3668197 DOI: 10.1186/1471-2164-14-350
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Workflow of bovine blastocyst sequencing preparation. RNA from Day 7 blastocysts was extracted, converted to cDNA and amplified before being processed with the Nugen Encore Library System. Electropherograms on right depict quality controls performed at each of the steps.
Figure 2Galaxy QC outputs for a representative sample. Read quality across the sequence was acceptable on average but G/C content was abnormally high at the first 9 bp of the 5′ end (lower left box). Trimming of sequences (two boxes on right) removed this bias.
Effect of end trimming on read alignment to genome
| 67.9 | 70.1 | 70.0 | 67.7 | 70.0 | 69.1 ± 0.5 | ||
| 80.7 | 82.8 | 82.5 | 81.7 | 82.5 | 82.0 ± 0.4 | ||
| 69.7 | 72.6 | 71.9 | 69.7 | 72.0 | 71.2 ± 0.6 | ||
| 72.0 | 74.6 | 73.9 | 71.8 | 74.0 | 73.3 ± 0.6 | ||
| 83.4 | 84.2 | 85.0 | 84.3 | 84.9 | 84.4 ± 0.3 | ||
| 88.0 | 89.4 | 89.6 | 88.7 | 89.3 | 89.0 ± 0.3 | ||
Number and proportion of reads mapped to different genome annotations and different genome builds
| Number of genes in annotation | 24,359 | 25,577 | 25,670 | 24,616 |
| Number of transcripts in annotation | 19,757 | 20,681 | 26,977 | 22,118 |
| Mean number of reads processed per embryo (Million reads ± SEM) | 37.6 ± 1.1 | 37.6 ± 1.1 | 37.6 ± 1.1 | 37.6 ± 1.1 |
| Total number of reads mapped to annotated genes (Million reads ± SEM) | 22.9 ± 0.6 | 24.6 ± 0.6 | 27.0 ± 0.7 | 28.9 ± 0.8 |
| % total reads mapped | 60.9% | 65.6% | 71.9% | 76.9% |
| Number of uniquely mapped reads (Million reads ± SEM) | 18.0 ± 0.5 | 18.2 ± 0.6 | 23.8 ± 0.7 | 25.5 ± 0.7 |
| % reads uniquely mapped | 48.1% | 48.4% | 63.2% | 67.7% |
Figure 3Distribution of mapped reads to different transcript types and gene regions. Top graph indicates the proportion of transcripts belonging to different RNA species. Numbers within pie chart indicate number of reads while numbers in parenthesis indicate number of transcripts. Bottom graph shows the read distribution within protein coding genes.
Figure 4The heat map shows the segregation of samples by gene expression. Blastocyst (BL1 to BL5) and milk somatic cell (SC1 and SC2) transcriptomes are distinguished clearly from one another while grouping highly within their own sample types. Average correlation of RPKM for all annotated genes reflected this within BL (r > 0.97) and between sample groups (r < .21) and plotting of log-transformed RPKM values showed a much less linear relationship between milk somatic cells and blastocyst samples.
Summary of GO enriched clusters in groups of genes sorted by expression level in blastocysts
| 1 | Ribosome | Lumen | Lumen | Lumen | DNA repair/Stress response |
| 2 | Oxidative Phosphorylation | Cytoskeleton | Chromosome structure | Zinc finger PHD | Zinc/ion binding |
| 3 | Ubiquitination | Protein catabolism | Protein complexes | Transcriptional regulation | Protein catabolism |
| 4 | Spliceosome | Chromosome structure | Mitochondrial subunits | Mitochondrial matrix | Protein transport |
| 5 | RNA binding | Nucleic acid binding | Nucleic acid binding/Phospho-rylation | Ankyrin repeats | tRNA aminoacylation |
| 6 | Electron transport | Ubiquitination | Endoplasmic reticulum | Protein/nucleotide interactions | Zinc finger CH2 |
| 7 | Protein localization | Protein complex synthesis | Cell cycle | Microtubules | Cell death |
| 8 | Protein biosynthesis | Cell cycle | Cellular membranes | Mitochondrial membranes | DNA repair/Polymerase |
| 9 | Ribosomal subunits | Mitochondrial components | Ankyrin repeats | Phosphorylation | Suppression of differentiation |
| 10 | Protein folding | RNA splicing | tRNA | Zinc finger CH2 | Nucleic acid binding |
GO biological process representative of top ten overrepresented functional annotation clusters among genes overexpressed in embryos versus somatic cells at P < 0.001 and 10 FC
| Chordate embryonic development | 27 | 4.6E-6 | |
| Blastocyst development | 6 | 0.007 | |
| Pattern specification process | 16 | 0.002 | |
| Negative regulation of gene expression | 21 | 0.001 | |
| Placenta development | 9 | 8.5E-4 | |
| Cellular component morphogenesis | 23 | 2.1E-5 | |
| Stem cell maintenance | 5 | 0.001 | |
| Cell adhesion | 24 | 0.005 | |
| Establishment of organelle localization | 5 | 0.007 | |
| Regulation of glucose metabolic process | 4 | 0.003 |
GO molecular functions representative of top five overrepresented functional annotation clusters among genes overexpressed in embryos versus somatic cells at P < 0.001 and 10 FC
| Transcription factor activity | 33 | 0.002 | |
| ATPase activity, coupled | 14 | 0.003 | |
| Transmembrane receptor protein tyrosine kinase activity | 7 | 0.002 | |
| Folic acid binding | 3 | 0.028 | |
| Helicase activity | 8 | 0.037 |
Figure 5Allelic expression analysis in single bovine embryos. a) Number of SNP detected in each sample. Left bar indicates the total number of SNP detected. Right bar, represents the number of SNP that matched dbSNP variants by position. More than 99% of those matches agreed on nucleotide identity. b) Allelic bias in heterozygous SNP expression. Number of SNP expressed at different major allelic frequencies. Average number of validated (left bar) and putative (right bar) SNPs are presented separate. The number of SNP with statistically significant imbalance (FDR < 0.01 by X2 test) is indicated within their respective group (shaded area). The 85-95% frequency only includes validated SNPs.
List of known imprinted genes with expressed SNP
| Bovine | ENSBTAG00000003212 | 3 | 82, 89*, 86* | |
| Bovine | ENSBTAG00000010128 | 1 | 59 | |
| Bovine | ENSBTAG00000045928 | 1 | 82 | |
| Murine | ENSBTAG00000002487 | 1 | 69 | |
| Murine | ENSBTAG00000004875 | 1 | 91* | |
| Human | ENSBTAG00000002813 | 5 | 64, 71, 75, 78, 67 | |
| Human | ENSBTAG00000003209 | 2 | 89*, 86* | |
| Human | ENSBTAG00000017245 | 2 | 67, 68 | |
| Human | ENSBTAG00000018185 | 1 | 84* |
* Chi-squared p < 0.01 difference from 50:50 expression of both alleles.