| Literature DB >> 21915259 |
Mingyan Lin1, Erika Pedrosa, Abhishek Shah, Anastasia Hrabovsky, Shahina Maqbool, Deyou Zheng, Herbert M Lachman.
Abstract
Genome-wide expression analysis using next generation sequencing (RNA-Seq) provides an opportunity for in-depth molecular profiling of fundamental biological processes, such as cellular differentiation and malignant transformation. Differentiating human neurons derived from induced pluripotent stem cells (iPSCs) provide an ideal system for RNA-Seq since defective neurogenesis caused by abnormalities in transcription factors, DNA methylation, and chromatin modifiers lie at the heart of some neuropsychiatric disorders. As a preliminary step towards applying next generation sequencing using neurons derived from patient-specific iPSCs, we have carried out an RNA-Seq analysis on control human neurons. Dramatic changes in the expression of coding genes, long non-coding RNAs (lncRNAs), pseudogenes, and splice isoforms were seen during the transition from pluripotent stem cells to early differentiating neurons. A number of genes that undergo radical changes in expression during this transition include candidates for schizophrenia (SZ), bipolar disorder (BD) and autism spectrum disorders (ASD) that function as transcription factors and chromatin modifiers, such as POU3F2 and ZNF804A, and genes coding for cell adhesion proteins implicated in these conditions including NRXN1 and NLGN1. In addition, a number of novel lncRNAs were found to undergo dramatic changes in expression, one of which is HOTAIRM1, a regulator of several HOXA genes during myelopoiesis. The increase we observed in differentiating neurons suggests a role in neurogenesis as well. Finally, several lncRNAs that map near SNPs associated with SZ in genome wide association studies also increase during neuronal differentiation, suggesting that these novel transcripts may be abnormally regulated in a subgroup of patients.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21915259 PMCID: PMC3168439 DOI: 10.1371/journal.pone.0023356
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
RNA-Seq Summary.
| Down-regulated from day 0 to 10 | Up-regulated from day 0 to 10 | Total number of genes | |
| Total | 5953 | 3055 | 50047 |
| lincRNA | 220 | 228 | 1443 |
| miRNA | 12 | 6 | 1809 |
| Misc. RNA | 18 | 30 | 1190 |
| Proc. transcript | 535 | 639 | 9062 |
| Protein coding | 4060 | 1808 | 21598 |
| pseudogene | 1052 | 319 | 10923 |
| snoRNA | 11 | 5 | 1523 |
| snRNA | 6 | 3 | 1951 |
| rRNA | N/A | 1 | 535 |
| rRNA p. gene | 22 | 2 | 179 |
| scRNA p. gene | 17 | 14 | 787 |
Abbreviations: lincRNA (long intergenic non-coding RNA); miRNA (microRNA); proc. (processed) (processed transcripts also known as lncRNA – long non-coding RNA); Misc. RNA (miscellaneous RNA, also includes mitochondrial RNA, mitochondrial transfer RNA, mitochondrial ribosomal RNA, polymorphic pseudogene); p. gene (pseudogene); snoRNA (small nucleolar RNA); snRNA (small nuclear RNA); rRNA (ribosomal RNA); scRNA (small cytoplasmic RNA).
Number (N) of RNA-Seq reads: Day 0: N of reads [pairs] 269,672,486 [134,836,243]; N of paired-match reads 121,620,928 [60,810,464]; N of single-match reads 39,276,866: Day 10: N of reads [pairs], 222,127,542 [111,063,771]; N of paired-match reads 115,280,074 [57,640,037], N of single-match reads 33,721,936.
Gene Ontology (GO) terms for differentially expressed genes.
| GO TERMS FOR DOWN-REGULATED GENES | ||
| p-value | Benjamani | |
| RNA processing | 6.7E-56 | 3.1E-52 |
| Translation | 1.4E-44 | 3.2E-41 |
| mRNA metabolic process | 9.7E-36 | 1.5E-32 |
| cell cycle | 6.0E-35 | 6.9E-32 |
| RNA Splicing | 1.7E-33 | 1.6E-30 |
| mRNA processing | 7.2E-33 | 5.5E-30 |
| RNP complex biogenesis | 3.0E-31 | 2.0E-28 |
| ribosomal biogenesis | 3.8E-30 | 2.2E-27 |
| mitotic cell cycle | 4.6E-29 | 2.4E-26 |
| cell cycle phase | 2.6E-27 | 1.2E-24 |
| DNA metabolic process | 5.3E-27 | 2.2E-24 |
| Cell cycle phase | 1.2E-23 | 4.6E-21 |
| translational elongation | 1.7E-23 | 6.0E-21 |
| RNA splicing via transester. | 3.1E-23 | 1.0E-20 |
| Nuclear mRNA splicing | 3.1E-23 | 1.0E-20 |
| rRNA metabolic process | 7.3E-23 | 2.2E-20 |
| ncRNA metabolic process | 9.7E-23 | 2.8E-20 |
Figure 1Comparison of fold-changes (log2-transformed) of gene expression between iPSCs and day 10 neurons as measured by RNA-Seq (x-axis) and Affymetrix microarray (y-axis).
Figure 2Validation by RT-PCR and qPCR in unamplified RNAs.
Panel A, top band in each of the 4 sets is a beta-actin control; bottom band, specific transcripts. Lane 1, iPSCs; lane 2, day 14 neurons; lane 3, day 27 neurons; lanes 4 and 5, fetal brain (9 and 20 weeks, respectively); lanes 6 and 7, adult prefrontal cortex; lane 8, no RT (negative control). Panel B, same as A, but no 9 week fetal sample (lane 4 is 20 week fetal brain and lanes 5 and 6 are adult prefrontal cortex. Panel C, qPCR for MIAT and CRNDE in iPSCs (white bar) and day 14 neurons (black bar). Relative changes in gene expression were calculated using the 2−ΔΔCt method with β2-microglobulin (β2M) as a reference gene. Mean increase in transcripts in neurons was statistically significant (p<0.05). Error bar is SD based on 3 or 4 replicates, each done in triplicate.
Figure 3Splice variants in NRXN1 (panel A) and NRG1 (panel B).
Asterisk in A shows splicing across exons 19, 20 and 21 (exons 20 and 21 are only ∼2 kb apart and appear superimposed in figure). Asterisk in B covers exons 10 and 11 in the NRG1 isoform HRG-β2b. Panel C shows qPCR and relative changes in gene expression calculated using the 2−ΔΔCt method with β2-microglobulin (β2M) as a reference gene. Mean increase in transcripts in neurons was statistically significant (p<0.05). Error bar is SD based on 3 or 4 replicates, each done in triplicate.