| Literature DB >> 34970540 |
Meltem Kuruş1, Soheil Akbari2, Doğa Eskier2,3, Ahmet Bursalı2, Kemal Ergin4, Esra Erdal2,5, Gökhan Karakülah2,3.
Abstract
The generation and use of induced pluripotent stem cells (iPSCs) in order to obtain all differentiated adult cell morphologies without requiring embryonic stem cells is one of the most important discoveries in molecular biology. Among the uses of iPSCs is the generation of neuron cells and organoids to study the biological cues underlying neuronal and brain development, in addition to neurological diseases. These iPSC-derived neuronal differentiation models allow us to examine the gene regulatory factors involved in such processes. Among these regulatory factors are long non-coding RNAs (lncRNAs), genes that are transcribed from the genome and have key biological functions in establishing phenotypes, but are frequently not included in studies focusing on protein coding genes. Here, we provide a comprehensive analysis and overview of the coding and non-coding transcriptome during multiple stages of the iPSC-derived neuronal differentiation process using RNA-seq. We identify previously unannotated lncRNAs via genome-guided de novo transcriptome assembly, and the distinct characteristics of the transcriptome during each stage, including differentially expressed and stage specific genes. We further identify key genes of the human neuronal differentiation network, representing novel candidates likely to have critical roles in neurogenesis using coexpression network analysis. Our findings provide a valuable resource for future studies on neuronal differentiation.Entities:
Keywords: WGCNA; coexpression; iPSC-derived neuronal differentiation; lncRNAs; transcriptome profiling
Year: 2021 PMID: 34970540 PMCID: PMC8712770 DOI: 10.3389/fcell.2021.727747
Source DB: PubMed Journal: Front Cell Dev Biol ISSN: 2296-634X
FIGURE 1Generation and characterization of human iPSCs-derived neurons. (A) A schematic representation of the in vitro culture system used for stepwise differentiation of human iPSCs into neurons. Timeline and representative bright-field images of cell morphology during stages of differentiation from day 0 to day 45. (B) The samples for RNA-seq analysis were collected on day 0 (pluripotent), days 9–12 (induced progenitor), days 25–28 (expanded progenitor), days 32–35 (differentiated/neuronal precursor) and days 45–47 (mature neuron). Confocal images of the cells showing expression of the pluripotency marker (OCT3/4), neural progenitor markers (PAX6, NESTIN) and neural markers (GFAP, β-TUBIII). Nuclei were visualized with DAPI.
FIGURE 2Comparison of the coding and non-coding transcriptome profiles of the cells during the neural differentiation process. I1 and I2 are iPSC samples, P1 and P2 are cells during neural induction, G1 and G2 are neural progenitor cells, D1 and D2 are cells undergoing neural differentiation, and M1 and M2 are mature neural cells. All transcriptome profiles are filtered for genes which display an FPKM value of at least 1.0 in both replicates of at least one biological condition. (A) Pearson correlation heatmap of samples clustered using hierarchical clustering. Cell colors indicate Pearson correlation values of the samples indicated in the row and column. Darker cells indicate higher correlation. (B) Stacked bar graph of long noncoding transcriptome profiles of the samples divided by percentage of RNA-seq counts sequenced per type. (C) Violin plot of protein coding and lncRNA expression values of the samples. (D) Density graph of genes by transcript length per gene type. (E) Histogram of genes by exon count in canonical transcript per gene type.
FIGURE 3Genes showing differential or stage-specific expression during neural differentiation from iPSC cells are implicated in biological processes. All transcriptome profiles are filtered for genes which display an FPKM value of at least 1.0 in both replicates of at least one biological condition. Differentially expressed genes are defined as those with an FDR of ≤0.05 and an absolute log2 fold change value of ≥0.6. (A,B) Percentage (A) and absolute count (B) of genes displaying differential expression between pairs of neural differentiation stages, divided into protein coding and lncRNA. Percentage values are calculated using the size of the transcript category as the denominator. (C) Dot plot of GO terms enriched in differentially expressed gene sets. Top 10 sets are selected in order of Benjamini-Hochberg adjusted p-values in each condition pair. Size of the dots indicate the number of differentially expressed genes associated with the GO term. (D) Scaled heatmap of the expression values of transcripts showing stage-specific expression according to the ROKU tissue specificity index. (E) Dot plot of GO terms enriched in stage specific gene sets. Top 10 sets are selected in order of Benjamini-Hochberg adjusted p-values in each condition pair. Size of the dots indicate the number of differentially expressed genes associated with the GO term.
FIGURE 4lncRNAs expressed during different stages of neural differentiation from iPSCs are associated with protein coding genes implicated in neural development biological processes. (A) WGCNA dendrogram and module affiliation graph of the transcriptome and association of gene expressions with biological conditions during neural differentiation. Each branch of the dendrogram represents a single gene expressed during neural differentiation. The colored bar under the dendrogram indicate the module the gene belongs in, with each color indicating a single module. The heatmap underneath the colored bar shows stage-module correlation levels, with red cells indicating positive correlation, blue cells indicating negative correlation, and darker colors indicating stronger correlation levels. (B) Scaled heatmaps of genes in modules showing strong association with single maturation stage (Pearson correlation coefficient ≥0.7). (C) Bar graphs indicating lncRNA module membership in clusters of interest categorized into annotated and novel lncRNAs. (D) Dot plot of GO terms enriched in gene clusters of interest. Up to 10 sets are selected in order of Benjamini-Hochberg adjusted p-values in each module. Size of the dots indicate the number of genes in the module associated with the GO term.