| Literature DB >> 25326687 |
Beatriz Roson-Burgo, Fermin Sanchez-Guijo, Consuelo Del Cañizo, Javier De Las Rivas1.
Abstract
BACKGROUND: Human Mesenchymal Stromal/Stem Cells (MSCs) are adult multipotent cells that behave in a highly plastic manner, inhabiting the stroma of several tissues. The potential utility of MSCs is nowadays strongly investigated in the field of regenerative medicine and cell therapy, although many questions about their molecular identity remain uncertain.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25326687 PMCID: PMC4287589 DOI: 10.1186/1471-2164-15-910
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1Characterization of MSCs primary cultures: microscope and flow cytometry. Microscope photographs of human MSCs in culture isolated from PL (A) and BM (B): phase contrast micrographs of passage three cultures seen at two amplifications. Flow cytometry histograms of standard immunophenotype markers (CD34, CD73, CD45, CD90, CD166, HLADR, CD105) tested in isolated PL (C) and BM (D) MSCs.
Figure 2MSCs RNA-Seq reads mapping on the loci of 8 specific genes. A superimposed view piling-up the uniquely aligned RNA-Seq reads over the DNA regions of 8 specific gene loci, showing the exonic and intronic transcription outcomes. The expressed regions appear as black densities representing the raw number of reads on each specific region of the locus (transformed to log scale). The locus corresponding to each gene is indicated by the chromosome and the nucleotide number position, including scales bars in kb. For each gene locus the signal of BM-MSCs and PL-MSCs are placed one on top of the other, and the corresponding structure of the locus from RefSeqGene (NCBI) is painted below in blue. The genes are divided in (A) positive markers, (B) negative markers and (C) other markers.
Figure 3Global expression: RNA-Seq expression data of human MSCs. (A) Scatter plot presenting the values of log2 (FPKM ) for each gene in the BM-MSC samples (X-axis) versus the PL-MSC samples (Y-axis). Insert (B): Scatter plot including the log2 (FPKM ) of the protein-coding genes in the BM- and PL-MSC samples (showing with a green shade the region that includes 95% of the expressed data distribution). Insert (C): Table indicating the log2 (FPKM ) values correspoding to 8 marker genes in the BM- and PL-MSC samples.
Figure 4Mapping the expression of several gene-sets on RNA-Seq data of MSCs. Scatter plots of log2 (FPKM ) in BM- against PL-MSC samples marking in colors different groups of genes: (A) set of human transcription factors (TF) including 740 genes (purple dots), with135 genes found in the MSC expressed region (hypergeometric p-value > 0.95, not significant); (B) set of 299 stem cell (SC) related genes (orange dots), including 139 found in the MSC expressed region (hypergeometric p-value = 5.96x10-17, show significant enrichment); (C) set of 158 highly conserved housekeeping (HK) genes (green dots), including 104 found in the MSC expressed region (hypergeometric p-value = 1.31x10-28, show significant enrichment). (D) Boxplot of the log2(FPKM ) expression distributions of the 3 gene sets (HK, SC, TF) corresponding to PL-MSCs (blue) or BM-MSCs (red).
Functional enrichment on KEGG signaling pathways of MSCs expressed coding genes
| KEGG pathway | Gene set size | Observed hits | Adjusted p-value* | Hits (Gene_Symbols) |
|---|---|---|---|---|
| mTOR signaling pathway (hsa04150) | 52 | 27 | 0.0000505 | AKT1, AKT3, CAB39, CAB39L, EIF4E2, EIF4EBP1, MAPK1M MAPK3, MLST8, MTOR, PIK3CB, PIK3R2, PRKAA1, PRKAA1, RHEB, RPS6, RPS6KA2, RPS6KA3, RPS6KB1, RPS6KB2, STK11, TSC1, TSC2, ULK1, ULK3, VEGFA, VEGFB, VEGFC |
| ErbB signaling pathway (hsa04012) | 87 | 35 | 0.0019942 | ABL1, AKT1, AKT3, BAD, CDKN1A, CDKN1B, CRK, CRKL, EGFR, EIF4EBP1, ELK1, ERBB2, GRN2, GSK3B, HBEGF, HRAS, KRAS, MAP2K1, MAP2K2, MAP2K4, MAP2K7, MAPK1, MAPK3, MAPK8, MAPK9, MTOR, MYC, NCK2, NRAS, PAK1, PAK2, PIK3CB, PIK3RS, RPS6KB1, RPS6KB2 |
| TGF-beta signaling pathway (hsa04330) | 83 | 33 | 0.0032629 | BMPR1A, BMPR2, CDKN2B, CUL1, E2F4, ID1, ID3, MAPK1, MAPK3, MYC, PPP2CB, PPP2R1A, PPP2R1B, RBL2, RBX1, ROCK1, ROCK2, RPS6KB1, RPS6KB2, SKP1, SMADA, SMAD4, SMAD7, SMURF2, SP1, TFDP1, TGFB2, TGFBR1, TGFBR2, THBS2, ZFYVE16, ZFYVE9 |
| Notch signaling pathway (hsa04330) | 46 | 20 | 0.0056174 | ADAM17, APH1A, CIR1, CTBP1, CTBP1, CTBP2, DTX3, DTX3, DTX3L, DVL1, DVL2, HDAC1, HDAC1, HDAC2, HES1, NCOR2, NOTCH2, NOTCH3, NUMB, NUMBL, PSEN2, RBPJ, RFNG |
| Osteociast differentiation (hsa04380) | 127 | 45 | 0.0088651 | AKT1, AKT3, CHUK, CYBA, FHL2, FOS, FOSB, FOSB, FOSL1, FOSL2, GRB2, IFNAR1, IFNAR2, IFNGR1, IFNGR2, IFNGR2, IKBKB, IRF9, JAK1, JUNB, JUND, MAP2K1, MAP2K7, MAP2K7, MAP3K7, MAPK1, MAPK12, MAPK14, MAPK3, MAPK3, MAPK8, MAPK9, NFKBIA, PIK3CB, PIK3R2, PP3CA, PPP3CC, RAC1, RELA, SIRPA, SOCS3, SQSTM1, STAT2, TAB2, TGFB2, TGFBRI, TGFBR2, TNFRSF11B, TYK2 |
| Wnt signaling pathway (hsa04310) | 150 | 51 | 0.0134525 | CCND1, CSNK2A1, CSNK2A2, CSNK2B, CTBP1, CTBP2, CTNNB1, CTNNBIP1, CUL1, DVL1, DVL2, FOSL1, FZD2, FZD4, FZD6, FZD7, FZD8, GSK3B, LRP5, LRP6, MAP3K7, MAPK8, MAPK9, MYC, PLCB3, PPP2R5C, PPP2RSE, PPP3CA, PPP3CC, PPP3R1, PRICKLE1, SENP2, SFRP4, SIAH1, SKP1, SMAD2, SMAD4, TCF7L1, WNT5B |
| Apoptosis (hsa04210) | 87 | 32 | 0.0136314 | AIFM1, AKT1, AKT3, BAD, BAX, BCL2L1, BID, BIRC2, CAPN1, CAPN2, CASP6, CASP8, CAPSP9, CFLAR, CHUK, ENDOG, FADD, IKBKB, IRAK1, NFKBIA, PIK3CB, PIK3R2, PPP3CA, PPP3CC, PPP3R1, PRKACA, PRKAR1A, RELA, RIPK1, TNFRSF10D, TRADD, XDIAP |
| VEGF signaling pathway (hsa04370) | 75 | 28 | 0.0160140 | AKT1, AKT3, BAD, CASP9, CDC42, HRAS, HSPB1, KRAS, |
| Adipocytokine signaling pathway (hsa04920) | 67 | 25 | 0.0207573 | ACSL1, ACSL3, ADIPOR1, ADIPOR2, AKT1, AKT3, CHUK, CPT1A, IKBKB, MAPK8, MAPK9, MAPK9, MTOR, NFKBIA, NFKBIB, PCK2, PRKAA1, PRKAB1, PRLAB2, PRKAG1, PTPN11, RELA, SOCS3, STAT3, STK11, TRADD |
*Hypergeometric test (p-values adjusted using Benjamini and Hockberg method; done with HTSanalyze R-pakage).
Figure 5Transcription factors within the MSC expression footprint. (A) Table of 11 meta-regulators found to be enriched on the TFBS of 135 TFs detected in the MSC expression footprint. (B) DNA methylation distributions –densities versus Beta values– corresponding to 135 expressed MSC-TFs (red) and to other sets of 135 randomly selected TFs (black). The plot shows higher accumulation of methylation measurements around lower values of Beta for MSC-TFs. (C) Table of expression values of log2 (FPKM ) corresponding to five TFs related to pluripotency (KLF4, c-MYC, POU5F1, SOX2 and NANOG); and RNA-Seq raw profiles of KLF4 and MYC genes in BM- and PL-MSC samples. (D) DNA methylation distributions –Beta values– corresponding to the CpGs associated to 4 TFs. Wilcoxon tests proving significant differences in these analyses gave the following p-values: KLF4 vs POU5F1, p-value = 2.40×10e-4; KLF4 vs SOX2, p-value = 2.63×10e-2; MYC vs POU5F1, p-value = 1.89×10e-6; MYC vs SOX2, p-value = 0.782×10e-6 (all parameters of the statistical tests presented in this figure are included in Additional file 12: Table S8) (E) Protein interaction network including the MSC-TFs found. Edge-thickness (blue lines) and number represents the number of experimental evidences that support a given protein-protein interaction (PPI). Shaded groups represent structural families. The 17 TFs that were found to regulate the MSCs gene expression footprint are labeled with red names and enclosed by a square (instead of a circle like the rest of the nodes). Within these 17 TFs, the nodes corresponding to the 11 TF meta-regulators (detailed in 5A) are also labeled with red names but larger squares. Nodes with border in blue (11 genes) are not linked in the network, but they are structural paralogs of some of the linked nodes placed aside.
Figure 6Analysis of differential expression on the RNA-Seq data of BM- and PL-MSCs. Scatter plots marking in red the significant genes found with two different methods: (A) Cuffdiff and (C) DEseq. Volcano plots of the expression data analyses done (B) using Cuffdiff and (E) using DEseq. Setting the q-value threshold at < 0.05 (blue line) for both methods, they detect 232 and 2627 significant genes, respectively. Since DEseq method is much less stringent than Cuffdiff and to avoid false positives, a second cut-off at q-value < 0.001 was set up for DEseq differential expression, selecting in this way 1388 significant genes. (D) Figure showing a proportional Venn diagram to illustrate the overlap of the genes selected by Cuffdiff (232) and DEseq (1388). The overlap includes 203 genes that undergo significant differential expression changes (i.e. common genes in red, in the scatter plots 6A and 6C).