| Literature DB >> 35269711 |
Thorben Sauer1, Giulia Facchinetti1, Michael Kohl1, Justyna M Kowal2, Svitlana Rozanova1, Julia Horn1, Hagen Schmal3,4, Ivo Kwee5, Arndt-Peter Schulz6,7, Sonja Hartwig8,9, Moustapha Kassem2, Jens K Habermann1,10, Timo Gemoll1.
Abstract
Mesenchymal stem cells (MSCs) gain an increasing focus in the field of regenerative medicine due to their differentiation abilities into chondrocytes, adipocytes, and osteoblastic cells. However, it is apparent that the transformation processes are extremely complex and cause cellular heterogeneity. The study aimed to characterize differences between MSCs and cells after adipogenic (AD) or osteoblastic (OB) differentiation at the proteome level. Comparative proteomic profiling was performed using tandem mass spectrometry in data-independent acquisition mode. Proteins were quantified by deep neural networks in library-free mode and correlated to the Molecular Signature Database (MSigDB) hallmark gene set collections for functional annotation. We analyzed 4108 proteins across all samples, which revealed a distinct clustering between MSCs and cell differentiation states. Protein expression profiling identified activation of the Peroxisome proliferator-activated receptors (PPARs) signaling pathway after AD. In addition, two distinct protein marker panels could be defined for osteoblastic and adipocytic cell lineages. Hereby, overexpression of AEBP1 and MCM4 for OB as well as of FABP4 for AD was detected as the most promising molecular markers. Combination of deep neural network and machine-learning algorithms with data-independent mass spectrometry distinguish MSCs and cell lineages after adipogenic or osteoblastic differentiation. We identified specific proteins as the molecular basis for bone formation, which could be used for regenerative medicine in the future.Entities:
Keywords: SWATH; data-independent acquisition mass spectrometry; differentiation markers; human stromal/mesenchymal stem cells; machine learning; protein profiling
Mesh:
Year: 2022 PMID: 35269711 PMCID: PMC8910760 DOI: 10.3390/ijms23052568
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Figure 1Unsupervised principal component analysis plots derived from mass spectrometry data ((a), 4108 proteins) and two-dimensional gel electrophoresis data ((b), 1517 protein spots). The three cell lineages MSCs (green), OB (blue), AD (pink) were run in quintuples. P11, p13, p15, p17, and p18 indicate the patient number of the corresponding sample. X- and y-axes show the first and second principal components, respectively.
Enriched gene sets of the MSigDB Hallmark collection for OB versus AD, OB versus MSC, and AD versus MSC. (Log2FC, logarithm of fold change; Meta-q, meta q-value used statistical methods; Avg. Expr., average expression).
| Gene set | Log2FC OB vs. AD | Meta-q | Avg. Expr. OB | Avg. Expr. AD |
|---|---|---|---|---|
| MYC_TARGETS_V1 | 0.575 | 0.0005 | 6.103 | 5.528 |
| E2F_TARGETS | 0.410 | 0.0005 | 7.049 | 6.638 |
| G2M_CHECKPOINT | 0.278 | 0.0005 | 5.411 | 5.133 |
| PEROXISOME | −0.223 | 0.0366 | 6.925 | 7.148 |
| BILE_ACID_METABOLISM | −0.317 | 0.0006 | 6.762 | 7.079 |
| FATTY_ACID_METABOLISM | −0.350 | 0.0006 | 6.535 | 6.885 |
| CHOLESTEROL_HOMEOSTASIS | −0.377 | 0.0315 | 6.318 | 6.695 |
| OXIDATIVE_PHOSPHORYLATION | −0.438 | 0.0006 | 5.028 | 5.466 |
| ADIPOGENESIS | −0.491 | 0.0006 | 5.782 | 6.273 |
|
|
|
|
|
|
| MYC_TARGETS_V2 | 0.279 | 0.0435 | 5.745 | 5.466 |
| E2F_TARGETS | 0.274 | 0.0008 | 7.034 | 6.76 |
| MYC_TARGETS_V1 | 0.241 | 0.0098 | 6.076 | 5.835 |
|
|
|
|
|
|
| OXIDATIVE_PHOSPHORYLATION | 0.659 | 0.0006 | 5.590 | 4.931 |
| CHOLESTEROL_HOMEOSTASIS | 0.533 | 0.0006 | 6.654 | 6.121 |
| ADIPOGENESIS | 0.529 | 0.0006 | 6.206 | 5.677 |
| FATTY_ACID_METABOLISM | 0.478 | 0.0006 | 6.909 | 6.431 |
| BILE_ACID_METABOLISM | 0.411 | 0.0006 | 7.096 | 6.685 |
| PEROXISOME | 0.337 | 0.0028 | 7.282 | 6.945 |
| XENOBIOTIC_METABOLISM | 0.271 | 0.0011 | 5.857 | 5.585 |
| PI3K_AKT_MTOR_SIGNALING | −0.203 | 0.0359 | 7.127 | 7.331 |
| MYC_TARGETS_V1 | −0.334 | 0.0105 | 5.605 | 5.939 |
| MITOTIC_SPINDLE | −0.335 | 0.0006 | 6.635 | 6.97 |
Figure 2Volcano plots of differentially expressed proteins between MSCs, osteoblasts, and adipocytes (a–c) as well as a Venn diagram indicating the overlap of differentially abundant proteins between all three group comparisons (d). Volcano plots are presented with the fold-change of the corresponding comparison in logarithmic scale (x-axis) against the q-value of the Tukey’s post hoc test (y-axis). Significance thresholds (q-value < 0.05 and |log2FC| threshold of >2) are indicated by dashed lines. Proteins passing these cut-offs are considered significant and colored in pink. Proteins passing the ANOVA and post-hoc q-value but not the log2FC threshold are colored in green. Proteins that were not significant in the ANOVA but in Tukey’s post hoc are indicated in blue. (a) OB vs. MSC comparison, 50 proteins are identified as significantly differentially expressed. (b) OB vs. AD comparison, 66 proteins are identified as significantly differentially expressed. (c) AD vs. MSC comparison, 86 proteins are identified as significantly differentially expressed. (d) Venn diagram indicating the overlap of differentially abundant proteins (ANOVA) of all three defined comparisons.
Figure 3Boxplots of nine proteins for (a) osteoblastic differentiation and (b) adipocytic differentiation using ANOVA and machine learning algorithms. Individual data points are shown with median expression in log2 of the measured intensity. Whiskers are plotted according to the Tukey method indicating 1.5 * interquartile range. The osteoblastic protein (a) COL3A1 and the adipocytic protein (b) HSD11B1 show significant differential expressions for all three comparisons (indicated by asterisks), however, the comparison between the AD and MSC group and OB and MSC did not exceed the net log2FC threshold of 2, respectively. Asterisks indicate the Tukey’s post hoc q-value results: * =≤ 0.05, ** =≤ 0.01, *** =≤ 0.001.
Figure 4Heatmap of 22 marker proteins for osteoblastic and adipocytic differentiation. Expression values are shown in log2 of the measured intensity.
Protein expression values of CD markers that have been defined by the International Society of Cellular Therapy for MSC classification. (+) defined high protein expression in MSCs; (-) defined low expression in MSCs. (CV, coefficients of variation; q-value, adjusted post-hoc p-value; Log2FC, logarithm of fold change).
| Protein | Gene Name | CV | q-Value | Log2FC | Log2FC | Log2FC |
|---|---|---|---|---|---|---|
| CD105+ | ENG | 0.104 | 0.25 | 0.121 | −0.884 | −0.763 |
| CD73+ | NT5E | 0.064 | 0.43 | 0.426 | 0.157 | 0.583 |
| CD90+ | THY1 | 0.121 | 0.41 | 0.355 | −1.046 | −0.691 |
| CD45- | PTPRC | Not Identified | ||||
| CD34- | CD34 | Not Identified | ||||
| CD14- | CD14 | Not Identified | ||||
| CD11b- | ITGAM | Not Identified | ||||
| CD79a- | CD79a | Not Identified | ||||
| CD19- | CD19 | Not Identified | ||||
| HLA-DR- | HLA-DR | Not Identified | ||||