| Literature DB >> 35956764 |
Juliana Pereira Lopes Gonçalves1, Christine Bollwein1, Anna Melissa Schlitter1,2, Mark Kriegsmann3, Anne Jacob1, Wilko Weichert1,2, Kristina Schwamborn1.
Abstract
Cancer-related deaths are very commonly attributed to complications from metastases to neighboring as well as distant organs. Dissociate response in the treatment of pancreatic adenocarcinoma is one of the main causes of low treatment success and low survival rates. This behavior could not be explained by transcriptomics or genomics; however, differences in the composition at the protein level could be observed. We have characterized the proteomic composition of primary pancreatic adenocarcinoma and distant metastasis directly in human tissue samples, utilizing mass spectrometry imaging. The mass spectrometry data was used to train and validate machine learning models that could distinguish both tissue entities with an accuracy above 90%. Model validation on samples from another collection yielded a correct classification of both entities. Tentative identification of the discriminative molecular features showed that collagen fragments (COL1A1, COL1A2, and COL3A1) play a fundamental role in tumor development. From the analysis of the receiver operating characteristic, we could further advance some potential targets, such as histone and histone variations, that could provide a better understanding of tumor development, and consequently, more effective treatments.Entities:
Keywords: mass spectrometry imaging; metastasis; pancreatic ductal adenocarcinoma; prognosis; proteomics; tumor development
Mesh:
Substances:
Year: 2022 PMID: 35956764 PMCID: PMC9369872 DOI: 10.3390/molecules27154811
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.927
Figure 1Pancreatic ductal adenocarcinoma (PDAC) and distant metastases. PDAC located in the pancreas (represented by the green dot) presents a high metastatic rate. Distant PDAC metastases are most commonly found in the liver, peritoneum, lungs, adrenal gland, and skin (represented by the red dots). The development of distant metastases is often associated with a poor prognosis for this ailment.
Figure 2Overview of the average mass spectrum of the primary (green) and metastatic (orange) tissues.
Patients’ characteristics of the mixed TMA.
| Patients | |
|---|---|
| Age (average ± deviation, years) | 68 ± 26 |
| Gender | |
| Female | 6 |
| Male | 7 |
| T stage | |
| N/A | 4 |
| pT3 | 8 |
| pT4 | 1 |
| Nodal involvement | |
| N/A | 4 |
| pN 1 | 8 |
| pN 2 | 1 |
| CTx | 5 |
| Tumor grading | |
| N/A | 4 |
| 2 | 4 |
| 3 | 5 |
| Metastasis location | |
| Liver | 9 |
| Peritoneum | 4 |
N/A—not available; CTx—neoadjuvant chemotherapy treatment.
Classification accuracy of the supervised classification models.
| Classification Outcome | RF | SVM | LDA |
|---|---|---|---|
| Accuracy | 0.9319 | 0.9368 | 0.8972 |
| Sensitivity | 0.9783 | 0.9567 | 0.9783 |
| Specificity | 0.8208 | 0.8892 | 0.7028 |
RF—random forest; SVM—support vector machine; LDA—linear discriminant analysis.
Figure 3Overview of the classification outcomes obtained for the external validation data set. The annotated regions of the primary (in red) and the annotated regions of the metastases (in yellow) were subjected to the built classification models. The classification outcomes that used the linear discriminant analysis (LDA) model were plotted and overlaid with the measurement regions. Classifications of primary tumors are represented with red dots, while classifications of distant metastases are represented by yellow dots.
Top 10 features, calculated by AUC-ROC.
| AUC-ROC | Tentative ID | Sequence | Modifications | MASCOT | |
|---|---|---|---|---|---|
| 1336.639 | 0.66484 | Core histone macro-H2A.1 | LEAIITPPPAKK | Acetyl (N-term) | 37 |
| 1198.711 | 0.66414 | Actin * | AVFPSIVGRPR | 73 | |
| 976.426 | 0.65948 | Actin | AGFAGDDAPR | # [ | |
| 632.313 | 0.65927 | Histone PARylation factor 1 | VGGGGKR | 31 | |
| 1220.666 | 0.65995 | Collagen alpha-3 (VI) chain | LGAPGTPGLPGPR | 2 Oxidation (P) | 40 |
| 1493.741 | 0.64999 | Collagen alpha-2(1) chain precursor | GLHGEFGLPGPAGPR | 2 Oxidation (P) | 40 |
| 1039.518 | 0.6448 | Tubulin beta-2C chain | # [ | ||
| 805.419 | 0.6441 | Collagen alpha-3 (VI) chain | # [ | ||
| 901.485 | 0.6415 | Histone H2B | LAHYNKR | 43 | |
| 911.438 | 0.6391 |
* Possible underlying isoforms: ACTA1 or ACTA2, ACTAB, ACTG1, ACTAG2, POTEI, POTEKP, POTEF or POTEE. # Tentative identification based on literature search.