Literature DB >> 35184396

The synergism of spatial metabolomics and morphometry improves machine learning-based renal tumour subtype classification.

Verena M Prade1, Na Sun1, Jian Shen1, Annette Feuchtinger1, Thomas Kunzke1, Achim Buck1, Peter Schraml2, Holger Moch2, Kristina Schwamborn3, Michael Autenrieth4, Jürgen E Gschwend4, Franziska Erlmeier5,6, Arndt Hartmann5,6, Axel Walch1.   

Abstract

Entities:  

Keywords:  machine learning; mass spectrometry imaging; metabolomics; morphometry; renal cell carcinoma; tumour of the kidney; tumour subtyping

Mesh:

Year:  2022        PMID: 35184396      PMCID: PMC8858620          DOI: 10.1002/ctm2.666

Source DB:  PubMed          Journal:  Clin Transl Med        ISSN: 2001-1326


× No keyword cloud information.
Dear Editor, Although mass spectrometry imaging generates both morphometric and metabolomics data, they have never been combined to improve machine learning‐based tumour typing. We demonstrate that the synergy of spatial metabolomics and morphometric data improves the classification of tumours of the kidney and thus has the potential to improve artificial intelligence (AI)‐based diagnostics. Tumours of the kidney are a heterogeneous group of various types of cancer with characteristic histologic or genetic features that require tumour type‐specific therapies. Chromophobe renal cell carcinomas (chRCC) and renal oncocytomas – two tumour types that can sometimes be difficult to distinguish based on morphology alone – are associated with different prognosis, and the former has the potential to progress and metastasize. , Both immunoncological and targeted therapies are investigated; however, immunotyping and genotyping are laborious, fall short of standardization, and immunohistochemical markers have been shown to be unreliable. , We used matrix‐assisted laser desorption/ionization (MALDI) mass spectrometry imaging (MSI) because one of its greatest strengths is the ability to combine in situ mass spectrometric data with conventional histology or immunohistochemistry, making it a powerful and very useful tool for multiparametric high‐dimensional multi‐omics analyses. , , , , This potential has also been successfully applied for biomarker discovery and machine learning‐based renal tumour subtyping using unique molecular data. , , , , Our study was performed on a large patient cohort (n = 853, Table 1) and on clinically relevant FFPE tissue samples to distinguish clear cell renal cell carcinomas (ccRCC, n = 552), papillary renal cell carcinomas (pRCC, n = 122), chRCC (n = 108) and renal oncocytomas (RO, n = 71). For details about the MALDI imaging and the morphometrics analysis of H&E stained tissue sections, see the supporting information.
TABLE 1

Patient characteristics

Patient characteristics n = 853
Age range (median) (years) 27–88 (65)
Gender
Male299 (40.4%)
Female441 (59.6%)
ISUP Grade
Grade 125 (4.1%)
Grade 2241 (39.4%)
Grade 3191 (31.3%)
Grade 4154 (25.2%)
Pathological stage
pT1418 (55.8%)
pT290 (12.0%)
pT3231 (30.9%)
pT410 (1.3%)
pN+26 (15.0%)
pM+4 (36.4%)
Subtype
RO71 (8.3%)
chRCC108 (12.7%)
ccRCC552 (64.7%)
pRCC122 (14.3%)
Survival (dead/alive)214 (31.5%)/465 (68.5%)
Overall survival median (months)36

Abbreviations: ccRCC, clear cell renal cell carcinomas; chRCC, chromophobe renal cell carcinomas; pRCC, papillary renal cell carcinoma; RO, renal oncocytomas.

Patient characteristics Abbreviations: ccRCC, clear cell renal cell carcinomas; chRCC, chromophobe renal cell carcinomas; pRCC, papillary renal cell carcinoma; RO, renal oncocytomas. Morphometric features (n = 110) describing tissue or cell compartment colour, shape or size (Table S1) and untargeted metabolomic features (n = 2,111) were used for classifier training (Figure 1). Patients were randomly split into training (2/3) and independent validation set (1/3). The data were normalized for training and validation set, separately. Feature selection was done by calculating a Kruskal–Wallis test (p < 0.01) on the training sets. Details about the classifier training are in the supporting information.
FIGURE 1

Workflow to analyze the synergistic effect of morphometric and molecular data on classifier performance. Eight hundred fifty‐three patient tissues were analyzed with MALDI mass spectrometry imaging (MSI), followed by hematoxylin & eosin staining and morphometric image analysis. Morphometric and molecular data of tumour regions were used both separately and in synergy to train three random forest classifiers. Patients were split into training and independent validation sets, followed by data normalization, feature selection and classifier training and validation

Workflow to analyze the synergistic effect of morphometric and molecular data on classifier performance. Eight hundred fifty‐three patient tissues were analyzed with MALDI mass spectrometry imaging (MSI), followed by hematoxylin & eosin staining and morphometric image analysis. Morphometric and molecular data of tumour regions were used both separately and in synergy to train three random forest classifiers. Patients were split into training and independent validation sets, followed by data normalization, feature selection and classifier training and validation Using morphometric data alone, the classifier reached a mean accuracy of 77.81% (Figure 2A). It performed best for ccRCC (F1‐score: 86.22%) and worst for chRCC (F1‐score: 52.17%).
FIGURE 2

Synergistic effect of morphometric and molecular data on classifier performance on renal cell carcinoma subtypes. (A and B) Classifier performance of the random forests revealing a synergistic effect of morphological and molecular data. In (B), the best performing classifier is visualized with a larger marker. The synergistic effect is best seen for papillary renal cell carcinoma (pRCC), where the performance is improved by up to 10 per cent. (C‐E) Feature importance of the top 50 features for the random forest trained on morphometric data (C), metabolite data (D) and the synergy of both (E). In the latter, the top 50 comprise a mixture of both types of features

Synergistic effect of morphometric and molecular data on classifier performance on renal cell carcinoma subtypes. (A and B) Classifier performance of the random forests revealing a synergistic effect of morphological and molecular data. In (B), the best performing classifier is visualized with a larger marker. The synergistic effect is best seen for papillary renal cell carcinoma (pRCC), where the performance is improved by up to 10 per cent. (C‐E) Feature importance of the top 50 features for the random forest trained on morphometric data (C), metabolite data (D) and the synergy of both (E). In the latter, the top 50 comprise a mixture of both types of features Trained on metabolomics data, the classifier reached a higher mean accuracy (85.13%) and performed even better on ccRCC (F1‐score: 91.06%), chRCC (F1‐score: 76.46%) and RO (F1‐score: 83.04%) (Figure 2A). Only on pRCC, this classifier did not perform as well as the classifier trained on morphometric data (F1‐score: 63.66%). The third classifier was trained on the synergy of both data sets – morphometric data and molecular data – and outperformed the two previous classifiers for each tumour subtype (Figure 2A). It reached a mean accuracy of 88.04% and F1‐scores of 92.54% (ccRCC), 76.73% (pRCC), 77.15% (chRCC) and 84.65% (RO). When comparing each statistical measure, the synergistic classifier trained on both data sets almost consistently outperforms the other two (Figure 2B). The synergy of morphometric and metabolite data not only improves general performance, but also seems to compensate for weaknesses of the two individual classifiers trained on either morphometric or metabolite data. For instance, the metabolomic classifier performed better compared to the morphometric classifier for ccRCC (F1‐scores: 91.06% vs. 86.22%), chRCC (F1‐scores: 76.46% vs. 52.17%) and RO (F1‐scores: 83.04% vs. 64.75%), while the morphometric classifier performed better for pRCC (F1‐scores: 63.66% vs. 67.56%). The classifier trained on the synergy of the two data sets outperforms either of the two (F1ccRCC: 92.54%, F1pRCC: 76.73%, F1chRCC: 77.15%, F1RO: 84.65%). The Gini importance of each feature was calculated for the three classifiers, and the top 50 features were compared (Figure 2C‐E). The top 50 features from the classifier trained on both data sets represent an even mixture of morphometric and metabolomic features (Figure 2E). Example metabolite ion images of the top features in the classifier are illustrated in Figure S1. Interestingly, the four most important features are metabolites, and while half of the top features are morphometric, the importance of the best metabolite is twice as high as that of the top morphometric feature. This further illustrates the synergistic impact both data sets have on the classifier's performance. Up to this point, the classifiers were trained to distinguish four different tumour subtypes, which is a much more difficult task than separating only two tumour subtypes. However, since ccRCC and pRCC are relatively easy to recognize by a pathologist using histology alone, we also tested our approach exclusively on the two remaining tumour subtypes – chRCC and RO (Figure 3A).
FIGURE 3

Synergistic effect of morphometric and molecular data on classifier performance of chromophobe renal cell carcinoma (ccRCC) and renal oncocytoma (RO). (A) Hematoxylin & eosin staining of chRCC and RO, illustrating their similar histology. (B and C) Classifier performance of the random forests revealing a synergistic effect of morphological and molecular data. In (C), the best performing classifier is visualized with a larger marker. (D‐F) Feature importance of the top 50 features for the random forest trained on morphometric data (D), metabolite data (E) and the synergy of both (F). In the latter, the top 50 comprise a mixture of both types of features. Scale bar: 200 μm

Synergistic effect of morphometric and molecular data on classifier performance of chromophobe renal cell carcinoma (ccRCC) and renal oncocytoma (RO). (A) Hematoxylin & eosin staining of chRCC and RO, illustrating their similar histology. (B and C) Classifier performance of the random forests revealing a synergistic effect of morphological and molecular data. In (C), the best performing classifier is visualized with a larger marker. (D‐F) Feature importance of the top 50 features for the random forest trained on morphometric data (D), metabolite data (E) and the synergy of both (F). In the latter, the top 50 comprise a mixture of both types of features. Scale bar: 200 μm The same synergistic effect can be observed for the three classifiers (Figure 3B). On morphometric data, a mean accuracy of 81.27% is achieved, while on metabolomics data, the mean accuracy is higher with 89.49%. The synergy of the data further increases the accuracy to 91.0% with an F1‐score of 92.70% for chRCC and 88.07% for RO. As these two subtypes can be histologically similar, the morphometry plays a minor role, while metabolite data are of higher importance for classification. Hence, fewer morphometric features are ranked among the top 50 (40%), but they are still beneficial for tumour subtype classification. The synergistic effect is reflected by the high ranking of morphometric features within the classifier (Figure 3F). Even though morphometric data are readily available in any MSI experiment, it has not been exploited to improve the predictive quality of molecular classifiers so far. This study provides evidence that the synergy of morphometric and molecular data improves renal carcinoma subtyping. Our study was performed on a large patient cohort (n = 853) and on clinically relevant FFPE tissue samples using metabolite data. The classifier trained on the combined data set or morphometric and metabolite data outperformed the classifiers trained on the individual data sets for each tumour subtype and reached an accuracy of 88.04%. Finally, the classifier was trained on chRCC and RO – two tumour subtypes that are sometimes difficult to distinguish based on histology alone – and was able to distinguish the subtypes with high accuracy (91%). In conclusion, we propose to utilize the so far unrecognized potential and synergy of computer‐aided image analysis and spatial metabolomics – both types of data available in all MSI experiments – to improve AI‐based diagnostics and tumour subtyping in general.

CONFLICT OF INTEREST

The authors declare no conflict of interest. Supporting Information Click here for additional data file.
  14 in total

1.  Expression of kidney-specific cadherin distinguishes chromophobe renal cell carcinoma from renal oncocytoma.

Authors:  Peter R Mazal; Markus Exner; Andrea Haitel; Sigurd Krieger; R Brent Thomson; Peter S Aronson; Martin Susani
Journal:  Hum Pathol       Date:  2005-01       Impact factor: 3.466

2.  Mass Spectrometry Imaging Enables Discrimination of Renal Oncocytoma from Renal Cell Cancer Subtypes and Normal Kidney Tissues.

Authors:  Jialing Zhang; Shirley Q Li; John Q Lin; Wendong Yu; Livia S Eberlin
Journal:  Cancer Res       Date:  2019-12-16       Impact factor: 12.701

3.  Multimodal analysis of formalin-fixed and paraffin-embedded tissue by MALDI imaging and fluorescence in situ hybridization for combined genetic and metabolic analysis.

Authors:  Katharina Huber; Thomas Kunzke; Achim Buck; Rupert Langer; Birgit Luber; Annette Feuchtinger; Axel Walch
Journal:  Lab Invest       Date:  2019-05-31       Impact factor: 5.662

Review 4.  MALDI imaging mass spectrometry in cancer research: combining proteomic profiling and histological evaluation.

Authors:  Cédrik Schöne; Heinz Höfler; Axel Walch
Journal:  Clin Biochem       Date:  2013-02-04       Impact factor: 3.281

5.  Imaging Mass Spectrometry Is an Accurate Tool in Differentiating Clear Cell Renal Cell Carcinoma and Chromophobe Renal Cell Carcinoma: A Proof-of-concept Study.

Authors:  Hsiang-Chih Lu; Nathan Heath Patterson; Audra M Judd; Michelle L Reyzer; Jennifer K Sehn
Journal:  J Histochem Cytochem       Date:  2020-05-28       Impact factor: 2.479

6.  Discovery of lipid biomarkers correlated with disease progression in clear cell renal cell carcinoma using desorption electrospray ionization imaging mass spectrometry.

Authors:  Keita Tamura; Makoto Horikawa; Shumpei Sato; Hideaki Miyake; Mitsutoshi Setou
Journal:  Oncotarget       Date:  2019-03-01

7.  De novo discovery of metabolic heterogeneity with immunophenotype-guided imaging mass spectrometry.

Authors:  Verena M Prade; Thomas Kunzke; Annette Feuchtinger; Maria Rohm; Birgit Luber; Florian Lordick; Achim Buck; Axel Walch
Journal:  Mol Metab       Date:  2020-02-14       Impact factor: 7.422

Review 8.  Pathology and systemic therapy of non-clear cell renal cell carcinoma: an overview.

Authors:  Lothar Bergmann; Sarah Weber; Arndt Hartmann; Marit Ahrens
Journal:  Expert Rev Anticancer Ther       Date:  2021-08-06       Impact factor: 4.512

9.  Morphometric Cell Classification for Single-Cell MALDI-Mass Spectrometry Imaging.

Authors:  Klára Ščupáková; Frédéric Dewez; Axel K Walch; Ron M A Heeren; Benjamin Balluff
Journal:  Angew Chem Int Ed Engl       Date:  2020-08-17       Impact factor: 16.823

View more
  1 in total

1.  MALDI Mass Spectrometry Imaging-Prognostic Pathways and Metabolites for Renal Cell Carcinomas.

Authors:  Franziska Erlmeier; Na Sun; Jian Shen; Annette Feuchtinger; Achim Buck; Verena M Prade; Thomas Kunzke; Peter Schraml; Holger Moch; Michael Autenrieth; Wilko Weichert; Arndt Hartmann; Axel Walch
Journal:  Cancers (Basel)       Date:  2022-03-30       Impact factor: 6.639

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.