Literature DB >> 34734969

spatialTIME and iTIME: R package and Shiny application for visualization and analysis of immunofluorescence data.

Jordan H Creed1, Christopher M Wilson1, Alex C Soupir1,2, Christelle M Colin-Leitzinger3, Gregory J Kimmel4, Oscar E Ospina1, Nicholas H Chakiryan5, Joseph Markowitz6, Lauren C Peres3, Anna Coghill3, Brooke L Fridley1.   

Abstract

SUMMARY: Multiplex immunofluorescence (mIF) staining combined with quantitative digital image analysis is a novel and increasingly used technique that allows for the characterization of the tumor immune microenvironment (TIME). Generally, mIF data is used to examine the abundance of immune cells in the TIME; however, this does not capture spatial patterns of immune cells throughout the TIME, a metric increasingly recognized as important for prognosis. To address this gap, we developed an R package spatialTIME that enables spatial analysis of mIF data, as well as the iTIME web application that provides a robust but simplified user interface for describing both abundance and spatial architecture of the TIME. The spatialTIME package calculates univariate and bivariate spatial statistics (e.g., Ripley's K, Besag's L, Macron's M, and G or nearest neighbor distance) and creates publication quality plots for spatial organization of the cells in each tissue sample. The iTIME web application allows users to statistically compare the abundance measures with patient clinical features along with visualization of the TIME for one tissue sample at a time.
AVAILABILITY AND IMPLEMENTATION: spatialTIME is implemented in R and can be downloaded from GitHub (https://github.com/FridleyLab/spatialTIME) or CRAN. An extensive vignette for using spatialTIME can also be found at https://cran.r-project.org/web/packages/spatialTIME/index.html. iTIME is implemented within a R Shiny application and can be accessed online (http://itime.moffitt.org/), with code available on GitHub (https://github.com/FridleyLab/iTIME). SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2021. Published by Oxford University Press.

Entities:  

Year:  2021        PMID: 34734969      PMCID: PMC8652029          DOI: 10.1093/bioinformatics/btab757

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.931


1 Introduction

Multiplex immunofluorescence (mIF) staining combined with quantitative digital image analysis is a novel and increasingly used technique that allows for the assessment and visualization of distinct immune cell populations in the tumor immune microenvironment (TIME), as well as discrimination between tumor and stroma compartments. mIF has been recently applied to the study of many cancer types, including oropharyngeal squamous cell carcinoma (Tsakiroglou ), gastric cancer (Huang ) and pancreatic cancer (Vayrynen ). An article was also recently published outlining the opportunities and challenges in the analysis of mIF data, including approaches for spatial analysis of mIF (Wilson et al., 2021b). Generally, mIF data has been used to examine the presence and abundance of immune cells in the TIME; however, this aggregate measure assumes uniform spatial patterns of immune cells throughout the tissue sample and overlooks potential spatial heterogeneity of immune marker-positive cell populations. mIF technologies also provide position data for immune cells in the TIME, thus allowing for the assessment of the spatial architecture of the TIME. Therefore, to facilitate the spatial analysis and visualization of mIF data following the preprocessing of the image data, we have developed an R package, spatialTIME and a Shiny application, iTIME. One commonly used technology for mIF data generation is the Vectra 3.0TM/PolarisTM system. Images of immune marker-stained tissues are processed within InForm (Gorris ; Mezheyeuski ; Mori ; Shakya ) followed by analysis with the HALO Image Analysis Platform (Indica Labs, NM). In HALO, a supervised classifier is trained to classify tissue as tumor, stroma and glass (no tissue) regions (Amancio ; Breiman, 2001; Horai ). Cell segmentation and marker quantitation is performed by compartmental examination of fluorescent intensity thresholds, with each immune marker assigned a distinct fluorescence value (Mezheyeuski ; Mori ). The output from this preprocessing of the mIF data is a file for each tissue sample with the locations of all detected cells and information on whether the cell is positive or negative for each of the assayed markers. In addition to the sample specific file, a summary file is also produced with the descriptive summary for each tissue sample, including the number of cells detected, the number of positive cells for each marker, percentage positive for each marker and density estimates. These summary measures can also be computed across all cells and by tumor/stroma if a tumor-associated marker is included in the assayed panel of markers. The file inputs for the spatialTIME and iTIME package follow the formats of the HALO output; however, the required file formats are general in format (i.e. cell location, indicator variable for positivity for each of the markers or prespecified combinations or ‘phenotypes’) with output from other commonly used technologies able to be easily reformatted into this general format for input into the software tools. Future updates to the package will allow for inputs directly from a variety of other technologies used in mIF studies. In spatialTIME, we have implemented a number of spatial measures of clustering including Ripley’s (Ripley, 1976), Besag’s con’s M (Marcon ) and G statistic (nearest neighbor distance) using spatstat R package (Baddeley ). To address a unique challenge presented in mIF data generated on tissue microarrays (TMAs) whereby cores often have areas where measurements are not able to be acquired (i.e. areas with tears, folded tissue or fibrous tissue) we have implemented a permutation approach for estimating the null distribution of no clustering for which the observed clustering statistic is compared (i.e. complete spatial randomness or CSR) (Wilson et al., 2021a). That is, the measure to be use in downstream visualization and statistical association with clinical outcome is the degree of spatial clustering as defined as the observed value of measure—measure under assumption of CSR. Additional information on the various spatial measures can be found in Supplemental Materials.

2 Implementation

spatialTIME is available as an R package from either GitHub (https://github.com/FridleyLab/spatialTIME) or CRAN (CRAN.R-project.org) and made publicly available under the MIT license. The package functionality is built upon the tidyverse principals and several of its packages along with the spatstat package. mIF experiments return two main datasets—a file with the summary-level information for the markers and individual files for each tissue sample with cell-level staining information. To combat potential confusion due to multiple samples per person, functions in spatialTIME use a custom input object called ‘mif.’ This object can be created by calling the create_mif function and requires at a minimum for users to input the summary and clinical datasets. The resulting mif object is the basis for all other functions within the package. spatialTIME provides functions for plotting mIF data for individual tissue samples and calculating univariate and bivariate measures of degree of spatial clustering and co-localization of marker-positive cells. For spatial measures of clustering, we have implemented a number of spatial measures of clustering including Ripley’s (Ripley, 1976), Besag’s (Besag, 1977), Marcon’s M (Marcon ) and G statistic (nearest neighbor distance) using spatstat R package (Baddeley ). To address a unique challenge presented in mIF data generated on tissue microarrays (TMAs) whereby cores often have areas where measurements are not able to be acquired (i.e. areas with tears, folded tissue or fibrous tissue), we have implemented a permutation approach for estimating the null distribution of no clustering for which the observed clustering statistic is compared (i.e. complete spatial randomness or CSR) (Wilson et al., 2021a). That is, the measure to be use in downstream visualization and statistical association with clinical outcome is the degree of spatial clustering as defined as the observed value of measure—measure under assumption of CSR. The clustering functions return data frames containing spatial clustering estimates for a range of neighborhood sizes, denoted by , for each sample. Users also have the option of obtaining either the full empirical distribution under CSR or simply the mean of this empirical null distribution. The choice of the radius (r) to use for estimating the spatial measures (i.e. size of neighborhood) should be based on the scale of clustering of interest (i.e. a small value of r will have clustering assess for small neighborhoods while a larger value of r would determine the level of clustering based on large-sized neighbors). It is recommended that the association analyses of spatial measure with endpoint of interested be conducted using a few different r values (i.e. sensitivity analysis). Future work is planned for conducting the association of the endpoint with the entire spatial curve computed at several r values using functional data analysis approaches. Finally, the package allows for the creation of publication quality plots (one for each spatial sample provided) using ggplot2 which can be directly exported as a pdf. Detailed information on how to use spatialTIME can be found in the R package documentation and the vignette. The iTIME Shiny application is available at http://itime.moffitt.org/ and is a point and click version with much of the functionality found in the spatialTIME package. On the ‘Importing Data’ page, users upload clinical, summary- and cell-level data in csv files and specify the variable to use for the data merge (i.e. variable that links the data across the various files). The ‘Univariate Summary’ page provides a high-level overview of the summary and clinical data files, including plots and hypothesis testing based on a beta-binomial model to compare summary marker data by clinical variable of interest. Users can customize clinical variables, color schemes and plot type for effective data visualization, including a scatter plot, boxplot, violin plot or histogram. The ‘Multivariate Summary’ page provides a heatmap and principal component plot for selected immune cell populations and provides the option to perform a clustering analysis to understand if immune cell composition of the TIME is related to clinical features. The ‘Spatial’ page displays an interactive visual of the imported cell-level data for a sample, with tumor cells displayed as triangles and stromal cells as circles, and a separate color to represent positivity for each distinct immune marker. When hovering over each cell, a text window will display the nearest neighbor distance. A plot of the measures of marker-positive immune cell spatial clustering, such as Ripley’s , Besag’s Marcon’s and nearest neighbor distance distribution G, over various values is also generated and can be calculated specific to each immune marker. These calculations adjust for edge effects with the isotropic or translational edge () and reduced sample or Hanisch () correction methods (Baddeley ), with a simulation-based envelope presented around the theoretical estimate under CSR. Currently, the iTIME application provides the theoretical distribution under the CSR assumption; however, future updates aim to also include the empirical distribution. To demonstrate the functionality of spatialTIME and iTIME, we examined two prostate tumor samples with differing TIME architecture, as presented in Figure  1.
Fig. 1.

mIF data generated from a TMA study of prostate cancer. (A) mIF image data for CD8 Positive cells measured on a prostate cancer tumor sample. (B) Plots showing CD8+ cells and locations, with illustrating a tissue sample with ‘holes’ or regions of unmeasured cells. (C) Empirical distribution of the estimate of Ripley’s K under CSR (N permutations = 500), with lines indicating the permuted (mean) and theoretical estimates under CSR. (D) Distribution of observed K, permuted estimate of K under CSR, and difference in these measurements (i.e. ‘degree of spatial clustering’) for the 10 core tissue samples included in spatialTIME. The degree of spatial clustering can be used in downstream association with the phenotype of interest. (E) Representation of cell locations, using the iTIME application with information in the box showing the nearest cell for each marker type. (F) Violin plot of the percent of CD8+ cells by a clinical endpoint (group A versus B) using the iTIME application

mIF data generated from a TMA study of prostate cancer. (A) mIF image data for CD8 Positive cells measured on a prostate cancer tumor sample. (B) Plots showing CD8+ cells and locations, with illustrating a tissue sample with ‘holes’ or regions of unmeasured cells. (C) Empirical distribution of the estimate of Ripley’s K under CSR (N permutations = 500), with lines indicating the permuted (mean) and theoretical estimates under CSR. (D) Distribution of observed K, permuted estimate of K under CSR, and difference in these measurements (i.e. ‘degree of spatial clustering’) for the 10 core tissue samples included in spatialTIME. The degree of spatial clustering can be used in downstream association with the phenotype of interest. (E) Representation of cell locations, using the iTIME application with information in the box showing the nearest cell for each marker type. (F) Violin plot of the percent of CD8+ cells by a clinical endpoint (group A versus B) using the iTIME application

3 Conclusions

The novel technique of mIF provides a cost-effective approach for studying the TIME in a large number of tissue samples simultaneously. This technology also provides the ability to measure not only the abundance but also the spatial location of multiple cell types within a tissue sample. The development of methods and approaches, such as those implemented in spatialTIME, that can deeply characterize both marker abundance and the spatial heterogeneity of marker-positive cells in the TIME can inform treatment approaches (e.g. immunotherapy) by providing a clearer picture of therapeutic targets in tumor tissue. Financial Support: This work has been supported in part by the Biostatistics and Bioinformatics Shared Resource at the H. Lee Moffitt Cancer Center & Research Institute, an NCI designated Comprehensive Cancer Center (P30-CA076292). Conflict of Interest: Authors have no conflict of interest to declare. Click here for additional data file.
  10 in total

1.  Eight-Color Multiplex Immunohistochemistry for Simultaneous Detection of Multiple Immune Checkpoint Molecules within the Tumor Microenvironment.

Authors:  Mark A J Gorris; Altuna Halilovic; Katrin Rabold; Anne van Duffelen; Iresha N Wickramasinghe; Dagmar Verweij; Inge M N Wortel; Johannes C Textor; I Jolanda M de Vries; Carl G Figdor
Journal:  J Immunol       Date:  2017-11-15       Impact factor: 5.422

2.  Multispectral imaging for quantitative and compartment-specific immune infiltrates reveals distinct immune profiles that classify lung cancer patients.

Authors:  Artur Mezheyeuski; Christian Holst Bergsland; Max Backman; Dijana Djureinovic; Tobias Sjöblom; Jarle Bruun; Patrick Micke
Journal:  J Pathol       Date:  2018-02-05       Impact factor: 7.996

3.  Composition, Spatial Characteristics, and Prognostic Significance of Myeloid Cell Infiltration in Pancreatic Cancer.

Authors:  Jonathan A Nowak; Brian M Wolpin; Sara A Väyrynen; Jinming Zhang; Chen Yuan; Juha P Väyrynen; Andressa Dias Costa; Hannah Williams; Vicente Morales-Oyarvide; Mai Chan Lau; Douglas A Rubinson; Richard F Dunne; Margaret M Kozak; Wenjia Wang; Diana Agostini-Vulaj; Michael G Drage; Lauren Brais; Emma Reilly; Osama Rahma; Thomas Clancy; Jiping Wang; David C Linehan; Andrew J Aguirre; Charles S Fuchs; Lisa M Coussens; Daniel T Chang; Albert C Koong; Aram F Hezel; Shuji Ogino
Journal:  Clin Cancer Res       Date:  2020-12-01       Impact factor: 12.531

4.  A systematic comparison of supervised classifiers.

Authors:  Diego Raphael Amancio; Cesar Henrique Comin; Dalcimar Casanova; Gonzalo Travieso; Odemir Martinez Bruno; Francisco Aparecido Rodrigues; Luciano da Fontoura Costa
Journal:  PLoS One       Date:  2014-04-24       Impact factor: 3.240

5.  Macrophage spatial heterogeneity in gastric cancer defined by multiplex immunohistochemistry.

Authors:  Yu-Kuan Huang; Minyu Wang; Yu Sun; Natasha Di Costanzo; Catherine Mitchell; Adrian Achuthan; John A Hamilton; Rita A Busuttil; Alex Boussioutas
Journal:  Nat Commun       Date:  2019-09-02       Impact factor: 14.919

6.  Spatial proximity between T and PD-L1 expressing cells as a prognostic biomarker for oropharyngeal squamous cell carcinoma.

Authors:  Anna Maria Tsakiroglou; Martin Fergie; Ken Oguejiofor; Kim Linton; David Thomson; Peter L Stern; Susan Astley; Richard Byers; Catharine M L West
Journal:  Br J Cancer       Date:  2019-12-06       Impact factor: 7.640

7.  Characterizing the Tumor Immune Microenvironment with Tyramide-Based Multiplex Immunofluorescence.

Authors:  Hidetoshi Mori; Jennifer Bolen; Louis Schuetter; Pierre Massion; Clifford C Hoyt; Scott VandenBerg; Laura Esserman; Alexander D Borowsky; Michael J Campbell
Journal:  J Mammary Gland Biol Neoplasia       Date:  2021-02-15       Impact factor: 2.673

Review 8.  Challenges and Opportunities in the Statistical Analysis of Multiplex Immunofluorescence Data.

Authors:  Christopher M Wilson; Oscar E Ospina; Mary K Townsend; Jonathan Nguyen; Carlos Moran Segura; Joellen M Schildkraut; Shelley S Tworoger; Lauren C Peres; Brooke L Fridley
Journal:  Cancers (Basel)       Date:  2021-06-17       Impact factor: 6.575

9.  Quantification of histopathological findings using a novel image analysis platform.

Authors:  Yasushi Horai; Mao Mizukawa; Hironobu Nishina; Satomi Nishikawa; Yuko Ono; Kana Takemoto; Nobuyuki Baba
Journal:  J Toxicol Pathol       Date:  2019-08-11       Impact factor: 1.628

  10 in total
  3 in total

1.  SPF: A spatial and functional data analytic approach to cell imaging data.

Authors:  Thao Vu; Julia Wrobel; Benjamin G Bitler; Erin L Schenk; Kimberly R Jordan; Debashis Ghosh
Journal:  PLoS Comput Biol       Date:  2022-06-15       Impact factor: 4.779

2.  Tumor immune cell clustering and its association with survival in African American women with ovarian cancer.

Authors:  Christopher Wilson; Alex C Soupir; Ram Thapa; Jordan Creed; Jonathan Nguyen; Carlos Moran Segura; Travis Gerke; Joellen M Schildkraut; Lauren C Peres; Brooke L Fridley
Journal:  PLoS Comput Biol       Date:  2022-03-02       Impact factor: 4.475

3.  mxnorm: An R Package to Normalize Multiplexed Imaging Data.

Authors:  Coleman Harris; Julia Wrobel; Simon Vandekar
Journal:  J Open Source Softw       Date:  2022-03-30
  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.