| Literature DB >> 23959844 |
Sonal Kothari1, John H Phan, Todd H Stokes, May D Wang.
Abstract
OBJECTIVES: With the objective of bringing clinical decision support systems to reality, this article reviews histopathological whole-slide imaging informatics methods, associated challenges, and future research opportunities. TARGET AUDIENCE: This review targets pathologists and informaticians who have a limited understanding of the key aspects of whole-slide image (WSI) analysis and/or a limited knowledge of state-of-the-art technologies and analysis methods. SCOPE: First, we discuss the importance of imaging informatics in pathology and highlight the challenges posed by histopathological WSI. Next, we provide a thorough review of current methods for: quality control of histopathological images; feature extraction that captures image properties at the pixel, object, and semantic levels; predictive modeling that utilizes image features for diagnostic or prognostic applications; and data and information visualization that explores WSI for de novo discovery. In addition, we highlight future research directions and discuss the impact of large public repositories of histopathological data, such as the Cancer Genome Atlas, on the field of pathology informatics. Following the review, we present a case study to illustrate a clinical decision support system that begins with quality control and ends with predictive modeling for several cancer endpoints. Currently, state-of-the-art software tools only provide limited image processing capabilities instead of complete data analysis for clinical decision-making. We aim to inspire researchers to conduct more research in pathology imaging informatics so that clinical decision support can become a reality.Entities:
Keywords: cancer prediction; computer-aided diagnosis; decision support systems; pathology imaging informatics; whole-slide images
Mesh:
Year: 2013 PMID: 23959844 PMCID: PMC3822114 DOI: 10.1136/amiajnl-2012-001540
Source DB: PubMed Journal: J Am Med Inform Assoc ISSN: 1067-5027 Impact factor: 4.497
Figure 1An example clinical decision support system for quantitative analysis of whole-slide images (WSI) of tissue biopsy samples. This system has the following key components: quality control to ensure only high-quality data are processed, image description to convert WSI into quantitative features, prediction modeling to develop quantitative diagnostic models, and exploratory analysis to interpret the image feature space. We include two case studies as examples of predictive modeling and exploratory analysis. ROI, region of interest.
Figure 2Eliminating tissue-fold artifacts and pen marks in a whole-slide image of a NIH Cancer Genome Atlas ovarian serous carcinoma biopsy.
Figure 3Normalization of color batch effects in ovarian samples provided by the NIH Cancer Genome Atlas.
Figure 4Representation of a (A) NIH Cancer Genome Atlas whole-slide image (WSI) of a kidney renal clear cell carcinoma biopsy using various quantitative features extracted from (B) a single image tile. Quantitative features include pixel-level features, ie, (C) color histogram and (D) Gabor filter response; object-level features, ie, (E) segmented shapes and (F) graph-based topology; and semantic-level features, ie, (G) percentage of high-level clinical properties.
Prediction performance for whole-slide image-based binary endpoints
| Class 1 | Class 2 | AUC for outer cross-validation | |||||
|---|---|---|---|---|---|---|---|
| Endpoint | Description | No. of patients | Description | No. of patients | Tissue (incl. non-tumor tiles) | Tumor | p Value |
| Histological grade | Grade 1 or 2 | 204 | Grade 3 or 4 | 239 | 0.66±0.01 | 0.69±0.01 | 0.0000 |
| Metastasis | No spread to other organs | 381 | Spread to other organs | 68 | 0.61±0.01 | 0.64±0.01 | 0.0001 |
| Stage | Stage I or II | 267 | Stage III or IV | 182 | 0.60±0.02 | 0.61±0.03 | 0.0562 |
| Five-year survival | <5 years | 126 | ≥5 years | 101 | 0.54±0.02 | 0.57±0.02 | 0.0151 |
| Lymphnode spread | No spread to nearby lymph nodes | 210 | Spread to nearby lymph nodes | 17 | 0.56±0.06 | 0.54±0.02 | 0.5148 |
AUC, area under the curve.
Statistically overrepresented feature subsets in models based on tumor tiles
| Endpoint | Average feature size | Statistically overrepresented feature subsets (Fishers test, p value=0.05) |
|---|---|---|
| Histological grade | 74 | Nuclear shape (0.013) |
| Metastasis | 28 | Nuclear shape (0.000) |
| Stage | 54 | Nuclear shape (0.013) and Basophilic-object shape (0.002) |
| Five year survival | 37 | Basophilic-region texture (0.000) |
| Lymph node spread | 34 | Nuclear shape (0.000) |
Figure 5Role of region of interest (ROI) selection on the performance of whole-slide image (WSI)-based prediction models. (A) An example WSI. (B) Tiles in the tumor region (ROI) of the WSI highlighted with green boxes. Scatter plots between the prediction performance (area under the curve; AUC) of inner and outer loop of nested cross-validation for (C) models based on features from tissue tiles, including tumor and non-tumor (normal, necrosis, and stroma) regions; and for (D) tiles in the tumor region only.
Summary of key methods in each component of a WSI-based clinical decision support system
| Section | Subsection | Key methods |
|---|---|---|
| Quality control | Image artifacts | Tissue folds, |
| Batch effects | Color normalization, | |
| Image description | Pixel-level features | Color, |
| Object-level features | Shape | |
| Semantic-level features | Bag-of-features | |
| Prediction modeling | ROI selection and tile-based WSI representation | ROI selection: supervised |
| Informative feature selection and reduction | Feature selection: filter, | |
| Classification | Multiple classifiers, | |
| Visualization and exploratory analysis | Unsupervised clustering and high-dimensional feature patterns | Hierarchal clustering, |
| Virtual microscope and spatial patterns | Image compression, |
MDS, ; PCA, ; ROI, region of interest; WSI, whole-slide image.