| Literature DB >> 26955503 |
Pekka Ruusuvuori1, Mira Valkonen2, Matti Nykter2, Tapio Visakorpi3, Leena Latonen3.
Abstract
This paper describes work presented at the Nordic Symposium on Digital Pathology 2015, in Linköping, Sweden. Prostatic intraepithelial neoplasia (PIN) represents premalignant tissue involving epithelial growth confined in the lumen of prostatic acini. In the attempts to understand oncogenesis in the human prostate, early neoplastic changes can be modeled in the mouse with genetic manipulation of certain tumor suppressor genes or oncogenes. As with many early pathological changes, the PIN lesions in the mouse prostate are macroscopically small, but microscopically spanning areas often larger than single high magnification focus fields in microscopy. This poses a challenge to utilize full potential of the data acquired in histological specimens. We use whole prostates fixed in molecular fixative PAXgene™, embedded in paraffin, sectioned through and stained with H&E. To visualize and analyze the microscopic information spanning whole mouse PIN (mPIN) lesions, we utilize automated whole slide scanning and stacked sections through the tissue. The region of interests is masked, and the masked areas are processed using a cascade of automated image analysis steps. The images are normalized in color space, after which exclusion of secretion areas and feature extraction is performed. Machine learning is utilized to build a model of early PIN lesions for determining the probability for histological changes based on the calculated features. We performed a feature-based analysis to mPIN lesions. First, a quantitative representation of over 100 features was built, including several features representing pathological changes in PIN, especially describing the spatial growth pattern of lesions in the prostate tissue. Furthermore, we built a classification model, which is able to align PIN lesions corresponding to grading by visual inspection to more advanced and mild lesions. The classifier allowed both determining the probability of early histological changes for uncategorized tissue samples and interpretation of the model parameters. Here, we develop quantitative image analysis pipeline to describe morphological changes in histological images. Even subtle changes in mPIN lesion characteristics can be described with feature analysis and machine learning. Constructing and using multidimensional feature data to represent histological changes enables richer analysis and interpretation of early pathological lesions.Entities:
Keywords: Histopathological image analysis; machine learning; prostatic intraepithelial neoplasia
Year: 2016 PMID: 26955503 PMCID: PMC4763506 DOI: 10.4103/2153-3539.175378
Source DB: PubMed Journal: J Pathol Inform
Feature categories
Figure 1Processing of mouse prostate material for feature-based analysis. Whole mouse prostates were sectioned through with 5 μm sections. Sections were H&E stained, and the whole slide scanned to obtain high-resolution images. H&E-stained histological image every 50 μm apart was processed and used to mark mouse prostatic intraepithelial neoplasia lesions as a region of interests and subjected to image processing. All region of interests in a particular lesion obtained from a stack of histological images in z-direction were included and subjected to feature analysis
Figure 2Image processing steps. An example of a PIN lesion area in a H&E-stained image (a) and its selection as a region of interest (b). Exclusion of secretion-filled (c) and empty (d) areas is performed to obtain effective tissue area within region of interest (e) excluded areas shown in turquoise
Figure 3Feature analysis of mouse prostatic intraepithelial neoplasia lesions. (a) Separation of individual lesions (red dots, n = 72) according to principal component analysis and the relative weights of features (blue lines). The values for each lesion are calculated across all region of interests of that particular lesion. (b) Scatter plot of mouse prostatic intraepithelial neoplasia lesions with average solidity and sum of the area across lesion (i.e., the sum of areas of all lesion region of interests in all sections in z). (c) Examples of the type of lesion region of interest masks representing the morphological groups separated by lesion solidity and size (e.g., lesion area or volume). Colors in (b) indicate the same lesions of which an individual region of interest mask is shown in (c). Region masks in (c) are in the same scale
Figure 4Mouse prostatic intraepithelial neoplasia lesion classifier model based on features obtained by machine learning. (a) Lesions sorted according to positive class conditional probabilities obtained during the 1000 repetitions of the classifier design by random hold-out of a training sample. Training samples are marked with colors (red, advanced phenotype; green, mild phenotype). A probability threshold of 0.5 is marked with a dashed line. (b) Examples of lesion region of interests representing the two phenotypes in the classification model. Examples of both training samples and classified samples are shown. It must be noted that the lesions are not in the same scale. (c) The histogram bins showing the number of times the features have been selected in the classifier model