| Literature DB >> 29136101 |
Jun Cheng1, Xiaokui Mo2, Xusheng Wang3, Anil Parwani4, Qianjin Feng1, Kun Huang3,5,6.
Abstract
Motivation: As a highly heterogeneous disease, the progression of tumor is not only achieved by unlimited growth of the tumor cells, but also supported, stimulated, and nurtured by the microenvironment around it. However, traditional qualitative and/or semi-quantitative parameters obtained by pathologist's visual examination have very limited capability to capture this interaction between tumor and its microenvironment. With the advent of digital pathology, computerized image analysis may provide a better tumor characterization and give new insights into this problem.Entities:
Mesh:
Year: 2018 PMID: 29136101 PMCID: PMC7263397 DOI: 10.1093/bioinformatics/btx723
Source DB: PubMed Journal: Bioinformatics ISSN: 1367-4803 Impact factor: 6.937
Patient demographics and tumor characteristics
| Characteristics | Summary |
|---|---|
| Patient no. | 190 |
| Age (year) | |
| Median | 60.5 |
| Range | 28–85 |
| Gender | |
| Female | 51 |
| Male | 139 |
| Follow-up (month) | |
| Median | 16.3 |
| Range | 1–185.3 |
| Number of Death | 27 |
| Subtype | |
| Type 1 | 46 |
| Type 2 | 60 |
| Not available | 84 |
| TNM stage | |
| I | 110 |
| II | 10 |
| III | 39 |
| IV | 12 |
| Not available | 19 |
Fig. 1.Overview of our workflow. (A) Learning nucleus patterns in an unsupervised manner. (B) Generating bag of edge histogram features and identifying survival-related edge patterns (Color version of this figure is available atBioinformatics online.)
Fig. 2.Illustration of the three main steps involved in our feature extraction workflow. (A) Nucleus segmentation. (B) Nucleus pattern learning using stacked sparse autoencoder to learn high-level features followed by clustering. Nucleus patterns are indicated by different colors. There are eight nucleus patterns. (C) Delaunay triangle edge patterns showed in different colors. Edge patterns are defined in terms of their end nodes. There are 36 edge patterns since we have eight nucleus patterns. The H&E image is converted to a grayscale image to highlight colors (Color version of this figure is available atBioinformatics online.)
Univariate survival analysis results using log-rank test
| Feature | |
|---|---|
| TNM stage (I versus II, III) | 0.073 |
| Subtype (type 1 versus type 2) | 0.009 |
| Skewness of length of major axis | 0.044 |
| Kurtosis of length of minor axis | 0.034 |
| Edge(14, 58) | 0.005 |
| Edge(58, 62) | 0.007 |
| Edge(16, 56) | 0.008 |
| Edge(21, 58) | 0.009 |
| Edge(15, 23) | 0.010 |
Note: For morphological and intensity features only the significant features are listed, and for the proposed BOEH features only the top five features with smallestP values are listed. The number of nucleus patterns is set to 64. Edge (14, 58) means the edge type with the 14th and 58th nucleus patterns as its end nodes, and the other pairs are listed in the same fashion.
Fig. 3.The proposed BOEH features provide better prognosis prediction than clinical variables. (A–C) Kaplan–Meier curves stratified by tumor stage, tumor subtype, and predicted risk index of lasso-Cox model built on BOEH features, respectively. (D) ROC curves that predict the binary outcome of 5-year survival using predicted risk index of lasso-Cox model built on BOEH features, tumor stage, and tumor subtype, respectively. For extracting BOEH features, the number of nucleus patterns is set to 64 (Color version of this figure is available atBioinformatics online.)
Fig. 4.Examples of the learned nucleus patterns forming edge types that are strongly associated with survival. The number of nucleus clusters is set to 64. The number in the upper-left corner of each image is the cluster index. Each image consists of 10 × 10 nucleus patches from the same cluster (Color version of this figure is available atBioinformatics online.)