| Literature DB >> 32021884 |
Tabassum Yesmin Rahman1, Lipi B Mahanta2, Anup K Das3, Jagannath D Sarma4.
Abstract
The repository is composed of 1224 images divided into two sets of images with two different resolutions. First set consists of 89 histopathological images with the normal epithelium of the oral cavity and 439 images of Oral Squamous Cell Carcinoma (OSCC) in 100x magnification. The second set consists of 201 images with the normal epithelium of the oral cavity and 495 histopathological images of OSCC in 400x magnification. The images were captured using a Leica ICC50 HD microscope from Hematoxyline and Eosin (H&E) stained tissue slides collected, prepared and catalogued by medical experts from 230 patients. A subset of 269 images from the second data set was used to detect OSCC based on textural features [1]. Histopathology plays a very important role in diagnosing a disease. It is the investigation of biological tissues to detect the presence of diseased cells in microscopic detail. It usually involves a biopsy. Till date biopsy is the gold-standard test to diagnose cancer. The biopsy slides are examined based on various cytological criteria under a microscope. Therefore, there is a high possibility of not retaining uniformity and ensuring reproducibility in outcomes [2, 3]. Computational diagnostic tools, on the other hand, facilitate objective judgments by making the use of the quantitative measure. This dataset can be utilized in establishing automated diagnostic tool using Artificial Intelligence approaches.Entities:
Keywords: 100x; 400x; Biopsy slides; Histopathology; OSCC; Oral cancer
Year: 2020 PMID: 32021884 PMCID: PMC6994517 DOI: 10.1016/j.dib.2020.105114
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Some images from the first set with (a) normal cells (b) malignant cells.
Image details in terms of type, quantity and application scope.
| Type | Category | Quantity | Application Scope |
|---|---|---|---|
| 100x | Normal | 89 | 1. In architectural level or tissue level analysis |
| 2. In feature extraction, segmentation and classification purposes | |||
| 3. For establishing an automated decision support system | |||
| 400x | Normal | 201 | 1. In both cell level (for both cell and nucleus) and tissue level analysis |
| 2. In feature extraction, segmentation and classification | |||
| 3. For automated decision support system set up | |||
| Total images | 1224 |
Fig. 2Some images from the second set with (a) normal cells (b) malignant cells.
Specifications Table
| Subject | Computer Science, Computer Vision and Pattern Recognition |
| Specific subject area | Medical Image Processing, Oral Biopsy Images, Cell segmentation, Cell classification |
| Type of data | Images |
| How data were acquired | Images were captured using a Leica DM 750 microscope with camera model ICC50 HD, in 100x (10x objective lens × 10x eyepiece) and 400x (40x objective lens × 10x eyepiece) magnifications (size 2048× 1536 pixels). |
| Data format | Raw |
| Parameters for data collection | Images were captured in 100x (10x objective lens × 10x eyepiece) and 400x (40x objective lens × 10x eyepiece) magnifications. The size of the images is 2048 × 1536 pixels. |
| Description of data collection | Biopsy slides were collected from two reputed healthcare service institutions, Ayursundra Healthcare Pvt. Ltd and Dr B. Borooah Cancer Institute from 230 patients recommended for Oral Biopsy test. The collection period was from October 2016 to November 2017. Images were captured using a Leica DM 750 microscope, model ICC50 HD connected to the camera and a high-configured computer and software. Images were captured in 100× and 400× magnifications. |
| Data source location | 1. Ayursundra Healthcare Pvt. Ltd, Guwahati, Assam, India |
| Data accessibility | Rahman, Tabassum Yesmin (2019), “A histopathological image repository of the normal epithelium of Oral Cavity and Oral Squamous Cell Carcinoma”, Mendeley Data, v1. |
| Related research article | Rahman T. Y., Mahanta L. B., Chakraborty C., Das A. K., Sarma J. D., “ |
This is the first dataset containing histopathological images of the normal epithelium of the oral cavity and OSCC. These data can be used as a gold standard for histopathological analysis of OSCC. Researchers can use these data for extracting cytological as well as tissue level features, in image segmentation and also for classification purposes, and aid in establishing an automated diagnostic tool using Artificial Intelligence approaches. Classification applying deep learning or semantic segmentation tasks can also be implemented by adding/augmenting images in the dataset. This dataset can be used for a comparative evaluation of one's experimental findings in future when more dataset of such kind is available. |