| Literature DB >> 29876376 |
Fereshteh S Bashiri1,2, Eric LaRose2, Peggy Peissig2, Ahmad P Tafti2.
Abstract
A fully-labeled image dataset provides a unique resource for reproducible research inquiries and data analyses in several computational fields, such as computer vision, machine learning and deep learning machine intelligence. With the present contribution, a large-scale fully-labeled image dataset is provided, and made publicly and freely available to the research community. The current dataset entitled MCIndoor20000 includes more than 20,000 digital images from three different indoor object categories, including doors, stairs, and hospital signs. To make a comprehensive dataset addressing current challenges that exist in indoor objects modeling, we cover a multiple set of variations in images, such as rotation, intra-class variation plus various noise models. The current dataset is freely and publicly available at https://github.com/bircatmcri/MCIndoor20000.Entities:
Keywords: Deep learning; Image classification; Image dataset; Indoor objects; Large-scale dataset; Supervised learning
Year: 2018 PMID: 29876376 PMCID: PMC5988436 DOI: 10.1016/j.dib.2017.12.047
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Sample images from the categories covered by the MCIndoor20000.
Fig. 2The MCIndoor20000 dataset carries several objects model variations.
An experimental validation performed on the dataset. The MCIndoor20000 dataset includes the original images along with all variations discussed in the Data section. The MCIndoor2000 (Original Images) includes only the original images without any variations. “ACC” stands for accuracy, while “TPR” and “PPV” stands for true positive rate and positive predictive value, respectively.
| MCIndoor20000 | 20% Train, 80% Test | 99.8% | 99.8% | 99.8% |
| MCIndoor20000 | 10% Train, 90% Test | 99.6% | 99.6% | 99.6% |
| MCIndoor2000 (Original Images) | 20% Train, 80% Test | 90.4% | 89.8% | 92.7% |
| MCIndoor2000 (Original Images) | 10% Train, 90% Test | 64.4% | 63.3% | 75.0% |
| Subject area | Machine learning, computer vision, deep learning, machine intelligence. |
| More specific subject area | Object classification, object detection, object recognition. |
| Type of data | 2D-RGB digital images (.JPEG, .PNG). |
| How data was acquired | Original images were collected in Marshfield Clinic by capturing photos from remarkable landmark objects, including clinic signs, doors and stairs. Images are manually cropped to eliminate the effect of surrounding objects in the learning process. To cover multiple variations in the objects model, we systematically rotated and augmented diverse noises to the original images. |
| Data format | Digital images, in raw and processed formats. |
| Data source location | Marshfield Clinic, Marshfield, Wisconsin, USA. |
| Data accessibility | The dataset is accessible at |