| Literature DB >> 30152811 |
Rossella Aversa1, Mohammad Hadi Modarres2, Stefano Cozzini1,3, Regina Ciancio4, Alberto Chiusole3.
Abstract
In this paper, we present the first publicly available human-annotated dataset of images obtained by the Scanning Electron Microscopy (SEM). A total of roughly 26,000 SEM images at the nanoscale are classified into 10 categories to form 4 labeled training sets, suited for image recognition tasks. The selected categories span the range of 0D objects such as particles, 1D nanowires and fibres, 2D films and coated surfaces as well as patterned surfaces, and 3D structures such as microelectromechanical system (MEMS) devices and pillars. Additional categories such as tips and biological are also included to expand the spectrum of possible images. A preliminary degree of hierarchy is introduced, by creating a subtree structure for the categories and populating them with the available images, wherever possible.Entities:
Year: 2018 PMID: 30152811 PMCID: PMC6111892 DOI: 10.1038/sdata.2018.172
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1Schematic view of nanostructure classification based on dimensionality from 0D to 3D.
(a) Particles (b) Clusters (c) Nanowires (d) Fibres (e) Films (f) Patterned surfaces (g) Pillars (h) MEMS devices.
Figure 2Representative images for each of the categories chosen for the SEM dataset.
(a) Tips (b) Particles (c) Patterned surfaces (d) MEMS devices and electrodes (e) Nanowires (f) Porous sponge (g) Biological (h) Powder (i) Films and coated surfaces (j) Fibres. The dimensionality of nanoscience objects provided the basis for the choice. Other categories, such as Biological and Tips, were added, as they were common images found in the SEM database.
The original SEM annotated dataset.
| The breakdown of the number of images in each category, and the total number of images composing the dataset are reported. | |
|---|---|
| Porous Sponge | 171 |
| Patterned surface | 3,310 |
| Particles | 3,412 |
| Films and Coated Surface | 308 |
| Powder | 895 |
| Tips | 1,561 |
| Nanowires | 3,656 |
| Biological | 953 |
| MEMS devices and electrodes | 4,158 |
| Fibres | 153 |
| TOTAL | 18,577 |
Preliminary hierarchical structure established.
| Columns 1, 2, and 3 indicate the root of the subtree, the first, and the second level of nodes, respectively. The last column reports the number of images in the last node. | |||
|---|---|---|---|
| Tips | Zoom out | 40 | |
| Tips | Zoom in | 40 | |
| Tips | Tip on cantilever | 40 | |
| Nanowires | Entangled nanowires | 40 | |
| Nanowires | Few | 35 | |
| Nanowires | Individual | 30 | |
| Nanowires | Forest | Parallel aligned | 30 |
| Nanowires | Forest | Crust | 25 |
| Fibres | 40 | ||
| Biological | 53 | ||
| MEMS devices electrodes | Electrode | 30 | |
| MEMS devices electrodes | Close up line | 30 | |
| MEMS devices electrodes | 3d edges | 18 | |
| MEMS devices electrodes | Waveguide | 8 | |
| MEMS devices electrodes | Microfluidic | 10 | |
| Patterned surface | Line array | 40 | |
| Patterned surface | Square array | 20 | |
| Patterned surface | Circle array | 60 | |
| Patterned surface | 3d edge patterned surface | 28 | |
| Patterned surface | 3d line | 50 | |
| Patterned surface | Triangle array | 8 | |
| Patterned surface | Ring spiral | 5 | |
| Patterned surface | Zigzag | 5 | |
| Patterned surface | Pillars | 3d array | 20 |
| Patterned surface | Pillars | Zoom in pillar | 10 |
| Patterned surface | Pillars | Individual pillar | 15 |
| Patterned surface | Pillars | Triangular pillar | 10 |
| Particles | Dispersed | 40 | |
| Particles | Small clusters | 25 | |
| Particles | Individual particle | 15 | |
| Particles | Other shape | 20 | |
| Powder | Zoom out | 40 | |
| Powder | Zoom in | 30 | |
| Films and coated surfaces | Smooth film | 60 | |
| Films and coated surfaces | Particle film | 35 | |
| Films and coated surfaces | Other film | 40 | |
| Porous Sponge | 13 |
Summary of the datasets presented in this paper.
| For each dataset, named in column 1, the total number of categories (or subcategories), the total number of images, and a short description are reported in columns 2, 3, and 4, respectively. | |||
|---|---|---|---|
| SEM | 10 | 18,577 | Dataset used in ref. |
| Hierarchical | 37 | 1,038 | Hierarchical, preliminary subset |
| Majority | 10 | 25,537 | Augmented, validated applying majority criterion |
| 100% | 10 | 25,430 | Augmented, validated applying 100% criterion |
Validation results, according to both majority and 100% criteria.
| human [%] | machine [%] | human [%] | machine [%] | |
|---|---|---|---|---|
| The numbers of validated images are expressed in percentage with respect to the category size (20 images for each of them). The total numbers of images are reported in the last two rows. | ||||
| Porous Sponge | 95 | 50 | 75 | 35 |
| Patterned surface | 55 | 80 | 35 | 65 |
| Particles | 85 | 60 | 30 | 15 |
| Films and Coated Surface | 85 | 75 | 40 | 35 |
| Powder | 65 | 80 | 35 | 15 |
| Tips | 100 | 55 | 95 | 45 |
| Nanowires | 75 | 95 | 60 | 85 |
| Biological | 100 | 80 | 75 | 65 |
| MEMS devices and electrodes | 95 | 80 | 75 | 65 |
| Fibres | 85 | 60 | 55 | 30 |
| PARTIAL TOTAL | 168 | 143 | 115 | 89 |
| TOTAL PER CRITERION | 311 | 204 | ||