| Literature DB >> 34926737 |
Mei-Ling Huang1, Yi-Xuan Xu1, Yu-Chieh Liao1.
Abstract
Tree blossoms have been widely used on the prevention and treatment of a variety of diseases in traditional Chinese medicine for thousand years [1,2]. The growth of flowers is not only for their ornamental value, but also for nutritional, medicinal, cooking, cosmetic and aromatic properties. They are a rich source of many compounds, which play an important role in various metabolic processes of the human body [3]. Edible flowers can promote the global demand for more attractive and delicious food, and can improve the nutritional content of gourmet food [4]. Flowers are beneficial for anti-anxiety, anti-cancer, anti-inflammatory, antioxidant, diuretic and immune-modulator, etc. It is very important to identify edible flowers correctly, because only a few are edible [5]. The shapes or colors of different flowers may be very similar. Visual evaluation is one of the classification methods, but it is error-prone and time-consuming [6]. Flowers are divided into flowers from herbaceous plants (flower) and flower trees (blossom). Now there is a public herbaceous flower dataset [7], but lack of dataset for Chinese medicinal blossoms. This article presents and establishes the dataset for twelve most commonly and economically valuable blossoms used in traditional Chinese medicine. The dataset provide a collection of blossom images on traditional Chinese herbs help Chinese pharmacist to classify the categories of Chinese herbs. In addition, the dataset can serve as a resource for researchers who use different algorithms of machine learning or deep learning for image segmentation and image classification.Entities:
Keywords: Chinese medicinal blossom; Classification; Data augmentation; Deep learning
Year: 2021 PMID: 34926737 PMCID: PMC8648792 DOI: 10.1016/j.dib.2021.107655
Source DB: PubMed Journal: Data Brief ISSN: 2352-3409
Fig. 1Examples of Chinese medicine blossom categories.
Fig. 2Data processing steps.
Fig. 3Example of data augmentation.
Number of images before and after data augmentation.
| Original | After Data Augmentation | ||||||||
|---|---|---|---|---|---|---|---|---|---|
| ID | Name | Train | Val | Test | Total | Train | Val | Test | Total |
| 1 | Syringa | 153 | 19 | 19 | 191 | 1224 | 152 | 19 | 1395 |
| 2 | Bombax malabarica | 138 | 17 | 17 | 172 | 1104 | 136 | 17 | 1257 |
| 3 | Michelia alba | 98 | 12 | 12 | 122 | 784 | 96 | 12 | 892 |
| 4 | Armeniaca mume | 188 | 24 | 24 | 236 | 1504 | 192 | 24 | 1720 |
| 5 | Albizia julibrissin | 178 | 22 | 22 | 222 | 1424 | 176 | 22 | 1622 |
| 6 | Pinus massoniana | 70 | 9 | 8 | 87 | 560 | 72 | 8 | 640 |
| 7 | Eriobotrya japonica | 92 | 11 | 12 | 115 | 736 | 88 | 12 | 836 |
| 8 | Prunus persica | 171 | 21 | 21 | 213 | 1368 | 168 | 21 | 1557 |
| 9 | Firmiana simplex | 72 | 9 | 8 | 89 | 576 | 72 | 8 | 656 |
| 10 | Ficus religiosa | 60 | 7 | 8 | 75 | 480 | 56 | 8 | 544 |
| 11 | Styphnolobium japonicum | 101 | 13 | 12 | 126 | 808 | 104 | 12 | 924 |
| 12 | Areca catechu | 55 | 6 | 7 | 68 | 440 | 48 | 7 | 495 |
| Total | 1376 | 170 | 170 | 1716 | 11008 | 1360 | 170 | 12538 | |
Fig. 4Architecture diagram of dataset.
Before data augmentation.
| Accuracy | Precision | Recall | F1-score | Time | |
|---|---|---|---|---|---|
| AlexNet | 93.57% | 92.98% | 94.52% | 93.62% | 00:01:17 |
| InceptionV3 | 89.18% | 88.21% | 90.06% | 88.79% | 00:08:14 |
After data augmentation.
| Accuracy | Precision | Recall | F1-score | Time | |
|---|---|---|---|---|---|
| AlexNet | 98.53% | 98.41% | 98.50% | 98.45% | 00:09:26 |
| InceptionV3 | 98.61% | 98.61% | 98.55% | 98.58% | 01:05:51 |
Fig. 5Training curves.
| Subject | Agricultural Sciences, Computer Science |
| Specific subject area | Image processing, Image identification, Image classification, computer vision |
| Type of data | Images |
| How data were acquired | Blossom images were captured by Google search. |
| Data format | Raw digital image (JPG format) |
| Parameters for data collection | Both close-up photography and telephoto images for each category were collected. Blurred images were deleted. |
| Description of data collection | Images of Chinese medicinal blossoms were collected and classified into twelve categories. |
| Data source location | Institution: National Chin-Yi University of Technology |
| Data accessibility | Repository name: Chinese medicinal blossom-dataset |