| Literature DB >> 33273507 |
Takuya Itaki1, Yosuke Taira2, Naoki Kuwamori2, Hitoshi Saito2, Minoru Ikehara3, Tatsuhiko Hoshino4.
Abstract
Microfossils are a powerful tool in earth sciences, and they have been widely used for the determination of geological age and in paleoenvironmental studies. However, the identification of fossil species requires considerable time and labor by experts with extensive knowledge and experience. In this study, we successfully automated the acquisition of microfossil data using an artificial intelligence system that employs a computer-controlled microscope and deep learning methods. The system was used to calculate changes in the relative abundance (%) of Cycladophora davisiana, a siliceous microfossil species (Radiolaria) that is widely used as a stratigraphic tool in studies on Pleistocene sediments in the Southern Ocean. The estimates obtained using this system were consistent with the results obtained by a human expert (< ± 3.2%). In terms of efficiency, the developed system was capable of performing the classification tasks approximately three times faster than a human expert performing the same task.Entities:
Year: 2020 PMID: 33273507 PMCID: PMC7713231 DOI: 10.1038/s41598-020-77812-6
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1Schematic diagram of the miCRAD system used for automated image collection and classification of microfossil species.
Figure 2Subsets of randomly collected training data for five particle categories: C. davisiana [Cdv], C. bicornis [Cbc], all other radiolarians [Rad], diatoms [dtm], and all other particles [oth].
Figure 3A screen shot of a scanning image showing the five particle categories in red.
Number of images of individual objects used as training data for each category of models Cdv%v2 and Cdv%v6R.
| Category | Cdv%v2 | Cdv%v6R |
|---|---|---|
| 2992 | 19,007 | |
| 295 | 1,040 | |
| Other radiolarians [Rad] | 4410 | 38,874 |
| Diatoms [dtm] | 1639 | 5,411 |
| Other particles [oth] | 922 | 2,006 |
| Total | 10,258 | 66,338 |
An example results of the confidence values from slide code #37 obtained for each category. Bold cells indicate the highest value in the five categories. Averaged confidence values were estimated from results of all 500 objects (whole dataset is shown in Supplementary Table 1). Numbers of the value greater than confidence threshold 0.60 and 095 are also shown.
| Object No | Other rads | Diatoms | Other particles | ||
|---|---|---|---|---|---|
| 1 | 0.08 | 0.03 | 0.00 | 0.01 | |
| 2 | 0.20 | 0.00 | 0.00 | 0.01 | |
| 3 | 0.00 | 0.00 | 0.00 | 0.00 | |
| 4 | 0.00 | 0.00 | 0.01 | 0.01 | |
| 5 | 0.00 | 0.00 | 0.07 | 0.00 | |
| : | |||||
| 500 | 0.01 | 0.03 | 0.09 | 0.28 | |
| Average | 0.16 | 0.02 | 0.44 | 0.33 | 0.05 |
| # of > 0.60 | 70 | 8 | 204 | 154 | 17 |
| # of > 0.95 | 31 | 0 | 117 | 111 | 3 |
Classification results for five test samples with model Cdv%v2 at a confidence level of 0.60 (a) and eight test samples with both models Cdv%v2 and Cdv%v6R at a confidence level of 0.95 (b). Numbers (#) indicate counts detected, and percentages (%) shows the ratio of unclassified images in the total and objects that were classified correctly in each category. Average percentage and standard deviation (S.D.) are also shown.
| Model | Slide | Core | Total | Unclassified images | Category | Category | Category | Category | Category | |||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| (cm) | (#) | (%) | (#) | (correct%) | (#) | (correct%) | (#) | (correct%) | (#) | (correct%) | (#) | (correct%) | ||
Cdv%v2 Confidence threshold 0.60 | 1 | 0.6 | 1228 | 9.6 | 60 | 26.7 | 76 | 44.7 | 627 | 89.8 | 52 | 57.7 | 295 | 96.6 |
| 13 | 14.5 | 1474 | 7.1 | 66 | 31.8 | 70 | 72.9 | 896 | 93.5 | 68 | 80.9 | 270 | 96.3 | |
| 21 | 23.6 | 2787 | 8.4 | 192 | 60.4 | 93 | 24.7 | 1523 | 79.5 | 311 | 95.5 | 433 | 92.6 | |
| 25 | 28.2 | 4005 | 8.0 | 371 | 77.6 | 78 | 28.2 | 1694 | 82.7 | 656 | 97.7 | 885 | 93.4 | |
| 29 | 32.7 | 4527 | 7.9 | 418 | 78.7 | 74 | 23.0 | 1566 | 83.8 | 1348 | 97 | 763 | 91.5 | |
| Average % | 8.2 | 55.1 | 38.7 | 85.9 | 85.8 | 94.1 | ||||||||
| S.D | 0.8 | 22.1 | 18.7 | 5.1 | 15.3 | 2.0 | ||||||||
Cdv%v2 Confidence threshold 0.95 | 1 | 0.6 | 1228 | 51.1 | 7 | 71.4 | 38 | 81.6 | 332 | 97.3 | 23 | 200 | ||
| 13 | 14.5 | 1474 | 45.5 | 11 | 90.9 | 42 | 69.0 | 536 | 95.7 | 41 | 173 | |||
| 21 | 23.6 | 2787 | 48.2 | 46 | 87.0 | 40 | 35.0 | 889 | 85.4 | 231 | 237 | |||
| 25 | 28.2 | 4005 | 47.9 | 117 | 93.2 | 34 | 44.1 | 858 | 90.7 | 515 | 562 | |||
| 29 | 32.7 | 4527 | 47.8 | 104 | 98.1 | 34 | 44.1 | 689 | 84.5 | 1078 | 459 | |||
| 33 | 37.3 | 4387 | 46.6 | 120 | 97.5 | 23 | 0.0 | 521 | 91.7 | 1529 | 149 | |||
| 37 | 41.8 | 4866 | 44.9 | 207 | 98.6 | 42 | 4.8 | 765 | 81.0 | 1462 | 167 | |||
| 41 | 46.3 | 3100 | 41.3 | 118 | 99.2 | 12 | 8.3 | 570 | 83.5 | 847 | 270 | |||
| Average % | 46.7 | 92.0 | 35.9 | 88.7 | ||||||||||
| S.D | 2.7 | 8.7 | 28.2 | 5.6 | ||||||||||
Cdv%v6R Confidence threshold 0.95 | 1 | 0.6 | 2174 | 47.0 | 14 | 78.6 | 34 | 82.4 | 340 | 85.3 | 76 | 688 | ||
| 13 | 14.5 | 2387 | 44.5 | 26 | 88.5 | 52 | 59.6 | 756 | 90.2 | 80 | 412 | |||
| 21 | 23.6 | 5581 | 47.9 | 182 | 88.5 | 91 | 48.4 | 1428 | 83.0 | 508 | 697 | |||
| 25 | 28.2 | 4458 | 48.3 | 253 | 96.1 | 61 | 32.8 | 1070 | 82.2 | 396 | 527 | |||
| 29 | 32.7 | 6602 | 49.0 | 331 | 96.4 | 89 | 13.5 | 1596 | 62.4 | 901 | 452 | |||
| 33 | 37.3 | 6030 | 48.8 | 357 | 97.8 | 125 | 3.2 | 1446 | 72.9 | 798 | 364 | |||
| 37 | 41.8 | 6281 | 47.2 | 464 | 97.0 | 87 | 2.3 | 1466 | 78.9 | 835 | 466 | |||
| 41 | 46.3 | 4539 | 48.1 | 223 | 96.9 | 42 | 2.4 | 984 | 72.6 | 634 | 474 | |||
| Average % | 47.6 | 92.4 | 30.6 | 78.4 | ||||||||||
| S.D | 1.3 | 6.3 | 28.5 | 8.2 | ||||||||||
Figure 4Plots of the C. davisiana% for the manual count vs the model count.
Figure 5The C. davisiana% curves for the manual count of the upper 46 cm of the core (green) and those estimated from model data to a depth of 216 cm (blue and red). Age model of core DCR-1PC is from Crosta et al.[3] (pink arrows in the lower graph panel). Correlation with results from sites PS2821/ODP 1089 by Cortese and Abelmann[2] is also shown (dashed pink lines). The C. davisiana zones a, b, c…e of Hays et al.[4] are indicated in the upper graph panel and correlated with gray bands.