| Literature DB >> 35660753 |
Christian Marzahl1,2, Jenny Hill3, Jason Stayt3, Dorothee Bienzle4, Lutz Welker5, Frauke Wilm6, Jörn Voigt7, Marc Aubreville8, Andreas Maier6, Robert Klopfleisch9, Katharina Breininger6,10, Christof A Bertram9,11.
Abstract
Pulmonary hemorrhage (P-Hem) occurs among multiple species and can have various causes. Cytology of bronchoalveolar lavage fluid (BALF) using a 5-tier scoring system of alveolar macrophages based on their hemosiderin content is considered the most sensitive diagnostic method. We introduce a novel, fully annotated multi-species P-Hem dataset, which consists of 74 cytology whole slide images (WSIs) with equine, feline and human samples. To create this high-quality and high-quantity dataset, we developed an annotation pipeline combining human expertise with deep learning and data visualisation techniques. We applied a deep learning-based object detection approach trained on 17 expertly annotated equine WSIs, to the remaining 39 equine, 12 human and 7 feline WSIs. The resulting annotations were semi-automatically screened for errors on multiple types of specialised annotation maps and finally reviewed by a trained pathologist. Our dataset contains a total of 297,383 hemosiderophages classified into five grades. It is one of the largest publicly available WSIs datasets with respect to the number of annotations, the scanned area and the number of species covered.Entities:
Mesh:
Substances:
Year: 2022 PMID: 35660753 PMCID: PMC9166691 DOI: 10.1038/s41597-022-01389-0
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 8.501
Fig. 1Overview of the macrophage annotation and validation pipeline: The publicly available RetinaNet object-detection model trained on equine slides[4] is used to perform inference on the unannotated slides, followed by a semi-automatic clustering step which clusters cells by size. Error-prone cells are highlighted and can then be efficiently deleted by a human expert. Afterwards, a human expert screens all WSI to increase the dataset consistency. Finally, a regression-based clustering system is applied to support experts searching for misclassifications of the hemosiderin grade.
Fig. 2Left: Statistics on the density map at the EXACT user interface. Right: Visualisation of a density map which was screened by an expert for mislabelled cells (especially regarding the label class). Subfigure a) displays six manually deleted annotations. Visualisations b) to e) show the border region between two grades, with the first two columns representing the lower grade and the last two the upper grade, which were sometimes corrected by the expert as visualized by the different color of the bounding box.
Overview of the dataset meta-data, including the species, the dataset name, the number of slides, the version of the post-processing refinement step (Inference, Cluster, Screening, DensityMap) and the number of labels per hemosiderin score.
| species | dataset | slides | Version | Total Cells | Score | Count of Cells by Grade | ||||
|---|---|---|---|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | ||||||
| Equine | MELE | 16 | MELE | 77,004 | 102 | 29,017 | 26,810 | 13,178 | 6,577 | 1,422 |
| S | 59,954 | 112 | 19,733 | 21,545 | 11,442 | 5,963 | 1,271 | |||
| D | 58,956 | 109 | 19,246 | 21,595 | 11,829 | 5,552 | 734 | |||
| EALE | 39 | I | 245,397 | 95 | 97,904 | 80,715 | 47,789 | 17,437 | 1,552 | |
| S | 168,333 | 108 | 54,432 | 60,189 | 39,316 | 13,404 | 992 | |||
| D | 164,365 | 101 | 51,797 | 67,798 | 36,339 | 7,810 | 621 | |||
| Human | EALH | 12 | I | 168,411 | 133 | 31,035 | 64,833 | 58,320 | 12,776 | 1,447 |
| C | 128,012 | 133 | 21,532 | 53,704 | 42,553 | 8,932 | 1,291 | |||
| S | 54,580 | 156 | 47,26 | 20,688 | 23,323 | 5,090 | 753 | |||
| D | 53,864 | 156 | 43,84 | 18,357 | 26,563 | 4,433 | 127 | |||
| Feline | EALF | 7 | I | 94,788 | 38 | 58,879 | 35,659 | 122 | 8 | 120 |
| C | 88,848 | 38 | 54,867 | 33,868 | 103 | 5 | 5 | |||
| S | 20,422 | 33 | 13,631 | 6,747 | 41 | 2 | 1 | |||
| D | 20,198 | 45 | 11,124 | 9,039 | 35 | 0 | 0 | |||
| Total | SDATA | 74 | I | 585,600 | 98 | 216,835 | 208,017 | 119,409 | 36,798 | 4,541 |
| S | 303,289 | 113 | 92,522 | 109,169 | 74,122 | 24,459 | 3,017 | |||
| D | 297,383 | 109 | 86,551 | 116,789 | 74,766 | 17,795 | 1,482 | |||
The filenames of the five training and two validation slides (below the double line) per species used for the ablation and inter species cross-validation study.
| File | Species | Count of Cells by Grade | ||||
|---|---|---|---|---|---|---|
| 0 | 1 | 2 | 3 | 4 | ||
| 15.svs | Equine | 2884 | 3124 | 2138 | 733 | 12 |
| 22.svs | Equine | 2147 | 2441 | 2025 | 899 | 69 |
| 30.svs | Equine | 2815 | 2464 | 1523 | 606 | 26 |
| 19.svs | Equine | 1197 | 1012 | 316 | 36 | 0 |
| 02.svs | Equine | 2094 | 2560 | 1291 | 364 | 2 |
| 2707.svs | Human | 1434 | 2954 | 1034 | 122 | 0 |
| 11480.svs | Human | 391 | 1221 | 674 | 20 | 0 |
| 10080.svs | Human | 284 | 3015 | 2375 | 342 | 9 |
| 10052.svs | Human | 120 | 1739 | 3744 | 739 | 9 |
| 10120.svs | Human | 112 | 2225 | 4656 | 1502 | 105 |
| 1.svs | Feline | 1200 | 2287 | 22 | 1 | 0 |
| 6.svs | Feline | 3009 | 895 | 0 | 0 | 0 |
| 14.svs | Feline | 2393 | 495 | 2 | 0 | 0 |
| 13.svs | Feline | 4663 | 810 | 2 | 0 | 0 |
| 2.svs | Feline | 57 | 502 | 10 | 0 | 0 |
| 27.svs | Equine | 1392 | 1359 | 364 | 93 | 1 |
| 17.svs | Equine | 2725 | 2625 | 715 | 166 | 14 |
| 10227.svs | Human | 1265 | 1943 | 324 | 5 | 0 |
| 2702.svs | Human | 963 | 523 | 413 | 67 | 2 |
| 10.svs | Feline | 412 | 371 | 4 | 1 | 0 |
| 12.svs | Feline | 1897 | 1387 | 1 | 0 | 0 |
The slides have been selected and ordered according to their ratio of grade zero and one cells to represent a balanced sub-dataset.
Fig. 3Each of the nine figures show on the left the species source training domain and on the top the species target domain with the obtained mAP. Green bounding boxes represent grade zero hemosiderophages while red show grade one.
Fig. 4Results of the ablation study using our customised RetinaNet object detector on an increasing number of humane, equine and feline training patches of size 1024 × 1024 pixel from one WSI or up to five complete WSIs. The boxes represent the total number of hemosiderophages used for training in combination with the mAP graphs for each species.
| Measurement(s) | Hemosiderin-Laden Macrophage • Hemosiderin-Laden Macrophage |
| Technology Type(s) | machine learning • visual observation method |
| Sample Characteristic - Organism | Homo sapiens • Felinae • Equus caballus |