| Literature DB >> 29860392 |
Geert Litjens1, Peter Bandi2, Babak Ehteshami Bejnordi1, Oscar Geessink1, Maschenka Balkenhol1, Peter Bult1, Altuna Halilovic1, Meyke Hermsen1, Rob van de Loo1, Rob Vogels1, Quirine F Manson2, Nikolas Stathonikos2, Alexi Baidoshvili3, Paul van Diest2, Carla Wauters4, Marcory van Dijk5, Jeroen van der Laak1.
Abstract
Background: The presence of lymph node metastases is one of the most important factors in breast cancer prognosis. The most common way to assess regional lymph node status is the sentinel lymph node procedure. The sentinel lymph node is the most likely lymph node to contain metastasized cancer cells and is excised, histopathologically processed, and examined by a pathologist. This tedious examination process is time-consuming and can lead to small metastases being missed. However, recent advances in whole-slide imaging and machine learning have opened an avenue for analysis of digitized lymph node sections with computer algorithms. For example, convolutional neural networks, a type of machine-learning algorithm, can be used to automatically detect cancer metastases in lymph nodes with high accuracy. To train machine-learning models, large, well-curated datasets are needed.Entities:
Mesh:
Year: 2018 PMID: 29860392 PMCID: PMC6007545 DOI: 10.1093/gigascience/giy065
Source DB: PubMed Journal: Gigascience ISSN: 2047-217X Impact factor: 6.524
Figure 1:Low-resolution example of a WSI from each of the five centers contributing data.
Rules for assigning clusters of metastasized tumor cells to a metastasis category
| Category | Size |
|---|---|
| Macro-metastasis | Larger than 2 mm |
| Micro-metastasis | Larger than 0.2 mm and/or containing more than 200 cells, but not larger than 2 mm |
| Isolated tumor cells | Single tumor cells or a cluster of tumor cells not larger than 0.2 mm or less than 200 cells |
Selection of N-stages for staging of breast cancer based on the 7th edition of the TNM staging criteria
| Stage | Description |
|---|---|
| N0 | Cancer has not spread to nearby lymph nodes |
| N0(i+) | Lymph nodes only contain ITCs |
| N1mi | Micro-metastases in 1 to 3 lymph nodes axillary |
| N1a | Cancer has spread to 1 to 3 lymph nodes axillary, with at least 1 macro-metastasis |
| N1b | Cancer has spread to internal mammary lymph nodes, but this spread could only be found on sentinel lymph node biopsy |
| N1c | Both N1a and N1b apply |
| N2a | Cancer has spread to 4 to 9 lymph nodes under the arm, with at least 1 macro-metastasis |
| N2b | Metastases in clinically detected internal mammary lymph nodes in the absence of axillary lymph node metastases |
Figure 2:Representative samples of the different sizes of breast cancer metastases in sentinel lymph nodes.
WSI-level characteristics for the CAMELYON16 part of the dataset
| Metastases | ||||
|---|---|---|---|---|
| Center | Total WSIs | None | Macro | Micro |
| RUMC | 249 | 150 | 48 | 51 |
| UMCU | 150 | 90 | 34 | 26 |
WSI-level characteristics for the CAMELYON17 part of the dataset
| Center | Total WSIs | Metastases (Train) | ||||
|---|---|---|---|---|---|---|
| Train | Test | None | Macro | Micro | ITC | |
| CWZ | 100 | 100 | 64 | 15 | 10 | 11 |
| LPON | 100 | 100 | 64 | 25 | 4 | 7 |
| RST | 100 | 100 | 60 | 11 | 22 | 7 |
| RUMC | 100 | 100 | 60 | 19 | 13 | 8 |
| UMCU | 100 | 100 | 75 | 15 | 8 | 2 |
| Total | 500 | 500 | 323 | 85 | 57 | 35 |
Basic descriptors for the TIFF used in the CAMELYON dataset
| Format | Tiled TIFF (bigTIFF) |
| Tile size | 512 pixels |
| Pixel resolution | 0.23 μm to 0.25 μm |
| Channels per pixel | 3 (red, green, blue) |
| Bits per channel | 8 |
| Data type | Unsigned char |
| Compression | JPEG |
Figure 3:H&E-stained tissue section and a consecutive section immunohistochemically stained for cytokeratin. The top row shows the low-resolution images and the bottom row a high-resolution image, centered at a metastasis. The metastasis is difficult to see in H&E but easy to identify in the immunohistochemically stained slide. A yellow bounding box indicates the metastasis location in the images in the top row.
Patient-level characteristics for the CAMELYON17 part of the dataset
| Center | Total patients | Stages (Train) | |||||
|---|---|---|---|---|---|---|---|
| Train | Test | pN0 | pN0i + | pN1mi | pN1 | pN2 | |
| CWZ | 20 | 20 | 4 | 3 | 5 | 7 | 1 |
| LPON | 20 | 20 | 6 | 2 | 2 | 7 | 3 |
| RST | 20 | 20 | 4 | 2 | 6 | 5 | 3 |
| RUMC | 20 | 20 | 3 | 2 | 4 | 8 | 3 |
| UMCU | 20 | 20 | 8 | 2 | 4 | 3 | 3 |
| Total | 100 | 100 | 25 | 11 | 21 | 30 | 13 |
Figure 4:Interface of the ASAP viewer interface. Visible items are the annotations tools in toolbar, the viewport showing the WSI, and the plugin panel on the left.