Literature DB >> 33665339

Deep learning shows no morphological abnormalities in neutrophils in Alzheimer's disease.

Floris Chabrun^1,2, Xavier Dieu^1,2, Nicolas Doudeau¹, Jennifer Gautier³, Damien Luque-Paz^2,4, Franck Geneviève⁴, Marc Ferré², Delphine Mirebeau-Prunier^1,2, Cédric Annweiler^2,3, Pascal Reynier^1,2.

Abstract

INTRODUCTION: Several studies have provided evidence of the key role of neutrophils in the pathophysiology of Alzheimer's disease (AD). Yet, no study to date has investigated the potential link between AD and morphologically abnormal neutrophils on blood smears.
METHODS: Due to the complexity and subjectivity of the task by human analysis, deep learning models were trained to predict AD from neutrophil images. Control models were trained for a known feasible task (leukocyte subtype classification) and for detecting potential biases of overfitting (patient prediction).
RESULTS: Deep learning models achieved state-of-the-art results for leukocyte subtype classification but could not accurately predict AD. DISCUSSION: We found no evidence of morphological abnormalities of neutrophils in AD. Our results show that a solid deep learning pipeline with positive and bias control models with visualization techniques are helpful to support deep learning model results.

Entities: Chemical

Keywords: Alzheimer's disease; artificial intelligence; deep learning; machine learning; neutrophils

Year: 2021 PMID： 33665339 PMCID： PMC7896639 DOI： 10.1002/dad2.12146

Source DB: PubMed Journal: Alzheimers Dement (Amst) ISSN： 2352-8729

INTRODUCTION

Over the past decade, inflammation has emerged as a prominent feature of Alzheimer's disease (AD) pathophysiology. Several studies have provided evidence of a key role played by neutrophils in AD‐related inflammatory processes, mediated by intracellular granules, surface expression of inflammatory proteins, higher extravasation, and vascular NETosis. , , It is known that, on a blood smear, neutrophils exhibit abnormalities or morphologic particularities during infections and inflammatory processes. , , However, to our knowledge, no studies have investigated such potential morphological particularities of AD neutrophils, secondary to their functional alteration. The objective of the present study is to investigate the presence of such abnormalities on peripheral blood smears and their potential as a biomarker of diagnosis of AD. Detecting unknown abnormalities on a blood smear is a complex and time‐consuming task. Deep learning has emerged during the past decade as a powerful tool for image analysis, capable of achieving expert‐level accuracy and even discovering unknown image patterns of clinical importance. , , To explore the link between neutrophil morphology and AD, we trained state‐of‐the‐art artificial intelligence architectures based on machine learning methods, so‐called deep learning, to predict AD based on images of blood smears.

MATERIALS AND METHODS

Cohort construction and data collection

Patients from the Department of Geriatrics of the University Hospital of Angers were recruited into two cohorts: the AD group for patients with AD and the subjective memory complaint (SMC) group for patients without dementia and without mild cognitive impairment (MCI) who had normal neuropsychological and functional performance. Details concerning the recruitment of patients are listed in File S1, section 1: Cohort construction in supporting information. Leukocyte images for patients were retrieved from routine a health‐care database of analyses run in the Department of Hematology of the University Hospital of Angers. Those images were partitioned in training, validation, and test sets according to rules described in File S1, section 2: Data partitioning. This study was performed using anonymized data collected during the care and treatment of patients. According to the approval of the Ethics Committee of the University Hospital of Angers (no 2020/118), due to the tacit consent of patients for the use of their anonymized data collected during routine care, written consent was not required and replaced by verification of the absence of patient's opposition. Data were collected and processed after declaration to the University Hospital of Angers’ Data Protection Officer according to the General Data Protection Regulation (declaration ar19‐0046v0).

Prediction tasks

Two pre‐trained state‐of‐the‐art architectures (VGG‐16 and Inception v3) were trained after transfer learning for three learning prediction tasks, including two control tasks, to ensure the veracity of the results obtained for AD patients. The first task, referred to as “AD prediction,” aimed at classifying patients with AD versus patients with SMC from neutrophil images from peripheral blood smears. The same deep learning architectures were used for a simpler task: the prediction of already known imaging features, namely leukocyte subtype classification into four classes: neutrophils (NE), eosinophils (EO), monocytes (MO), and lymphocytes (LY). Basophils were excluded because of an insufficient number of samples. Finally, to explore biases potentially responsible for overfitting, by learning by heart for example, we also trained the same architectures to predict the patient to whom each image belonged. This task is referred to as “patient identification.” Details concerning the choice of those deep learning architectures and the training are presented in File S1, sections 3‐7; see Figure S1 in supporting information.

HIGHLIGHTS

There is no argument in favor of morphologically abnormal neutrophils in Alzheimer's disease. Study is based on data allowing state‐of‐the‐art leukocyte deep learning classification. A pipeline allows its robustness to be checked regardless of sample size.

RESEARCH IN CONTEXT

Systematic review: We reviewed literature using published and pre‐print sources (eg, PubMed, arXiv). Several studies have recently provided evidence of a major role played by neutrophils in the pathophysiology of Alzheimer's disease. To our knowledge, however, no study has addressed the presence of morphological alterations among neutrophils on blood smears, despite the well‐known links between inflammation processes and neutrophilic morphological abnormalities. Interpretation: Our results confirm the absence of neutrophilic morphological abnormalities on blood smears. This is supported by deep learning control models achieving state‐of‐the‐art accuracy for a known feasible task on the same data and visualization techniques exploring the behavior of those models. Future directions: As neutrophils did not show any morphological alterations on blood smears carried out on routine blood formula counts, it may be interesting to analyze higher resolution images to search for finer alterations or focus on non‐cell elements not routinely analyzed such as neutrophil extracellular traps.

RESULTS

Cohort description

The cohort is described in Table 1. Two patients had Waldenström's disease: one in the AD group and one in the SMC group. Because of the balance between both groups, those patients were kept in the analyses. No other patients in the AD group had any form of malignant hemopathy. In the SMC group, one patient had chronic lymphocytic leukemia, and one had a follicular lymphoma. Those patients were kept in our study, because our partitioning pipeline prevents models from being tested on images of patients they were trained with.

TABLE 1

Cohort description

Group	AD	SMC	Total/p‐value^, ²
Patients
Number of patients	8	10	Total: 18
Number of reports	12	13	Total: 25
Age: mean ± SD	85.3 ± 5.4 years	84.8 ± 6.6 years	N.S. ¹
Biology
Total leukocytes count	10.8 G/L	8.0 G/L	N.S. ¹
PMN count	8.6 G/L	6.6 G/L	N.S. ¹
PME count	0.2 G/L	0.1 G/L	* ¹
Monocyte count	0.7 G/L	0.5 G/L	N.S. ¹
Lymphocyte count	1.2 G/L	0.6 G/L	* ¹
Hemoglobin	11.2 g/dL	11.8 g/dL	N.S. ¹
Hematocrit	33.0%	35.2%	N.S. ¹
Mean globular volume	92.5 fL	96.3 fL	N.S. ¹
Mean hemoglobin concentration	33.8	33.7	N.S. ¹
Platelet count	211	152	N.S. ¹
CRP ≥ 5 mg/L/total	7/9 (3 unknown)	7/8 (5 unknown)	N.S. ²
Images
PMN images	1455	1468	Total: 2923
PME images	54	28	Total: 82
Monocyte images	117	118	Total: 235
Lymphocyte images	347	200	Total: 547

Notes: P values were computed using Mann‐Whitney tests. N.S.: P > 0.05.

P < 0.05.

P values were determined with Mann‐Whitney tests.

P value was determined with a Fisher's exact test.

Abbreviations: CRP, C‐reactive protein; PME, polymorphonuclear eosinophil; PMN, polymorphonuclear neutrophil; SD, standard deviation.

Cohort description Notes: P values were computed using Mann‐Whitney tests. N.S.: P > 0.05. P < 0.05. P values were determined with Mann‐Whitney tests. P value was determined with a Fisher's exact test. Abbreviations: CRP, C‐reactive protein; PME, polymorphonuclear eosinophil; PMN, polymorphonuclear neutrophil; SD, standard deviation.

Leukocyte subtype classification

The top model (VGG‐16) achieved an accuracy on the test set of 97.5% (94.3% with random translation/rotation applied to the image). For all models, median accuracy was 84.6% (96.1% and 82.2% median accuracy for models trained with soft and strong image augmentation, respectively).

Alzheimer's disease prediction

The top model (VGG‐16) achieved an area under the curve for receiver operating characteristic curve (AUC‐ROC) on the test set of 0.68 (0.62 with random translation/rotation applied to the image). For all models, the median AUC‐ROC was 0.5.

Patient identification

The top model (Inception v3) achieved an accuracy on the test set of 95.3% (80.3% with random translation/rotation applied to the image). For all models, the median accuracy was 17.3% (73.8% and 12.2% median accuracy for models trained with soft and strong image augmentation, respectively).

Gradient‐weighted class activation mapping (Grad‐CAM) visualization

The heatmaps presented in Figure 1A–C depict the attention of neural networks on the image for the associated output prediction. This allows assessment of (1) which regions are useful for the predictions and whether those regions are consistent with the expectations of human hematologists, namely for leukocyte subtype prediction; (2) the number and size of those regions; and (3) the stability and robustness of neural networks when transformations such as translation or rotation are applied to the image.

FIGURE 1

Gradient‐weighted class activation mapping (Grad‐CAM) visualization of best models for blood cell type classification (A), Alzheimer's disease prediction (B), and patient prediction (C). For each image, the model was inputted with the raw image (left) and the same image after random translation and/or rotation (right). Predicted class and associated confidence are plotted on the heatmap, while ground truth is plotted on the input image

DISCUSSION

With 18 patients and 3787 pictures, we trained a model to classify cells according to the four main leukocyte types, with an overall state‐of‐the‐art accuracy of 97.3%. , , The consistency of those results was confirmed by (1) Grad‐CAM, showing that the attention of the model was focused on the area an expert hematologist would interpret, that is, cells’ nuclei and cytoplasm, and this attention follows the elements when translation/rotation is applied to the image; and (2) the results during grid search, with most models achieving an overall accuracy on the validation set higher than 85%, even with strong image augmentation. We also showed that the deep learning models used (mainly Inception v3) could achieve a high accuracy when predicting to which patient a cell image belongs. Grad‐CAM showed that this ability is attained by focusing on more minor details, such as red blood cell density and spreading, or particularities on cells, like granulation density. Although this highlights the high sensitivity of those small details and thus their possible performance, this also highlights how those models are prone to overfitting. This is demonstrated by predictions that neural networks tend to change when translation/rotation is applied to the image. The best model trained for AD prediction showed an interesting AUC of 0.70 on the training set, confirmed on a test set of new images (AUC = 0.68). However, Grad‐CAM visualization supports a tendency to overfit, showing a high versatility concerning the details on which the model focuses and a high instability when applying translation or rotation to the images. Furthermore, a grid search showed that most models could not achieve an AUC on a test set higher than 0.5, thus supporting this tendency. Our results draw attention to the overfitting risks of deep learning architectures. We hereby show that state‐of‐the‐art deep learning architectures can efficiently learn to memorize to which patient an image belongs, based on a variety of details, some of which are minute. In our study, we carefully partitioned our samples to make sure that a model trained on images from a patient would not be tested on images of the same patient, to avoid any bias leading to an overestimation of its performance. However, to date, a number of articles recently applying deep learning to blood cell classification or prediction have not specified the stratification during partitioning. , , This omission may lead to the overestimation of both the results of those models and particularly their ability to generalize to new patients.

Limits

Our study was restricted on retrospective data analyzed during the routine care of patients seen in the University Hospital of Angers. For further confirmation of these results, multicentric prospective data, including blood smear images obtained with other coloration methods and other digital microscopes will be needed. Our data are accessible for further analysis upon reasonable request to the corresponding author.

CONCLUSIONS

We used a solid prediction‐based pipeline: (1) a positive control predictive model, trained to verify the feasibility of a known task on our dataset, that is, leukocyte subtype classification; (2) a bias control predictive model, trained to determine whether models could learn features allowing them to overfit; and (3) Grad‐CAM visualization to check that the attention given by the model fit with our expectations. Thanks to this pipeline, we have demonstrated here that there is no evidence of a link between blood smear neutrophil morphology and AD. We have also highlighted the importance of control predictive models, heavy visualization techniques, grid search, and transfer learning, enabling us to verify the consistency of models and prevent falsely accurate results from being trusted, despite a complex task based on a relatively low number of samples.

AUTHORS’ CONTRIBUTION

FC: design of the study and the deep learning methodology, data processing, statistical and deep learning analyses, literature review, and redaction of the manuscript; XD: design of the deep learning methodology, review of statistical and deep learning analyses; ND: literature review concerning the deep learning methodology, data processing, and deep learning analysis; JG: collection and pre‐processing of clinical data; DL: collection of biological data and review of hematology‐related sections and literature; FG: collection of biological data and review of hematology‐related sections and literature; MF: collection of biological data and review of the deep learning methodology; DM: global review of the methodology and proofreading of the manuscript; CA: design of the study, collection and structuring of clinical data, review of Alzheimer's disease‐related sections and literature, global review of the methodology, and proofreading of the manuscript; PR: design of the study, global review of the methodology, and proofreading of the manuscript.

CONFLICTS OF INTEREST

The authors declare that there are no conflicts of interest. Supplementary File 1: Material and methods details. Click here for additional data file. figureS1 Click here for additional data file. figureS2 Click here for additional data file. figureS2 Click here for additional data file.

11 in total

1. Recognition of peripheral blood cell images using convolutional neural networks.

Authors: Andrea Acevedo; Santiago Alférez; Anna Merino; Laura Puigví; José Rodellar
Journal: Comput Methods Programs Biomed Date: 2019-08-09 Impact factor: 5.428

2. Fine-grained leukocyte classification with deep residual learning for microscopic images.

Authors: Feiwei Qin; Nannan Gao; Yong Peng; Zizhao Wu; Shuying Shen; Artur Grudtsin
Journal: Comput Methods Programs Biomed Date: 2018-05-22 Impact factor: 5.428

Review 3. Medical Image Analysis using Convolutional Neural Networks: A Review.

Authors: Syed Muhammad Anwar; Muhammad Majid; Adnan Qayyum; Muhammad Awais; Majdi Alnowami; Muhammad Khurram Khan
Journal: J Med Syst Date: 2018-10-08 Impact factor: 4.460

Review 4. A prospective study of hospitalized patients with leukemoid reaction; causes, prognosis and value of manual peripheral smear review.

Authors: Stamatis Karakonstantis; Mina Koulouridi; Kyriakos Pitsillos; Eirini Kalokyri; Anna Kozyri; Galateia Gourniezaki; Charalampos Lydakis
Journal: Rom J Intern Med Date: 2019-09-01

Deep learning shows no morphological abnormalities in neutrophils in Alzheimer's disease.

INTRODUCTION

MATERIALS AND METHODS

Cohort construction and data collection

Prediction tasks

HIGHLIGHTS

RESEARCH IN CONTEXT

RESULTS

Cohort description

Leukocyte subtype classification

Alzheimer's disease prediction

Patient identification

Gradient‐weighted class activation mapping (Grad‐CAM) visualization

DISCUSSION

Limits

CONCLUSIONS

AUTHORS’ CONTRIBUTION

CONFLICTS OF INTEREST

1. Recognition of peripheral blood cell images using convolutional neural networks.

2. Fine-grained leukocyte classification with deep residual learning for microscopic images.

Review 3. Medical Image Analysis using Convolutional Neural Networks: A Review.

Review 4. A prospective study of hospitalized patients with leukemoid reaction; causes, prognosis and value of manual peripheral smear review.

5. Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning.

Review 6. Imaging of Leukocyte Trafficking in Alzheimer's Disease.

7. Morphological anomalies of circulating blood cells in COVID-19.

Review 8. Inflammation as a central mechanism in Alzheimer's disease.

9. Deep learning approach to peripheral leukocyte recognition.

Review 10. The role of neutrophil granule proteins in neuroinflammation and Alzheimer's disease.