Literature DB >> 33665339

Deep learning shows no morphological abnormalities in neutrophils in Alzheimer's disease.

Floris Chabrun1,2, Xavier Dieu1,2, Nicolas Doudeau1, Jennifer Gautier3, Damien Luque-Paz2,4, Franck Geneviève4, Marc Ferré2, Delphine Mirebeau-Prunier1,2, Cédric Annweiler2,3, Pascal Reynier1,2.   

Abstract

INTRODUCTION: Several studies have provided evidence of the key role of neutrophils in the pathophysiology of Alzheimer's disease (AD). Yet, no study to date has investigated the potential link between AD and morphologically abnormal neutrophils on blood smears.
METHODS: Due to the complexity and subjectivity of the task by human analysis, deep learning models were trained to predict AD from neutrophil images. Control models were trained for a known feasible task (leukocyte subtype classification) and for detecting potential biases of overfitting (patient prediction).
RESULTS: Deep learning models achieved state-of-the-art results for leukocyte subtype classification but could not accurately predict AD. DISCUSSION: We found no evidence of morphological abnormalities of neutrophils in AD. Our results show that a solid deep learning pipeline with positive and bias control models with visualization techniques are helpful to support deep learning model results.
© 2021 The Authors. Alzheimer's & Dementia: Diagnosis, Assessment & Disease Monitoring published by Wiley Periodicals, LLC on behalf of Alzheimer's Association.

Entities:  

Keywords:  Alzheimer's disease; artificial intelligence; deep learning; machine learning; neutrophils

Year:  2021        PMID: 33665339      PMCID: PMC7896639          DOI: 10.1002/dad2.12146

Source DB:  PubMed          Journal:  Alzheimers Dement (Amst)        ISSN: 2352-8729


INTRODUCTION

Over the past decade, inflammation has emerged as a prominent feature of Alzheimer's disease (AD) pathophysiology. Several studies have provided evidence of a key role played by neutrophils in AD‐related inflammatory processes, mediated by intracellular granules, surface expression of inflammatory proteins, higher extravasation, and vascular NETosis. , , It is known that, on a blood smear, neutrophils exhibit abnormalities or morphologic particularities during infections and inflammatory processes. , , However, to our knowledge, no studies have investigated such potential morphological particularities of AD neutrophils, secondary to their functional alteration. The objective of the present study is to investigate the presence of such abnormalities on peripheral blood smears and their potential as a biomarker of diagnosis of AD. Detecting unknown abnormalities on a blood smear is a complex and time‐consuming task. Deep learning has emerged during the past decade as a powerful tool for image analysis, capable of achieving expert‐level accuracy and even discovering unknown image patterns of clinical importance. , , To explore the link between neutrophil morphology and AD, we trained state‐of‐the‐art artificial intelligence architectures based on machine learning methods, so‐called deep learning, to predict AD based on images of blood smears.

MATERIALS AND METHODS

Cohort construction and data collection

Patients from the Department of Geriatrics of the University Hospital of Angers were recruited into two cohorts: the AD group for patients with AD and the subjective memory complaint (SMC) group for patients without dementia and without mild cognitive impairment (MCI) who had normal neuropsychological and functional performance. Details concerning the recruitment of patients are listed in File S1, section 1: Cohort construction in supporting information. Leukocyte images for patients were retrieved from routine a health‐care database of analyses run in the Department of Hematology of the University Hospital of Angers. Those images were partitioned in training, validation, and test sets according to rules described in File S1, section 2: Data partitioning. This study was performed using anonymized data collected during the care and treatment of patients. According to the approval of the Ethics Committee of the University Hospital of Angers (no 2020/118), due to the tacit consent of patients for the use of their anonymized data collected during routine care, written consent was not required and replaced by verification of the absence of patient's opposition. Data were collected and processed after declaration to the University Hospital of Angers’ Data Protection Officer according to the General Data Protection Regulation (declaration ar19‐0046v0).

Prediction tasks

Two pre‐trained state‐of‐the‐art architectures (VGG‐16 and Inception v3) were trained after transfer learning for three learning prediction tasks, including two control tasks, to ensure the veracity of the results obtained for AD patients. The first task, referred to as “AD prediction,” aimed at classifying patients with AD versus patients with SMC from neutrophil images from peripheral blood smears. The same deep learning architectures were used for a simpler task: the prediction of already known imaging features, namely leukocyte subtype classification into four classes: neutrophils (NE), eosinophils (EO), monocytes (MO), and lymphocytes (LY). Basophils were excluded because of an insufficient number of samples. Finally, to explore biases potentially responsible for overfitting, by learning by heart for example, we also trained the same architectures to predict the patient to whom each image belonged. This task is referred to as “patient identification.” Details concerning the choice of those deep learning architectures and the training are presented in File S1, sections 3‐7; see Figure S1 in supporting information.

HIGHLIGHTS

There is no argument in favor of morphologically abnormal neutrophils in Alzheimer's disease. Study is based on data allowing state‐of‐the‐art leukocyte deep learning classification. A pipeline allows its robustness to be checked regardless of sample size.

RESEARCH IN CONTEXT

Systematic review: We reviewed literature using published and pre‐print sources (eg, PubMed, arXiv). Several studies have recently provided evidence of a major role played by neutrophils in the pathophysiology of Alzheimer's disease. To our knowledge, however, no study has addressed the presence of morphological alterations among neutrophils on blood smears, despite the well‐known links between inflammation processes and neutrophilic morphological abnormalities. Interpretation: Our results confirm the absence of neutrophilic morphological abnormalities on blood smears. This is supported by deep learning control models achieving state‐of‐the‐art accuracy for a known feasible task on the same data and visualization techniques exploring the behavior of those models. Future directions: As neutrophils did not show any morphological alterations on blood smears carried out on routine blood formula counts, it may be interesting to analyze higher resolution images to search for finer alterations or focus on non‐cell elements not routinely analyzed such as neutrophil extracellular traps.

RESULTS

Cohort description

The cohort is described in Table 1. Two patients had Waldenström's disease: one in the AD group and one in the SMC group. Because of the balance between both groups, those patients were kept in the analyses. No other patients in the AD group had any form of malignant hemopathy. In the SMC group, one patient had chronic lymphocytic leukemia, and one had a follicular lymphoma. Those patients were kept in our study, because our partitioning pipeline prevents models from being tested on images of patients they were trained with.
TABLE 1

Cohort description

GroupADSMCTotal/p‐value, 2
Patients
Number of patients810Total: 18
Number of reports1213Total: 25
Age: mean ± SD85.3 ± 5.4 years84.8 ± 6.6 yearsN.S. 1
Biology
Total leukocytes count10.8 G/L8.0 G/LN.S. 1
PMN count8.6 G/L6.6 G/LN.S. 1
PME count0.2 G/L0.1 G/L * 1
Monocyte count0.7 G/L0.5 G/LN.S. 1
Lymphocyte count1.2 G/L0.6 G/L * 1
Hemoglobin11.2 g/dL11.8 g/dLN.S. 1
Hematocrit33.0%35.2%N.S. 1
Mean globular volume92.5 fL96.3 fLN.S. 1
Mean hemoglobin concentration33.833.7N.S. 1
Platelet count211152N.S. 1
CRP ≥ 5 mg/L/total7/9 (3 unknown)7/8 (5 unknown)N.S. 2
Images
PMN images14551468Total: 2923
PME images5428Total: 82
Monocyte images117118Total: 235
Lymphocyte images347200Total: 547

Notes: P values were computed using Mann‐Whitney tests. N.S.: P > 0.05.

P < 0.05.

P values were determined with Mann‐Whitney tests.

P value was determined with a Fisher's exact test.

Abbreviations: CRP, C‐reactive protein; PME, polymorphonuclear eosinophil; PMN, polymorphonuclear neutrophil; SD, standard deviation.

Cohort description Notes: P values were computed using Mann‐Whitney tests. N.S.: P > 0.05. P < 0.05. P values were determined with Mann‐Whitney tests. P value was determined with a Fisher's exact test. Abbreviations: CRP, C‐reactive protein; PME, polymorphonuclear eosinophil; PMN, polymorphonuclear neutrophil; SD, standard deviation.

Leukocyte subtype classification

The top model (VGG‐16) achieved an accuracy on the test set of 97.5% (94.3% with random translation/rotation applied to the image). For all models, median accuracy was 84.6% (96.1% and 82.2% median accuracy for models trained with soft and strong image augmentation, respectively).

Alzheimer's disease prediction

The top model (VGG‐16) achieved an area under the curve for receiver operating characteristic curve (AUC‐ROC) on the test set of 0.68 (0.62 with random translation/rotation applied to the image). For all models, the median AUC‐ROC was 0.5.

Patient identification

The top model (Inception v3) achieved an accuracy on the test set of 95.3% (80.3% with random translation/rotation applied to the image). For all models, the median accuracy was 17.3% (73.8% and 12.2% median accuracy for models trained with soft and strong image augmentation, respectively).

Gradient‐weighted class activation mapping (Grad‐CAM) visualization

The heatmaps presented in Figure 1A–C depict the attention of neural networks on the image for the associated output prediction. This allows assessment of (1) which regions are useful for the predictions and whether those regions are consistent with the expectations of human hematologists, namely for leukocyte subtype prediction; (2) the number and size of those regions; and (3) the stability and robustness of neural networks when transformations such as translation or rotation are applied to the image.
FIGURE 1

Gradient‐weighted class activation mapping (Grad‐CAM) visualization of best models for blood cell type classification (A), Alzheimer's disease prediction (B), and patient prediction (C). For each image, the model was inputted with the raw image (left) and the same image after random translation and/or rotation (right). Predicted class and associated confidence are plotted on the heatmap, while ground truth is plotted on the input image

Gradient‐weighted class activation mapping (Grad‐CAM) visualization of best models for blood cell type classification (A), Alzheimer's disease prediction (B), and patient prediction (C). For each image, the model was inputted with the raw image (left) and the same image after random translation and/or rotation (right). Predicted class and associated confidence are plotted on the heatmap, while ground truth is plotted on the input image

DISCUSSION

With 18 patients and 3787 pictures, we trained a model to classify cells according to the four main leukocyte types, with an overall state‐of‐the‐art accuracy of 97.3%. , , The consistency of those results was confirmed by (1) Grad‐CAM, showing that the attention of the model was focused on the area an expert hematologist would interpret, that is, cells’ nuclei and cytoplasm, and this attention follows the elements when translation/rotation is applied to the image; and (2) the results during grid search, with most models achieving an overall accuracy on the validation set higher than 85%, even with strong image augmentation. We also showed that the deep learning models used (mainly Inception v3) could achieve a high accuracy when predicting to which patient a cell image belongs. Grad‐CAM showed that this ability is attained by focusing on more minor details, such as red blood cell density and spreading, or particularities on cells, like granulation density. Although this highlights the high sensitivity of those small details and thus their possible performance, this also highlights how those models are prone to overfitting. This is demonstrated by predictions that neural networks tend to change when translation/rotation is applied to the image. The best model trained for AD prediction showed an interesting AUC of 0.70 on the training set, confirmed on a test set of new images (AUC = 0.68). However, Grad‐CAM visualization supports a tendency to overfit, showing a high versatility concerning the details on which the model focuses and a high instability when applying translation or rotation to the images. Furthermore, a grid search showed that most models could not achieve an AUC on a test set higher than 0.5, thus supporting this tendency. Our results draw attention to the overfitting risks of deep learning architectures. We hereby show that state‐of‐the‐art deep learning architectures can efficiently learn to memorize to which patient an image belongs, based on a variety of details, some of which are minute. In our study, we carefully partitioned our samples to make sure that a model trained on images from a patient would not be tested on images of the same patient, to avoid any bias leading to an overestimation of its performance. However, to date, a number of articles recently applying deep learning to blood cell classification or prediction have not specified the stratification during partitioning. , , This omission may lead to the overestimation of both the results of those models and particularly their ability to generalize to new patients.

Limits

Our study was restricted on retrospective data analyzed during the routine care of patients seen in the University Hospital of Angers. For further confirmation of these results, multicentric prospective data, including blood smear images obtained with other coloration methods and other digital microscopes will be needed. Our data are accessible for further analysis upon reasonable request to the corresponding author.

CONCLUSIONS

We used a solid prediction‐based pipeline: (1) a positive control predictive model, trained to verify the feasibility of a known task on our dataset, that is, leukocyte subtype classification; (2) a bias control predictive model, trained to determine whether models could learn features allowing them to overfit; and (3) Grad‐CAM visualization to check that the attention given by the model fit with our expectations. Thanks to this pipeline, we have demonstrated here that there is no evidence of a link between blood smear neutrophil morphology and AD. We have also highlighted the importance of control predictive models, heavy visualization techniques, grid search, and transfer learning, enabling us to verify the consistency of models and prevent falsely accurate results from being trusted, despite a complex task based on a relatively low number of samples.

AUTHORS’ CONTRIBUTION

FC: design of the study and the deep learning methodology, data processing, statistical and deep learning analyses, literature review, and redaction of the manuscript; XD: design of the deep learning methodology, review of statistical and deep learning analyses; ND: literature review concerning the deep learning methodology, data processing, and deep learning analysis; JG: collection and pre‐processing of clinical data; DL: collection of biological data and review of hematology‐related sections and literature; FG: collection of biological data and review of hematology‐related sections and literature; MF: collection of biological data and review of the deep learning methodology; DM: global review of the methodology and proofreading of the manuscript; CA: design of the study, collection and structuring of clinical data, review of Alzheimer's disease‐related sections and literature, global review of the methodology, and proofreading of the manuscript; PR: design of the study, global review of the methodology, and proofreading of the manuscript.

CONFLICTS OF INTEREST

The authors declare that there are no conflicts of interest. Supplementary File 1: Material and methods details. Click here for additional data file. figureS1 Click here for additional data file. figureS2 Click here for additional data file. figureS2 Click here for additional data file.
  11 in total

1.  Recognition of peripheral blood cell images using convolutional neural networks.

Authors:  Andrea Acevedo; Santiago Alférez; Anna Merino; Laura Puigví; José Rodellar
Journal:  Comput Methods Programs Biomed       Date:  2019-08-09       Impact factor: 5.428

2.  Fine-grained leukocyte classification with deep residual learning for microscopic images.

Authors:  Feiwei Qin; Nannan Gao; Yong Peng; Zizhao Wu; Shuying Shen; Artur Grudtsin
Journal:  Comput Methods Programs Biomed       Date:  2018-05-22       Impact factor: 5.428

Review 3.  Medical Image Analysis using Convolutional Neural Networks: A Review.

Authors:  Syed Muhammad Anwar; Muhammad Majid; Adnan Qayyum; Muhammad Awais; Majdi Alnowami; Muhammad Khurram Khan
Journal:  J Med Syst       Date:  2018-10-08       Impact factor: 4.460

Review 4.  A prospective study of hospitalized patients with leukemoid reaction; causes, prognosis and value of manual peripheral smear review.

Authors:  Stamatis Karakonstantis; Mina Koulouridi; Kyriakos Pitsillos; Eirini Kalokyri; Anna Kozyri; Galateia Gourniezaki; Charalampos Lydakis
Journal:  Rom J Intern Med       Date:  2019-09-01

5.  Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning.

Authors:  Ryan Poplin; Avinash V Varadarajan; Katy Blumer; Yun Liu; Michael V McConnell; Greg S Corrado; Lily Peng; Dale R Webster
Journal:  Nat Biomed Eng       Date:  2018-02-19       Impact factor: 25.671

Review 6.  Imaging of Leukocyte Trafficking in Alzheimer's Disease.

Authors:  Enrica Pietronigro; Elena Zenaro; Gabriela Constantin
Journal:  Front Immunol       Date:  2016-02-15       Impact factor: 7.561

7.  Morphological anomalies of circulating blood cells in COVID-19.

Authors:  Gina Zini; Silvia Bellesi; Francesco Ramundo; Giuseppe d'Onofrio
Journal:  Am J Hematol       Date:  2020-04-22       Impact factor: 10.047

Review 8.  Inflammation as a central mechanism in Alzheimer's disease.

Authors:  Jefferson W Kinney; Shane M Bemiller; Andrew S Murtishaw; Amanda M Leisgang; Arnold M Salazar; Bruce T Lamb
Journal:  Alzheimers Dement (N Y)       Date:  2018-09-06

9.  Deep learning approach to peripheral leukocyte recognition.

Authors:  Qiwei Wang; Shusheng Bi; Minglei Sun; Yuliang Wang; Di Wang; Shaobao Yang
Journal:  PLoS One       Date:  2019-06-25       Impact factor: 3.240

Review 10.  The role of neutrophil granule proteins in neuroinflammation and Alzheimer's disease.

Authors:  Amanda J Stock; Anne Kasus-Jacobi; H Anne Pereira
Journal:  J Neuroinflammation       Date:  2018-08-27       Impact factor: 8.322

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.