Literature DB >> 30981204

Machine learning based hierarchical classification of frontotemporal dementia and Alzheimer's disease.

Jun Pyo Kim¹, Jeonghun Kim², Yu Hyun Park¹, Seong Beom Park¹, Jin San Lee³, Sole Yoo⁴, Eun-Joo Kim⁵, Hee Jin Kim¹, Duk L Na¹, Jesse A Brown⁶, Samuel N Lockhart⁷, Sang Won Seo⁸, Joon-Kyung Seong⁹.

Abstract

BACKGROUND: In a clinical setting, an individual subject classification model rather than a group analysis would be more informative. Specifically, the subtlety of cortical atrophy in some frontotemporal dementia (FTD) patients and overlapping patterns of atrophy among three FTD clinical syndromes including behavioral variant FTD (bvFTD), non-fluent/agrammatic variant primary progressive aphasia (nfvPPA), and semantic variant PPA (svPPA) give rise to the need for classification models at the individual level. In this study, we aimed to classify each individual subject into one of the diagnostic categories in a hierarchical manner by employing a machine learning-based classification method.
METHODS: We recruited 143 patients with FTD, 50 patients with Alzheimer's disease (AD) dementia, and 146 cognitively normal subjects. All subjects underwent a three-dimensional volumetric brain magnetic resonance imaging (MRI) scan, and cortical thickness was measured using FreeSurfer. We applied the Laplace Beltrami operator to reduce noise in the cortical thickness data and to reduce the dimension of the feature vector. Classifiers were constructed by applying both principal component analysis and linear discriminant analysis to the cortical thickness data. For the hierarchical classification, we trained four classifiers using different pairs of groups: Step 1 - CN vs. FTD + AD, Step 2 - FTD vs. AD, Step 3 - bvFTD vs. PPA, Step 4 - svPPA vs. nfvPPA. To evaluate the classification performance for each step, we used a10-fold cross-validation approach, performed 1000 times for reliability.
RESULTS: The classification accuracy of the entire hierarchical classification tree was 75.8%, which was higher than that of the non-hierarchical classifier (73.0%). The classification accuracies of steps 1-4 were 86.1%, 90.8%, 86.9%, and 92.1%, respectively. Changes in the right frontotemporal area were critical for discriminating behavioral variant FTD from PPA. The left frontal lobe discriminated nfvPPA from svPPA, while the bilateral anterior temporal regions were critical for identifying svPPA.
CONCLUSIONS: In the present study, our automated classifier successfully classified FTD clinical subtypes with good to excellent accuracy. Our classifier may help clinicians diagnose FTD subtypes with subtle cortical atrophy and facilitate appropriate specific interventions.

Entities: Chemical Disease Gene Species

Keywords: Classification model; Frontotemporal dementia; Machine learning

Year: 2019 PMID： 30981204 PMCID： PMC6458431 DOI： 10.1016/j.nicl.2019.101811

Source DB: PubMed Journal: Neuroimage Clin ISSN： 2213-1582 Impact factor: 4.881

Background

Frontotemporal dementia (FTD) is one of the leading causes of early-onset degenerative dementia (Vieira et al., 2013). The clinically defined syndromes within the FTD spectrum include three variants: the behavioral variant FTD (bvFTD), which is associated with early behavioral and executive deficits; semantic variant primary progressive aphasia (svPPA), which is associated with semantic anomia and impaired comprehension; and non-fluent/agrammatic variant primary progressive aphasia (nfvPPA), which is a progressive disorder of speech, grammar and word output.(Bang et al., 2015) Interpretation of magnetic resonance imaging (MRI) scans largely relies on the intuition and experience of clinicians, though MRI scans help clinicians diagnose FTD as an auxiliary tool. With the rapid development of neuroimaging analysis, we can automatically analyze cortical atrophy. In this regard, a previous study suggested that FTD syndromes are characterized by cortical atrophy in the frontal, anterior temporal and frontoinsular regions (Rosen et al., 2002). The relative involvement patterns of frontotemporal structures in FTD also vary among clinical syndromes. That is, patients with bvFTD had cortical atrophy in the anterior cingulate and frontal insular cortices, most prominently early in the course of the disease (Rosen et al., 2002; Davies et al., 2009). Conversely, studies in patients with nfvPPA indicate cortical atrophy in the left frontal area, especially the inferior frontal gyrus, pars triangularis, Rolandic operculum and precentral gyrus with left predominance. (Pereira et al., 2009; Gorno-Tempini et al., 2006) Semantic variant PPA is known to have the most distinct atrophic pattern among FTD clinical syndromes, which is most prominent in the left anterior temporal lobe.(Davies et al., 2009; Hodges and Patterson, 2007; Brambati et al., 2015) In clinical settings, an individual subject classification model rather than a group analysis would be more informative. The subtlety of cortical atrophy in early-stage FTD patients and overlapping characteristics of atrophy patterns among the FTD clinical syndromes, Alzheimer's disease (AD) and normal aging give rise to the need for an automated image analysis procedure which can be used at the individual level. Specifically, considering that different forms of dementia correlate with different underlying neuropathologies,(Seelaar et al., 2011) distinguishing between different causes of dementia will become more important with the emergence of targeted therapies. In this study, we therefore aimed to classify each individual subject into one of the diagnostic categories in a hierarchical manner by employing a machine learning-based classification method using surface-based cortical thickness data. The hierarchical scheme of our classification algorithm was designed based on the clinical decision process. In clinical practice, after noticing that a patient has abnormal findings that cannot be explained by normal aging, a clinician has to rule out AD first since it is the most common cause of degenerative dementia. If the patient has behavioral or language problems that suggest FTD, the clinician will determine which clinical syndrome it is. To emulate this process, first, we discriminated the dementia group (FTD and AD groups combined) from the cognitively normal (CN) group. Subsequently, the dementia group was classified into FTD and AD groups. Afterwards, subjects from the FTD group were classified into bvFTD and PPA groups. Finally, the PPA group was further classified into nfvPPA and svPPA groups. Our algorithm used a Laplace Beltrami operator to reduce noise, followed by linear discriminant analysis (LDA) in combination with principal component analysis (PCA).

Methods

Participants

We consecutively recruited 143 patients with FTD who visited the dementia clinic of Samsung Medical Center (Seoul, Korea) from September 2007 to March 2017. All FTD patients who were enrolled in this study met the diagnostic criteria for FTD clinical subtypes proposed by Rascovsky et al. (Rascovsky et al., 2011) (for bvFTD) and Gorno-Tempini et al.(Gorno-Tempini et al., 2011) (for nfvPPA and svPPA). All patients were evaluated by comprehensive interviews, neurological examinations, and neuropsychological assessment. In brief, caregivers were interviewed in depth by neurologists and neuropsychologists. Blood tests to exclude secondary causes of dementia included a complete blood count, blood chemistry tests, vitamin B12/folate, syphilis serology, and thyroid function tests. Conventional brain MRI scans confirmed the absence of structural lesions such as tumors, traumatic brain injuries, hydrocephalus, and severe white matter hyperintensities. Thirty-four out of 143 FTD patients underwent 18F-florbetaben or 18F-flutemetamol amyloid positron emission tomography (PET) scanning and four of them had significant amyloid deposition. A committee that included 5–10 dementia specialists held a quarterly meeting to review the clinical histories and brain imaging results of all cases enrolled in this study, and to reach a consensus regarding clinical diagnosis. We also recruited 50 age-matched AD dementia patients and 146 CN subjects from an in-house registry of individuals who underwent amyloid PET scanning (18F-florbetaben or 18F-flutemetamol) from August 2015 to July 2017 and performed the same clinical assessments and imaging studies. All AD dementia patients met the criteria for probable AD dementia with evidence of the AD pathophysiological process proposed by the National Institute on Aging-Alzheimer's Association(McKhann et al., 2011) based on clinical assessments and Aβ positivity shown in the amyloid PET. The CN group consisted of cognitively normal subjects without amyloid deposition on 18F-florbetaben PET.

Ethics statement

The institutional review boards at all participating centers approved this study, and informed consent was obtained from the patients and caregivers.

PET image acquisition and analysis

We used 18F-florbetaben PET or 18F-flutemetamol PET to detect amyloid in the brain. PET images were dichotomized as either amyloid positive or negative using visual reads. We defined 18F-florbetaben PET as positive when a score of 2 or 3 was assigned during visual assessment on the brain Aß plaque load (BAPL) scoring system.(Barthel et al., 2011) Visual interpretation of 18F-flutemetamol PET images relied upon a systematic review of five brain regions (frontal, parietal, posterior cingulate and precuneus, striatum and lateral temporal lobes). If any one of the brain regions systematically reviewed for 18F-flutemetamol PET was positive in either hemisphere, the scan was considered positive.(Farrar, 2017) In this study, PET images were used to confirm the diagnostic label of subjects in AD and CN groups.

MR image acquisition

All subjects underwent a three-dimensional (3D) volumetric brain MRI scan. An Achieva 3.0-Tesla MRI scanner (Philips, Best, the Netherlands) was used to acquire 3D T1 Turbo Field Echo (TFE) MRI data using the following imaging parameters: sagittal slice thickness, 1.0 mm with 50% overlap; no gap; repetition time of 9.9 ms; echo time of 4.6 ms; flip angle of 8°; and matrix size of 240 × 240 pixels reconstructed to 480 × 480 over a field of view of 240 mm.

Image preprocessing

For automated surface modeling and measurement of each subject's cortical thickness, we applied the FreeSurfer (version 5.1) pipeline to the T1 weighted MR image (http://surfer.nmr.mgh.harvard.edu/). Fig. 1A shows an overview of the proposed method. The first step segments the T1 weighted image based on signal intensity, which includes motion noise correction, space transformation, normalization and skull stripping. Afterwards, we employed the CIVET pipeline for additional correction of the skull-stripped image. Subsequently, the cortical surfaces were constructed for both white and gray matter boundaries. The gray and white matter surfaces were then used for calculating cortical thickness. To achieve correspondence between subjects, the mesh vertices were resampled to have the same number (40,962) of vertices for each hemisphere. Finally, the cortical thicknesses were defined at every vertex as the distance between two corresponding vertices of the gray and white matter surfaces. Throughout the whole image preprocessing pipeline, a neuroanatomist visually checked images, and corrected image processing errors manually. In particular, the image segmentation was carefully examined and corrected manually in subjects with atrophy: for example, svPPA patients often have anterior temporal lobe atrophy to such a degree that FreeSurfer processing may initially fail to detect the gray matter.

Fig. 1

Overview of the proposed cortical atrophy pattern-based classification method. (A) Image preprocessing and cortical thickness extraction. (B) Noise removal based on the Laplace Beltrami operator. (C) Cortical atrophy pattern-based classification including a training step and a testing step. Of the initial 339 subjects, eight subjects were excluded due to errors in preprocessing: FreeSurfer failed to produce results in seven subjects, and one subject had an overestimation error which could not be corrected manually. Therefore, MRI scans of 331 subjects (48 AD, 48 bvFTD, 50 svPPA, 39 nfvPPA and 146 CN subjects) were analyzed in this study.

Hierarchical classification based on cortical atrophy

For hierarchical classification, we used four different pairs of groups for each classifier step. Fig. 2A shows a schematic view of the hierarchical classification. First, we trained a classifier using the CN and Dementia (FTD + AD) groups (Step 1). Subsequently, another classifier was trained using the FTD and AD groups (Step 2). Next, the FTD classifier was trained using the bvFTD and PPA groups (Step 3). Finally, the PPA classifier was trained to distinguish between svPPA and nfvPPA (Step 4). The hierarchical classification was performed with these four classifiers. Specifically, a single subject was classified using the Step 1 classifier. If the subject was classified as a patient, the subject was then tested using the Step 2 classifier. Again, if the subject was classified as an FTD patient, then Step 3 classifier was applied. This hierarchical process was performed consecutively through the entire tree until the subject was finally classified into one of the final clinical labels. Using the cortical thickness data of each subject, we applied the Laplace Beltrami operator for noise removal (Fig. 1B). This scheme transforms the cortical thickness data from the geometrical domain into frequency space, and represents the original data using oscillations of alternating thin and thick cortices across the cortical surface.(Qiu et al., 2006; Vallet and Lévy, 2008) In the frequency domain, high frequency components were considered as noise, and thus the lower frequency components were used as features in classification. For each subject, the original cortical thickness data was sampled at 81,924 vertices, which was then transformed to about 250-dimensional frequency domain. The detailed process of noise removal was described in our previous work.(Cho et al., 2012) The classifier was then constructed by applying both PCA and LDA to the cortical thickness data(Belhumeur et al., 1997; San Lee et al., 2018) (Fig. 1C). PCA was applied for the purpose of dimension reduction, which transforms the 250-dimensional feature data to much lower dimensional space. The detailed information on the feature transformation was described in Supplementary Table S1. Finally, the individual cortical thickness data that was not included in the training data set was tested to obtain a prediction label (Fig. 1C).

Fig. 2

Schematic view of (A)hierarchical and (B)non-hierarchical classification.

For comparison purposes, two additional experiments were performed. First, classification using a single, five-label LDA classifier employing the same learning procedure (Fig. 2B) was performed to demonstrate how the hierarchical scheme improved the classification performance. Additionally, pairwise classifications without the use of the Laplace-Beltrami operator were conducted to evaluate how much this noise-removal step contributed to our classification performances. Schematic view of (A)hierarchical and (B)non-hierarchical classification. In order to evaluate the classification performance, we used a k-fold cross-validation (CV) approach (Fig. 1C). For each classification step in the hierarchical tree, the set was randomly split into k = 10 independent subsets. Nine subsets were used for training, and the remaining subset was used for testing. We performed the cross-validation 1000 times for reliability and calculated the mean accuracy, sensitivity and specificity.

Discriminative region analysis

We extracted the discriminative regions for our classifiers, which provided topographic patterns representing the contribution of each brain region to the discriminability between the two groups being compared. We obtained the discriminative regions for each classification by visualizing the weight vector of the classifier similarly to the method of Haufe et al.(Haufe et al., 2014) Generally, the discriminative regions are the brain regions with relative importance in classification and are obtained as:(w: weight vector, ∑x = XX is the covariance of the feature vector) Since, in our approach, we used additional steps for noise removal and PCA dimension reduction, the weight vector was defined by multiplication between the PCA matrix and the LDA matrix (w = M × M). Additionally, X is the filtered feature in the frequency domain. We then obtained the discriminate pattern (D) on frequency space as: This discriminative pattern on frequency domain was projected back into the surface domain by shifting back D using the center (PCA) and MHT (M). Finally, the discriminated region (D) on the surface was obtained as: For visualization, we colored D on the template surface, and the warm/cool color represents the importance of the feature for each group, with darker colors indicating greater importance.

Statistics

We compared the demographic and clinical data among groups using one-way analysis of variance (ANOVA) tests. Continuous variables were expressed as mean ± standard deviation (SD). Statistical analyses were performed using R version 3.5.0.

Results

Clinical characteristics

As shown in Table 1, of the 331 subjects, 153 (46.2%) were men. The mean age of subjects in the FTD group was 65.5 ± 11.8 years (bvFTD: 62.4 ± 9.4, svPPA: 65.6 ± 7.9, and nfvPPA: 68.9 ± 8.6). The interval between the onset of symptoms and MRI acquisition was 3.4 ± 2.4 years. There were no differences in age, years of education, and time from symptom onset to MRI among groups. There was no difference in mini-mental status examination (MMSE) score among FTD subtypes and the AD group.

Table 1

Clinical characteristics of participants.

	Total	FTD			AD	CN
	Total	bvFTD	svPPA	nfvPPA	AD	CN
Number	331	48	50	39	48	146
Age, years	65.4 ± 11.8	62.4 ± 9.4	65.6 ± 7.9	68.9 ± 8.6	65.7 ± 7.6	65.5 ± 15.0
Gender (M/F)	153/178	26/22	29/21	17/22	25/23	56/90
Education, years	10.8 ± 5.3	12.5 ± 5.2	11.0 ± 4.8	11.5 ± 5.1	10.9 ± 2.7	10.0 ± 6.0
K-MMSE score	23.2 ± 7.4	19.6 ± 6.7	18.8 ± 8.8	19.5 ± 7.9	17.7 ± 5.8	28.6 ± 1.7†
Years from first symptom	3.4 ± 2.4	3.2 ± 2.4	3.4 ± 2.5	3.0 ± 2.0	4.1 ± 2.5

Abbreviations: N = number, FTD = frontotemporal dementia, bvFTD = behavioral variant frontotemporal dementia, svPPA = semantic variant primary progressive aphasia, nfvPPA = non-fluent/agrammatic variant primary progressive aphasia, AD = Alzheimer's disease, CN = cognitively normal, K-MMSE = Korean mini-mental state examination.

p < 0.05.

Clinical characteristics of participants. Abbreviations: N = number, FTD = frontotemporal dementia, bvFTD = behavioral variant frontotemporal dementia, svPPA = semantic variant primary progressive aphasia, nfvPPA = non-fluent/agrammatic variant primary progressive aphasia, AD = Alzheimer's disease, CN = cognitively normal, K-MMSE = Korean mini-mental state examination. p < 0.05.

Classification performance

The results from classification using the entire hierarchical tree showed that each subject was classified into one of the five clinical labels with 75.8% accuracy. (Tables 2A-1) Supplementary Fig. S1A shows a confusion matrix of the hierarchical classification approach. Within the confusion matrix, the numbers in the boxes located diagonally from the top-left corner to the bottom-right corner indicate the cumulative accuracies per diagnostic subgroup (CN 89.3%, AD 73.1%, bvFTD 57.2%, nfvPPA 51.9%, and svPPA 75.6%).

Table 2

Classification performances.

	Accuracy	Sensitivity	Specificity	AUC
A-1. The entire hierarchical tree approach
	75.8%	69.4%	93.2%

A-2. Performances of the pairwise classifiers from four steps
Step 1 (CN vs Dementia)	86.1% (85.9–86.3%)	87.0%	85.4%	0.917
Step 2 (AD vs FTD)	90.8% (90.5–91.1%)	87.5%	92.0%	0.955
Step 3 (bvFTD vs PPA)	86.9% (86.2–87.5%)	92.1%	77.1%	0.865
Step 4 (nfvPPA vs svPPA)	92.1% (91.6–92.7%)	97.4%	88.0%	0.955

B. A single, multi-label classification performance
	73.0%	67.1%	92.6%

Accuracies for pairwise classifiers were shown with 95% confidence intervals in brackets.

Abbreviations: AUC = Area under receiver operating characteristic curve, FTD = frontotemporal dementia, bvFTD = behavioral variant frontotemporal dementia, PPA = primary progressive aphasia, svPPA = semantic variant primary progressive aphasia, nfvPPA = non-fluent/agrammatic variant primary progressive aphasia, AD = Alzheimer's disease, CN = cognitively normal.

Classification performances. Accuracies for pairwise classifiers were shown with 95% confidence intervals in brackets. Abbreviations: AUC = Area under receiver operating characteristic curve, FTD = frontotemporal dementia, bvFTD = behavioral variant frontotemporal dementia, PPA = primary progressive aphasia, svPPA = semantic variant primary progressive aphasia, nfvPPA = non-fluent/agrammatic variant primary progressive aphasia, AD = Alzheimer's disease, CN = cognitively normal. Table 2A-2 shows the performance of each classification step. In step 1, the accuracy in discriminating between CN subjects and Dementia (FTD + AD) patients was 86.1%. In step 2, AD and FTD patients were classified with 90.8% accuracy. Within the FTD group, the classifier discriminated between bvFTD and PPA patients with 86.9% accuracy (step 3). The accuracy in discriminating between PPA clinical syndromes was 92.1% in step 4. The receiver operating characteristic (ROC) curves for steps 1 to 4 are shown in Supplementary Fig. S2. The areas under the ROC curves for steps 1 to 4 were 0.917, 0.955, 0.865, and 0,955, respectively.

Supplementary Fig. S2

The receiver operating characteristic (ROC) curves for steps 1 to 4.

For comparison with the hierarchical classifier, the classification performance of a single, multi-label classifier which does not use the hierarchical scheme is shown in Table 2B. This classifier demonstrated an accuracy of 73.0%. The confusion matrix of this multi-label classifier is shown in Supplementary Fig. S1B. Supplementary Table S2 further shows the classification performance without the application of the Laplace Beltrami operator to cortical thickness data. In the overall hierarchical steps, we obtained 3–4% improvements in accuracies by using the operator to reduce noise components in the cortical thickness data. Fig. 3 depicts the discriminative regions on the atlas surface for our classifiers.

Fig. 3

Discriminative regions for each step. Each discriminative area corresponds to the group written in the same color.

For classification between Dementia and CN groups, the left fronto-parieto-temporal areas, right anterior temporal area, and right superior frontal gyrus distinguished demented subjects from cognitively normal subjects. In step 2, the bilateral precuneus and lateral parietal, right posterior temporal and lateral occipital, and left frontal regions distinguished AD from FTD, while the bilateral anterior temporal, anterior cingulate and right frontal regions distinguished FTD from AD. The left frontotemporal region and left inferior parietal lobule distinguished PPA from bvFTD, while the right frontotemporal regions significant influenced the in discrimination of bvFTD from nfvPPA (Step 3). For distinguishing svPPA from nfvPPA, the bilateral anterior temporal regions and left anterior cingulate cortex were of significantly influential in identifying svPPA, whereas the left frontal lobar regions were significant for identifying nfvPPA (Step 4).Abbreviations: FTD = frontotemporal dementia, bvFTD = behavioral variant frontotemporal dementia, PPA = primary progressive aphasia, svPPA = semantic variant primary progressive aphasia, nfvPPA = non-fluent/agrammatic variant primary progressive aphasia, AD = Alzheimer's disease, CN = cognitively normal Discriminative regions for each step. Each discriminative area corresponds to the group written in the same color. Examples of misclassified subjects. (A) The MRI scan shows no definite atrophy. (B) The MRI scan shows significant atrophy in the bilateral frontotemporal areas. The atrophy is slightly worse on the left side, which might have led to the misclassification.

Post-hoc assessment for misclassified subjects

We further performed a post-hoc assessment for misclassified subjects. A total of 89 out of 742 pairwise classifications did not match the clinical diagnosis. In a visual review of these scans, 22 of them only had subtle atrophy which were not suggestive of a single clinical diagnosis (Fig. 4A). Nine scans showed significant atrophy but the spatial pattern of cortical atrophy was shared by more than one clinical diagnosis (Fig. 4B). For the remainder of the misclassified subjects, it was not obvious from visual review why misclassification occurred. There were subjects who were misclassified in multiple steps. Fourteen subjects were misclassified in two steps, and one subject was misclassified in three steps (Supplementary Table S3).

Fig. 4

Examples of misclassified subjects. (A) The MRI scan shows no definite atrophy. (B) The MRI scan shows significant atrophy in the bilateral frontotemporal areas. The atrophy is slightly worse on the left side, which might have led to the misclassification.

Discussion

We developed a machine learning-based automated classifier for differential diagnosis of FTD clinical syndromes. We included carefully phenotyped FTD patients, for whom precise clinical diagnoses were made through a consensus decision, for the development of our classifier. This classifier was successful in discriminating among CN, AD, and FTD subtypes using MRI-based cortical thickness data. Since AD is the most prevalent cause of dementia and the most important etiology to be considered for patients with cognitive decline, we also included AD patients in our classification model. Methodologically, the proposed method based on the Laplace operator removed high-frequency components of cortical thickness data as noise, which made the classification more sensitive. This was possible because we were able to overcome the spatial variance due to noise while maintaining significant differences of shape especially in FTD and AD groups. Thus, we believe that one advance of our study was the application of a Laplace Beltrami operator to the cortical thickness data, which allowed us to reduce the contribution of noise to the classification. Moreover, visualization of discriminative regions was possible by transforming the discriminative patterns in the frequency domain to that in the surface domain. Therefore, our study clearly shows that the classifier models discriminate each patient group with relative importance weights distributed across multiple brain regions. Furthermore, automated classifiers are expected to help in the clinical diagnosis of patients with subtle cortical atrophy. While a clinician might be biased to look for only a few well-known structural changes in structural MRI, our automated classifier can identify minute changes in co-varying regions. For example, in svPPA, clinicians may only pay attention to the anterior temporal lobe, while our study demonstrates that the anterior cingulate cortex also has significance in discriminating svPPA from nfvPPA (Fig. 3). Compared with previous classifier studies, we used more diverse diagnostic categories of neurodegenerative dementia. Previous studies introduced classifiers discriminating AD patients from cognitively normal subjects, (Wee et al., 2013; Westman et al., 2013) as well as those distinguishing between AD, FTD, and CN groups(Davatzikos et al., 2008; Klöppel et al., 2008; Raamana et al., 2014; Moller et al., 2016; Bron et al., 2017; Bouts et al., 2018; Kloppel et al., 2008; Kloppel et al., 2015). Other studies demonstrated classifiers for individual level classification of PPA subtypes. (Wilson et al., 2009; Bisenius et al., 2017) In contrast, we built a more comprehensive classifier, which not only discriminates FTD, AD, and CN from each other, but also further classifies three clinical syndromes of FTD. We achieved good to excellent accuracies for classification between groups, especially between dementia groups. In discriminating FTD from AD, our classifier had an accuracy of 90.8%, demonstrating similar performance compared to literature reporting 72% to 89.2% accuracies. (Davatzikos et al., 2008; Klöppel et al., 2008; Raamana et al., 2014; Moller et al., 2016; Bron et al., 2017; Bouts et al., 2018; Kloppel et al., 2008; Kloppel et al., 2015) Our model classified subjects with nfvPPA and svPPA with 92.1% accuracy, which was similar to or higher than the 78% to 89% accuracies reported in previous studies. (Wilson et al., 2009; Bisenius et al., 2017; Agosta et al., 2015) We believe that both the surface-based feature extraction and Laplace-Beltrami operator-based noise removal have contributed to the improvement in performance. The patterns of discriminative regions for our classifiers are similar to previously known cortical atrophic patterns in each clinical syndrome. Although the discriminative regions of a certain group can vary depending on the combinations of groups compared, they generally reflect the structural changes occurring in that group. Compared with PPAs, the bvFTD group's discriminative regions showed right frontal predominance. This may have been influenced by the cortical atrophy pattern of bvFTD, which is known from past studies. (Rosen et al., 2002; Seeley et al., 2008) The discriminative regions of svPPA were most prominent in the left anterior temporal area, which is also consistent with results from previous studies. (Galton et al., 2001) The left frontal region was crucial in discriminating nfvPPA from other FTD clinical syndromes, also consistent with the previously known left frontal dominance in atrophy pattern in this condition. (Gorno-Tempini et al., 2006) Thus, our classifiers reflect the known cortical atrophy pattern for each clinical subtype. The four steps of our hierarchical classifier emulate the clinical decision process for diagnosing FTD patients. First, it is important to determine whether the patient's complaints are due to normal aging. (step 1) When the patient seems to have behavioral or language symptoms that cannot be considered as changes of normal aging, the physician still has to rule out AD (step 2) since the prevalence of AD is much higher than FTD, and AD patients can show similar symptoms. If the patient's symptoms and signs suggest FTD, the clinician tries to determine which clinical subtype is the most likely (steps 3–4). The classification accuracy of step 4 was highest among the four steps, probably due to the distinctiveness of the cortical atrophy pattern in svPPA patients. (Davies et al., 2009; Hodges and Patterson, 2007; Brambati et al., 2015) Although our main goal was to develop a hierarchical classifier to emulate the clinical diagnostic process, the classifiers for each step can also be utilized in clinical practice. For example, it is sometimes difficult to determine whether a patient's behavioral symptoms suggest bvFTD or AD with prominent frontal dysfunction. Additionally, it is often hard to tell which type of PPA a patient with mild language dysfunction has. In these cases, individual classifiers can be used selectively. We found that about 12% of classifications did not match the clinical diagnosis. This might be related to subtle atrophy or diffuse severe atrophy at the individual level. In our experiments, many of the misclassified patients (about 23%) had subtle cortical atrophy, which made it difficult for the classifier to capture the characteristic atrophy pattern. Indeed, such subtle changes in the cerebral cortex were barely detectable even in visual assessment performed by an expert neurologist. Another main reason for misclassification stems from the shared spatial pattern of cortical atrophy across multiple clinical diagnoses. As the supervised learning proposed in this study tries to detect different cortical atrophy patterns in two clinical diagnoses, similar atrophy patterns between them could lead to a misclassification. This issue is a well-known overfitting problem in supervised learning, and has been a major obstacle to the application of computer-aided diagnosis to medical images due to limited numbers of patients. Once we have more data, we believe this overfitting problem could be resolved, resulting in an improved performance for classification. This study has several limitations. First, we did not have pathological diagnoses for most FTD patients, although we included carefully phenotyped patients. It is important for a subject's clinical syndrome and underlying disease to be distinguished, as not only could patients with clinical AD dementia potentially have underlying frontotemporal lobar degeneration, but patients with FTD clinical syndromes could also have AD pathology. The purpose of this study was to predict clinical syndromes rather than underlying diseases. However, to enhance the homogeneity within the groups, we used the results of amyloid PET scans for inclusion of AD and CN subjects. Second, four FTD patients were amyloid (+) on PET, which leaves open the possibility that frontal variant AD might be included in the FTD group. However, since we diagnosed FTD patients through a consensus decision committee, amyloid (+) in these patients might be incidental findings. In fact, the prevalence of amyloid (+) in the FTD group seemed to be similar to that of cognitively normal individuals.(Engler et al., 2008) Third, the discriminative regions depicted in our figure provide information on the statistically significant regions used in discriminating one group from another, but do not clearly demonstrate whether the cortical regions are thicker or thinner. Fourth, because we developed the classifier using carefully phenotyped subjects, performance may not be as high when applied to patients in the early stages of these diseases whose clinical phenotypes are not conclusive yet. In future studies, it would be meaningful to conduct similar analysis using MRI scans acquired in the early stage of the clinical course from subjects who were later carefully phenotyped. Finally, there may be a concern for overfitting when training an LDA classifier with a relatively small number of data samples. We indeed compared the classification performance by incorporating additional regularization terms for the LDA classifier, which unfortunately could not improve the 10-fold CV performances. Our future studies will focus on developing computationally more generalizable methods with comparable or better classification accuracy using an external dataset for validation if possible.

Conclusion

With our fully automated classifier, cortical thickness data alone could classify FTD clinical subtypes and AD with good to excellent accuracy. Our classifier may help clinicians to diagnose FTD subtypes with subtle cortical atrophy and facilitate the selection of appropriate interventions. The following are the supplementary data related to this article. Confusion matrices of (A) hierarchical and (B) single, multi-label classification approaches. The receiver operating characteristic (ROC) curves for steps 1 to 4.

Supplementary Table S1

The original cortical thickness data was transformed into the frequency domain of about 250 dimensions, and the dimension was further reduced by PCA.

Supplementary Table S2

Accuracies for the classifiers were shown with 95% confidence intervals in brackets.

Supplementary Table S3

Numbers of subjects who were misclassified at multiple steps are shown according to the steps in which the subjects were misclassified.

Funding

This work was supported by the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) [No. NRF-2017R1A2B2005081]; Research of Korea Centers for Disease Control and Prevention [2018-ER6203-01]; and the National Research Foundation of Korea (NRF) grant funded by the Korea government (MSIP) [No. 2016R1A2B4014398].

33 in total

1. Anatomical correlates of early mutism in progressive nonfluent aphasia.

Authors: M L Gorno-Tempini; J M Ogar; S M Brambati; P Wang; J H Jeong; K P Rankin; N F Dronkers; B L Miller
Journal: Neurology Date: 2006-08-23 Impact factor: 9.910

2. Alzheimer Disease and Behavioral Variant Frontotemporal Dementia: Automatic Classification Based on Cortical Atrophy for Single-Subject Diagnosis.

Authors: Christiane Möller; Yolande A L Pijnenburg; Wiesje M van der Flier; Adriaan Versteeg; Betty Tijms; Jan C de Munck; Anne Hafkemeijer; Serge A R B Rombouts; Jeroen van der Grond; John van Swieten; Elise Dopper; Philip Scheltens; Frederik Barkhof; Hugo Vrenken; Alle Meije Wink
Journal: Radiology Date: 2015-12-11 Impact factor: 11.105

3. Cerebral amyloid-β PET with florbetaben (18F) in patients with Alzheimer's disease and healthy controls: a multicentre phase 2 diagnostic study.

Authors: Henryk Barthel; Hermann-Josef Gertz; Stefan Dresel; Oliver Peters; Peter Bartenstein; Katharina Buerger; Florian Hiemeyer; Sabine M Wittemer-Rump; John Seibyl; Cornelia Reininger; Osama Sabri
Journal: Lancet Neurol Date: 2011-04-08 Impact factor: 44.182

4. Regional magnetic resonance imaging measures for multivariate analysis in Alzheimer's disease and mild cognitive impairment.

Authors: Eric Westman; Carlos Aguilar; J-Sebastian Muehlboeck; Andrew Simmons
Journal: Brain Topogr Date: 2012-08-14 Impact factor: 3.020

5. Prediction of Alzheimer's disease and mild cognitive impairment using cortical morphological patterns.

Authors: Chong-Yaw Wee; Pew-Thian Yap; Dinggang Shen
Journal: Hum Brain Mapp Date: 2012-08-28 Impact factor: 5.038

6. Atrophy patterns in histologic vs clinical groupings of frontotemporal lobar degeneration.

Authors: J M S Pereira; G B Williams; J Acosta-Cabronero; G Pengas; M G Spillantini; J H Xuereb; J R Hodges; P J Nestor
Journal: Neurology Date: 2009-05-12 Impact factor: 9.910

7. Frontal paralimbic network atrophy in very mild behavioral variant frontotemporal dementia.

Authors: William W Seeley; Richard Crawford; Katya Rascovsky; Joel H Kramer; Michael Weiner; Bruce L Miller; Maria Luisa Gorno-Tempini
Journal: Arch Neurol Date: 2008-02

8. Three-Class Differential Diagnosis among Alzheimer Disease, Frontotemporal Dementia, and Controls.

Authors: Pradeep Reddy Raamana; Howard Rosen; Bruce Miller; Michael W Weiner; Lei Wang; Mirza Faisal Beg
Journal: Front Neurol Date: 2014-05-12 Impact factor: 4.003

9. Multiparametric computer-aided differential diagnosis of Alzheimer's disease and frontotemporal dementia using structural and advanced MRI.

Authors: Esther E Bron; Marion Smits; Janne M Papma; Rebecca M E Steketee; Rozanna Meijboom; Marius de Groot; John C van Swieten; Wiro J Niessen; Stefan Klein
Journal: Eur Radiol Date: 2016-12-16 Impact factor: 5.315

10. Automatic classification of MR scans in Alzheimer's disease.

Authors: Stefan Klöppel; Cynthia M Stonnington; Carlton Chu; Bogdan Draganski; Rachael I Scahill; Jonathan D Rohrer; Nick C Fox; Clifford R Jack; John Ashburner; Richard S J Frackowiak
Journal: Brain Date: 2008-01-17 Impact factor: 13.501

14 in total

Review 1. Machine Learning in Neuro-Oncology, Epilepsy, Alzheimer's Disease, and Schizophrenia.

Authors: Mason English; Chitra Kumar; Bonnie Legg Ditterline; Doniel Drazin; Nicholas Dietz
Journal: Acta Neurochir Suppl Date: 2022

2. Early Detection of Pancreatic Cancers Using Liquid Biopsies and Hierarchical Decision Structure.

Authors: Deepesh Agarwal; Obdulia Covarrubias-Zambrano; Stefan H Bossmann; Balasubramaniam Natarajan
Journal: IEEE J Transl Eng Health Med Date: 2022-06-27

3. Radiomics Model for Frontotemporal Dementia Diagnosis Using T1-Weighted MRI.

Authors: Benedetta Tafuri; Marco Filardi; Daniele Urso; Roberto De Blasi; Giovanni Rizzo; Salvatore Nigro; Giancarlo Logroscino
Journal: Front Neurosci Date: 2022-06-20 Impact factor: 5.152

Review 4. Neuroimaging in Frontotemporal Lobar Degeneration: Research and Clinical Utility.

Authors: Sheena I Dev; Bradford C Dickerson; Alexandra Touroutoglou
Journal: Adv Exp Med Biol Date: 2021 Impact factor: 2.622

Review 5. Imaging biomarkers in neurodegeneration: current and future practices.

Authors: Peter N E Young; Mar Estarellas; Emma Coomans; Meera Srikrishna; Helen Beaumont; Anne Maass; Ashwin V Venkataraman; Rikki Lissaman; Daniel Jiménez; Matthew J Betts; Eimear McGlinchey; David Berron; Antoinette O'Connor; Nick C Fox; Joana B Pereira; William Jagust; Stephen F Carter; Ross W Paterson; Michael Schöll
Journal: Alzheimers Res Ther Date: 2020-04-27 Impact factor: 6.982

6. Convolution neural network-based Alzheimer's disease classification using hybrid enhanced independent component analysis based segmented gray matter of T2 weighted magnetic resonance imaging with clinical valuation.

Authors: Shaik Basheera; M Satya Sai Ram
Journal: Alzheimers Dement (N Y) Date: 2019-12-28

7. An MRI-based strategy for differentiation of frontotemporal dementia and Alzheimer's disease.

Authors: Qun Yu; Yingren Mai; Yuting Ruan; Yishan Luo; Lei Zhao; Wenli Fang; Zhiyu Cao; Yi Li; Wang Liao; Songhua Xiao; Vincent C T Mok; Lin Shi; Jun Liu
Journal: Alzheimers Res Ther Date: 2021-01-12 Impact factor: 6.982

8. Deep Learning-Based Classification and Voxel-Based Visualization of Frontotemporal Dementia and Alzheimer's Disease.

Authors: Jingjing Hu; Zhao Qing; Renyuan Liu; Xin Zhang; Pin Lv; Maoxue Wang; Yang Wang; Kelei He; Yang Gao; Bing Zhang
Journal: Front Neurosci Date: 2021-01-21 Impact factor: 4.677

Review 9. Artificial Intelligence in Health Care: Current Applications and Issues.

Authors: Chan Woo Park; Sung Wook Seo; Noeul Kang; BeomSeok Ko; Byung Wook Choi; Chang Min Park; Dong Kyung Chang; Hwiyoung Kim; Hyunchul Kim; Hyunna Lee; Jinhee Jang; Jong Chul Ye; Jong Hong Jeon; Joon Beom Seo; Kwang Joon Kim; Kyu Hwan Jung; Namkug Kim; Seungwook Paek; Soo Yong Shin; Soyoung Yoo; Yoon Sup Choi; Youngjun Kim; Hyung Jin Yoon
Journal: J Korean Med Sci Date: 2020-11-02 Impact factor: 2.153

10. Differential Diagnosis of Frontotemporal Dementia, Alzheimer's Disease, and Normal Aging Using a Multi-Scale Multi-Type Feature Generative Adversarial Deep Neural Network on Structural Magnetic Resonance Images.

Authors: Da Ma; Donghuan Lu; Karteek Popuri; Lei Wang; Mirza Faisal Beg
Journal: Front Neurosci Date: 2020-10-22 Impact factor: 4.677