Literature DB >> 34141110

Landmark annotation and mandibular lateral deviation analysis of posteroanterior cephalograms using a convolutional neural network.

Saori Takeda¹, Yuichi Mine¹, Yuki Yoshimi², Shota Ito², Kotaro Tanimoto², Takeshi Murayama¹.

Abstract

BACKGROUND/
PURPOSE: Facial asymmetry is relatively common in the general population. Here, we propose a fully automated annotation system that supports analysis of mandibular deviation and detection of facial asymmetry in posteroanterior (PA) cephalograms by means of a deep learning-based convolutional neural network (CNN) algorithm.
MATERIALS AND METHODS: In this retrospective study, 400 PA cephalograms were collected from the medical records of patients aged 4 years 2 months-80 years 3 months (mean age, 17 years 10 months; 255 female patients and 145 male patients). A deep CNN with two optimizers and a random forest algorithm were trained using 320 PA cephalograms; in these images, four PA landmarks were independently identified and manually annotated by two orthodontists.
RESULTS: The CNN algorithms had a high coefficient of determination (R 2 ), compared with the random forest algorithm (CNN-stochastic gradient descent, R 2  = 0.715; CNN-Adam, R 2  = 0.700; random forest, R 2  = 0.486). Analysis of the best and worst performances of the algorithms for each landmark demonstrated that the right latero-orbital landmark was most difficult to detect accurately by using the CNN. Based on the annotated landmarks, reference lines were defined using an algorithm coded in Python. The CNN and random forest algorithms exhibited similar accuracy for the distance between the menton and vertical reference line.
CONCLUSION: Our findings imply that the proposed deep CNN algorithm for detection of facial asymmetry may enable prompt assessment and reduce the effort involved in orthodontic diagnosis.

Entities: Chemical

Keywords: Artificial intelligence; Convolutional neural network; Deep learning; Mandibular deviation; Posteroanterior cephalograms

Year: 2020 PMID： 34141110 PMCID： PMC8189930 DOI： 10.1016/j.jds.2020.10.012

Source DB: PubMed Journal: J Dent Sci ISSN： 1991-7902 Impact factor: 2.080

Introduction

Facial asymmetry is commonly encountered in the general population. Generally, lateral deviation of the mandible is easily recognized in patients with severe facial asymmetry; this manifestation causes concern for patients and leads to functional impairment. Therefore, facial symmetry affects satisfaction with orthodontic treatment., Cephalometric radiography is a diagnostic tool used to quantitatively analyze the craniomaxillofacial skeleton during orthodontic treatment planning. Broadbent introduced cephalometric radiography in 1931; cephalometric analysis has since been an important diagnostic procedure during orthodontic treatment. In orthodontic treatment planning, assessment of the midline and mandibular deviation in posteroanterior (PA) cephalograms are complex but clinically important steps. Notably, additional orthodontic training is needed to accurately evaluate the midline and mandibular deviation. In recent years, deep learning (a branch of artificial intelligence) has developed rapidly and has shown potential for solving complicated medical tasks. Convolutional neural networks (CNNs), a type of deep learning inspired by human vision, have performed well in image classification tasks because of their robust abilities to automatically learn important features from images in healthcare, including the dental field. Research involving artificial intelligence algorithms is ongoing with respect to the detection of caries, and apical lesions, diagnosis of periodontal disease, and oral cancer,, and implementation of maxillofacial prosthetic rehabilitation. Overall, lateral cephalometric analysis has been demonstrated to automatically identify anatomical landmarks.14, 15, 16, 17, 18 In 2014 and 2015, public competitions for automatic lateral cephalometric landmark detection were held at the Institute of Electrical and Electronics Engineers International Symposium on Biomedical Imaging, with the goal of establishing an integrative approach to biomedical image analysis., These competitions encouraged the continuing development of various approaches for automatic detection of lateral cephalometric landmarks. Here, we propose a fully automated annotation system that supports analysis of mandibular deviation and detection of facial asymmetry in PA cephalograms by means of a deep learning-based CNN algorithm. In the first part of this study, four landmarks used for mandibular deviation analysis were annotated in PA cephalograms; the CNN algorithm with a stochastic gradient descent (SGD) optimizer (i.e., CNN-SGD) showed the best experimental performance. In the second part of this study, two reference lines were automatically defined and mandibular deviation was measured to aid in prompt detection of facial asymmetry.

Materials and methods

Dataset

This study was approved by the Ethical Committee for Epidemiology of Hiroshima University (Approval Number: E−2119). Four hundred PA cephalograms were collected from the medical records of patients aged 4 years 2 months–80 years 3 months (mean age, 17 years 10 months; 255 female patients and 145 male patients). All images were recorded in DICOM format by using a cephalometric scanner (CX-150 W; Asahi Roentgen Ind. Co., Ltd, Kyoto, Japan). The original image resolution was 1648 × 1980 pixels with pixel spacing of 0.15 mm; images were resized to 256 × 256 pixels. Subsequently, the images were randomly divided into a training set (320 images) and a test set (80 images). Two orthodontists (12 and 6 years of experience, respectively) independently identified and manually annotated four PA cephalometric landmarks: neck of crista galli, right latero-orbital, left latero-orbital, and menton (Me) (Fig. 1). The X, Y coordinate values for each landmark were recorded as datasets and defined as the ground truth locations. Then, the horizontal reference line (a straight line connecting the right and left latero-orbital landmarks) and vertical reference line (VRL; a perpendicular line to the horizontal reference line through the neck of crista galli) were defined (Fig. 1). All landmark and line definitions employed Sassouni analysis, which is the most commonly used method for assessment of asymmetry.

Figure 1

Landmarks and prediction points on posteroanterior cephalograms. NC, neck of crista galli; Lo, right latero-orbital; Lo’, left latero-orbital; Me, menton; HRL, horizontal reference line; VRL, vertical reference line.

Deep learning-based CNN and random forest algorithms

All procedures were performed with an Intel Core i7-9750H 2.60 GHz CPU (Intel, Santa Clara, CA, USA), 16.0 GB RAM, and NVIDIA GeForce RTX 2070 MAX-Q 8.0 GB GPU (NVIDIA, Santa Clara, CA, USA). CNNs were constructed using Python and implemented using the Keras framework for deep learning, with TensorFlow as the backend. Supervised learning was implemented by means of two machine learning approaches: a deep learning-based CNN and a random forest algorithm (i.e., a robust decision tree-based machine learning algorithm). The overall CNN architecture is shown in Fig. 2. In the CNN learning process, optimizers play crucial roles in model training., Therefore, two CNN optimizers were employed in this study: SGD and Adam, with learning rates of 1.8 × 10−3 and 1.8 × 10−6, respectively. The total epoch number was 5000. The output layer had eight nodes, defined as individual pairs of X, Y coordinate values for right latero-orbital, left latero-orbital, neck of crista galli, and Me landmarks. The random forest hyper parameters were implemented with default settings. Following landmark annotation, the horizontal reference line and VRL were automatically defined with an algorithm coded in Python.

Figure 2

Architecture of the convolutional neural network used in this study. ReLU, rectified linear unit.

Performance metrics

Eighty test images were used to validate accuracy and computational efficacy. The CNN and random forest performances were assessed based on the coefficient of determination (R2). All annotation results were analyzed as the successful detection rate (in %) for four precision measurements, in accordance with previous reports., The successful detection rate was defined as the proportion of corresponding landmarks within 2 mm, 2.5 mm, 3 mm, and 4 mm from the ground truth location. The distance from the VRL to the Me was measured to determine mandibular deviation. To evaluate algorithm performance, mean absolute error (MAE) was calculated using the following formula:where D is the ground truth distance from the VRL to the Me and D’ is the predicted distance from the predicted VRL to the predicted Me.

Results

Table 1 shows the algorithm performances during prediction of the four landmarks. The CNN algorithms had high R values, compared with the random forest algorithm (CNN-SGD, R = 0.715; CNN-Adam, R = 0.700; random forest, R = 0.486). Table 2 summarizes the best and worst performances of the algorithms for each landmark. Notably, the right latero-orbital landmark was most difficult to detect accurately by using the CNN. The successful detection rates of the CNN and random forest algorithms are shown in Fig. 3. Compared with the random forest algorithm, the CNN-SGD algorithm exhibited an approximately 5% higher successful detection rate across all precision ranges.

Table 1

Performance evaluation of proposed algorithms for landmark prediction.

Algorithm	Optimizer	R^2
CNN	SGD	0.715
	Adam	0.700
Random forest	–	0.486

CNN, convolutional neural network; SGD, stochastic gradient descent.

Table 2

Successful detection rates for four landmarks at four precision ranges.

Algorithm	Optimizer	Landmark	SDR (%)
Algorithm	Optimizer	Landmark	<2.0 mm	<2.5 mm	<3.0 mm	<4.0 mm
CNN	SGD	Mn	26	33	45	63
		Nc	30	41	50	62
		Lo	26	35	42	60
		Lo'	36	48	55	70
	Adam	Mn	27	41	45	57
		Nc	32	38	47	65
		Lo	22	27	37	51
		Lo'	23	31	38	58
Random forest	–	Mn	12	13	26	35
		Nc	37	48	53	67
		Lo	22	36	40	55
		Lo'	26	37	43	60

SDR, successful detection rate; CNN, convolutional neural network; SGD, stochastic gradient descent.

Figure 3

Success detection rates of the proposed convolutional neural network and random forest algorithms for 2.0-mm, 2.5-mm, 3.0-mm, and 4.0-mm precision ranges for the four landmarks assessed in this study. CNN, convolutional neural network; SGD, stochastic gradient descent.

Performance evaluation of proposed algorithms for landmark prediction. CNN, convolutional neural network; SGD, stochastic gradient descent. Successful detection rates for four landmarks at four precision ranges. SDR, successful detection rate; CNN, convolutional neural network; SGD, stochastic gradient descent. Success detection rates of the proposed convolutional neural network and random forest algorithms for 2.0-mm, 2.5-mm, 3.0-mm, and 4.0-mm precision ranges for the four landmarks assessed in this study. CNN, convolutional neural network; SGD, stochastic gradient descent. Based on the annotated landmarks, the horizontal reference line and VRL were defined using an algorithm coded in Python. Representative predicted reference lines are shown in Fig. 4. The distances between the Me and VRL were determined for ground truth and predicted landmarks, respectively. MAEs between ground truth and predicted distances were similar for CNN and random forest algorithms (Table 3).

Figure 4

Table 3

Performance evaluation of proposed algorithm in terms of mean absolute error for distance from the vertical reference line.

Algorithm	Optimizer	MAE	Max	Min	Median
CNN	SGD	1.67 ± 1.77	7.957	0.038	1.147
	Adam	1.69 ± 1.63	8.374	0.013	1.174
Random forest	–	1.80 ± 1.81	8.298	0.005	1.295

CNN, convolutional neural network; SGD, stochastic gradient descent; MAE, mean absolute error.

Representative annotations of landmarks and definitions of reference lines. Ground truth landmarks (blue), convolutional neural network-stochastic gradient descent predictions (yellow), and predicted reference lines are shown. NC, neck of crista galli; Lo, right latero-orbital; Lo’, left latero-orbital; Me, menton; HRL, horizontal reference line; VRL, vertical reference line. Performance evaluation of proposed algorithm in terms of mean absolute error for distance from the vertical reference line. CNN, convolutional neural network; SGD, stochastic gradient descent; MAE, mean absolute error.

Discussion

Various artificial intelligence algorithms have been proposed and applied to medical research. Here, we applied two artificial intelligence algorithms (i.e., CNN and random forest) to orthodontic diagnosis; our proposed system was able to automatically annotate landmarks and measure mandibular lateral deviation by means of PA cephalograms. CNNs typically consist of three types of layers: convolution, pooling, and fully connected. Convolution layers are composed of multiple feature maps, whereas pooling layers are inserted periodically between convolution layers to reduce the number of parameters in the network. These two types of layers perform feature map extraction of the images; extracted features are then transformed into the fully connected layer. The algorithm does not require manual feature extraction and does not necessarily require manual segmentation of the target (e.g., tumor or organ) by human experts. However, CNNs are computationally intensive because they require large amounts of data to estimate millions of trainable parameters. Random forest is an ensemble algorithm that builds randomized decision trees and incorporates a variety of features into its classification process; importantly, it can deter overfitting by a “majority voting” approach. Thus, the random forest algorithm has crucial advantages relative to other algorithms; these include effectiveness in multiclass classification and regression tasks, rapid training speed, ease of implementation in parallel computation, tuning simplicity, robustness to noise, and ability to handle highly non-linear biological data. Although prior studies have used lateral cephalograms, rather than PA cephalograms, machine learning-based algorithms such as random forest and CNN have been proposed for landmark annotation. Random forest algorithms have been proposed for some landmark annotation systems., In a recent study, You-Only-Look-Once version 3, a CNN specialized for real-time object detection, was applied to lateral cephalograms for landmark detection. These systems are effective for a broad range of landmark annotation. Selection of an optimizer is an important step in deep learning-based CNNs that influences model performance., The SGD and Adam optimizers were employed in this study because of their widespread use in prior investigations.,,,, However, there is no established guideline for optimizer selection. Thus, researchers rely on empirical studies and comparative benchmarking. In the present study, the SGD optimizer showed the best experimental performance, in terms of the successful detection rate. Orthodontists may choose any of several methods for setting the facial midline, such as drawing a perpendicular line at the midpoint between two landmarks on either side, or connecting the left and right landmarks with a horizontal line and drawing a perpendicular line passing through a landmark located near the midline of the face. It is challenging to define facial midline due to the influences of vertical and horizontal errors in setting each landmark. Thus, automatic detection of the neck of crista galli and other PA landmarks by means of artificial intelligence may be a promising clinical technique that facilitates definition of the facial midline. A limitation of this study was that the successful detection rates were relatively moderate, compared with previously reported landmark detection systems that used lateral cephalograms; this might have had substantial effects on the final measurement of mandibular deviation. Despite the use of symmetrical landmarks, the right latero-orbital landmark showed lower successful detection rates, compared with the left latero-orbital landmark. The difference in annotations between the two experts may have been greater for the right latero-orbital landmark in our dataset. In addition, craniofacial growth continues with advancing age in humans, a phenomenon widely accepted in current medical literature., We included 400 patients in our small dataset; their ages ranged from 4 to 80 years. Given the changes that occur in the facial skeleton over time, this is a fairly wide age range. We presume that many sophisticated datasets evaluated by consensuses involving several experts will contribute to the improvement of successful detection rates in the future. The Sassouni analysis employed in this study is a widely used method in which lines connecting the left and right latero-orbital landmarks are used as the horizontal reference plane, while the perpendicular line passing through the neck of crista galli is regarded as the facial midline. Some investigators have concluded that the neck of crista galli is among the landmarks with the greatest inter-inspector error in PA cephalometric analysis. It is important to emphasize that the final evaluation of facial symmetry requires hard tissue evaluation by PA cephalometric analysis, as well as soft tissue evaluation with facial photographs., Recently, several methods have been reported for evaluating facial symmetry via simultaneous analysis of hard and soft tissue characteristics in computed tomography images., The use of CNNs for three-dimensional evaluation may provide an important diagnostic tool in the future. In this study, we annotated PA cephalometric landmarks that contribute to the determination of reference lines in the Sassouni analysis, using deep learning-based CNN algorithms; we evaluated the precision of this annotation. Additionally, we described systems that could automatically measure mandibular deviation to aid in the detection of facial asymmetry. Although further improvement may be necessary for clinical implementation, the proposed application of deep CNNs for detection of facial asymmetry offers a promising technique that might reduce the effort involved in orthodontic diagnosis. Future studies should focus on building a comprehensive diagnostic system that includes lateral cephalometric analysis and three-dimensional evaluation.

Declaration of competing interest

All authors declare no conflicts of interest.

27 in total

1. Discriminative thresholds of cephalometric indexes in the subjective evaluation of facial asymmetry.

Authors: Naoya Masuoka; Atsushi Muramatsu; Yoshiko Ariji; Hiroyuki Nawa; Shigemi Goto; Eiichiro Ariji
Journal: Am J Orthod Dentofacial Orthop Date: 2007-05 Impact factor: 2.650

Review 2. Improvement of oral cancer screening quality and reach: The promise of artificial intelligence.

Authors: Ankita Kar; Volkert B Wreesmann; Vineeth Shwetha; Shalini Thakur; Vishal U S Rao; Gururaj Arakeri; Peter A Brennan
Journal: J Oral Pathol Med Date: 2020-05-28 Impact factor: 4.253

Review 3. A survey on deep learning in medical image analysis.

Authors: Geert Litjens; Thijs Kooi; Babak Ehteshami Bejnordi; Arnaud Arindra Adiyoso Setio; Francesco Ciompi; Mohsen Ghafoorian; Jeroen A W M van der Laak; Bram van Ginneken; Clara I Sánchez
Journal: Med Image Anal Date: 2017-07-26 Impact factor: 8.545

4. Artificial intelligence in orthodontics : Evaluation of a fully automated cephalometric analysis using a customized convolutional neural network.

Authors: Felix Kunz; Angelika Stellzig-Eisenhauer; Florian Zeman; Julian Boldt
Journal: J Orofac Orthop Date: 2019-12-18 Impact factor: 1.938

5. A benchmark for comparison of dental radiography analysis algorithms.

Authors: Ching-Wei Wang; Cheng-Ta Huang; Jia-Hong Lee; Chung-Hsing Li; Sheng-Wei Chang; Ming-Jhih Siao; Tat-Ming Lai; Bulat Ibragimov; Tomaž Vrtovec; Olaf Ronneberger; Philipp Fischer; Tim F Cootes; Claudia Lindner
Journal: Med Image Anal Date: 2016-02-28 Impact factor: 8.545

6. A new classification of mandibular asymmetry and evaluation of surgical-orthodontic treatment outcomes in Class III malocclusion.

Authors: Yi-Jane Chen; Chung-Chen Yao; Zwei-Chieng Chang; Hsiang-Hua Lai; Shao-Chun Lu; Sang-Heng Kok
Journal: J Craniomaxillofac Surg Date: 2016-03-29 Impact factor: 2.078

7. Applying deep artificial neural network approach to maxillofacial prostheses coloration.

Authors: Yuichi Mine; Shunsuke Suzuki; Toru Eguchi; Takeshi Murayama
Journal: J Prosthodont Res Date: 2019-09-22 Impact factor: 4.642

8. Random Forest ensembles for detection and prediction of Alzheimer's disease with a good between-cohort robustness.

Authors: A V Lebedev; E Westman; G J P Van Westen; M G Kramberger; A Lundervold; D Aarsland; H Soininen; I Kłoszewska; P Mecocci; M Tsolaki; B Vellas; S Lovestone; A Simmons
Journal: Neuroimage Clin Date: 2014-08-28 Impact factor: 4.881

9. Deep learning-based survival prediction of oral cancer patients.

Authors: Dong Wook Kim; Sanghoon Lee; Sunmo Kwon; Woong Nam; In-Ho Cha; Hyung Jun Kim
Journal: Sci Rep Date: 2019-05-06 Impact factor: 4.379

10. Deep Learning for the Radiographic Detection of Periodontal Bone Loss.

Authors: Joachim Krois; Thomas Ekert; Leonie Meinhold; Tatiana Golla; Basel Kharbot; Agnes Wittemeier; Christof Dörfer; Falk Schwendicke
Journal: Sci Rep Date: 2019-06-11 Impact factor: 4.379

1 in total

1. Automated segmentation of articular disc of the temporomandibular joint on magnetic resonance images using deep learning.

Authors: Shota Ito; Yuichi Mine; Yuki Yoshimi; Saori Takeda; Akari Tanaka; Azusa Onishi; Tzu-Yu Peng; Takashi Nakamoto; Toshikazu Nagasaki; Naoya Kakimoto; Takeshi Murayama; Kotaro Tanimoto
Journal: Sci Rep Date: 2022-01-07 Impact factor: 4.379

1 in total