Literature DB >> 35079025

A novel intelligent system based on adjustable classifier models for diagnosing heart sounds.

Shuping Sun¹, Tingting Huang², Biqiang Zhang², Peiguang He², Long Yan², Dongdong Fan², Jiale Zhang², Jinbo Chen².

Abstract

A novel intelligent diagnostic system is proposed to diagnose heart sounds (HSs). The innovations of this system are primarily reflected in the automatic segmentation and extraction of the first complex sound [Formula: see text] and second complex sound [Formula: see text]; the automatic extraction of the secondary envelope-based diagnostic features [Formula: see text], [Formula: see text], and [Formula: see text] from [Formula: see text] and [Formula: see text]; and the adjustable classifier models that correspond to the confidence bounds of the Chi-square ([Formula: see text]) distribution and are adjusted by the given confidence levels (denoted as [Formula: see text]). The three stages of the proposed system are summarized as follows. In stage 1, the short time modified Hilbert transform (STMHT)-based curve is used to segment and extract [Formula: see text] and [Formula: see text]. In stage 2, the envelopes [Formula: see text] and [Formula: see text] for periods [Formula: see text] and [Formula: see text] are obtained via a novel method, and the frequency features are automatically extracted from [Formula: see text] and [Formula: see text] by setting different threshold value ([Formula: see text]) lines. Finally, the first three principal components determined based on principal component analysis (PCA) are used as the diagnostic features. In stage 3, a Gaussian mixture model (GMM)-based component objective function [Formula: see text] is generated. Then, the [Formula: see text] distribution for component k is determined by calculating the Mahalanobis distance from [Formula: see text] to the class mean [Formula: see text] for component k, and the confidence region of component k is determined by adjusting the optimal confidence level [Formula: see text] and used as the criterion to diagnose HSs. The performance evaluation was validated by sounds from online HS databases and clinical heart databases. The accuracy of the proposed method was compared to the accuracies of other state-of-the-art methods, and the highest classification accuracies of [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text], [Formula: see text], 99.67[Formula: see text] and 99.91[Formula: see text] in the detection of MR, MS, ASD, NM, AS, AR and VSD sounds were achieved by setting [Formula: see text] to 0.87,0.65,0.67,0.65,0.67,0.79 and 0.87, respectively.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35079025 PMCID： PMC8789933 DOI： 10.1038/s41598-021-04136-4

Source DB: PubMed Journal: Sci Rep ISSN： 2045-2322 Impact factor: 4.379

Introduction

Background

As an efficient method, using heart sound (HS) analysis is often used to evaluate heart function; this approach has been widely used to diagnose heart disease and evaluate heart functions, such as congenital heart disease classification[1], ventricular septal defect detection[2], blood pressure estimation[3] and congenital heart disease screening[4], for children and adults. A normal HS is primarily composed of two basic sounds: the first sound () which is generated by the closing of aortic valves and the vibrations associated with tensing of the chordate trendiness and the ventricular walls, the second sound () is produced by the closure of the aortic and pulmonic valves at the beginning of is volumetric ventricular relaxation. However, HSs with unitary murmurs generally occur between and with different noise patterns[5]. Therefore, analyses of , , and the period between and play important roles in characterizing HS features with different types of information. Detailed information for , , and the sounds between and can be used to accurately classify HS. Additionally, to avoid analyzing the sounds between and , which are generally segmented from HSs with low accuracy, and part of the period between and are integrated to obtain , and and the part of the period between and are integrated to form . Then, the features are efficiently extracted from and . Finally, a classification method is established to diagnose heart diseases.

Need for research

and extraction The studies regarding HS segmentation can be summarized into two branches: one branch includes studies that segment each cardiac cycle into a sequence of four heart stages: Systole period Diastole period[6,7]. As a result, the four fundamental stages to be segmented are different due to the nonstationary nature of an abnormal HSs signal and the effect of background noise. The other branch includes studies that segment a periodic HSs into a sequence of two heart stages, which are expressed as based on the STMHT algorithm; this approach was reported to be successfully applied in diagnosing heart diseases, such as in ventricular septal defect (VSD) diagnosis[8] and several kinds of heart disease diagnosis[9]. Moreover, study[9] noted that the use of frequency features was more efficient in distinguishing normal from abnormal sounds than was the use of time features. Therefore, an efficient frequency feature extraction method should be developed. Feature extraction As an important component of efficient feature extraction, the frequency width of the envelope over a given threshold value () has been verified to be useful for detecting heart diseases[8-11]. However, for many types of HSs, it is difficult to extract frequency widths with an unsuitable due to the existence of a non smooth envelope. To extract the frequency widths for a smooth envelope without setting different values, the smooth envelope can be treated as a secondary envelope, as proposed in[9], and used to automatically extract the frequency feature matrix based on the STMHT technique; this method was successfully applied to detect different types of heart diseases. However, for mitral stenosis and mitral regurgitation noises, the feature matrix was not easily extracted because the second frequency component was missing. Therefore, to improve the classification accuracy for diagnosing different types of heart disease and simplify the complexity of the diagnostic method, the smooth envelopes for and extraction in the frequency domain must be considered; additionally, more frequency widths corresponding to different values should be used, and dimensionality reduction should be employed to reduce the number of features considered . Such a classification method could be applied in the efficient extraction of features for diagnosing heart diseases. Classifier model Gaussian mixture models (GMMs) have been used in a wide variety of clustering applications[12-18] due to their powerful mathematical characteristics. Confidence regions are used to diagnose the detection data x in GMMs, and the optimal confidence regions is determined based on Mahalanobis distance following the Chi-square () distribution. Thus, classifier models with adjustable sizes corresponding to the confidence bounds of the Chi-square () distribution, which can be adjusted by changing the desired confidence level (denoted as ), are proposed. The confidence bounds used as the classification criteria are employed to diagnose heart diseases.

Major contributions and organization

In summary, this study proposes an innovative and intelligent system. The major contributions in this study are (1) the STMHT-based and are automatically located and extracted; (2) a novel method for obtaining the secondary curves of and are extracted in the frequency domain; (3) frequency features are automatically extracted over the given threshold value; (4) the diagnostic features , and are determined based on PCA; and (5) the confidence region of the distribution, which are adjusted based on the desired , is determined and used as the classification criterion for diagnosing a given HS. The remainder of this paper is organized as follows. Section “Methodology” presents the approach for determining the diagnostic features , and a definition of the confidence region-based diagnostic method for diagnosing heart diseases. In “Performance evaluation” section, the performance of the proposed method is compared with that of other efficient methods for diagnosing heart diseases. In “Conclusion” section, the conclusions are provided. Finally, the future study is pointed out in “Future study”.

Methodology

This study was approved by the ethics committee of Nanyang Institute of Technology (Approval Number:2016-06) and the informed consent was waived by the ethics committee of Nanyang Institute of Technology. The present study was also conducted in accordance with the tenets of the 1975 Declaration of Helsinki, as revised in 2008[19]. The flow chart of the proposed intelligent system, shown in Fig. 1, consists of three stages: the automatic location and extraction of and ; the automatic determination of frequency features , and ; and the establishment of the Mahalanobis distance criterion-based diagnostic method. In stage 1, the STMHT-based curve (denoted as ), which is extracted for the envelope generated by the HS, is used to segment and extract and from the HS (Fig. 1A). In stage 2, the envelopes and for every period and are obtained via a novel method, and the frequency features are automatically extracted from and by setting different lines. Finally, the first three principal components, , and , which express of the information, are determined and used as diagnostic features (Fig. 1B, C). In stage 3, the GMM-based mixed classification objective function which combines component k with respect to the parameters , , and and the features , is generated. Then, the distribution for component k is determined by calculating the Mahalanobis distance from x to the class mean of component k, and the adjustable confidence bound (denoted as shown in Fig. 1E) is determined to diagnose heart diseases.

Figure 1

Flow chart of the proposed methodology.

Stage 1: Automatic extraction of and

As shown in Fig. 2, five steps consisting of heart sound auscultation, heart sound preprocessing, heart sound envelope extraction, STMHT extraction, and and extraction, which is used to construct the procedure of the and extraction and is detailed in the following steps.

Figure 2

Flow chart of the and extraction.

Step A: Heart sound auscultation

Auscultation is performed for the purposes of examination cardiovascular. As described in previous study[8], the original heart sound, denoted as (colored in blue line as shown in Fig. 2), are collected by 3M-3200 electronic stethoscope with a Hz sample rate which is widely used by many doctors and produced by American 3M company[20], and the tricuspid area is selected as the auscultation area due to the tricuspid area reported to supply more important information[21]. Meanwhile, you can hear the sounds when auscultating heart sounds, ensuring that we avoid as much environmental noise as possible during the auscultation procedure. Even so, the collected heart sounds still need to be preprocessing for canceling the invalid components.

Step B: WD-based heart sound preprocessing

HSs are reported to be primarily dispersed in the frequency range of 20700 Hz[2,8,9]. Therefore, according to the sampling frequency ( kHz), WD-based HSs are filtered to obtain the efficient frequency components ( Hz). The Daubechies wavelet 10 (dB10) has been used to give the maximum signal-to-noise ratio and minimum root-mean-square error for HSs[22]. Therefore, dB10 is selected for use as the mother wavelet for preprocessing HSs. A filtered and normalized sound, colored by gray and denoted as , is shown in Fig. 2.

Step C: heart sound envelope extraction

The Viola integral-based envelope, denoted as , is extracted from the heart sound , as reported in studies[8,9]; this envelope can effective overcome amplitude variations and complex backgrounds and noise. This concept is described as follows. Consider a filtered sound for , where M denotes the number of HSs. In a neighborhood of time m, called the width time scale, the M-point envelope is obtained by Eq. (1):where if the duration of or greater than 0.13 s. Finally, normalization is performed by setting the maximum amplitude of to 1 (Fig. 2).

Step D: STMHT extraction for HS

Given an M-point HS, the STMHT for the HSs , , is computed from Eq. (3)where , and is a moving window of odd length N. According to studies[2,8], the length N is set to 44101.

Step E: Automatic extraction of and

The characteristics of considered in studies[2,8], as shown in Fig. 3A, C, are summarized as follows: The negative-to-positive (N2P) points of , denoted by , correspond to the geometry center peaks of and ; The geometry center between and , denoted by is determined by the positive-to-negative P2N points of . Moreover, the interval from to is generally greater than that from to in one period of an HS[23-25]. Therefore, the N2P and P2N-based and features can be automatically segmented from one period of an HS and extracted by two procedures, as described as follows. The automatic extraction procedures for and are illustrated in Fig. 3. Figure 3(A, B) show a typical AR sound, and the typical NM sound is shown in Fig. 3(C, D).

Figure 3

The automatic extraction procedures for and . A-B show the procedure for an example of a typical AR from the database in[26]. C-D show the procedure for an example of a typical normal sound database[27].

and location The algorithm for detecting and is detailed as follows. First, the signum function of , denoted as , is calculated by Then, the variation in () is determined from Eq. (6) Finally, N2P and P2N are determined by Automatic extraction of and Calculate the difference between two adjacent N2Ps, denoted as , with Eq. (8) Determine the points and that are used for segmentation from to and from to , respectively, by using Eq. (9). Extract (denoted as ) and (denoted as ) for the ith period of an HS as follows The automatic extraction procedures for and . A-B show the procedure for an example of a typical AR from the database in[26]. C-D show the procedure for an example of a typical normal sound database[27]. Example of feature definition and automatic extraction.

Stage 2: Automatic feature generation

Feature definition

To extract the efficient frequency widths, as shown in Fig. 4, the smooth envelopes for and in the frequency domain are firstly generated, and then the frequency widths corresponding to different Thv values are extracted. The frequency widths over a given threshold value are defined and calculated bywhere and are the left and right intersections, respectively, of and over the lines (=0.3, 0.5 and 0.8). Moreover, the frequency features are expressed based on Eq. (15) and described in Table 1.

Figure 4

Example of feature definition and automatic extraction.

Table 1

Description of the frequency domain feature matrix .

Feature index	Feature’s symbol	Feature description	Unit
1	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{1}}_{\mathrm{FW1}}$$\end{document}CS1FW1	The frequency width of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{1}$$\end{document}CS1 corresponding to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Thv}=0.3$$\end{document}Thv=0.3	Hz
2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{1}}_{\mathrm{FW2}}$$\end{document}CS1FW2	The frequency width of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{1}$$\end{document}CS1 corresponding to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Thv}=0.5$$\end{document}Thv=0.5	Hz
3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{1}}_{\mathrm{FW3}}$$\end{document}CS1FW3	The frequency width of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{1}$$\end{document}CS1 corresponding to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Thv}=0.8$$\end{document}Thv=0.8	Hz
4	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{1}}_{\mathrm{G}}$$\end{document}CS1G	The Center of gravity of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{1}$$\end{document}CS1 in frequency-domain	Hz
5	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{2}}_{\mathrm{FW1}}$$\end{document}CS2FW1	The frequency width of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{2}$$\end{document}CS2 corresponding to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Thv}=0.3$$\end{document}Thv=0.3	Hz
6	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{2}}_{\mathrm{FW2}}$$\end{document}CS2FW2	The frequency width of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{2}$$\end{document}CS2 corresponding to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Thv}=0.5$$\end{document}Thv=0.5	Hz
7	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{2}}_{\mathrm{FW3}}$$\end{document}CS2FW3	The frequency width of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{2}$$\end{document}CS2 corresponding to \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${Thv}=0.8$$\end{document}Thv=0.8	Hz
8	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS _{2}}_{\mathrm{G}}$$\end{document}CS2G	The Center of gravity of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{2}$$\end{document}CS2 in frequency-domain	Hz

Secondary envelopes and generation: Given an M-point HS, the secondary envelope in the frequency-domain, denoted as , can be calculated from Eq. (11): where , and are defined by Eq. (12): is the absolute value sign, is the first window width, and is the second window width. According to studies[9,28], and are set to 9 and 17, respectively. Moreover, is also normalized by setting the maximum amplitude of to 1. The secondary envelopes for and , denoted as and respectively, are illustrated by using the examples described in Fig. 3 which are first automatically generated based on Eq. (11), are shown in Fig. 4, where the plots in Fig. 4A.2 describe the results of corresponding to in Fig. 4A.1, the plots in Fig. 4B.2 describe the results of which corresponds to in Fig. 4B.1, the plots in Fig. 4A.3 describe the results of corresponding to in Fig. 4A.1, and the plots in Fig. 4B.3 describe the results of which corresponds to in Fig. 4B.1. Definition and automatic extraction of frequency features: The frequency features are illustrated in Fig. 4(B, C), and their gravities are calculated by Description of the frequency domain feature matrix .

Experimental results for several typical types of heart disease

The features of six typical and normal sounds are illustrated in Fig. 5. From Fig. 5, and are first automatically located and extracted, then, the envelopes for every and are extracted by Eq. (11). Finally, the features defined by Eq. (15) for and in the frequency domain are automatically extracted with Eqs. (13-14). The experimental sounds are 665-period AR sounds (3M database[29], medical sound library[30], heart auscultation sounds[31], auscultation sound[32], continuing medical implementation[26], sounds Database of the University of Dundee[27], and patients only with AR disease from the Nanyang First People’s Hospital), 381-period AS sounds (continuing medical implementation[26], sounds database of the University of Dundee[27], 3M database[29], medical sound library[30], auscultation sound[32], and patients only with AS disease from the Nanyang first People’s Hospital, and heart auscultation sounds[31]), 315-period ASD sounds (Medical sound library[30], heart auscultation sounds[31], 3M database[29], patients only with ASD disease from the Nanyang First People’s Hospital, and medical sound library[30]), 769-period MR sounds (3M database[29], sounds database of the University of Dundee[27], heart auscultation sounds[31], medical sound library[30], and auscultation sound[32]), 439-period MS sounds(3M database[29], auscultation sound[32], medical sound library[30], and continuing medical implementation[26]), and 1056-period NM sounds(3M database[29], Michigan database[33], medical sound library[30], ThinkLabs database[34], and healthy undergraduates from Nanyang Institute of Technology, China)(whom I thank for the data used in this study)). Moreover, the boxplots for the features are plotted in Fig. 6, where Fig. 6A shows the features extracted from and Fig. 6B shows the features from for each type of heart disease. The scatter plots of features in Fig. 6 illustrate the discrimination ability of the model in distinguishing among different heart diseases and highlighting the following findings: The MS and VSD sounds are easy to distinguish from the other sounds by using (Fig. 6A), and by using the (Fig. 6B), the VSD sound is easy to distinguish from the other sounds; The MS sound is easy to distinguish from the other sounds based on (Fig. 6A), and by using the (Fig. 6B), the AR and VSD sounds are distinguished from other sounds; The NM sound is easy to distinguish from other sounds using (Fig. 6A). The AR and VSD sounds are easy to distinguish from the other sounds using , as shown in Fig. 6B; Fig. 6A indicates that can be used to easily distinguish MR from other sounds and the AS and ASD sounds from other sounds; Fig. 6B shows that the distribution of from AS sounds is different from that for other sounds, except NM sounds. The analysis results discussed above indicate that different combinations of several features defined by Eq. (15) can be used to distinguish among various types of heart disease. Therefore, to simplify features and develop a diagnostic method that is simple and effective, dimension reduction is used to determine new features; this process is described in detail as follows.

Figure 5

Examples of a typical normal sound and six types of typical heart disease sounds.

Figure 6

Box plot representation of FF for each type of heart disease. shows the box plots for features from . In addition, represents the features from .

Examples of a typical normal sound and six types of typical heart disease sounds. Box plot representation of FF for each type of heart disease. shows the box plots for features from . In addition, represents the features from .

Diagnostic feature determination

To simplify the computation when using features to diagnose heart diseases, PCA, a linear dimensionality reduction technique for finding principal components and replacing high-dimension data in many studies, such as studies on heart arrhythmias classification[35], heart disease classification[2,36], emotion recognition[37], respiratory rate extraction[38] and electrocardiogram heart disease diagnosis[39], is employed to generate a few efficient principal components to characterize HS features and diagnose heart diseases. The algorithm corresponding to the generation of new features via PCA for a given data set FF is described as Algorithm 1. The eigenvector in Algorithm 1 , which corresponds to the eigenvalue and is calculated for the matrix Z in step 2, as shown in Table 3, is the actual weighted coefficient for the ith principal component . Table 3 shows that the largest absolute coefficients in the first principal component are , and ; the second principal component is mainly weighted based on , , and ; and the third component is mainly weighted based on , and (Table 3). To determine the smallest number of principal components m should be considered, the Pareto chart is used; this chart provides a tool for visualizing the Pareto principle, which states that observing a small set of variables that influence a common outcome is more common than detecting many variables that influence the same outcome. This approach has been used to determine the percent variability explained by each principal component (Fig. 7A). Therefore, according to the smallest m value such that [40], combined with the scatter plot for the first m principal components, the smallest m is determined. The Pareto chart of the PCA results in Fig. 7A shows the explained variance and accumulated variance for each principal component , where . According to Fig. 7A, of the total variance is captured by the first two components, and , and of the total variance is captured by the first three components , and . Therefore, the following conclusions can be obtained.Therefore, m is set to 3, and the new 3-dimensional feature matrices consisting of , and (see Fig. 7C) are used to diagnose heart diseases.

Table 3

Eigenvector and eigenvalue for in descending order of eigenvalues.

Features	Eigenvector (eigenvalue) in descending order of eigenvalues
Features	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{1}(\lambda _{1}=3.6423)$$\end{document}ξ1(λ1=3.6423)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{2}(\lambda _{2}=1.7643)$$\end{document}ξ2(λ2=1.7643)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{3}(\lambda _{3}=1.5316)$$\end{document}ξ3(λ3=1.5316)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{4}(\lambda _{4}=0.4671)$$\end{document}ξ4(λ4=0.4671)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{5}(\lambda _{5}=0.3226)$$\end{document}ξ5(λ5=0.3226)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{6}(\lambda _{6}=0.1748)$$\end{document}ξ6(λ6=0.1748)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{7}(\lambda _{7}=0.0606)$$\end{document}ξ7(λ7=0.0606)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\xi _{8}(\lambda _{8}=0.0368)$$\end{document}ξ8(λ8=0.0368)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }1}_{\mathrm{FW1}}$$\end{document}S1FW1	0.4309	-0.2169	0.0818	-0.2179	0.8062	-0.2394	-0.0499	0.0616
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }1}_{\mathrm{FW2}}$$\end{document}S1FW2	0.3716	-0.0757	0.5303	0.0431	-0.0306	0.6778	0.2873	-0.1735
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }1}_{\mathrm{FW3}}$$\end{document}S1FW3	0.3411	-0.0983	0.5431	0.0444	-0.4364	-0.6126	-0.1001	-0.0375
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }1}_{\mathrm{G}}$$\end{document}S1G	0.2385	0.6501	-0.0157	-0.2122	0.0212	0.0986	-0.5500	-0.4031
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }2}_{\mathrm{FW1}}$$\end{document}S2FW1	0.3313	0.0487	-0.2471	0.8958	0.0977	-0.0299	-0.0699	-0.0944
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }2}_{\mathrm{FW2}}$$\end{document}S2FW2	0.4475	-0.2390	-0.2924	-0.1628	-0.3044	0.2633	-0.3928	0.5607
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }2}_{\mathrm{FW3}}$$\end{document}S2FW3	0.3517	-0.2191	-0.5142	-0.2650	-0.2357	-0.1047	0.3750	-0.5353
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ S }2}_{\mathrm{G}}$$\end{document}S2G	0.2632	0.1026	-0.0768	-0.0662	-0.0207	-0.1308	0.5506	0.4386

Figure 7

PCA results. A shows the Pareto chart of the variance by contribution of each principal component, B plots the scatter diagram of the first two components and , and C shows the first three components , and .

and lead to a dimensionality reduction of (from 8 to 2 variables) and only information loss. The scatter diagram of and given in Fig. 7B indicates that although the distribution region corresponding to each type of heart disease is obviously different and the overlaps between MR and other diseases, AR and other diseases, and VSD and other diseases are small, the overlaps among MS, ASD, NM, and AS are relatively large; therefore, it is difficult to accurately distinguish among these four types of heart diseases. However, the scatter diagram of , and , plotted in Fig. 7C, shows that there are different distribution regions for these types of heart diseases. In addition, , as shown in Fig. 7A, based on feature number determination[40]. Thus, , and lead to a dimensionality reduction of (from 8 to 3 variables) with only information loss. The scatter diagram of , and in Fig. 7C is used to verify the different distribution regions corresponding to these types of heart diseases. Mean () and standard deviation () of the features. PCA results. A shows the Pareto chart of the variance by contribution of each principal component, B plots the scatter diagram of the first two components and , and C shows the first three components , and . Eigenvector and eigenvalue for in descending order of eigenvalues. Flow chart of the diagnostic determination and 3-dimensional surface classifier results.

Stage 3: classification based on the squared Mahalanobis distance criterion

Classifier determination

The squared Mahalanobis distance classification criterion-based diagnostic methodology, consisting of the five sequential steps as shown in the flow chart (Fig. 8A), is proposed to diagnose HSs and is described in the following 5 steps.

Figure 8

Flow chart of the diagnostic determination and 3-dimensional surface classifier results.

Step 1: GMM-based and generation

In the design step of GMM, the estimated target function, , is a mixture of d-dimensional normal Gaussian distributions that reflect the training pattern of each component; it is assumed that components can be modeled by mixtures of normal Gaussian distributions bywhereexpresses the posterior probabilities corresponding to each component; K is the number of components; corresponds to the mixed weights, such that ; and and are the mean value and covariance matrix of the component, respectively. Because the goal is to maximize the function , the parameters (, , and ) are determined based on the EM algorithm[41] for a set of sample records. Based on the types of heart disease described in Sect. 2.2 and the scatter diagram plotted in Fig. 7C, the number of Gaussian mixture components is set to , and the fitgmdist function in MATLAB 2018b is used to return a GMM with components fitted to the features established in Sect. 2.2 using the EM algorithm by assigning a posterior probability to each component density with respect to each observation. Furthermore, the regularization value is set as 0.01 to avoid ill-conditioned covariance estimates, and the number of optimization iterations is set to 1000 based on experience. The Gaussian mixture parameter estimates for and are obtained and shown in Table 4. To characterize the 3-dimensional interspace corresponding to each 3-dimensional Gaussian component for diagnosing heart diseases, the 3-dimensional interspaces can be used as 3-dimensional classifiers to diagnose heart diseases with high classification accuracy; the overlapping interspace between two random components is made as small as possible, and the independent 3-dimensional interspace corresponding to each component is considered.

Table 4

The Gaussian mixture parameter estimates are achieved for the new features by setting the number of Gaussian mixture components as 7.

Components	Component number	Gaussian mixture parameter estimates
Components	Component number	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\pi _{_k}$$\end{document}πk		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu _{k}$$\end{document}μk		\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\Sigma _{k}$$\end{document}Σk
MR Classifier	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=1$$\end{document}k=1	0.1947	0.7056	2.7126	1.4950	0.0425	-0.0007	0.0013
						-0.0007	0.2343	-0.0126
						0.0013	-0.0126	0.2122
MS Classifier	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=2$$\end{document}k=2	0.0827	3.2981	-2.6064	-3.7382	0.3310	-0.0094	-0.0122
						-0.0094	0.3906	-0.0210
						-0.0122	-0.0210	0.5386
ASD Classifier	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=3$$\end{document}k=3	0.1130	2.3453	-0.3484	0.5773	0.5373	-0.0172	-0.0039
						-0.0172	0.0608	-0.0053
						-0.0039	-0.0053	0.1883
NM Classifier	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=4$$\end{document}k=4	0.1683	2.7874	1.8620	-0.9829	0.1403	0.0107	0.0063
						0.0107	0.2549	0.0016
						0.0063	0.0016	0.1301
AS Classifier	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=5$$\end{document}k=5	0.0783	0.7511	0.3199	-0.5341	0.0972	0.0077	-0.0161
						0.0077	0.0344	-0.0050
						-0.0161	-0.0050	0.2634
AR Classifier	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=6$$\end{document}k=6	0.2676	-1.2294	0.1198	0.3222	0.3301	-0.0011	0.0025
						-0.0011	0.0230	0.0005
						0.0025	0.0005	0.3255
VSD Classifier	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$k=7$$\end{document}k=7	0.0954	-0.1631	-1.1167	0.9454	0.1338	0.0048	-0.0155
						0.0048	0.1449	-0.0095
						-0.0155	-0.0095	0.1573

The Gaussian mixture parameter estimates are achieved for the new features by setting the number of Gaussian mixture components as 7.

Step 2: determination for the component in 3-dimensional interspace

Since the squared Mahalanobis distances for each Gaussian component follow the Chi-square distribution () in 3-dimensional interspace, to determine the decision region for classifying the test data x via the components estimated in the above step, the squared Mahalanobis distance in 3-dimensional interspace for the component with mean and full covariance matrix , , is computed as follows:Therefore, , which is constructed based on component k and denoted as , is determined byTherefore, the squared Mahalanobis distance specified based on the desired confidence level, denoted as , can be used as the kth classifier criterion for determining whether feature x belongs to the kth class. The achieved accuracies corresponding to classifying the heart sounds described in Sect. 2.2 by setting form 0.63 to 0.97 with a step of 0.02.

Step 3: The confidence level determination

Actually, the kth confidence region, as specified by the kth desired confidence level , is surrounded by the kth ellipsoid, and this relation is expressed aswhere is the inverse of for a given confidence level , and represents the classification criterion for component k and satisfies the following equationFor the distribution, although the confidence regions corresponding to the confidence levels of , , and are widely used classification criteria in many studies[2,42-45], the optional is identified by setting combined with the following rules: 1) each ellipsoid should be as large as possible; 2) each common region should be as small as possible; and 3) the classification accuracy defined in Eq. (26) should be as high as possible. The classification accuracies for classifying sound data summarized in Sect. 2.2 are plotted in Figs. 9, and 9 shows the following results: For VSD sounds, high accuracy can be achieved by setting the desired confidence level to each value within the interval of (), as shown in Fig. 9(VSD); For AR and MR sounds, by setting the desired confidence level based on , high classification accuracy could be achieved (Fig. 9(MR and AR) ); For MS, AS and NM sounds, to achieve the accurate classification of HSs, the interval of the desired confidence level should be set as [0.63, 0.65] (Fig. 9); For ASD sounds, Fig. 9(ASD) shows that the highest classification accuracy is achieved by setting the desired confidence level to . Furthermore, the desired confidence level can be adjusted to improve the classification accuracy and fit new datasets without reperforming the computations for the objective function, especially for VSD sounds and MR sounds (Fig. 9(VSD and MR)). In this study, according to the rules described above combined with the accuracy analysis results plotted in Fig. 9, the values are set as 0.87, 0.65, 0.67, 0.65, 0.67, 0.79 and 0.87, respectively.

Figure 9

The achieved accuracies corresponding to classifying the heart sounds described in Sect. 2.2 by setting form 0.63 to 0.97 with a step of 0.02.

Step 4: determination corresponding to

Based on the confidence level achieved for in the above step, by using the function ’chi2inv’ in MATLAB 2018b, the inverse of , denoted as , is determined. The analysis results for the confidence region in the 3-dimensional interspace, which is surrounded by the ellipsoid corresponding to the desired confidence level , are determined and shown in Fig. 8B. Furthermore, Fig. 8B shows that the common regions between two random ellipsoids are almost zero; thus, a faulty decision process is avoided because the input will not fall into two or more categories.

Step 5: -based diagnostic result determination

Based on the ellipsoid surfaces region shown in Fig. 8B, the diagnosis method is described as follows. The 3-dimensional diagnostic features [, , ] are first transformed from the features FF (denoted as ) of the testing sample and calculated with the following equation where and are shown in Table 2.

Table 2

Mean () and standard deviation () of the features.

Statistics	Frequency features (\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu _{_\mathrm{FF}}+\sigma _{_\mathrm{FF}}$$\end{document}μFF+σFF)
	Features from \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{1}}$$\end{document}CS1				Features from \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ CS }_{2}$$\end{document}CS2
	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{1}}_{\mathrm{FW1}}$$\end{document}CS1FW1	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{1}}_{\mathrm{FW2}}$$\end{document}CS1FW2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{1}}_{\mathrm{FW3}}$$\end{document}CS1FW3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{1}}_{\mathrm{G}}$$\end{document}CS1G	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{2}}_{\mathrm{FW1}}$$\end{document}CS2FW1	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{2}}_{\mathrm{FW2}}$$\end{document}CS2FW2	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{2}}_{\mathrm{FW3}}$$\end{document}CS2FW3	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${{ CS }_{2}}_{\mathrm{G}}$$\end{document}CS2G
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\mu _{_\mathrm{FF}} \pm \sigma _{_\mathrm{FF}}$$\end{document}μFF±σFF	45.3± 11.8	33.1± 5.8	18.8± 3.6	80.6± 21.7	44.1± 23.1	32.2± 9.1	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$18.4 \pm 6.5$$\end{document}18.4±6.5	79.8±18.9

Then, according to the confidence region shown in Fig. 8B, the -based diagnostic result for a test features is determined. -based diagnostic results for a test feature are determined by where class k corresponding to the type of heart disease is detailed in Table 4, and (1, 2, , 7) is 5.6489, 3.2831, 3.4297, 3.2831, 3.4297, 4.5258 and 5.6489.

Performance evaluation criteria

To evaluate the performance of these ellipsoids in 3-dimensional space, the classification accuracy (), sensitivity () and specificity () values are calculated bywhere , , and are the numbers of true positives, false positives, true negatives and false negatives, respectively. Experimental sounds used to evaluate the performance.

Performance evaluation

To evaluate the performance of the proposed methodology, the comparison between the proposed methodology and the state-of-the-art methods on the clinical sounds and online sounds data was conducted as follows.Overall, the efficiency of the proposed method in diagnosing MR, MS, ASD, NM, AS, AR and VSD diseases was evaluated by comparison with the other efficient methods listed in Table 7.

Table 7

Comparative analysis of eight different methods for the diagnosis of heart diseases summarized in Table 5.

Method	MR			MS			ASD			NM			AS			AR			VSD
Method	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ Se }\%$$\end{document}Se%	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CA (\%)$$\end{document}CA(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Sp (\%)$$\end{document}Sp(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ Se }\%$$\end{document}Se%	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CA (\%)$$\end{document}CA(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Sp (\%)$$\end{document}Sp(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ Se }\%$$\end{document}Se%	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CA (\%)$$\end{document}CA(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Sp (\%)$$\end{document}Sp(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ Se }\%$$\end{document}Se%	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CA (\%)$$\end{document}CA(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Sp (\%)$$\end{document}Sp(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ Se }\%$$\end{document}Se%	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CA (\%)$$\end{document}CA(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Sp (\%)$$\end{document}Sp(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ Se }\%$$\end{document}Se%	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CA (\%)$$\end{document}CA(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Sp (\%)$$\end{document}Sp(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${ Se }\%$$\end{document}Se%	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$CA (\%)$$\end{document}CA(%)	\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Sp (\%)$$\end{document}Sp(%)
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp 1$$\end{document}♯1	92.1	86.34	87.6	88.2	86.81	87.3	91.2	84.53	85.1	86.3	82.90	81.6	90.9	98.25	99.1	88.2	86.05	85.4	95.2	96.31	96.8
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp 2$$\end{document}♯2	90.6	89.93	88.3	88.8	90.32	91.3	83.9	86.12	87.21	95.9	98.3	97.7	96.3	96.1	96.02	87.1	86.31	85.9	90.6	89.3	88.1
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp 3$$\end{document}♯3	90.1	88.6	87.5	85.1	85.69	99.2	81.3	80.97	80.8	93.6	91.95	90.3	87.9	85.04	83.3	92.1	88.69	85.6	88.6	86.9	85.9
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp 4$$\end{document}♯4	90.1	87.34	86.7	83.2	86.81	87.3	90.2	83.13	82.5	85.7	81.90	80.6	91.3	90.40	90.3	87.2	84.04	83.4	96.2	97.66	97.8
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp 5$$\end{document}♯5	88.6	85.93	85.3	86.1	89.72	90.2	79.8	82.10	82.3	96.1	98.93	99.9	98.3	91.95	96.2	85.1	86.97	83.6	87.5	87.19	82.1
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp 6$$\end{document}♯6	88.1	87.6	87.8	83.1	89.36	90.2	80.3	81.67	81.8	92.6	91.63	91.3	83.9	86.04	86.3	90.1	84.69	83.6	87.6	86.03	85.9
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp 7$$\end{document}♯7	89.7	91.64	92.1	85.2	83.52	83.3	86.3	87.49	87.6	93.7	91.61	90.9	90.1	87.05	86.7	85.2	82.21	81.6	90.8	91.63	91.7
This	100	99.43	99.3	99.2	98.93	98.9	99.6	99.13	99.1	100	99.85	99.8	98.8	98.62	98.6	100	99.67	99.6	100	99.91	99.9

Total sounds: The total sounds, consisting of sounds described in Sect. 2.2 and new sounds, were summarized in Table 5 to evaluate the performance of this proposed methodology.

Table 5

Experimental sounds used to evaluate the performance.

Data source	Period numbers of every type of heart disease/Patients
Data source	MR	MS	ASD	NM	AS	AR	VSD
Sounds in Sect. 2.2	769/10	439/5	315/7	1056/45	381/10	665/15	327/10
New sounds	156/3	132/2	82/2	183/8	126/3	153/4	70/3
Total sounds	925/13	571/7	397/9	1239/53	507/13	818/19	397/13

State-of-the-art methods: To highlight the efficiency of the proposed methodology for diagnosing the seven typical heart diseases, the state-of-the-art methods, published in recent five years and described in Table 6, were comparatively analyzed.

Table 6

Efficient methods successfully used in diagnosing normal sounds from other common heart diseases.

Method	Year	Performance evaluation
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 1[46]	2021	The Fano-factor constrained tunable quality wavelet transform (TQWT) was the sensitivity and specificity of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$86.32\%$$\end{document}86.32% and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$99.44\%$$\end{document}99.44% respectively and overall score of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$92.88\%$$\end{document}92.88% to detect abnormal heart sounds.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 2[47]	2021	This study proposed a heart sound classification method based on improved MFCC features and convolutional recurrent neural networks, which achieved classification accuracy of \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$98\%$$\end{document}98% in the 2016 PhysioNeT/CinC Challenge database with dropout rate of 0.5.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 3[48]	2020	A deep WaveNet model was proposed to classify five heart sound types and achieve high classification accuracies: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$98.20\%$$\end{document}98.20% for diagnosing Normal, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$95.20\%$$\end{document}95.20% for diagnosing MVP, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$97.80\%$$\end{document}97.80% for diagnosing MS, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$96.10\%$$\end{document}96.10% for diagnosing MR, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$97.70\%$$\end{document}97.70% for diagnosing AS.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 4[8]	2018	The higher CA, achieved in this study, was \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$95.5\%$$\end{document}95.5%, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$92.1\%$$\end{document}92.1%, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$96.2\%$$\end{document}96.2% and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$99.0\%$$\end{document}99.0% for diagnosing small ventricular septal defect (VSD), moderate VSD, large VSD and normal sounds, respectively.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 5[49]	2017	A rule-based classification tree method proposed by this study achieved very high CA: \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$95.45\%$$\end{document}95.45% for diagnosing VSD, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$100\%$$\end{document}100% for diagnosing normal, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$100\%$$\end{document}100% for diagnosing aortic stenosis and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$95.45\%$$\end{document}95.45% for diagnosing aortic insufficiency.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 6[50]	2016	Artificial neural networks (ANNs) was reported to achieve the second-best score compared to the other methods in classifying the phonocardiogram recordings provided by the CinC Challenge.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 7[51]	2016	Random forest, a meta-learning approach that uses multiple random decision trees as base learners and aggregates them to compute the final ensemble prediction, was successfully used in sound classification such as studies.

Comparsion results: The comparison results were summarized in Table 7, where the parameters corresponding to the state-of-the-art methods were described in Table 8. The results in Table 7 support the following conclusions.

Table 8

The highest accuracies corresponding to the parameters set in every state-of-the-art method.

Method	Performance evaluation
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 1[46]	The highest classification accuracies were obtained by using the features described in Table 3 on page 28.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 2[47]	The highest classification accuracies were obtained by using the 13-features extracted using MFCC algorithm.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 3[48]	The highest classification accuracies were obtained by using the porposed WaveNet model consists of 6 residual blocks.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 4[8]	The highest classification accuracies were obtained based on the rules described in a previous study[8].
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 5[49]	The highest CA results were obtained based on the following rules.
	Rule 1: If the \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$8^{th}$$\end{document}8th value of Lyapunov exponent \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(\mathrm{LPE}_{8})$$\end{document}(LPE8) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\ge 0.79$$\end{document}≥0.79 and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(\mathrm{LPE}_{9})$$\end{document}(LPE9) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\le 0.38$$\end{document}≤0.38 then the heart is normal.
	Rule 2: If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text {LPE}}_{2}\le 0.17$$\end{document}LPE2≤0.17 and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$(\mathrm{LPE}_{8})$$\end{document}(LPE8) \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\le 0.79$$\end{document}≤0.79, then the heart disease is VSD.
	Rule 3: If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text {LPE}}_{4}\ge 0.17$$\end{document}LPE4≥0.17, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{6}\le 0.39$$\end{document}LPE6≤0.39, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{3}\le 0.56$$\end{document}LPE3≤0.56, then the heart disease is MR.
	Rule 4: If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text {LPE}}_{5}\ge 0.17$$\end{document}LPE5≥0.17, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{4}\ge 0.67$$\end{document}LPE4≥0.67, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{3}\ge 0.37$$\end{document}LPE3≥0.37, then the heart disease is MS.
	Rule 5: If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text {LPE}}_{7}\ge 0.54$$\end{document}LPE7≥0.54, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{3}\ge 0.29$$\end{document}LPE3≥0.29, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{5}\ge 0.49$$\end{document}LPE5≥0.49, then the heart disease is AR.
	Rule 6: If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text {LPE}}_{8}\ge 0.39$$\end{document}LPE8≥0.39, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{5}\ge 0.72$$\end{document}LPE5≥0.72, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{2}\le 0.68$$\end{document}LPE2≤0.68, then the heart disease is ASD.
	Rule 7: If \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\text {LPE}}_{9}\ge 0.64$$\end{document}LPE9≥0.64, \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{3} \ge 0.39$$\end{document}LPE3≥0.39, and \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$${\mathrm{LPE}}_{7}\ge 0.21$$\end{document}LPE7≥0.21, then the heart disease is AS.
	Rule 8: If none of these conditions are met, the HS is undefined.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 6[50]	The most accurate results were obtained by the structure consisting of one input layer with 60 neurons, one hidden layer with 11 neurons and one output layer with five neurons.
\documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$\sharp$$\end{document}♯ 7[51]	The most accurate results were obtained by setting the number of features at each node, the number of trees and the maximum depth of trees to 1, 108, and 36, respectively.
This method	The most accurate results were obtained for the diagnosis of MR, MS, ASD, NM, AS, AR and VSD at \documentclass[12pt]{minimal} \usepackage{amsmath} \usepackage{wasysym} \usepackage{amsfonts} \usepackage{amssymb} \usepackage{amsbsy} \usepackage{mathrsfs} \usepackage{upgreek} \setlength{\oddsidemargin}{-69pt} \begin{document}$$Thv =0.4, 0.3,0.2, 0.2, 0.4, 0.1$$\end{document}Thv=0.4,0.3,0.2,0.2,0.4,0.1 and 0.2, respectively.

Although using the method to diagnose AS yielded a higher than that of the proposed method, the was lower than that of the proposed method, partially due to the high achieved by the proposed method. Although using the method to diagnose MS yielded a higher than that of the proposed method, the was lower than that of the proposed method, partially due to the high achieved by the proposed method. Although using the method to diagnose NM yielded a higher than that of the proposed method, the was lower than that of the proposed method, partially due to the high achieved by the proposed method. For other sounds, the classification accuracies achieved in the proposed method were all greater than those of the other methods listed in Table 7 . Efficient methods successfully used in diagnosing normal sounds from other common heart diseases. Comparative analysis of eight different methods for the diagnosis of heart diseases summarized in Table 5. The highest accuracies corresponding to the parameters set in every state-of-the-art method.

Conclusion

A novel intelligent system was proposed for diagnosing heart diseases with high . The innovation of this approach is primarily reflected in: 1) the automatic extraction of secondary envelope-based frequency features; 2) the automatic determination of PCA-based diagnostic features , and ; and 3) the determination of adjustable confidence regions corresponding to the distribution. The confidence regions are obtained by calculating the Mahalanobis distance, which is adjusted by the desired confidence level , and the results were used as the classification criteria for diagnosing heart diseases. The procedure for the implementation of the intelligent system involved three stages. Stage 1 described the location and extraction of STMHT-based and . In stage 2, in the frequency domain, a novel method was first proposed to generate the envelopes and ; then, based on the lines, was automatically extracted. Finally, based on PCA, the first three principal components, , and , which expressed of the information, were determined and used as diagnostic features. In stage 3, the GMM-based objective function with respect to the features and the parameters [, , ], where , was generated. Then, the distribution for component k was determined by calculating the Mahalanobis distance from to the class mean of component k, and the confidence region for component k was determined by adjusting the optimal confidence level and used as the criterion (denoted as ) to diagnose a given HS. The performance evaluation was validated by sounds from online HS databases and clinical heart databases. The accuracy of the proposed method was compared to the accuracies of other well-known classifiers, and the highest classification accuracies of , , , , , 99.67 and 99.91 in the detection of MR, MS, ASD, NM, AS, AR and VSD sounds were achieved by setting to 0.87,0.65,0.67,0.65,0.67,0.79 and 0.87, respectively. Therefore, this proposed intelligent diagnosis system provided an efficient way to diagnose seven types of heart diseases. The advantages and limitations were summarized as follows: Advantages: and were automatically extracted to reduce difficulty in segmenting each cardiac cycle into a sequence of four heart stages: Systole period Diastole period; More features could be extended by setting even more threshold values for the unknown heart diseases, especially for the heart sound with the compound heart diseases; Every classifier achieved in this study could be adjusted based on the desired for fitting incremental new features without being retrained via huge training features. Limitations: This methodology was impossible to diagnose the sounds when and cannot be segmented and extracted via the STMHT method for a given heart sound such as that plotted in Fig. 10; The proposed classifier might not be satisfied with the compound heart diseases due to the distribution of features extracted from which can not fit a single Gaussian distribution.

Figure 10

An example of a AR sound from database[26].

Future study

Future study focused on how to handle the sounds (such as some AR sounds) when and cannot be segmented and extracted via the STMHT method will be explored, and on how to build the classifier model for fitting the compound heart diseases will be further studied.

Research statement

The study was conducted at Nanyang Institute of Technology and Nanyang First People’s Hospital, Henan, China from December 2017 to June 2021, and was approved by the ethics committee of Nanyang Institute of Technology and First People’s Hospital (Approval Number: V6.0). Informed consent was waived due to the retrospective design of the study. The study complies with the Declaration of Helsinki.

18 in total

1. Automated Diagnosis of Heart Sounds Using Rule-Based Classification Tree.

Authors: Mohamed Esmail Karar; Sahar H El-Khafif; Mohamed A El-Brawany
Journal: J Med Syst Date: 2017-03-01 Impact factor: 4.460

2. Heart Sound Segmentation-An Event Detection Approach Using Deep Recurrent Neural Networks.

Authors: Elmar Messner; Matthias Zohrer; Franz Pernkopf
Journal: IEEE Trans Biomed Eng Date: 2018-06-01 Impact factor: 4.538

3. Principal component analysis-based features generation combined with ellipse models-based classification criterion for a ventricular septal defect diagnosis system.

Authors: Shuping Sun; Haibin Wang
Journal: Australas Phys Eng Sci Med Date: 2018-09-20 Impact factor: 1.430

4. On the characterization of flowering curves using Gaussian mixture models.

Authors: Frédéric Proïa; Alix Pernet; Tatiana Thouroude; Gilles Michel; Jérémy Clotault
Journal: J Theor Biol Date: 2016-04-22 Impact factor: 2.691

5. Ensemble Empirical Mode Decomposition With Principal Component Analysis: A Novel Approach for Extracting Respiratory Rate and Heart Rate From Photoplethysmographic Signal.

Authors: Mohammod Abdul Motin; Chandan Kumar Karmakar; Marimuthu Palaniswami
Journal: IEEE J Biomed Health Inform Date: 2017-03-07 Impact factor: 5.772

6. [Classification of heart sound signals in congenital heart disease based on convolutional neural network].

Authors: Zhaowen Tan; Weilian Wang; Rong Zong; Jiahua Pan; Hongbo Yang
Journal: Sheng Wu Yi Xue Gong Cheng Xue Za Zhi Date: 2019-10-25

7. Systolic and diastolic time intervals in the critically ill patient.

Authors: J A Máttar; W C Shoemaker; D Diament; A Lomar; A C Lopes; E De Freitas; F P Stella; L A Factore
Journal: Crit Care Med Date: 1991-11 Impact factor: 7.598

8. Systolic and diastolic time intervals measured from Doppler tissue imaging: normal values and Z-score tables, and effects of age, heart rate, and body surface area.

Authors: Wei Cui; David A Roberson; Zhen Chen; Luisa F Madronero; Bettina F Cuneo
Journal: J Am Soc Echocardiogr Date: 2007-07-12 Impact factor: 5.251

9. Segmentation of small ground glass opacity pulmonary nodules based on Markov random field energy and Bayesian probability difference.

Authors: Shaorong Zhang; Xiangmeng Chen; Zhibin Zhu; Bao Feng; Yehang Chen; Wansheng Long
Journal: Biomed Eng Online Date: 2020-06-17 Impact factor: 2.819

10. A Fast Incremental Gaussian Mixture Model.

Authors: Rafael Coimbra Pinto; Paulo Martins Engel
Journal: PLoS One Date: 2015-10-07 Impact factor: 3.240

1 in total

Review 1. Artificial Intelligence in Cardiovascular Medicine: Current Insights and Future Prospects.

Authors: Ikram U Haq; Karanjot Chhatwal; Krishna Sanaka; Bo Xu
Journal: Vasc Health Risk Manag Date: 2022-07-12

1 in total