Literature DB >> 35224099

A Comprehensive Review: Computational Models for Obstructive Sleep Apnea Detection in Biomedical Applications.

E Smily JeyaJothi¹, J Anitha², Shalli Rani³, Basant Tiwari⁴.

Abstract

Obstructive sleep apnea (OSA) is a sleep disorder characterized by periodic episodes of partial or complete upper airway obstruction caused by narrowing or collapse of the pharyngeal airway despite ongoing breathing efforts during sleep. Fall in the blood oxygen saturation and cortical arousals are prompted by this reduction in the airflow which lasts for at least 10 seconds. Impaired labor performance, debilitated quality of life, excessive daytime sleepiness, high snoring, and tiredness even after a whole night's sleep are the primary symptoms of OSA. In due course, the long-standing contributions of OSA culminate in hypertension, arrhythmia, cerebrovascular disease, and heart failure. The traditional diagnostic approach of OSA is the laboratory-based polysomnography (PSG) overnight sleep study, which is a tedious and labor-intensive process that exaggerates the discomfort to the patient. With the advent of computer-aided diagnosis (CAD), automatic detection of OSA has gained increasing interest among researchers in the area of sleep disorders as it influences both diagnostic and therapeutic decisions. The research literature on sleep apnea published during the last decade has been surveyed, focusing on the varied screening approaches accustomed to identifying OSA events and the developmental knowledge offered by multiple contributors from the software perspective. The current study presents an overview of the pathophysiology of OSA, the detection methods, physiological signals related to OSA, the different preprocessing, feature extraction, feature selection, and classification techniques employed for the detection and classification of OSA. Consequently, the research challenges and research gaps in the diagnosis of OSA are identified, critically analyzed, and presented in the best possible light.

Entities: Chemical

Mesh：

Year: 2022 PMID： 35224099 PMCID： PMC8866013 DOI： 10.1155/2022/7242667

Source DB: PubMed Journal: Biomed Res Int Impact factor: 3.411

1. Clinical Background

Sleep disorders on account of obstructive sleep apnea (OSA) count on specific disturbances of the normal functioning of the upper respiratory airway and are associated with sleep deprivation, fragmented sleep, and primary hypersomnia. Recurrent occurrences of partial or complete cessation of breath during sleep are the characteristics of OSA, and the literature estimates it to be one of the most common sleep disorders prevailing in 17% women and 34% men of the adult population. The initial symptoms are essence with change in behavioural patterns that eventually lead to change in breathing patterns, insomnia, and may also trigger narcolepsy [1]. The duration of the involuntary and nocturnal respiratory pause varies based on the pathophysiology of the patient and usually extends from 10 seconds (reduction in airflow-hypopnea) to 2 minutes (complete cessation-apnea), with 20 to 40 seconds being the common duration. Disproportionate fat depositions in the pharyngeal muscles tend to constrict the upper air passage and account for the apneic events in OSA patients. The apnea-hypopnea episodes are usually accompanied by a drop in the blood oxygen saturation which thereupon triggers the autonomic neural response leading to micro arousals with nocturnal choking and gasping for breath. Unrefreshing and fragmented sleep, tiredness, fatigue, and lack of energy are reported by 90% of the OSA patients. A larger fraction of OSA patients are asymptomatic and contribute to the fact that most of them are presently underdiagnosed. Repetitive episodes of apnea-hypopnea result in primary events like intermittent hypoxemia, micro arousals, and increased intrathoracic pressure, which further initiates a cascade of interacting processes that contribute to developmental origins of secondary disease manifestations and disease end points as depicted in Figure 1. Activation of sympathetic nervous system (SNS) is the major contributor to metabolic dysfunction and elevated blood pressure in OSA patients. Sleep apnea is a severe medical condition where the complications gear up cardiac problems, kidney and liver-related disorders, diabetes, and most importantly daytime fatigue [2, 3].

Figure 1

Obstructive sleep apnea and cardiovascular disease associations.

The standard diagnostic approach for sleep apnea is an overnight sleep study that is usually carried out in the sleep labs. It simultaneously records electrocardiography (ECG), electromyography (EMG), electroencephalography (EEG), electrooculography (EOG), oxygen saturation (SpO2), abdominal and thoracic movement, body posture, snoring level, and other signals and visually analyzed by medical experts. Due to the limitation of sleep labs in hospitals and taking into account the intricacy of the above diagnostic technique, the procedure is quite expensive and cumbersome to use. A minimum of 22 lead wires are connected to the patient's body to acquire 12 different PSG signals, which makes the signal analysis part quite tricky and also cause discomfort to the patient [4, 5]. Apnea-Hypopnea Index (AHI) is used to analyze the severity of the disease, which is measured as the number of apneic episodes in the PSG recordings per hour of sleep, in comparison with the American Academy of Sleep Medicine (AASM). The frequency of apneic events for the detection of OSA is categorized as mild (5 ≤ AHI < 15 e/h), moderate (15 ≤ AHI < 30 e/h), or severe (AHI ≥ 30 e/h) [6]. Considering the complexity based on the number and variability of signals involved, PSG recording and analysis is time consuming and involves the expertise of the clinician. The quality of analysis is ultimately reduced due to the accumulated fatigue throughout the review and due to the complexity of the work itself. In this regard, the development of automated systems, at least in part, PSG's analysis represents significant savings in time, money, and effort, greatly assisting physician work and increasing his time. Indeed, in the best case scenario of a fully capable and appropriately compliant PSG automated analysis, the use of diagnostic support tools will help the scorer to focus only on relevant real information (e.g., AHI). Indeed, in the end, the work of the physician can be reduced to a single task of testing and/or verifying the results of computer analysis [7]. This study is aimed at covering the diagnostic approaches in the field of OSA, which are to some extent supported by automated computer-assisted diagnostic procedures. With this objective, the paper addresses the following in detail: Physiological signals associated with OSA detection Signal processing algorithms used for extraction of features that aid in apneic event detection Hardware devices that source the biological signals used for OSA detection Classifiers used for discrimination of normal and abnormal signals The objective of this study is to reflect the current state of research in a variety of ways and computer-assisted diagnosis for OSA. The abstraction of literature analysis under this review is organized as follows. Section 2 confers about the various biological signals utilized in the study for the detection of OSA, a detailed study on various databases available for sleep apnea detection in Section 3. Section 4 discusses the preprocessing technique used in signal processing, including the details on signal decomposition and extraction of feature components from the signal in Section 5 and followed by feature selection in Section 6. Different classification techniques used for the classification of sleep apnea are discussed in Section 7 and followed by the summarization of other researchers' work highlighting the challenges faced and the future scope in Section 8, and Section 9 concludes the work of the study.

2. Physiological Signals for OSA Detection

The fundamental physiological signals that are employed in various approaches, for the screening of OSA, are discussed in the following session. Based on the inference from the survey, there have been few studies on novel OSA detection techniques, over the past two decades, with a limited set of signals analyzed during PSG procedure. Thus, ECG, PPG, SpO2, and audio signals have been used to assist in the diagnosis of sleep apnea.

2.1. Polysomnography

Polysomnography (PSG) is a multiparametric examination that incorporates recording of several signals of neuro-physiological origin. Signals recorded from the head include EEG, EOG, and EMG. Other biophysiological signals included in the PSG are jaw movement, body position, snoring, respiratory movements, and oxygen saturation levels [8-10]. EEG signals are acquired using the fundamental 10-20 electrode system [11, 12]. Sleep is a combination of events that includes light sleep, deep sleep, and rapid eye movement (REM) sleep that occur in a cyclic manner, without conscious effort of the individual. The different sleep stages depend on the metabolic processes and function of sympathetic and para-sympathetic nervous system. The PSG recordings also include the horizontal EOG signal and submental chin EMG signal [13]. It is evident from the pie chart portrayed in Figure 2 that most of the researchers have made use of the PSG signal for OSA detection. The amount of data required for the detection of OSA is huge and has to be recorded when the person is asleep, on account of which the recording of PSG is done overnight. A sample of PSG signal with only five parametric signals is depicted in Figure 3.

Figure 2

Signal sources used in the literature.

Figure 3

Polysomnography signal.

2.2. Photoplethysmography

For the measurement of respiratory and hemodynamic function, the best opted method is photoplethysmography (PPG). As an alternative to traditional arterial oxygen saturation, pulse photoplethysmography (PPG) signal has been proposed which indicates vasoconstriction changes. In the study by Grote and Zou, the suitability of this signal in monitoring the amplitude changes in airflow and oxygen saturation is tested and compared with other acquisition methods as well as the default apneic event detector. Peripheral arterial tone (PAT) technique is widely used in monitoring the pulsatile changes in the arterial volume. Using PAT technology, a combined algorithm has been proposed by Varon et al. [14] to identify OSA, where the algorithm included the data from pulse wave attenuations, heart rate (HR) responses, and SpO2 levels [15]. By carefully analyzing the PPG signal and identifying the morphological parameters from the shape of the waveform assist in extracting the details such as the duration of the apneic period, the number of recurrences, the variation in heart rate, and breathing rate. Basically, a PPG signal is obtained by illuminating the skin of either the thumb or fore finger and thereby measuring the changes in light absorption. The signal obtained is based on the transmission, absorption, and dispersion of light as it passes through the skin, the underlying tissues, and the blood. The PPG is obtained from a device which utilizes small light emitting diodes (red and near infrared band) transmitting light through the finger to a photodiode. The characteristic parameters extracted from the PPG signal are pulse wave amplitude in both systolic and diastolic, the pulse propagation time, and peak waveform in systolic and diastolic pulse wave. The absorption of light depends purely on the heartbeat, as the blood capillary in the finger contracts and expands with each pulsation of the heart. The signal traces the minimum absorption and the peak absorption of the light intensity, and it is clearly proportional to the cardiac pulse [16]. Most of the previously developed approaches using pulse oximetry signal for screening of OSA are based on percentage oxygen saturation measurement and have been proved to be less accurate as measurement of SpO2 is less accurate at low values [17].

2.3. Electrocardiography

The OSA diagnostic approach based on the single-channel ECG recording provides a high diagnostic accuracy when compared to other physiological signals [12]. An easier way to monitor heart rate function is an ECG signal that can indicate the level of oxygen coming to the heart. RR interval and ECG-derived respiration (EDR) signals extracted from the ECG signal are employed for the detection of sleep apnea [18]. The ECG signals recorded in combination with PSG signals have gained interest among most of the researchers [19-21]. Of all the research articles compared, majority of the authors used ECG- and PSG-based signal for the detection and classification of OSA. Motivation to use ECG signal preferred over other signals is due to the effortless acquisition of real-time signals and ample number of sophisticated hardware available to trace the electrical activity of the heart. These measurements have the potential to reduce the cost of diagnosis of OSA.

2.4. Oxygen Saturation (SpO2)

Many patients with obstructive sleep apnea (OSA) report a history of shortness of breath associated with hypopnea. Blood oxygen levels below 90% are considered hazardous and require immediate remedial actions. Considering the cost-effectiveness and ease to access, the extraction of SpO2 signals is the best choice in the diagnosis of sleep apnea [13]. The oxygen saturation signals are obtained by using a simple pulse oximetry device, in case of real-time monitoring, or from publicly available databases like the PhysioNet Apnea-ECG database and University College Dublin (UCD) database. The Apnea-ECG database contains few recordings of SpO2 [22]. Time domain and frequency domain oximetry features are extracted for OSA detection. The drop in the level of SpO2 is measured through spectral analysis to identify the pattern of sleep apnea [23, 24].

2.5. Audio Signals

Investigating snore sounds can be the first step in testing for OSA as soft tissue structures and tone of voice influence the acoustic parameters. Audio signal analysis of snoring sounds can be distributed in a variety of ways, and contactless audio recordings are considered for the OSA severity measurement. Rosenwein et al. [25, 26] have developed an algorithm to identify the normal breathing pattern to that of abnormal breathing pattern based on acoustic signals, and the severity of OSA is estimated by in-home recordings compared with PSG recordings. Kalkbrenner et al. [27] have utilized a microphone to acquire the tracheal sounds. The breathing sound is clear near the vicinity of the neck, and accordingly, this method paved way for the acquisition of pulse signals from the carotid arteries. The low amplitude signal due to shallow breathing of the patient is perfectly tapped with proper signal conditioning devices which reduce the noise and the aliasing effect of the signal. The tracheal sound well correlates with the respiratory sound and hence are deployed for the diagnosis of OSA. In 1980, researchers found that laryngeal sounds proved effective in sleep apnea diagnosis [28]. Recent development in the sleep apnea machine has tracheal sound sensors for the recording of respiratory sounds. This tracheal sound sensor incorporates acoustic sensors like that of electronic stethoscopes. In addition to the measurement of tracheal sounds, breathing, snoring, and intrathoracic pressure variations can also be detected if appropriate sensors are used. These acquired parameters are essential for the classification of stages of sleep apnea. The PneaVoX sensor is used for this purpose [29].

3. Signal Source

All the physiological signals employed for OSA detection contain essential information along with the prevalence of artifacts. These signals are either collected directly from hospitals with patient's consent or taken from publicly available databases. This section discusses the databases that source apnea signals for the detection of sleep apnea.

3.1. PhysioNet Apnea-ECG Database

PhysioNet is a repository of open source medical research data that provides access to various biomedical signals for signal processing and analysis. Of course, a deeper understanding and analysis of the disease is an integral part of the diagnosis besides using appropriate data for accurate diagnosis. The PhysioNet Apnea database consists of 70 annotated night time ECG recordings. In sleep apnea, the diaphragm's upper airway muscles and neural activation function are imbalanced and consequently result in arousal, where the brain receives an insufficient supply of oxygen, and hence, visual scoring of the breathing pattern is also taken into consideration. Thirty-five recordings are provided with annotations, eight of which contain respiratory and O2 saturation signals. [21]]. The potential benefit of using the PhysioNet database is that it allows for a broader challenge for analyzing biomedical signals, including indications like arrhythmias, neurological disorders, cardiovascular diseases, and aging [19]. The PhysioNet database is used along with the UCD repository signals. UCD database is taken from Dublin University Hospital [7, 30]. Along with ECG and EMG signals, tracheal sounds, and nasal airway sounds are also deployed for the diagnosis of sleep apnea [27, 29, 31–33]. Graphical images of PSG signals are also available and used for the analysis [34]. In the Multi-Ethnic Study of Atherosclerosis (MESA) study database, 24480 samples of nasal flow signals have been taken from 10 subjects along with the sleep questionnaire for effective diagnosis of sleep apnea [31].

3.2. Case Study (Sleep Study)

Different case study approaches have been taken on board by the researchers. The diagnostic strategies used for all the acquired signals are relevant to PSG, nasal sounds, breathing movements, abdominal wall movements, chest wall movements, tracheal sounds, and diaphragm movements. These signals are recorded using various devices viz., a microphone, Home Sleep Apnea Testing (HSAT) machine, WatchPat system, and by using sensors for measuring the breathing pattern and accelerometer for recording the breathing movement. The subjects are categorized into several groups based on the BMI (body mass index), age, sex, pre-OSA conditions, sleep disorders, and psychological conditions, including narcolepsy. Table 1 summarizes the different sleep study approaches followed by various researchers. PSG signals contain ECG, EEG, EMG, nasal sounds, and breathing patterns [6, 21, 35]. Figure 2 portrays the distribution of the signal source used in the literature.

Table 1

Summary of sleep study.

Ref. no.	Author and year	Sleep study	Signals/sample acquired
[4]	[4]	253 patientsSTOP-Bang Questionnaire	PSG
[5]	[5]	5114 patientsSTOP-Bang Questionnaire	PSG
[36]	[36]	320 patients	SpO2
[37]	[37]	WatchPat deviceSTOP-Bang Questionnaire	SpO2, ECG
[38]	[38]	160 children	PSG, SpO2
[39]	[39]	188 patients	PSG
[31]	[31]	100 patientsSleep Questionnaire	Nasal flow signal
[40]	[40]	3 study groups, 2066 subjectsBNSQ Questionnaire	PSG, SpO2
[41]	[42]	455 patients	PSG
[27]	[27]	Ten subjects	PSG
[43]	[43]	1970 signals	SpO2
[44]	[44]	28 patients	PSG
[45]	[45]	79 subjects	SpO2
[46]	[47]	Eight subjects	SpO2
[12]	[12]	33 patients	Snoring sound
[48]	[48]	HSAT device	PSG
[1]	[1]	3 sensors	PSG signals, respiratory sounds, respiratory-related movements
[49]	[49]	144 patients	PSG, venous blood sample
[50]	[50]	186 subjects	PSG, audio signals
[51]	[51]	Eight males and females	PSG
[52]	[13]	Ten patients	Breathing movement

3.3. Other Modes of Signal Acquisition

3.3.1. Hardware Devices for Signal Acquisition

In many applications, sensors are used for the acquisition of physiological signals. These hardware set-ups have built-in amplifiers and filter circuits for efficient filtering and amplification of the signals. The sensors commonly used are accelerometer sensors for measuring the breathing movement and diaphragm movement [53-55]. Pulse oximeters are used to measure the oxygen saturation level [38, 53]. Microphones are used to record the nasal and tracheal sounds [29, 31, 53]. PneaVoX sensor is made use to sense the acoustic sounds and suprasternal pressure [29]. In line with the development of laboratory systems, a growing interest over the years has been observed in the design of outpatient equipment which aids in home-based tests. Generally, documents relating to the certification of portable equipment for OSA diagnostics are also extensive, and previous reviews on this can be found [56, 57].

3.3.2. Acquisition Based on Questionnaires

The STOP-Bang Questionnaire is considered as the gold standard for sleep apnea diagnosis along with PSG recordings. The risk factors for OSA are divided into two parts: with the objective question (yes or no) [5]. The acronym for these STOP questions is as follows. Demographic queries are included in the BANG questionnaire [58, 59]. For each of the questions presented, “yes” is marked 1, and “no” is marked 0. The score ranges from 0 to 8 for men and 0 to 7 for women [5]. Basic Nordic Sleep Questionnaire (BNSQ) provides a five-point scaling to determine sleep apnea. Habitual snoring is considered along with the stress level indicated by the person. The scaling range used for the sleep apnea detection is as follows [5, 60].

4. Preprocessing and Decomposition

Filtering, windowing, and sampling are the most widely used preprocessing techniques. The above section has highlighted different signals that are used for the analysis of sleep apnea. Various filtering methods have been used for effective utilization of the signals for the later stages of signal processing and thereby improve the efficiency of diagnosis. This section spotlights the preprocessing techniques used in the study. Low pass filters and bandpass filters are the commonly used filtering techniques for removing spurious noise due to motion artifacts and interferences. The signals are usually sampled at a sampling rate of 200 Hz and segmented for further analysis. The windowing technique used to segment the waveforms is min by min or sec by sec [61-63]. The sample length/window size is chosen in accordance with the type of features to be extracted. The entropy function has been predominantly used for feature extraction [62]. Bandpass Chebyshev filter II is applied on the PhysioNet apnea database, in addition to the preprocessing stages viz., filtering, denoising, and segmentation of the signal. A weight calculation algorithm is utilized for noise removal, in which each segment is assigned a weight, and segmentation of the signal is carried out through the thresholding technique [64]. Song et al. [65] have used the Markov model for segmentation of ECG signals, from which, QRS wave and the RR interval are identified. A median filter and a windowing technique show better results for the min-by-min extraction of the signal [18, 66]. The wavelet transform (WT) translates the signal duration of the signal to represent the frequency of time combined as a set of wave coefficients. These wavelet coefficients can be used in a frequency-dependent manner to achieve various digital signal processing results. Wavelet transform decomposes the signal into a set of low frequency coefficients and high frequency coefficients [7, 67]. The advantage of using wavelet transform is due to the fact that the value of SNR is quite high, and the filtering effect is better with high precision [45, 68–70]. Tunable Q Wavelet transform (TQWT) has been used to analyze oscillatory signals which deals with three flexible parameters: Q factor, number of decomposition levels, and redundancy or sampling rate. The transfer functions used in TQWT are rational [7]. Nasal airflow signals obtained from 100 subjects are randomly selected, and the Haar Wavelet function is employed which is an orthogonal wavelet, and its transfer function is redundant. In the initial stages of decomposition, the signal appears to be square-shaped while in the later sections, segments of the signal are obtained [31, 71]. From these segments, statistical coefficients are extracted for further analysis [31]. Pan–Tompkins algorithm is the widely used technique for the efficient extraction of QRS complex from the ECG signal. QRS complex waveform occurs due to ventricular depolarization, which is represented as a spike waveform. Pan–Tompkins algorithm highlights the frequency variation, indicating the spikes, and uses a series of filters to highlight the common content of this rapid heartbeat and remove background noise in the signal [72]. The block diagram of the algorithm is illustrated in Figure 4.

Figure 4

Illustration of Pan–Tompkins Algorithm [72].

Previous studies reveal that EDR signals and RR interval signal carry important information for OSA diagnosis. A median filter which is a nonlinear filter, where the output sample has been calculated from the median value of the input signal is utilized by [18, 65, 66, 73–75]. Along with the output from the Pan–Tompkins algorithm, local median filter responses are used to extract RR interval data from the ECG signal [75, 76]. Other preprocessing techniques are also discussed in this study; however, they are more likely to be a stand-alone system. Few of such algorithms are discussed in detail. Multistate General Gamma Cumulative Sum Scheme (CUSUM) [68] The amplitude of the extracted signals is investigated, and the gamma distribution is taken Two assumptions are made based on homogeneity and likelihood The cumulative sum of parameter shift in both the assumptions is considered, and the mean and variance of the signal are obtained The main feature of CUSUM was to obtain a cumulative sum of the sample deviations State-space reconstruction, heterogeneous recurrent analysis [77] Characterized by nonlinear function, dx/dt = F (x, θ), where F is nonlinear function and θ is the model parameter State-space reconstruction is performed, and the signal is segmented For iterative segmentation, a multidimensional indexing method is implemented A categorical variable is assigned to each segment, and by identifying the similarity patterns, the heterogeneous recurrence is recognized This method is evaluated by Principal Component Analysis (PCA) Several other signal processing techniques such as discrete wavelet transform (DWT), empirical mode decomposition (EMD) [78], variational mode decomposition (VMD), and empirical wavelet transform (EWT) for signal denoising and feature extraction have been considered. Of all the comparative methods used for filtering and windowing, simple bandpass filters are more preferred due to its efficiency in preserving information related to OSA, and WT gave a prominent result not only for denoising but also for the extraction of features [67].

5. Feature Extraction

This section summarizes the investigations on feature extraction and optimization for OSA detection. Time-domain, frequency-domain, nonlinear, and statistical features are considered more relevant in sleep apnea findings. After denoising, decomposition, and segmentation of the acquired signals, feature extraction plays a vital role in classifying signals. The features extracted based on the input signal viz., RR interval signals, ECG-derived respiration (EDR) signals, heart rate variability (HRV), oxygen saturation signal (SpO2), blood gas or blood oxygen saturation (SaO2), and autocorrelation function (ACF), are primarily extracted by a majority of authors. The time-domain features included in the study are (i) the difference in the root mean square of RR peak amplitude; (ii) the time interval between the former and latter HRV signal not exceeding 50 milliseconds; (iii) standard deviation of HRV signals; (iv) the mean, variance, and kurtosis of ECG signals; (v) mean of ECG signals, and (vi) variance coefficient [22, 79–86]. Most of the frequency domain features are extracted using WT, as WT uses a multiscale basis, and it is advantageous over Fourier transforms. The varying window size is taken for nonstationary signals, hence providing a better extraction of features [87, 88]. Statistical feature analysis is implemented to identify errors that are not identified during the initial stages of signal processing. The attributes in the signal are identified using three tests, Kolmogorov–Smirnov test, t-test, and Mann–Whitney U test [5]. Empirical mode decomposition (EMD) and Bivariate are adopted to evaluate the output of EEG signals. EMD performance improved significantly when the number of samples decreased [89]. The segmental error analyzed in event-related potential (ERP) indicated the occurrence of apnea. The delta energy associated with the immune system and the regulation of homeostasis is due to the depletion of oxygen in the event of apnea. Using the Hilbert Huang Transform, there is a wave of force in low waves when apnea occurs. This can be linked to delta energy related to the immune system and the regulation of homeostasis. The EMD and Bivariate methods were compared to reflect the key features associated with apnea for analytical purposes [31]. In a study on nonlinear feature extraction methods, three parameters are extracted: (i) the length of the longest diagonal, (ii) recurrent points needed to construct the diagonal lines, and (iii) percentage of recurrent lines to shape the vertical lines [8]. Multidomain feature extraction utilizes a combination of feature extraction methods and signal processing algorithms. Comparing the feature extraction methods and the features extracted, RR series signal and HRV data are extracted from PSG signal and ECG waveforms, Oxygen-related features from SpO2 and SaO2 signals and the rest of the features are extracted using the abovementioned feature extraction strategies. Automatic data handling has been proposed to handle electrical impedance tomography (EIT) images. EIT measurement is done invasively for the measurement of the upper airway during normal sleep. The upper airway closure signal is extracted as a feature along with statistical analysis. The features are analyzed by one-way analysis of variance (ANOVA) test, where the patients' features are displayed as a box and whisker plots and heat map clusters [87]. Mostafa et al. [47] extracted seven features from SpO2 signal using genetic algorithm. Genetic algorithm shows prominent results in the feature extraction arena and in classification. The summaries of features extracted for sleep apnea detection using various physiological signals are tabulated in Table 2.

Table 2

Summary of features extracted.

Input signal	Features extracted
SpO₂	(i) Desaturation events(ii) Speed of decline of SpO₂ concentration(iii) Time domain features such as regularity, variability, flexibility, and complexity(iv) ODI indices(v) Frequency domain features such as PSD(vi) Minimum and mean SpO₂ values
ECG	(i) R-R intervals(ii) R-peak amplitudes(iii) EDR(iv) HRV features(v) Statistical features like mean, kurtosis, variance, and skewness(vi) Mean heart rate
Nasal, respiratory, tracheal, and abdominal	(i) Length of respiratory signals(ii) HRV(iii) Respiration rate variability(iv) Temporal and spectral characteristics of acoustic signals(v) Time-series features including mean, variance, minimum, maximum, and median values(vi) Minimum, maximum, average inspiration/expiration amplitudes, and durations of nasal airflow signal
EEG	(i) Demographic information–frequency, percentage of every sleep stage, time in bed, total sleep time, sleep efficiency, and total number of one-step transitions overnight
Speech	(i) Speech/voice analysis(ii) Time-domain and frequency-domain features

6. Feature Selection

The role of feature selection is to reduce the redundancy in data and to reduce the time required to train the network. However, due to the redundancy, the accuracy of the classifier could be increased. Hence, feature selection plays a significant role in the classification stage. Li et al. [8] employed a two-stage procedure for feature selection. Initially, statistical analysis is done followed by SVM selection. Another method of feature selection has been carried out by combining the weighted average of the histogram. The weighted histogram features are combined from the Gabor filter bank, and they are represented as a feature network. This work is signatured as feature concatenation [90]. Minu and Amithab [45] used a mutual information-based feature selection to avoid redundancy and to increase the relevance. The mathematical expression has been defined as the joint probability to that of the marginal probability. In a peculiar feature selection technique proposed by García et al. [23], the partitions of signals that are counted to train that the network is split into two groups and are labelled as count_train and count_test. By this way, the dataset is partitioned to select the appropriate features required for apnea detection. The feature selection is done through two phases: in the initial stage, sequential forward feature selection method is adopted, and the features are ranked based on the number of times they have been included in the dataset. In the second stage, the selection method is repeated 250 times, and the error rate is calculated based on the feature set [91]. The step wise feature selection is done through Leave-One-Out Cross-Validation (LOOCV) where the insignificant features are ruled out, preventing the risk of incorrect deletion [18]. A total of fifteen features are selected, and the network is trained with N = 4 times, resulting in four error estimations, and successively eight most discriminating features are selected. A linear classifier is used for the feature selection process [38]. The relevant and nonredundant variables are dimensionally reduced using Fast Correlation-Based Filter (FCBF) technique. In the initial stage, log criterion is applied, followed by removal of redundant features. The symmetric uncertainty is calculated for each of the features by ranking using discriminant relevance (DR) method. Based on the rank obtained by them, the features are either selected or removed [36]. DR method is also used for feature ranking and selection. In this method, the discriminatory power of the model using all the features is compared with one feature that is under study. Equation (1) defines the formula used to calculate DR.

7. Classification

Physiological signals often contain a great deal of information necessary for disease detection, though they contain noises and artifacts. In recent years, computer-aided diagnosis (CAD) for healthcare applications has been gaining advantage [92, 93]. Various machine learning and deep learning algorithms have been used for the detection and classification of sleep apnea. From Figure 5, it is evident that neural network-based classification is predominantly used for classification and identification of apneic events. Long short-term memory (LSTM) networks yielded a higher accuracy of 99% [61, 94, 95], followed by random forest (RF), support vector machine (SVM), K-nearest neighbor (KNN), Adaboost, linear regression (LR), and ANFIS classifier with accuracy around 97% [18, 45, 80, 96–98]. The average accuracy of other classifiers is around 95%. Most of the classifiers used data from the PhysioNet Apnea-ECG database [19, 21]. The accuracy varies depending on the feature extraction and the feature selection strategy.

Figure 5

Summary of classifier models used in the study.

7.1. Mathematical Classifier

The upper airway signal along with the closure of the eye lids is also considered for the sleep apnea detection. EIT data is collected from patients using swallowing maneuver. The PSG and the EIT signals are compared based on Bland-Altman plots and the Pearson correlation [87]. To identify the significant features representing the apnea, statistical analysis is done on the control groups. The most prominent test is Mann–Whitney U test (UMW), which is a nonparameter test similar to that of t-test. It is marked by the difference in the median of the ranked data representing individual groups. The performance of the network is analyzed on voice-based network [99], and statistical analysis is done using LR. In a logistic model, considering two predictors, x1 and x2 and the output variable Y, the linear relationship in mathematical form is given in Here, l are the odds in the classification, b is the base of log, and β is the model parameter. p is the probability function and is given in Equation (2) gives the model description of LR. The advantage of LR is that it does not require multivariate assumption [39, 77, 100].

7.2. Clustering-Based Classifier

The clustering-based classifier that has been discussed in this study includes SVM, KNN, and LDA where the highly preferred classifier is support vector machine (SVM). The major purpose of classification is to indicate the category of the input signal to which it belongs to. SVM is a supervised learning method which is used for classification and regression challenges. The main advantage of using SVM is its effectiveness in identifying the number of dimensions when there is limited number of data samples or images [101]. AI-Ratrout and Hossen [102] used SVM along with fivefold wavelet decomposition using dbl filters for the classification of sleep apnea. By this method, 100% accuracy is obtained. Kumar and Kanhangad [90] have proposed a novel method to obtain single dimensional and phase descriptor (PD) model from ECG lead form. These PDs are extracted from Gabor filter responses and are fed to SVM. The type of SVM classifier used in this study is least square SVM. They are effective in the discrimination of normal and apneic signals with an accuracy of 93.31%. In the study that involved RR interval series for the classification of sleep apnea and using SVM as the classifier, the average accuracy achieved is 92.05%. The performance of SVM was compared with linear regression (LR), linear discriminant analysis (LDA), neural networks (NN), and K-nearest neighbors (KNN), of which SVM proves to be the best classifier [16, 65, 66, 68, 103]. Standard statistical features extracted have also proven to be effective in the classification of sleep apnea. Kumari et al. [96] have extracted only statistical mean from the signal and, using the SVM classifier, has achieved an accuracy of 98%, and the result is compared with KNN. In the case study where HR signals are used for the classification purpose, the accuracy achieved is 82.12% using SVM [104], while heart rate variability signal extracted using wavelet decomposition claimed to have an accuracy, sensitivity, and specificity of 93.34%, 90%, and 100%, respectively [102]. In few cases, KNN outperformed SVM; by using the oxygen saturation signals as the input, KNN achieved a maximum accuracy of 93% and, in collaborative performance along with least square SVM, achieved an accuracy of 86% [17]. Another interesting classifier that is used by majority of the researchers is LDA. Linear combination of features that specifies an object is extracted using LDA, and it is used to separate two or more classes [105]. LDA basically is developed to overcome the limitations faced in LR. The advantage of using LDA is that samples without class labels can be used under the LDA model. On the other hand, the problem faced with LDA is that it can only be used for binary classification and is not intended for multiclass classification. Hence, classification of stages of sleep apnea cannot be done using LDA. To overcome this problem, CNN is considered as the best option for the classification of stages of sleep apnea considering that the data available is sufficiently large. Benavides et al. [99] extracted features like signal-to-dis-periodicity ratio, a nasality measure, harmonic-to-noise ratio, jitter, difference between third and second formants on a specific vowel, duration of two of the sentences, and the percentage of silence in one of the sentences for classification of OSA using LDA. The accuracy achieved is 85%, and specificity is 75%. García et al. [23] extracted the time and frequency domain variables from blood oxygen saturation and used LDA for the classification. The classification accuracy obtained is 87% with sensitivity of 76% and specificity of 91%. In another study, González et al. [91] extracted heart rate variability signal and used three different algorithms viz., LR, LDA, and quadratic discriminant analysis (QDA) for the classification of OSA. LDA outperformed both the classifiers with an accuracy of 84.76%, sensitivity of 81.45%, and sensitivity of 86.82%. The respiratory features extracted by phase space reconstruction (PSR) feature extraction method along with RR signals and heart rate signals are given to the classifier. Five different classification networks such as LD, KNN, SVM, ANN, and QD are used for the classification. The performance of the classifier is evaluated by cross-validation function and by training them independently. By using thirty extracted features, the accuracy, sensitivity, and specificity of the work are found to be 90.9%, 89.6%, and 91.8%, respectively [66].

7.3. Neural Network-Based Classifier

Random forest was grown with 18 features, of which five important features are selected with three different time intervals. It generates replications of data, and for each replicated data, the network is trained. Each node is split based upon the randomly chosen features. This work has been carried out for the purpose of “in-home detection of sleep apnea” [25]. The time domain, frequency domain, and time-frequency and the wavelet features are extracted from SpO2 signals and are fed to artificial neural network (ANN) classifier. According to the thumb rule, the number of features required to train the network must be ten times larger than the weights of the network. The optimization of the network is performed using genetic algorithm (GA). The classification error has been reduced by implementing genetic algorithm. ANN model along with GA can yield better accuracy of 97.7% [47]. Extreme learning machine (ELM) is feed forward neural network that can be used for regression, classification, clustering, and feature learning. The input layer is randomly chosen, and the hidden parameters are set, and the output weights are determined analytically. The nonlinear activation function that is used along with ELM is sigmoid, sine, and hard limit. The proposed method using ELM and statistical feature extraction has shown an accuracy of 83.77% compared to the conventional method using wavelet features [106]. In another model designed for in-home monitoring of sleep apnea using the signals from pulse oximeter, the features are fed to Bayesian MLP-NN. The statistical analysis of the data is done using Kolmogorov-Smirnoff and Levene tests. The nonparametric Mann–Whitney U test is also done to identify the significant differences in the features [36]. The ANN along with the wavelet transforms is used for the detection of sleep apnea from EEG signals. The network has been configured to give three outputs referring to the condition of sleep apnea syndrome (SAS). Three different techniques are considered: Characteristic Part Analysis (CPA), Continual Characteristic Segment Analysis (CCSA), and Characteristic Segment Part Analysis (CSPA) [69].

7.4. Fuzzy-Based Classifier

Minu and Amitabh [45] used wavelet transform for the initial stages of signal processing, and ANFIS classifier is used for the classification of sleep apnea. Adaptive neuro-fuzzy inference system is a fuzzy model that which is been adapted for learning. The architecture of ANFIS consists of five layers/nodes. These nodes are referred as fixed nodes and adaptive nodes. The first and fourth layers have adaptive nodes, while second, third, and fifth layers have fixed nodes. ANFIS gives a better accuracy when compared to Adaboost classifier. The architecture of ANFIS is represented in Figure 6 where x and y represent the input and f represents the output. There are 5 distinct layers in the ANFIS architecture. Every node in the layer 1 is an adaptive node with an adaptive node function, and A1, A2, B1, and B2 are linguistic labels. Every node in layer 2 is a fixed node, labelled as M, whose output is the product of all the incoming signals. Each node output represents the firing strength of a rule. Every node in the third layer is a fixed node labelled as N, and it calculates the ratio of the firing strengths to the sum of all the rules firing strength. Every node in the fourth layer is an adaptive node with a node function. The single node in this layer is a fixed node labelled S, which computes the overall output. The overall output is the summation of all the incoming signals. The overall output is given in where w corresponds to the weights or the output from each layer and f is the output from layer 4.

Figure 6

Basic architecture of CNN.

SVM along with Gaussian Radial Basis Function (RBF) kernel is employed for the classification of sleep apnea by selecting the features from LOOCV. The RBF kernel-based model is compared with the performance of Hidden Markov Model (HMM). The fuzziness in the fuzzy set is determined by membership function (MF), and the shapes of MF are significant as they affect the interference of the fuzzy system. These shapes might include triangular, trapezoidal, Gaussian, etc. The main condition satisfied by MFs is the value ranges between 0 and 1. Tahani et al. introduced Sugeno λ-measure technique for extraction of effective fuzzy measure. Mukherjee et al. [107] used ensembled network method and Choquet integral-based fuzzy fusion technique for the identification of sleep apnea. The network combines the confidence values and all the possible combinations of data. For each classifier, individual weights are designed, called the fuzzy measures. The fuzzy measure is calculated using the entropy of the classifier's probability vector. The entropy value is subtracted from the maximum entropy value. The fuzzy measure is divided by the sum of total fuzzy measures. Hence, the fuzzy measure is evaluated based on the Sugeno λ-measure technique. The advantage of using fuzzy-based classifiers is that the computation power is saved, and they are often robust. They are not sensitive to the changing environments and often forget the rules (erroneous). They have less development time when compared to other conventional methods.

7.5. Deep Learning-Based Classifier

The major layers in CNN are convolutional layer, pooling layer, and fully connected layer. The layers of CNN are modified according to the input signals and the target. In addition to CNN, other deep learning models are recurrent neural network (RNN), auto encoder-based classifier, and deep generative models. Figure 6 shows the basic CNN architecture which consists of three main layers: the convolutional layer, the max pooling layer, and the SoftMax layer. The CNN is used for both feature extraction and classification, and few articles showcase the significance of CNN in segmentation also. Depending upon the size of the signal used for the analysis, dropout layer is introduced to reduce the problem of overfitting. The activation functions used are unique for the input signal and are based on the researchers. The most commonly used activation function is ReLU and SoftMax activation. However, the optimizer depends on the training accuracy of the network. Erdenebayar et al. [108] used deep belief network (DBN) to classify the sleep apnea using SpO2 signals. It consists of three layers among which the first two layers are constructed using restricted Boltzmann machine (RBM), and the final layer is SoftMax layer. Mostafa et al. (2017) have proposed an unsupervised auto encoder, where each of the stacks of RBM is used as auto encoder. The outputs obtained correspond to the input of the decoder. The unsupervised technique proved better results when compared to feature-based technique. Pathinarupothi et al. [94] used LSTM-RNN (long short-term memory recurrent neural network) for the diagnosis of sleep apnea from a single sensor technique. The proposed architecture has three layers: 30 input neurons, 34 memory blocks in the hidden layer, and two neurons in the output layer. Each memory block has memory units to store information over a long range of sequences. Yildirim et al. [13] used a one-dimensional CNN network for the classification of sleep apnea, considering EEG and EOG signals. Similarly, Haidar et al. (2016) used statistical features for training CNN and SVM, of which CNN yielded an accuracy of 75% to that of SVM. In general, the comparison of machine learning to deep learning algorithms is not appreciated because deep learning always outperforms the former [13]. Mostafa et al. (2017), Pathinarupothi et al. [94], Leino et al. [43], and Yildirim et al. [13] used deep learning-based algorithms for the classification of sleep apnea. Table 3 summarizes the available performance metrics of the deep learning algorithms and other related algorithms used for the detection of sleep apnea. Cheng, Maowei et al. [109, 110] used recurrent neural network (RNN) for the detection of sleep apnea. The extracted features are fed as input to RNN, and this layer produces the output for the final detection of sleep apnea. The accuracy of RNN model is 97.80% at the 50th training epoch. The increase in the number of epochs has increased the accuracy of classification, but the accuracy of classification remains unchanged after 50 epochs.

Table 3

Deep learning networks.

Author and citation	Classifiers	Database (DB)	Signals	Acc.%	Sen. %	Spec.%	Others
Song et al. [18]	CNN-LSTM	Apnea-ECG DB	ECG	96.1	96.1	96.2	Recognition rate—94.39
Zhang et al. [112]	CNN-LSTM	Apnea-ECG DB	Lead II ECG	99.80	96.94	98.97
Novak et al. (2018)	LSTM	Heart rate signals	HR	82.1	85.5	80.1
Yin et al. [70]	CNN-LSTM	Apnea-ECG DB	ECG	97.21	94.1	98.94
Yin et al. [70]	CNN-LSTM	UCD DB	ECG	93.70	90.69	95.82
Shen et al. [113]	LSTM	Snoring sound	Respiratory and tracheal sounds	87	84	91
	3-layer CNN			85	82	81
	5-layer CNN			82	83	84
Morgan and Scofield [114]	CNN-LSTM	Apnea-ECG DB	ECG	85.58	—	88.26	Recall—84.43
Pinho et al. [104]	LSTM-RNN	MIT-BIH arrhythmia DB	Lead II ECG	99	—	—	AUC—0.98Precision—99.2Recall—92.9
Faust et al. [115]	LSTM	Apnea-ECG DB	ECG	99.80	99.85	99.73
Pinho et al. [104]	Bi-LSTM	PSG and respiration signals	SpO₂, PSG, and respiratory signals		90.3	83.7
Acharya et al. [116]	DNN	Nocturnal ECG recordings	ECG	93.1	9.	94
	1D-CNN			98.5	99	99
	2D-CNN			95.9	96	96
	RNN			85.4	97	87
	LSTM			98	98	98
	GRU			99	99	99
AI-Ratrout and Hossen [102]	LSTM	Sleep-heart-health-study-1 DB	ECG and HR	98	—	—
Wu et al. [75]	1D-CNN	EEG and EOG signals	EEG, EOG	97.62	94.34	92.33
Song et al. [18]	RNN	Measurements from eight healthy subjects (EMG signal)	EMG
Zhai et al., 2017	CNN	NinaPro DB	EMG	83
Acharya et al. [117]	CNN	Freiburg EEG DB	EEG	88.67
Kalkbrenner et al. [27]	CNN	EEG and rs-fMRI measurements from the ECoG dataset [118]	EEG	—	—	—	Normal p value 1.85e-14, p seizure value 4.64e-27
Hassan [7]	CNN	37 subjects, 70 sessions	EEG				Area under the receiver operating characteristics: 82.78
Acharya et al. [116]	CNN	MIT-BIH arrhythmia DB	ECG	92.50	98.09	93.13
Tran et al. [119]	LSTM, CNN	PhysioNet DB: Fantasia (normal) and St.-Petersburg Institute of Cardiology Technics (CAD)	ECG	99.85
Hafezi et al. [111]	CNN	PhysioNet DB: Fantasia (normal) and St.-Petersburg Institute of Cardiology Technics (CAD)	ECG	94.9595.11	—	—
Pourbabaee et al., 2017	CNN	The PAF prediction challenge	Two-channel ECG				Precision—93.6
Rosenwein et al. [25]	Stacked denoising autoencoder	Fetal-ECG signal reconstruction	ECG	99.5	—	—
Singh et al. [49]	CNN	EOG, EEG, ECG—6 subjects and 2 sessions each	EOG, EEG, ECG	93.8
Shen et al. [113]	CNN	22 subjects and 22 sessions each	EOG	—	—	—	Mean correlation coefficient—0.73
Zhang et al. [112]	Feature extraction with DNN, classification by SVM	Apnea-ECG DB	ECG	84.7		94.5	Recall—68.8
Morales et al. [17]	DBN	Apnea-ECG DB	ECG	97.64	78.75	95.89
Yin et al. [70]	LSTM–CNN	70 subjects	PSG	100	100	100
Jifara et al. [51]	CNN-LSTM	Apnea-ECG DB	ECG	99.80	96.94	98.97	—
Chaw et al. [120]	CNN	50 patients—SpO₂ values were obtained	SpO₂	91.31			Loss rate -2.3 using cross-entropy cost function

The classification of sleep apnea is carried out by analyzing ECG signals, SpO2 signals, PSG signals, nasal, and tracheal sounds and by taking the AHI index. The cross validation of the acquired data is done using the databases. Deep learning-based network has proved efficient in the classification and detection of sleep apnea. The design of CNN architecture plays a vital role in achieving the accuracy of the network. Acharya et al. (2017) used ECG signals as the input and constructed a 9-layer CNN network and evaluated the performance with and without noise removal. Data augmentation performance has also been evaluated. The accuracy achieved is 94.03% with data augmentation and 89.07% with the original data. Chaw et al. (2019) designed a 10-layer CNN network to classify the SpO2 signals for sleep apnea detection. The layers included are 2D-CNN, flatten layer, dense layer, and dropout layer. The proposed network is compared with different types of classification networks such as LDA, SVM, bagging rep tree, and ANN. The CNN is based on cross-entropy cost function-based network. The overall accuracy of the network is 91.38%. Hafezi et al. [111] identified OSA by using tracheal and abdominal movements. The CNN model consists of seven layers with 128 hidden units in LSTM. The performance of the classifier is evaluated based on Pearson or Spearman's correlation coefficient, Bland-Altman Test, and Kruskal-Wallis one-way analysis of variance, to compare the values of AHI signals. The sensitivity, specificity, and accuracy of diagnosis are 81%, 87%, and 84%, respectively. Single-channel ECG lead signals are acquired and analyzed using CNN-LSTM network. The Kappa coefficient value of CNN was 0.89, to that of CNN-LSTM network was 0.92 [112]. Erdenebayar et al. [108] diagnosed sleep apnea by using four different deep learning architectures. Their performance has been evaluated based on sensitivity, specificity, and accuracy. 1D-CNN exhibited an accuracy of 98.5%, 96.4%, and 96.3% for apnea, hypopnea, and AHI events. The accuracy of LSTM network is 97%, and significant difference is found in the performance of 1D-CNN and 2D-CNN. At least 20 iterations are required to achieve the maximum performance of LSTM and Gated Recurrent Unit (GRU) models. 1D-CNN and GRU models are preferred for the automatic detection of sleep apnea using time domain features. Table 3 gives the summary of deep learning networks used and the number of layers specified in the architecture. The performance metrics are compared in the best possible way to identify the best architecture in the diagnosis of OSA. Ensemble networks are also utilized for OSA diagnosis, and the network techniques used are based on (i) majority voting, (ii) sum rule, (iii) Choquet integral based fuzzy fusion, and (iv) trainable MLP. Maximum accuracy of 85.58% is achieved using MLP technique. To the preexisting CNN model, the “flatten” layer is introduced to increase the diagnostic accuracy. The ensemble technique has two broad approaches, viz., for decision level fusion, only one decision is considered for each class, and for the score level fusion, for each model, all the prediction scores are considered. Data augmentation is done to increase the number of training samples. The trainable ensemble network using MLP has achieved precision of 84.80, recall of 84.43%, F1 score of 84.67%, and specificity of 88.26% [107].

7.6. Performance Metrics

The performance metrics that are used for the evaluation of the performance of the classifier are accuracy, sensitivity, and specificity. In addition to this, the mean absolute error (MAE) and the Pearson linear correlation coefficient (CC) are also considered for the estimation of values of Apnea-Hypopnea Index (AHI) [34]. The other performance measure includes F1 measure, kappa, precision, and recall [31, 80, 121]. Accuracy refers to the closeness of the obtained result to that of the standard value. It has been calculated by measuring the true positives (TP), false positives (FP), true negatives (TN), and true positives (TP) using Equation (5). Sensitivity and specificity are calculated using Equation (6) and Equation (7), respectively. The recall can be calculated from Equation (8) and precision from Equation (9). F1 measure of F1 score can be derived from recall and precision. The best value for F1 score is 1, and the worst is 0. Equation (10) gives the formula to calculate F1 score. Kappa is generally calculated in ensemble networks. It is used to quantify the degree of agreement. It is expressed in Equation (11), where Pr(a) is the proportion when there is an agreement and Pr(e) is the proportion of units that is expected to be agreed. Jung et al. [42] used real-time automatic detection of apnea using nocturnal pulse oximetry signals. The performance has been evaluated using accuracy, sensitivity, specificity, positive predictive value (PPV), negative predictive value (NPV), Cohen's Kappa coefficient, and ROC-AUC (area under the receiver operating characteristics curve). The AUC was calculated through true positive rates (TPR) versus false positive rates (FPR). The true positive rate is calculated in The AUC is calculated by Equation (10).

8. Inferences from the Survey

All through the comprehensive literature review, the state-of-the-art technology used in signal acquisition, signal preprocessing, feature extraction, and classification techniques that influence the automated computer-aided diagnosis of the OSA has been elaborately discussed. The growing interest in the arena of sleep studies and their therapeutic measures has resulted in advances in computer-based analytical methods, contributing to the increase of the number of developments in the sleep sector over the period of 10 years. The objective of the survey is to encourage young researchers interested in the development of computer-aided diagnostic techniques of OSA. The term diagnosis has to be preferably denoted as the potentiality of the computer-based methods to expropriate the severity of OSA based on AHI index. The American Academy of Sleep Medicine has clearly stated that the diagnosis of OSA cannot be exclusively based on AHI index, rather would require additional systematic analysis based on the symptoms [79]. From the literature analysis, it is noted that the so-called CAD approaches are more focused on two significant aspects. One trend is aimed at reducing the time required in monitoring multiple parameters as in the case of PSG studies, which is performed overnight in an attended hospital environment [5]. On the other hand, the emergence of sophisticated signal processing algorithms and predictive classifiers has paved way for a break-through for automatic detection of OSA. Although reducing the number of signals to be monitored has a significance in terms of reduced costs, increased patient comfort, and reduced waiting lists, yet, manual analysis of still images as in the case of PSG is more complex and time consuming. For this purpose, automation of the screening process is more preferred. Based on the analysis carried out in the previous sections, Figures 1–8 and Tables 1–3 deliberately present a quick summary of different methodologies covered. In particular, Tables 3 enlists the type of signals used, a number of signals used for the study, and the performance metrics of different classification approaches for reference and comparison purposes. It is well understood that usually validation results are significantly based on the database used, demographic details of the study group, number of subjects, number of signals used for training, etc., and on the type of validation metrics used. Changes that are made in the recording setup and changes in the criteria used for measuring the score or in the validation metrics make it demanding to carry out meaningful comparisons among the different methods.

Figure 7

Articles reviewed on OSA detection over the period of 2014 to 2021.

Figure 8

Architecture of ANFIS.

After a collective analysis of the literature survey carried out in this work, majority of the authors have opted for dataset from PhysioNet Apnea-ECG database along with publicly available questionnaire. This is due to the difficulty in acquiring real-time data and availability of patients. While sleep study is conducted in hospitals, the major challenge faced is the ethical concern. Moreover, these data are always imposed with noise and artefacts. A proper noise removal method is needed to make the signal to be an effective one and also to make the signal camera ready for feature extraction and classification. Concerned with the next stage, few authors have decomposed the signal for faster analysis, where ECG waveform is normally segmented for every one-minute time duration. Pan–Tompkins algorithm is predominantly used by the researchers for obtaining the information of QRS complex in ECG waveform. Due to the high precision filtering, wavelet transforms are highly preferred. All new technologies emerging today have their pros and cons. When compared with sleep apnea monitors, overnight acquisition of signals is a tedious task and also considered as a challenging one. The major advantage of using nocturnal polysomnography is it helps in the monitoring of the heart, lungs, brain, breathing pattern, leg, and arm movements. The disadvantage is that the patient gets hooked up with wires and sensors, which would cause lot of discomfort to the individuals, and most of the patients have to repeat the test again on account of improper sleep. In the feature extraction stage, RR interval signals are extracted mostly from ECG recordings. Not all the researchers preferred PSG recordings; a variety of signals are also used for the sleep apnea detection. When blood oxygen saturation or oxygen saturation signals are considered for the analysis, it yields an effective outcome, while the EEG recordings depicted artefacts; hence, a separate preprocessing unit needs to be installed for better acquisition of signals. During the recording of EEG, the signal gets disrupted if the patient suffered suffocation, muscle pains, or limb twitches and hence requires a surrogate technique to monitor each parameter separately [69, 122]. Most of the researchers failed to mention the methodology adapted to evaluate the segmentation and decomposition accuracy. With the increase in stress level of people and OSA becoming a significant health problem, the data available for the diagnosis or detection seems to be insufficient. Considering the cost for monitoring and the discomfort caused during the recording of PSG signals, many patients refuse to take up the procedure. With the availability of two main databases, the Sleep Apnea-ECG database and University of Dublin database (UCD) of sleep apnea, many researchers have presented a comparative study. Different sets of machine learning and deep learning algorithms are implemented using these databases. The future scope of work as quoted by majority of the researchers is that implementation of noninvasive, home-bound techniques is considered as a reliable technique for effective diagnosis of OSA [79, 123, 124]. With more than 130 research articles compared in this study, machine learning algorithms are more significantly used for OSA screening. The recent research papers from year 2020 acknowledge the emergence of deep learning methods for OSA detection, and they also outperform the machine learning techniques. Ensemble classifier also showed significant performance. CAD procedure is proved to show good results in the classification of stages of OSA as well as detection of OSA signals from that of normal signals. In few related events, the classifier has not been efficient enough to identify the hypopnea. The apnea and the normal signals are identified efficiently by CNN-LSTM network. The score transitions between each epoch are considered as a hideous process [112].

9. Conclusion

Throughout the sections discussed in the review, the detection of OSA is handled in a different way according to the researcher's point of view. In this paper, more than 136 research articles have been reviewed, focusing on the preprocessing, decomposition, feature extraction, and classification techniques. In the initial phase of analysis, different preprocessing techniques along with the signal decomposition are discussed. Median filtering and wavelet transform are highly preferred among the researchers for initial phase of signal processing. In the second phase of the analysis, different feature extraction methods and the features extracted are discussed. Of the research articles published, the majority of the papers are focused on classification techniques. Perhaps, for data acquisition, more than 70% of the researchers have utilized PhysioNet Apnea-ECG database. PSG signals and RR interval signals yield better diagnostic result. In the comparison of classification techniques, neural network-based classification has provided better accuracy. With increased number of data samples, deep learning and reinforcement learning have proved to be efficient. The selection of layers in the classifiers plays a greater role in achieving accuracy. Additionally, incorrect selection of hidden layers leads to poor approximations and overfitting [19]. For the futuristic analysis, the current approaches must be validated with real time data. More number of heterogeneous cases can be included, involving standard database and patient sample. In this way, meaningful results can be obtained thereby, improving the overall accuracy of the classification of OSA.

S	Do you snore loudly?
T	Do you feel tired, even after a night sleep?
O	The observed absence of breathing during sleep?
P	Presence of high blood pressure

B	Body mass index (BMI) over 35 kg/m²
A	Age over 50 years
N	Neck circumference over 40 cm
G	Gender (male)

I	Never or less than once per month
II	Less than once per week
III	On 1-2 nights per week
IV	On 3-5 nights per week
V	Almost every night

57 in total

Review 1. A review of signals used in sleep analysis.

Authors: A Roebuck; V Monasterio; E Gederi; M Osipov; J Behar; A Malhotra; T Penzel; G D Clifford
Journal: Physiol Meas Date: 2013-12-17 Impact factor: 2.833

2. Cascaded LSTM recurrent neural network for automated sleep stage classification using single-channel EEG signals.

Authors: Nicola Michielli; U Rajendra Acharya; Filippo Molinari
Journal: Comput Biol Med Date: 2019-01-19 Impact factor: 4.589

3. Obstructive Sleep Apnea Screening and Airway Structure Characterization During Wakefulness Using Tracheal Breathing Sounds.

Authors: Ahmed Elwali; Zahra Moussavi
Journal: Ann Biomed Eng Date: 2016-09-06 Impact factor: 3.934