| Literature DB >> 29693559 |
Raj Rakshit1, Anwesha Khasnobish2, Arijit Chowdhury3, Arijit Sinharay4, Arpan Pal5, Tapas Chakravarty6.
Abstract
Smoking causes unalterable physiological abnormalities in the pulmonary system. This is emerging as a serious threat worldwide. Unlike spirometry, tidal breathing does not require subjects to undergo forceful breathing maneuvers and is progressing as a new direction towards pulmonary health assessment. The aim of the paper is to evaluate whether tidal breathing signatures can indicate deteriorating adult lung condition in an otherwise healthy person. If successful, such a system can be used as a pre-screening tool for all people before some of them need to undergo a thorough clinical checkup. This work presents a novel systematic approach to identify compromised pulmonary systems in smokers from acquired tidal breathing patterns. Tidal breathing patterns are acquired during restful breathing of adult participants. Thereafter, physiological attributes are extracted from the acquired tidal breathing signals. Finally, a unique classification approach of locally weighted learning with ridge regression (LWL-ridge) is implemented, which handles the subjective variations in tidal breathing data without performing feature normalization. The LWL-ridge classifier recognized compromised pulmonary systems in smokers with an average classification accuracy of 86.17% along with a sensitivity of 80% and a specificity of 92%. The implemented approach outperformed other variants of LWL as well as other standard classifiers and generated comparable results when applied on an external cohort. This end-to-end automated system is suitable for pre-screening people routinely for early detection of lung ailments as a preventive measure in an infrastructure-agnostic way.Entities:
Keywords: locally weighted learning; pulmonary ailments; ridge regression; tidal breathing pattern
Mesh:
Year: 2018 PMID: 29693559 PMCID: PMC5981858 DOI: 10.3390/s18051322
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.576
Figure 1Schematic of the overall system for healthy and unhealthy pulmonary system recognition from tidal breathing signals. TBPR: tidal breathing pattern recorder; LWL-Ridge: locally weighted learning classifier with ridge estimator.
Demographics of Participants.
| Demographic Variable | Cohort 1 | Cohort 2 | ||
|---|---|---|---|---|
| Smokers | Non-Smokers | Smokers | Non-Smokers | |
| Number | 10 | 10 | 10 | 10 |
| Age | 35.3 ± 7.96 | 34.7 ± 6.32 | 29 ± 7.08 | 30 ± 7.07 |
| Gender | 8 M/2 F | 6 M/4 F | 9 M/1 F | 4 M/6 F |
| Smoking Years | 15.5 ± 6.86 | - | 13.4 ± 3.04 | - |
| CPD | 16.3 ± 4.74 | - | 14.6 ± 1.94 | - |
| Lifetime-Usage | 13.61 ± 9.63 | - | 10.06 ± 3.12 | - |
M/F: male/female; CPD: cigarettes per day; Lifetime Usage: measured in pack-years (=CPD × smoking years/20).
Figure 2Preprocessed tidal breathing flow rate (TBF(t)) signals of (a) two non-smokers (NS1, NS2) and (b) two smokers (S1, S2). The red portions of the signal indicate inspiration and the green portion indicates expiration.
List of used features.
| Feature No. | Features | Description |
|---|---|---|
| F1 | Inspiratory time (TI) | Mean duration of all the acquired Inspiration phases in seconds |
| F2 | Expiratory time (TE) | Mean duration of all the acquired Expiration phases in seconds |
| F3 | Breathing rate (BR) | Number of breaths per minute |
| F4 | Duty Cycle (DCy) | Mean of the ratios of inspiration time to total breath time of all the acquired breath cycles |
| F5 | Peak Inspiratory Flow (PIF) | Maximum flow rate attained during the inspiratory period. |
| F6 | Peak Expiratory Flow (PEF) | Maximum flow rate attained during the expiratory period. |
| F7 | Time to Peak Inspiratory Flow (TP IF) | Mean time from onset to peak of inspiration of all inspiratory phases. |
| F8 | Time to Peak Expiratory Flow (TP EF) | Mean time from onset to peak of expiration of all expiratory phases. |
| F9 | Inspiratory Tidal volume (TVins) | Mean volume of air inspired of all the acquired inspiration phases |
| F10 | Expiratory Tidal volume (TVexp) | Mean volume of air expired of all the acquired expiration phases |
| F11 | Inspiratory velocity (Velins) | Mean velocity of inspiration from onset to peak of inspiration flow of all the acquired inspiration phases |
| F12 | Expiratory velocity (Velexp) | Mean velocity of expiration from onset to peak of expiration flow of all the acquired expiration phases |
Figure 3Projection of features on the Fisher’s Linear Discriminant Line.
Comparison of different classification models.
| Scheme | % Acc | TPR | TNR | F | Kappa | AUC | AUP |
|---|---|---|---|---|---|---|---|
| LWL + L-R | 0.80 | 0.70 | |||||
| LWL + L-O | 80.83 | 0.80 | 0.62 | 0.85 | 0.86 | ||
| L-R | 64.00 | 0.60 | 0.68 | 0.61 | 0.28 | 0.69 | 0.73 |
| L-O | 63.83 | 0.58 | 0.70 | 0.60 | 0.28 | 0.67 | 0.72 |
L-O = Ordinary Logistic Regression, L-R = Logistic Regression with Ridge regression, TPR = true positive rate, TNR = true negative rate, AUC = area under the receiver operating characteristic curve, AUP = area under the precision-recall curve. Bold blue fonts denote the highest value for each metric (i.e., column-wise).
Variation of accuracy with k, R, and percentage of test data.
| Choices of { | |||||
|---|---|---|---|---|---|
| (i.e., 60 × 1/ | |||||
| 1/3 | 1/4 | 1/5 | 1/6 | 1/7 | |
| {10, 10−5} | 65 | 73.33 | 100 | 50 | 88.89 |
| {5, 10−4} | 85 | 80 | 100 | 70 | 88.89 |
| {5, 10−3} | 85 | 80 | 100 | 70 | 88.89 |
Figure 4Surface plot for 80–20 split.
Measures of the performance of classification.
|
| % Acc | TPR | TNR | F | Kappa | AUC | AUP |
|---|---|---|---|---|---|---|---|
| 10−3 | 0.88 | 0.92 | |||||
| 10−4 | 85.02 | 0.80 | 0.82 | 0.70 | 0.90 |
Bold blue fonts denote the highest value for each metric (i.e., column-wise).
Statistical measures for the binary classification test with simulated data using SMOTE.
| No. of Instances | % Acc | TPR | TNR | Kappa | AUC | AUP |
|---|---|---|---|---|---|---|
| 60 + 60 | 94.08 | 0.96 | 0.92 | 0.88 | 0.97 | 0.96 |
| 60 + 120 | 93.22 | 0.97 | 0.90 | 0.86 | 0.96 | 0.95 |
| 60 + 180 | 95.12 | 0.97 | 0.94 | 0.90 | 0.98 | 0.98 |
| 60 + 240 | 95.87 | 0.94 | 0.92 | 0.99 | 0.99 | |
| 60 + 300 | 95.44 | 0.95 | 0.95 | 0.91 | 0.98 | 0.98 |
| 60 + 360 | ||||||
| 60 + 420 | 96.04 | 0.97 | 0.96 | 0.92 | 0.99 | 0.98 |
| 60 + 480 | 96.56 | 0.97 | 0.96 | 0.93 | 0.99 | 0.99 |
| 60 + 540 | 96.40 | 0.95 | 0.93 | 0.99 | 0.98 |
Bold blue fonts denote the highest value for each metric (i.e., column-wise).
Comparison with standard classifiers and Friedman Test ranks.
| Method | % Acc | TPR | TNR | F | Kappa | AUC | AUP | Rank (Rj) |
|---|---|---|---|---|---|---|---|---|
| SVM-RBF | 51.67 | 0.67 | 0.37 | 0.55 | 0.03 | 0.52 | 0.51 | 4 |
| RF | 78.33 | 0.77 | 0.30 | 0.76 | 0.57 | 0.77 | 0.79 | 2.5 |
| kNN | 78.33 | 0.67 | 0.90 | 0.72 | 0.57 | 0.78 | 0.76 | 2.5 |
| LWL-ridge |
|
|
|
|
|
|
|
|
Bold blue fonts denote the best value for each metric (i.e., column-wise). SVM-RBF = support vector machine with radial basis function kernel, RF = random forest, kNN = k-nearest neighbor, LWL-ridge = locally weighted learning with ridge regression.
Evaluation of originally devised LWL-ridge classifier on external cohort.
| % Acc | TPR | TNR | F | Kappa | AUC | AUP | |
|---|---|---|---|---|---|---|---|
|
| 81.33 | 0.79 | 0.84 | 0.79 | 0.63 | 0.81 | 0.85 |
The values in parentheses are the standard deviations.
Comparison with the works related to monitoring adult lungs by tidal breathing analysis (TBA).
| Year of Study | Tidal Breathing Parameters Utilized | Remarks |
|---|---|---|
| Ours | 12: TI, TE, BR, DCy, PIF, TPIF, PEF, TPEF, TVins, TVexp, Velins, Velexp | Complete automated system to intelligently recognize smokers from healthy individuals directly from tidal breathing features. |
| 2014 [ | TI, TE, BR, DCy, PIF, TPIF, PEF, TPEF, TTOT, TPTEF/TE, TPTIF/TI, VE, VT, IP PEF, TP PEF, TPPEF20, TPPEF80, | Structural analysis of tidal expirograms was carried out to quantify COPD. |
| 2010 [ | PEF, VT, VE and several others related to forced breathing | Breath-by-breath structural analysis of expiratory signal during incremental exercise in COPD patients. |
| 2004 [ | TE, BR, PIF, TPTEF, TVexp, TPTIF, TTOT, TPTEF /TE, IPPEF, TPPEF20, TPPEF80, TPPEF20, TPPEF80, | Inter-relationships between body size, age, and tidal breathing profile in obstructive airway disease was established using |
COPD: chronic obstructive pulmonary disease. TTOT = Total time of one complete breathing cycle = TI +TE, TPTEF/TE = Ratio of TPTEF to TE, TPTIF/TI = Ratio of TPTIF to TI, VT = Tidal volume = TVins + TVexp, VT/TI = Ratio of tidal volume to inspiratory time, IP PEF = Integral of expiratory signal from peak to end, TP PEF20(80) = Post-peak expiratory flow at time 20%(80%), VE = Minute ventilation.
Relative comparison with studies aimed towards the detection of smokers.
| Year of Study | Modality of Study | Breathing Gesture | Remarks |
|---|---|---|---|
| Ours (10S, 10NS) | Physiological parameters extraction and binary classification | Tidal breathing for 1 min through a hollow, both-sides-open pipe | Classification accuracy 86.17%. |
| 2017 [ | Forensic analysis via Gas chromatography (GC)/Mass Spectrometry (MC) of breathing signal | Prolonged breaths | 12 compounds were determined to be statistically significant between groups. Nicotine was found to be the most significant discriminant. Smokers were detected with an accuracy of 72%, while non-smokers were detected with 100% accuracy. GC/MC analysis took 21 min. |
| 2016 [ | Environmental carbon monoxide (CO) gas sensor paired with smart Phone | Forceful breaths with 15 secs of breath-hold between inhale and exhale | Twelve statistical features along with several ensemble techniques were used. Average classification accuracy of 79.6%. |
| 2015 [ | Magnetic Resonance Imaging (MRI) of subjects. | NA | Maximum accuracy obtained was 69.6% with 139 highest-ranked features, SVM-RFE, and 10-fold CV. |
| 2011 [ | Analysis of breath odor using electronic Nose and GC/MC | One single exhaled breath was collected in a sampling bag. | Principle component analysis (PCA) and Linear discriminant function analysis (LDA) yields 100% accuracy. Forensic analysis of each breath sample took around 35 min. |
| 2004 [ | Forensic analysis of Exhaled-Breath Condensate (EBC) | Tidal breathing for 20 min | The concentrations of total protein and nitrite and neutrophil chemotactic activity were significantly higher in the EBC of smokers. Only statistical analysis. |