Literature DB >> 35767259

Diagnostic Performance of Electronic Noses in Cancer Diagnoses Using Exhaled Breath: A Systematic Review and Meta-analysis.

Max H M C Scheepers1, Zaid Al-Difaie1, Lloyd Brandts2, Andrea Peeters2, Bart van Grinsven3, Nicole D Bouvy4.   

Abstract

Importance: There has been a growing interest in the use of electronic noses (e-noses) in detecting volatile organic compounds in exhaled breath for the diagnosis of cancer. However, no systematic evaluation has been performed of the overall diagnostic accuracy and methodologic challenges of using e-noses for cancer detection in exhaled breath. Objective: To provide an overview of the diagnostic accuracy and methodologic challenges of using e-noses for the detection of cancer. Data Sources: An electronic search was performed in the PubMed and Embase databases (January 1, 2000, to July 1, 2021). Study Selection: Inclusion criteria were the following: (1) use of e-nose technology, (2) detection of cancer, and (3) analysis of exhaled breath. Exclusion criteria were (1) studies published before 2000; (2) studies not performed in humans; (3) studies not performed in adults; (4) studies that only analyzed biofluids; and (5) studies that exclusively used gas chromatography-mass spectrometry to analyze exhaled breath samples. Data Extraction and Synthesis: PRISMA guidelines were used for the identification, screening, eligibility, and selection process. Quality assessment was performed using Quality Assessment of Diagnostic Accuracy Studies 2. Generalized mixed-effects bivariate meta-analysis was performed. Main Outcomes and Measures: Main outcomes were sensitivity, specificity, and mean area under the receiver operating characteristic curve.
Results: This review identified 52 articles with a total of 3677 patients with cancer. All studies were feasibility studies. The sensitivity of e-noses ranged from 48.3% to 95.8% and the specificity from 10.0% to 100.0%. Pooled analysis resulted in a mean (SE) area under the receiver operating characteristic curve of 94% (95% CI, 92%-96%), a sensitivity of 90% (95% CI, 88%-92%), and a specificity of 87% (95% CI, 81%-92%). Considerable heterogeneity existed among the studies because of differences in the selection of patients, endogenous and exogenous factors, and collection of exhaled breath. Conclusions and Relevance: Results of this review indicate that e-noses have a high diagnostic accuracy for the detection of cancer in exhaled breath. However, most studies were feasibility studies with small sample sizes, a lack of standardization, and a high risk of bias. The lack of standardization and reproducibility of e-nose research should be addressed in future research.

Entities:  

Mesh:

Year:  2022        PMID: 35767259      PMCID: PMC9244610          DOI: 10.1001/jamanetworkopen.2022.19372

Source DB:  PubMed          Journal:  JAMA Netw Open        ISSN: 2574-3805


Introduction

Interest in the use of volatile organic compounds (VOCs) in exhaled breath to diagnose cancer has been increasing.[1,2] Volatile organic compounds are degradation products of biochemical processes in the human body that can be detected in exhaled breath. The composition of VOCs in exhaled breath changes because of pathological processes, such as cancer.[3] Because of their low solubility in blood, VOCs diffuse easily into alveolar air and are subsequently excreted via exhaled breath, enabling detection.[4] Disease-specific differences in VOC profiles enable scientists to investigate breath for the detection of specific conditions, including cancer. Breath analysis has several advantages compared with analysis of other clinical samples. Breath testing is painless and noninvasive. In addition, unlike blood and urine samples, breath samples do not need any workup, allowing for immediate analysis and rapid results. The most accurate method for identifying VOCs is a combination of gas chromatography–mass spectrometry (GC-MS),[5] which is highly accurate and allows for the selective detection of individual VOCs. Previous studies[6,7,8,9] that used GC-MS have identified potentially disease-specific VOCs as indicators of several malignant tumors. However, GC-MS is both time-consuming and expensive and can be performed only by trained experts.[10,11,12] A relatively new, emerging technique for analyzing VOCs in exhaled breath can be performed by electronic noses (e-noses). E-noses are portable, cheap, and easy-to-use diagnostic tests that are capable of producing rapid results. Detection of VOC patterns is possible because of binding of VOCs to sensors within e-noses. The binding of VOCs to these sensors generates an electrical response. This electrical response can then be measured. Different types of sensors are used in e-noses, with each sensor type having its own distinct advantages and disadvantages. Unlike GC-MS, e-noses are incapable of identifying individual VOCs. Furthermore, the accuracy of e-noses is affected by endogenous and exogenous factors, such as comorbidities, smoking, diet, body mass index, and ambient air.[13] These factors may alter detected VOC patterns in exhaled breath.[14] E-nose technologies analyzing exhaled breath have been extensively studied for oncologic indications and have been demonstrated to have promising results with high diagnostic accuracies.[15,16,17,18] Several reviews[13,15,19] have investigated the diagnostic accuracy of available e-noses for different indications. Reviewers have uniformly concluded that e-noses have the potential to become promising diagnostic tools in everyday clinical practice. Although these results are encouraging, at the time of this writing, no e-nose is being used in clinical practice to detect malignant tumors. No systematic review is currently available that provides a pooled analysis of the diagnostic performance of e-noses for the detection of cancer. This systematic review with meta-analysis provides an overview of the diagnostic performance of all e-nose technologies currently used for cancer diagnosis in exhaled breath. Furthermore, this review identifies current obstacles and limitations found in this field of research.

Methods

Search Strategy

On January 7, 2021, an electronic search was performed in the PubMed and Embase databases. Keywords such as lung diseases, gastrointestinal diseases, neoplasms, e-nose, electronic nose, volatile organic compounds, diagnosis, sensitivity and specificity, and ROC curve were used as search terms, combined using AND/OR operators. The full search strategy can be found in eTable 1 in the Supplement. This study followed the Preferred Reporting Items for Systematic Reviews and Meta-analyses (PRISMA) reporting guideline. A total of 6253 articles were found. All articles were screened for eligibility by reading the title and abstract. A total of 113 articles were screened for eligibility by 2 independent reviewers (M.H.M.C.S. and Z.A.-D.) who read the full text. Diagnostic studies that met the following inclusion criteria were included: (1) use of e-nose technology, (2) detection of cancer, and (3) analysis of exhaled breath. The exclusion criteria were (1) studies published before 2000, (2) studies not performed in humans, (3) studies not performed in adults, (4) studies that only analyzed biofluids (including urine, blood, and/or feces), and (5) studies that exclusively used GC-MS to analyze breath samples. A total of 52 articles were deemed eligible for inclusion. Figure 1 shows the flow diagram of the identification, screening, eligibility, and selection process using the PRISMA guidelines. The following information was gathered independently by 2 reviewers (M.H.M.C.S. and Z.A.-D.) and tabulated from the articles: author, type of cancer, cancer stage, country of publication, year of publication, sample size, type of e-nose, reference test, method of data analysis, control groups, sensitivity, specificity, accuracy, area under the curve, true-positive results, false-positive results, true-negative results, and false-negative results.
Figure 1.

Identification, Screening, Eligibility, and Selection Process

e-Nose indicates electronic nose; GC-MS, gas chromatography–mass spectrometry.

Identification, Screening, Eligibility, and Selection Process

e-Nose indicates electronic nose; GC-MS, gas chromatography–mass spectrometry.

Quality Assessment

The methodologic quality of the selected studies was assessed by a modified version of the Quality Assessment of Diagnostic Studies 2 tool (QUADAS-2).[20] This modified version was previously constructed by Hanna et al[1] to improve the quality assessment of feasibility studies. Specific changes to the QUADAS-2 tool were focused on the importance given to the inclusion of benign conditions and healthy controls, internal and/or external validation of results, assessment before therapeutic intervention, and reproducibility of the chosen index test. Quality assessment was performed independently by 2 reviewers (M.H.M.C.S. and Z.A.-D.). Any discrepancies were resolved in a consensus meeting between these 2 reviewers. The full version of the modified QUADAS-2 can be found in eTable 2 in the Supplement. The quality of the evidence was assessed by the Rational Clinical Examination Levels of Evidence scale.[21] Quality of evidence is scored from 1/A (high quality) to 5/C (low quality). The full rating scale with a definition of the level of evidence scores is provided in eTable 3 in the Supplement.

Statistical Analysis

A meta-analysis of all e-noses reporting diagnostic performance values was conducted on 46 studies, which included Aeonose (n = 11),[18,22,23,24,25,26,27,28,29,30,31] Cyranose 320 (n = 11),[16,17,32,33,34,35,36,37,38,39,40] PEN3 (n = 3),[41,42,43] and custom-made e-noses (n = 21).[6,8,9,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61] Six studies[62,63,64,65,66,67] did not report diagnostic performance values and, therefore, could not be included in the meta-analysis. Pooled estimates of diagnostic accuracy were calculated using bivariate generalized mixed-effects regression models to account for the negative correlation between sensitivity and specificity.[68] Sensitivity and specificity are estimated with their respective CIs and prediction in the receiver operating characteristic curve. A scatterplot using standardized predicted random effects (standardized level 2 residuals) was used to check for outliers. A spike plot was used for investigating particularly influential studies using Cook distance. Outliers and influential studies were excluded, and a meta-analysis of all available e-noses was performed again. The Deeks funnel plot asymmetry test was performed to test for publication bias. A 2-sided P < .10 was assumed to be statistically significant.[69] To quantify statistical heterogeneity among pooled studies, the I2 index was used. Stata software, version 17 (StataCorp LLC) was used to pool data, and the statistical package MIDAS was used for bivariate meta-analysis.[70] Because of heterogeneity in used sensors, analytical methods, and sampling techniques, separate meta-analyses were performed for the Aeonose and Cyranose 320 to determine the individual diagnostic accuracy of these 2 e-noses. Furthermore, separate pooled analyses were performed for lung cancer (LC), colorectal cancer (CRC), and head and neck cancer (HNC) and for advanced and early tumor stages to determine the diagnostic accuracy of e-noses in detecting different cancer types and stages. No separate analysis was performed for other cancer types and the PEN3 e-nose because of an insufficient number of studies (n < 4) to perform bivariate meta-analysis. Additional pooled analysis of studies with a low or unclear risk of bias in the patient selection domain of the QUADAS-2 tool was performed to assess the influence of studies with a high risk of bias in patient selection on the diagnostic accuracy. Similar pooled analyses were performed for 3 other QUADAS-2 domains: index test, reference test, and flow and timing.

Results

Study Characteristics

The literature search identified 52 publications[6,16,17,18,22,67] that met the inclusion criteria with a total of 3677 patients with cancer across all studies. All studies were feasibility studies. The most commonly used e-noses were the Cyranose 320 (n = 12),[16,17,32,33,34,35,36,37,38,39,40,66] Aeonose (n = 11),[18,22,23,24,25,26,27,28,29,30,31] and PEN3 (n = 3).[41,42,43] Furthermore, a large heterogeneous group of custom-made e-noses were used (n = 26).[6,8,9,44,45,46,47,48,49,50,51,52,53,54,55,56,57,58,59,60,61,62,63,64,65,67] The most commonly used sensors were metal oxide sensors. Aeonose, PEN3, and several e-nose prototypes use metal oxide sensors to detect VOCs. Other sensor types reported were quartz microbalance sensors, conducting polymers, and a variety of nanomaterial-based sensors. Characteristics of the studies, including used sensor types, are summarized in the Table.
Table.

Characteristics and Outcomes of All Studies Included in the Qualitative Analysis

SourceCancer type/stagePatients with cancer, No.ControlsModelSensitivity, %Specificity, %AUC, %Accuracy, %eNoseStatistical methodQuality of evidencea
Altomare et al,[41] 2016CRC/primarily advanced stage15HCs (n = 15), benign (n = 15)CRC (n = 15) vs HCs (n = 10)9310NR38PEN3: 10 MOSPNN4/C
Amal et al,[6] 2014OC/mixed48Benign (n = 86), HCs (n = 48)OC (n = 48) vs HC and benign (n = 134)7171NR71Prototype: 9 nanomaterial sensors (GNP and SWCNTs)DFA4/C
Amal et al,[8] 2015CRC/primarily early stage65Benign (n = 22), HCs (n = 122)CRC (n = 16) vs benign (n = 16)9488NR91Prototype: 6 nanomaterial sensors (GNP and SWCNTs)DFA4/C
Amal et al,[44] 2016GC/primarily advanced stage99Benign (n = 385)GC (n = 30) vs OLGIM 0-IV (n = 95)7398NR92Prototype: 8 nanomaterial sensors (GNP and SWCNTs)DFA4/C
Barash et al,[62] 2015BC/NR169Benign (n = 52), HCs (n = 30)BC vs benign and HCs (n = 140)84809083Prototype: 40 nanomaterial sensors (GNP and SWCNTs)DFA4/C
Broza et al,[45] 2012LC/early stage12Benign (n = 5)LC (n = 12) vs benign (n = 5)10080NR94Prototype: 25 nanomaterial sensors (GNP and PtNPs)DFA4/C
Capuano et al,[63] 2015LC/NR20Benign (n = 10)LC (n = 20) vs benign (n = 10)NRNRNR93LibraNose: 8 QMB sensorsPLS-DA4/C
Chapman et al,[32] 2012MPM/early stage20HCs (n = 42), benign (n = 18)MPM (n = 10) vs HCs (n = 32)9091NR95Cyranose 320: 32 conducting polymer sensorsPCA4/C
Chen et al,[46] 2021LC/primarily advanced stage101HCs (n = 134)LC (n = 101) vs HCs (n = 134)95.691.1NR93.6Prototype: 11 sensors: MOS, EGS, HWGS CCGSKPCA-XGBoost4/C
Chen Q et al,[47] 2020LC/NR48HC (n = 48)LC (n = 24) vs HCs (n = 25)9696100NRPrototype: GO sensorLDA4/C
de Kort et al,[23] 2018LC/primarily advanced stage144HCs and suspected (n = 146)NSCLC (n = 144) vs HCs and suspected (n = 146)94.432.976NRAeonose: 3 MOSANN–cross-validation (Aethena)4/C
de Kort et al,[22] 2020LC/primarily advanced stage138HCs (n = 84), suspected (n = 59)NSCLC (n = 138) vs non-LC (n = 143)94.244.175NRAeonose: 3 MOSANN (Aethena)4/C
de Vries et al,[64] 2015LC/primarily advanced stage31Benign (n = 68), HCs (n = 45)LC (n = 31) vs benign (n = 31)NRNR9588SpiroNose: 4 MOS sensorsPCA4/C
Di Natale et al,[65] 2003LC/mixed35HCs (n = 18)LC (n = 35) and HCs (n = 18)NRNRNR94Prototype: 8 QMB sensorsPLS-DA4/C
Diaz de Leon-Martinez et al,[33] 2020BC/mixed262HCs (n = 181)BC (n = 262) vs HCs (n = 181)100100NR98.7Cyranose 320: 32 conducting polymer sensorsCDA4/C
Dragonieri et al,[66] 2009LC/mixed10Benign (n = 10), HCs (n = 10)LC (n = 10) vs benign (n = 10)NRNRNR85Cyranose 320: 32 conducting polymer sensorsPCA4/C
Dragonieri et al,[16] 2012MPM/primarily early stage13HCs (n = 13), AEx (n = 13)MPM (n = 13) vs AEx (n = 13)92869281Cyranose 320: 32 conducting polymer sensorsCVA4/C
Gasparri et al,[48] 2016LC/primarily early stage70HCs (n = 76)LC (n = 21) and HCs (n = 20)8110087NRPrototype: 8 QMB sensorsPLS-DA4/C
Gruber et al,[9] 2014HNSCC/mixed22Benign (n = 21), HCs (n = 19)HNSCC (n = 22) and HCs (n = 19)7790NR83Prototype:6 nanomaterial sensors (GNP and SWCNTs)DFA4/C
Hakim et al,[49] 2011HNSCC/primarily advanced stage22HCs (n = 40)HNSCC (n = 16) and HCs (n = 24)92100NR95NA-NOSE: 5 GNP sensorsPCA4/C
Herman-Saffar et al,[34] 2018BC/early stage48HCs (n = 45)BC (n = 33) vs HCs (n = 32)4862NR55Cyranose 320: 32 conducting polymer sensorsFE-ANN (1000-fold)4/C
Huang et al,[35] 2018LC/primarily early stage56Nontumor controls (n = 188)LC (n = 12) vs non-LC (n = 29)8386NR85Cyranose 320: 32 conducting polymer sensorsSVM (external validation)4/C
Hubers et al,[36] 2014LC/primarily advanced stage20Benign (n = 31)LC (n = 18) vs benign (n = 8)941366NRCyranose 320: 32 conducting polymer sensorsPCA4/C
Kononov et al,[50] 2019LC/mixed65HCs (n = 53)LC (n = 65) vs HCs (n = 53)9510095.697.2Prototype: 6 MOSLRA without PCA decomposition4/C
Krauss et al,[24] 2020LC/primarily advanced stage120Benign (n = 197), HCs (n = 33)LC (n = 91) vs HCs (n = 33)84979273Aeonose: 3 MOSANN (Aethena)4/C
Lamote et al,[37] 2017MPM/NR14HCs (n = 16), AEx (n = 19), benign ARD (n = 15)MPM (n = 11) vs benign (n = 27)82557574Cyranose 320: 32 conducting polymer sensorsPCA4/C
Leja et al,[51] 2021GC/primarily advanced stage94HCs (n = 180)GC (n = 31) vs HCs (n = 65)87859285SniffPhone (SGNPs)LDA4/C
Leunis et al,[52] 2014HNSCC/primarily advanced stage36Benign (n = 23)HNSCC (n = 36) vs benign (n = 23)90808985Prototype:12 MOSLRA4/C
Li et al,[53] 2017LC/NR24HCs (n = 23), benign (n = 5)LC (n = 24) vs non-LC (n = 28)9292NR92Prototype: 14 sensors, MOS HWG, CCGS, EGSLDA (fuzzy 5-NN)4/C
Li et al,[67] 2020LC/primarily advanced stage115HCs (n = 153)LC (n = 115) vs HCs (n = 153)NRNR87NRPrototype: 10 sensors, MOS HWG, CCGS, EGSLDA combined with PCA, Fast ICA, NMF, and Kbest4/C
Liu et al,[54] 2021LC/advanced stage98HCs (n = 116)LC (n = 98) vs HCs (n = 116)95.397.2NR96.1Prototype: 11 sensors, MOS HWG, CCGS, EGSPCA-SVE4/C
Machado et al,[17] 2005LC/primarily advanced stage14Benign (n = 62)LC (n = 14) vs non-LC (n = 62)7192NR85Cyranose 320: 32 conducting polymer sensorsSVM4/C
Marzorati et al,[55] 2019LC/early stage6HCs (n = 10)LC (n = 6) vs HCs (n = 10)86100NR94Prototype: 4 MOSANN (LOOCV)4/C
Mohamed et al,[25] 2021OSCC/primarily advanced stage49HCs (n = 35)OSCC (n = 49) vs HCs (n = 35)88718681Aeonose: 3 MOSANN (Aethena)4/C
Mohamed et al,[42] 2019LC/primarily advanced stage50Benign (n = 50)LC (n = 28) vs benign (n = 20)9390NR92PEN3: 10 MOSPCA and ANN4/C
Peled et al,[56] 2012LC/mixed53Benign (n = 19)LC (n = 50) vs benign (n = 19)86969988Prototype: 18 nanomaterial sensors (GNPs and SWCNTs)DFA4/C
Raspagliesi et al,[43] 2020OC/primarily advanced stage86HCs (n = 114), benign (n = 51)OC (n = 28) vs HCs and benign (n = 55)8293NR87PEN3: 10 MOSKNN (strict prediction)4/C
Rocco et al,[57] 2016LC/advanced stage23HCs (n = 77)LC (n = 23) vs HCs (n = 77)869587NRBIONOTE: 7 Acoustic-mass sensorsPLS-DA4/C
Schuermans et al,[26] 2018GC/NR16HCs (n = 28)GC (n = 16) vs benign (n = 28)8171NR75Aeonose: 3 MOSANN (Aethena)4/C
Shehada et al,[58] 2016LC/primarily advanced stage149Benign (n = 56), HCs (n = 129)LC n = 149 vs non-LC n = 568975NR86Prototype: SNSDFA4/C
Shlomi et al,[59] 2017LC/primarily advanced stage89Benign (n = 30)LC (n = 16) vs benign (n = 30)7593NR87Prototype: 40 nanomaterial sensorsDFA4/C
Steenhuis et al,[27] 2020Recurrent CRC/primarily advanced stage26No recurrence (n = 36)CRC positive (n = 26) vs CRC negative (n = 36)88758681Aeonose: 3 MOSANN (Aethena)4/C
Tan et al,[60] 2016LC/advanced stage12Benign (n = 12), HCs (n = 13)LC (n = 12) vs non-LC (n = 25)838886NRPrototype: chemiresistor-based alkane sensorMANOVA4/C
Tirzite et al,[38] 2017LC/NR165HCs (n = 79), benign (n = 91)LC (n = 45) vs HCs (n = 16)9869NR90Cyranose 320: 32 conducting polymer sensorsSVM4/C
Tirzite et al,[39] 2019LC/advanced stage252Benign and HCs (n = 223)LC (n = 119) vs non-LC (n = 91)95.892.3NRNRCyranose 320: 32 conducting polymer sensorsLRA4/C
van de Goor et al,[28] 2018LC/primarily advanced stage52HCs (n = 93)LC (n = 8) vs HCs (n = 14)8886NR86Aeonose: 3 MOSANN (Aethena)4/C
van de Goor et al,[18] 2019Recurrent HNSCC/mixed20No recurrence (n = 20)HNSCC recurrence positive (n = 20) vs HNSCC recurrence negative (n = 20)85808583Aeonose: 3 MOSANN (Aethena)4/C
van de Goor et al,[29] 2020HNSCC/mixed91HCs (n = 72)HNSCC (n = 91) vs HCs (n = 72)79637572Aeonose: 3 MOSANN (Aethena)4/C
van Keulen et al,[30] 2020CRCr/mixed70Benign (n = 234), HCs (n = 128)CRC (n = 62) vs HCs (n = 104)956484NRAeonose: 3 MOSANN (Aethena)4/C
Waltman,[31] 2020PC/primarily early stage32Benign (n = 53)PC (n = 32) and benign (n = 53)84707977Aeonose: 3 MOSANN (Aethena)4/C
Xu et al,[61] 2013GC/mixed37Benign (n = 93)GC (n = 37) vs benign (n = 93)8990NR90Prototype:14 nanomaterial sensors (GNP and SWCNTs);DFA4/C
Yang et al,[40] 2021BC/NR351HCs (n = 88)BC (n = 351) vs HCs (n = 88)86979991Cyranose 320: 32 conducting polymer sensorsRandom forest4/C

Abbreviations: AEx, asymptomatic former asbestos exposure; ANN, artificial neural network; ARD, asbestos-related disease; BC, breast cancer; CCGS, catalytic combustion gas sensor; CDA, canonical discriminant analysis; CRC, colorectal cancer; DFA, discriminate function analysis; EGS, electrochemical gas sensor; FE, feature extraction; GC, gastric cancer; GNP, gold nanoparticles; GO, graphene oxide; HCs, healthy controls; HNSCC, head and neck squamous cell carcinoma; HWGS, hot wire gas sensor; ICA, independent component analysis; KNN, K-nearest neighbors; KPCA, kernel principal component analysis; LC, lung cancer; LDA, linear discriminant analysis; LOOCV, leave-one-out cross-validation; LRA, logistic regression analysis; MANOVA, multivariate analysis of variance; MOS, metal oxide sensor; MPM, malignant pleural mesothelioma; NMF, nonnegative matrix factorization; NN, neural network; NR, not reported; NSCLC, non–small cell lung carcinoma; OC, ovarian cancer; OLGIM, operative link on gastric intestinal metaplasia; OSCC, oral squamous cell carcinoma; PCA, principal component analysis; PLS-DA, partial least-squares discriminant analysis; PC, prostate cancer; PNN, probabilistic neural network; PtNP, platinum nanoparticles; QMB, quartz microbalance; SNS, silicon nanowire sensor; SVE, statistical volume element; SVM, support vector machines; SWCNT, single-walled carbon nanotubes; XGBoost, eXtreme Gradient Boosting.

Quality of evidence rating conducted by the Rational Clinical Examination Levels of Evidence scale (for full details of the scale, see eTable 3 in the Supplement).

Abbreviations: AEx, asymptomatic former asbestos exposure; ANN, artificial neural network; ARD, asbestos-related disease; BC, breast cancer; CCGS, catalytic combustion gas sensor; CDA, canonical discriminant analysis; CRC, colorectal cancer; DFA, discriminate function analysis; EGS, electrochemical gas sensor; FE, feature extraction; GC, gastric cancer; GNP, gold nanoparticles; GO, graphene oxide; HCs, healthy controls; HNSCC, head and neck squamous cell carcinoma; HWGS, hot wire gas sensor; ICA, independent component analysis; KNN, K-nearest neighbors; KPCA, kernel principal component analysis; LC, lung cancer; LDA, linear discriminant analysis; LOOCV, leave-one-out cross-validation; LRA, logistic regression analysis; MANOVA, multivariate analysis of variance; MOS, metal oxide sensor; MPM, malignant pleural mesothelioma; NMF, nonnegative matrix factorization; NN, neural network; NR, not reported; NSCLC, non–small cell lung carcinoma; OC, ovarian cancer; OLGIM, operative link on gastric intestinal metaplasia; OSCC, oral squamous cell carcinoma; PCA, principal component analysis; PLS-DA, partial least-squares discriminant analysis; PC, prostate cancer; PNN, probabilistic neural network; PtNP, platinum nanoparticles; QMB, quartz microbalance; SNS, silicon nanowire sensor; SVE, statistical volume element; SVM, support vector machines; SWCNT, single-walled carbon nanotubes; XGBoost, eXtreme Gradient Boosting. Quality of evidence rating conducted by the Rational Clinical Examination Levels of Evidence scale (for full details of the scale, see eTable 3 in the Supplement). Fourteen studies[18,22,23,24,25,26,27,28,29,30,31,51,57,64] used an e-nose that enabled direct breath sampling. Eleven of these studies[18,22,23,24,25,26,27,28,29,30,31] used the Aeonose, whereas other studies used Spironose (Breathomix),[64] Bionote (Campus Bio-Medico University),[57] and SniffPhone (Technion Institute of Technology).[51] Thirty-four studies[6,8,9,16,18,32,35,36,37,38,39,40,41,42,43,44,45,46,48,49,52,53,54,55,56,58,59,61,62,63,64,65,66,67] used sampling bags to collect exhaled breath. Of these, 19 studies[6,16,35,36,37,40,41,46,48,52,53,54,55,58,61,62,63,66,67] used Tedlar bags, 6 studies[9,17,45,49,56,58] used Mylar bags, 3 studies[8,44,59] used GasAmpler bags, and 6 studies[32,38,39,42,43,65] used other or unspecified bags. A variety of statistical methods were used to analyze VOC patterns in exhaled breath. Artificial neural networks were the most frequently reported analytical method. Other examples of reported analytical methods were discriminate function analysis, principal component analysis, partial least squares discriminant analysis, and canonical discriminant analysis. The Table provides an overview of the analytical methods reported. The number of included patients with cancer ranged from 10 to 351. Types of cancer studied were lung (n = 28), head and neck (n = 5), gastric (n = 4), breast (n = 4), colorectal (n = 4), mesothelioma (n = 3), oral cavity (n = 2), ovarian (n = 1), and prostate (n = 1) (Table). Most studies compared patients with cancer with a control group of healthy volunteers and/or patients with benign disease. Most studies did not perform any diagnostic tests to exclude malignant tumors in healthy volunteers. Histopathological analysis was used for diagnostic confirmation of malignant tumors in 50 studies; 2 studies[47,65] did not report the method used for diagnostic confirmation of malignant tumors. Studies included patients with early and advanced tumor stages, often with various histological types. Tumor stage was not reported in 8 studies.[26,37,38,40,45,47,53,63] Consideration and reporting of exogenous and endogenous factors during exhaled breath collection differed largely among studies. A variety of measures were implemented to reduce possible effects of confounding factors on exhaled VOCs. Fasting before breath sampling was performed in 30 studies (58%).[6,8,9,16,32,33,35,36,41,43,44,45,46,47,48,50,51,52,53,54,55,57,58,59,60,61,62,65,66,67] Duration of fasting differed largely among these studies. Cessation of smoking was applied in 24 studies (46%).[8,9,24,33,35,36,40,43,44,45,47,48,50,51,53,54,55,57,58,59,60,61,66,67] Minimum duration of cessation also differed among studies. In 5 studies (10%),[32,51,53,61,67] patients were asked to rest for a certain amount of time before breath sampling. Other measures to reduce potential exogenous confounding factors included rinsing of the mouth, cessation of perfume, cessation of pungent food, and use of inspiratory VOC filters. Furthermore, a variety of comorbidities were considered as having a potential confounding effect on VOCs in exhaled breath. In most studies, patients with certain comorbidities were excluded from participation. Examples of several endogenous and exogenous factors and measures to reduce their influence are given in eTable 4 in the Supplement.

Diagnostic Accuracy

Sensitivity of all e-noses ranged from 48.3% to 95.8% for the detection of cancer. Specificity ranged from 10.0% to 100.0%. Pooled receiver operating characteristic analysis of all e-noses resulted in a pooled area under the curve of 94% (95% CI, 92%-96%), sensitivity of 90% (95% CI, 88%-92%), and specificity of 87% (95% CI, 81%-92%) (Figure 2 and Figure 3). Pooled studies had an I2 index of 76.41% for sensitivity and 95.53% for specificity, which corresponds to a high statistical heterogeneity. Additional meta-analysis after exclusion of outliers and influential studies (eFigure 1 in the Supplement) showed similar results, with a pooled sensitivity of 90% (95% CI, 87%-91%) and a pooled specificity of 89% (95% CI, 84%-92%), indicating limited influence of outliers and influential studies on the results (eFigure 2 in the Supplement). The Deeks funnel plot asymmetry test showed a significant funnel plot asymmetry (intercept, 7.01%; 95% CI, 5.01%- 9.01%; P = .02) (eFigure 3 in the Supplement).
Figure 2.

Summary Receiver Operating Characteristic (SROC) Curve Analysis of All Electronic Noses

For the summary operating point, sensitivity was 0.90 (95% CI, 0.88-0.92) and specificity was 0.87 (95% CI, 0.81-0.92). For the SROC curve, the area under the curve was 0.94 (95% CI, 0.92-0.95).

Figure 3.

Pooled Sensitivity and Specificity Analyses of All Electronic Noses

Summary Receiver Operating Characteristic (SROC) Curve Analysis of All Electronic Noses

For the summary operating point, sensitivity was 0.90 (95% CI, 0.88-0.92) and specificity was 0.87 (95% CI, 0.81-0.92). For the SROC curve, the area under the curve was 0.94 (95% CI, 0.92-0.95). A separate pooled analysis for the Cyranose 320 demonstrated its ability to detect cancer with a sensitivity of 93% (95% CI, 85%-97%; I2 = 83.28%) and a specificity of 89% (95% CI, 72%-96%; I2 = 89.80%) (eFigure 4 in the Supplement). Separate pooled analysis for the Aeonose showed a sensitivity of 88% (95% CI, 83%-92%; I2 = 71.92%) and a specificity of 70% (95% CI, 57%-80%; I2 = 90.25%) (eFigure 5 in the Supplement). Separate pooled analyses of LC, HNC, and CRC resulted in a sensitivity of 92% (95% CI, 89%-94%; I2 = 80.87%) and a specificity of 91% (95% CI, 83%-95%; I2 = 96.07%) for LC, a sensitivity of 85% (95% CI, 77%-90%; I2 = 50.49%) and a specificity of 85% (95% CI, 68%-94%; I2 = 82.86%) for HNC, and a sensitivity of 93% (95% CI, 87%-97%; I2 = 0.00%) and a specificity of 59% (95% CI, 24%-87%; I2 = 92.56%) for CRC (eFigures 6, 7, and 8 in the Supplement, respectively). Additional sensitivity analysis for advanced and early tumor stage resulted in similar accuracies, with a sensitivity of 90% (95% CI, 87%-93%; I2 = 75.11%) and a specificity of 86% (95% CI; 75%-93%; I2 = 96.20%) for advanced stage tumors and a sensitivity of 89% (95% CI, 83%-93%; I2 = 0.00%) and a specificity of 86% (95% CI, 77%-92%; I2 = 57.94%) for early stage tumors (eFigures 9 and 10 in the Supplement). Pooled analysis of studies with a low or unclear risk of bias in the patient selection domain of the QUADAS-2 tool resulted in a sensitivity of 91% (95% CI, 0.87%-0.94%; I2 = 83.56%) and a specificity of 87% (95% CI, 0.77%-0.93%; I2 = 96.69%) (eFigure 11 in the Supplement). Separate analyses of the other QUADAS-2 risk of bias domains resulted in similar diagnostic accuracies (eFigures 12-14 in the Supplement). The results of the quality assessment using QUADAS-2 are shown in eTables 5 and 6 in the Supplement. Overall, a high risk of bias was found in most studies. Regarding the use of the e-nose (index test), a total of 43 studies[6,16,17,18,22,23,24,25,26,27,28,29,30,31,32,33,34,35,36,37,38,39,40,41,42,43,45,47,48,49,50,52,54,55,56,57,58,59,60,63,64,65,66] (83%) had a high risk of bias, mainly because of a lack of standardization and reproducibility of methods. Regarding flow and timing, a high risk of bias was found in 28 studies[16,17,22,23,24,25,28,29,31,32,34,36,40,45,46,48,49,52,53,54,55,56,58,60,63,64,65,67] (54%). The main cause of the high risk of bias for flow and timing was insufficient consideration of endogenous and exogenous factors. Regarding patient selection, a high risk of bias was found in 21 studies[6,16,28,29,31,34,35,40,42,46,48,52,53,55,58,61,63,64,65,66,67] (40%). The main cause of the high risk of bias for patient selection was the lack of matching patient groups. The risk of bias was lowest for the reference standard criterion, with only 2 studies[40,46] (4%) showing high risk of bias. Other causes of bias included the absence of a validation set and the failure to report a time interval between the index test and the reference test. Furthermore, a significant concern was raised regarding the applicability of the study design to the study question as a result of inadequate patient selection criteria. This concern resulted in a high-applicability concern for patient selection criteria in 22 studies[9,16,17,22,23,24,28,29,35,38,39,43,46,48,50,51,58,63,64,65,66,67] (42%). No significant concerns were found regarding the applicability of the index test and reference test to the study question. All studies had a Rational Clinical Examination Level of Evidence rating of 4/C (Table; eTable 3 in the Supplement).

Discussion

We conducted a systematic review and meta-analysis of studies that investigated the diagnostic accuracy of e-noses in detecting cancer from exhaled breath samples. Pooled analysis of all e-noses demonstrated a high diagnostic accuracy of e-noses for the detection of cancer, with a pooled sensitivity of 90% and specific of 87%. The Aeonose and Cyranose 320 demonstrated similarly high diagnostic accuracies in detecting cancer. Furthermore, e-noses showed a high diagnostic accuracy in detecting LC, HNC, and CRC, although e-noses had a relatively low specificity in detecting CRC compared with LC and HNC. The high diagnostic accuracy of e-noses found in this study is in line with results from previous reviews[2,19] that demonstrated similar diagnostic performance in detecting cancer in exhaled breath. Although these results are promising, they should be interpreted with caution because of high heterogeneity among studies, a high risk of bias found in most studies, and the potential presence of publication bias. Before e-noses can be implemented in daily clinical practice, several important issues must be addressed. Various analytical methods, such as machine learning, were used to analyze VOCs in exhaled breath. Most studies here did not adequately explain how these analytical techniques were used, which means these e-nose studies have limited reproducibility. Because of the complex nature of machine learning, in their review, Sar et al[13] recommended that an open-source database be constructed wherein data and know-how about machine learning could be freely and fairly exchanged. Such a development would improve the reproducibility and standardization of the analytical methods used in e-nose research. One of the main technical disadvantages inherent to e-noses is sensor drift, which is defined as a gradual change in sensor output, independent of changes in measured sample or input. This change may lead to a gradual decrease in instrument sensitivity, which could lead to a false diagnosis.[19,71] Sensor drift is caused by a variety of factors. Although several techniques have been proposed and developed to compensate for sensor drift,[71,72] few studies in this review mentioned implementing these techniques. Furthermore, it is important to note that e-nose sensors have a limited sensor life, after which their sensitivity decreases.[73] A potential solution to limited sensor life and sensor drift could be the use of disposable sensors.[19] In concordance with several other reviews,[13,15,19] a general lack of standardization was observed in study design and reporting of methods and results. Studies differed substantially with regard to patient selection criteria, consideration of endogenous and exogenous factors, exhaled breath collection, and analysis. Differences in study design might be the cause of the high statistical heterogeneity found in the pooled analysis. Separate analysis of the Cyranose 320 and Aeonose did not result in a significant reduction of statistical heterogeneity. This finding suggests that technological differences among e-noses are not the main cause of the statistical heterogeneity found. A lack of standardization in study design and reporting resulted in an overall low methodologic quality, with most studies having a risk of bias. Recently, Hanna et al[1] constructed a comprehensive framework for the standardization of VOC-based studies. Conducting e-nose studies in a standardized manner using such a framework would improve the overall quality of e-nose research. Many of the studies did not report on factors that could influence VOC analysis. Several endogenous and exogenous factors may affect breath profile, such as smoking, comorbidities, diet, age, sex, body mass index, and medication.[74] However, the influence of such factors on breath prints needs to be investigated further. The reported measures to minimize the effects of these factors varied among studies. Insufficient consideration of these factors limits the validity and generalizability of results. Future research should thus focus on conducting more preclinical studies to investigate the effects of potential confounders on patients’ breath prints. Most studies did not perform any diagnostic tests to exclude malignant tumors in healthy volunteers, so it is possible that healthy controls had an underlying malignant tumor. Most studies were feasibility studies with relatively small sample sizes used for internal validation, and although calculating correct sample sizes for internal validation fell outside the scope of this review, an extensive guide for sample size calculations has recently been published.[75] Adequately powered external validation studies to confirm the results of these feasibility studies are rarely conducted. Because e-noses are particularly sensitive to changes in endogenous and exogenous factors, the diagnostic accuracy may vary among different research settings and patient groups. Therefore, it is important to conduct large, multicenter external validation studies to investigate the generalizability and reproducibility of the results in different research settings and different patient groups. Separate analyses of LC, HNC, and CRC studies demonstrated that e-noses have a slightly higher diagnostic accuracy in detecting LC, with a pooled sensitivity of 92% and a pooled specificity of 91%. However, the diagnostic accuracy of the other cancer types was similarly high, with the exception of CRC, which had a lower specificity compared with other cancer types. Future studies should further investigate which cancer type could benefit most from the use of e-noses using exhaled breath.

Limitations

This study has limitations. Substantial heterogeneity was observed among studies, making the interpretation of the results of the meta-analysis difficult. Furthermore, significant funnel plot asymmetry was detected, which might indicate the presence of publication bias. Although funnel plots are widely used to investigate publication bias, funnel plot asymmetry might have other possible causes, such as poor methodologic design, heterogeneity, and chance.[76] In addition, a large number of studies had a high risk of bias, although sensitivity analyses by excluding studies with a high risk of bias had a limited effect on the pooled diagnostic accuracy. All studies included here were diagnostic cross-sectional studies with a case-control study design. This study design is not an accurate representation of daily practice in which consecutive patients would undergo a diagnostic test. Furthermore, a significant number of studies primarily included patients with advanced stages of cancer. The discriminatory ability of e-noses for patients with early and advanced stages of disease is still under investigation. This review did not include studies that used e-noses to analyze biofluids, such as urine and blood. There is a possibility that biofluids could potentially be more suitable than exhaled breath for VOC analysis for certain cancers.

Conclusions

The results of this systematic review with meta-analysis indicate that e-noses have a relatively high diagnostic accuracy in the detection of cancer in exhaled breath. However, the existing e-nose studies generally consist of feasibility studies with small sample sizes, a lack of standardization, and a high risk of bias. Thus, there is a need for adequately powered, multicenter external validation studies to establish the potential of e-noses in the diagnostic workup of cancer. Before clinical implementation can be realized, the lack of standardization and reproducibility in the field of e-nose research must be addressed.
  73 in total

1.  Detection of lung cancer by sensor array analyses of exhaled breath.

Authors:  Roberto F Machado; Daniel Laskowski; Olivia Deffenderfer; Timothy Burch; Shuo Zheng; Peter J Mazzone; Tarek Mekhail; Constance Jennings; James K Stoller; Jacqueline Pyle; Jennifer Duncan; Raed A Dweik; Serpil C Erzurum
Journal:  Am J Respir Crit Care Med       Date:  2005-03-04       Impact factor: 21.405

2.  The performance of tests of publication bias and other sample size effects in systematic reviews of diagnostic test accuracy was assessed.

Authors:  Jonathan J Deeks; Petra Macaskill; Les Irwig
Journal:  J Clin Epidemiol       Date:  2005-09       Impact factor: 6.437

3.  Recognizing lung cancer and stages using a self-developed electronic nose system.

Authors:  Ke Chen; Lei Liu; Bo Nie; Binchun Lu; Lidan Fu; Zichun He; Wang Li; Xitian Pi; Hongying Liu
Journal:  Comput Biol Med       Date:  2021-02-23       Impact factor: 4.589

4.  Integration of electronic nose technology with spirometry: validation of a new approach for exhaled breath analysis.

Authors:  R de Vries; P Brinkman; M P van der Schee; N Fens; E Dijkers; S K Bootsma; F H C de Jongh; P J Sterk
Journal:  J Breath Res       Date:  2015-10-15       Impact factor: 3.262

5.  Recognizing lung cancer using a homemade e-nose: A comprehensive study.

Authors:  Wang Li; Ziru Jia; Dandan Xie; Ke Chen; Jianguo Cui; Hongying Liu
Journal:  Comput Biol Med       Date:  2020-03-19       Impact factor: 4.589

6.  Identification of profiles of volatile organic compounds in exhaled breath by means of an electronic nose as a proposal for a screening method for breast cancer: a case-control study.

Authors:  Lorena Díaz de León-Martínez; Maribel Rodríguez-Aguilar; Patricia Gorocica-Rosete; Carlos Alberto Domínguez-Reyes; Verónica Martínez-Bustos; Juan Alberto Tenorio-Torres; Omar Ornelas-Rebolledo; José Alfonso Cruz-Ramos; Berenice Balderas-Segura; Rogelio Flores-Ramírez
Journal:  J Breath Res       Date:  2020-09-22       Impact factor: 3.262

7.  Breath testing as potential colorectal cancer screening tool.

Authors:  Haitham Amal; Marcis Leja; Konrads Funka; Ieva Lasina; Roberts Skapars; Armands Sivins; Guntis Ancans; Ilze Kikuste; Aigars Vanags; Ivars Tolmanis; Arnis Kirsners; Limas Kupcinskas; Hossam Haick
Journal:  Int J Cancer       Date:  2015-08-07       Impact factor: 7.396

8.  A survey on gas sensing technology.

Authors:  Xiao Liu; Sitian Cheng; Hong Liu; Sha Hu; Daqiang Zhang; Huansheng Ning
Journal:  Sensors (Basel)       Date:  2012-07-16       Impact factor: 3.576

9.  Detecting head and neck squamous carcinoma using a portable handheld electronic nose.

Authors:  Rens M G E van de Goor; Michel R A van Hooren; Darius Henatsch; Bernd Kremer; Kenneth W Kross
Journal:  Head Neck       Date:  2020-06-03       Impact factor: 3.147

10.  Lung Cancer Screening Based on Type-different Sensor Arrays.

Authors:  Wang Li; Hongying Liu; Dandan Xie; Zichun He; Xititan Pi
Journal:  Sci Rep       Date:  2017-05-16       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.