Yan Wang1, Qing Wang, Jing Feng, Qi Wu. 1. Respiratory Department of Tianjin Medical University General Hospital, Tianjin, People's Republic of China.
Abstract
BACKGROUND: The purpose of this paper was to compare the sensitivity, specificity, and overall diagnostic performance of autofluorescence imaging bronchoscopy (AFI) versus white light bronchoscopy (WLB) in the detection of lung cancers and precancerous lesions by meta-analysis. METHODS: We performed a literature search using the PubMed and EMBASE databases to identify studies published between March 1991 and March 2012. Article selection, quality assessment, and data extraction were then performed. The pooled sensitivity, specificity, diagnostic odds ratio, and area under the curve of the summary receiver operating characteristic for AFI versus WLB were calculated using Stata version 12.0 software. RESULTS: Six studies were included in the meta-analysis. The pooled sensitivity of AFI and WLB was 0.89 (95% confidence interval [CI] 0.81-0.94) and 0.67 (95% CI 0.46-0.83) and the pooled specificity of AFI and WLB was 0.64 (95% CI 0.37-0.84) and 0.84 (95% CI 0.74-0.91), respectively. The diagnostic odds ratio for AFI and WLB was 14.5 (95% CI 3.76-55.63) and 10.9 (95% CI 3.12-38.21), and the area under the curve for AFI and WLB was 0.89 (95% CI 0.86-0.92) and 0.85 (95% CI 0.81-0.88), respectively. The pooled positive and negative likelihood ratios were 2.5 (95% CI 1.21-4.97) and 0.17 (95% CI 0.08-0.36) for AFI, and the corresponding values for WLB were 4.3 (95% CI 2.13-8.52) and 0.39 (95% CI 0.21-0.73). The pooled positive likelihood ratio for AFI and WLB was not higher than 10, and the pooled negative likelihood ratio for AFI and WLB was not lower than 0.1. CONCLUSION: The sensitivity of AFI is higher than that of WLB, while the specificity of AFI is lower than that of WLB. The overall diagnostic performance of AFI is slightly better than that of WLB in detecting lung cancers and precancerous lesions. AFI should find its place in routine bronchoscopic examination and may improve the diagnostic outcome on endoscopy.
BACKGROUND: The purpose of this paper was to compare the sensitivity, specificity, and overall diagnostic performance of autofluorescence imaging bronchoscopy (AFI) versus white light bronchoscopy (WLB) in the detection of lung cancers and precancerous lesions by meta-analysis. METHODS: We performed a literature search using the PubMed and EMBASE databases to identify studies published between March 1991 and March 2012. Article selection, quality assessment, and data extraction were then performed. The pooled sensitivity, specificity, diagnostic odds ratio, and area under the curve of the summary receiver operating characteristic for AFI versus WLB were calculated using Stata version 12.0 software. RESULTS: Six studies were included in the meta-analysis. The pooled sensitivity of AFI and WLB was 0.89 (95% confidence interval [CI] 0.81-0.94) and 0.67 (95% CI 0.46-0.83) and the pooled specificity of AFI and WLB was 0.64 (95% CI 0.37-0.84) and 0.84 (95% CI 0.74-0.91), respectively. The diagnostic odds ratio for AFI and WLB was 14.5 (95% CI 3.76-55.63) and 10.9 (95% CI 3.12-38.21), and the area under the curve for AFI and WLB was 0.89 (95% CI 0.86-0.92) and 0.85 (95% CI 0.81-0.88), respectively. The pooled positive and negative likelihood ratios were 2.5 (95% CI 1.21-4.97) and 0.17 (95% CI 0.08-0.36) for AFI, and the corresponding values for WLB were 4.3 (95% CI 2.13-8.52) and 0.39 (95% CI 0.21-0.73). The pooled positive likelihood ratio for AFI and WLB was not higher than 10, and the pooled negative likelihood ratio for AFI and WLB was not lower than 0.1. CONCLUSION: The sensitivity of AFI is higher than that of WLB, while the specificity of AFI is lower than that of WLB. The overall diagnostic performance of AFI is slightly better than that of WLB in detecting lung cancers and precancerous lesions. AFI should find its place in routine bronchoscopic examination and may improve the diagnostic outcome on endoscopy.
Recent epidemiological research shows that lung cancer is the most common cause of cancer-related death, and there is no doubt that early detection followed by prompt treatment can improve the outcome for patients with lung cancer and afford a better chance of cure.1 In patients with invasive forms of the disease, earlier detection of synchronous lesions may lead to significant changes in the clinical management protocol.2,3There are several types of lung cancer, including small and nonsmall cell lung cancer. These cancers may arise from the peripheral (bronchioles and alveoli) or central (the major bronchi, central airways) structures of the lung. Adenocarcinoma and large cell carcinoma usually arise peripherally, whereas squamous cell carcinoma and small cell lung cancer usually arise centrally. Different approaches are required to detect cancer in different compartments of the lung.Bronchoscopy is a desirable technology for detecting lung cancers arising from the central airways. However, even when performed by experienced bronchoscopists, it is still a challenge to detect early lung cancers and precancerous lesions using conventional white light bronchoscopy (WLB). It has been reported that only 29% of carcinoma in situ and 69% of microinvasive tumors are identified endoscopically on WLB, even by experienced bronchoscopists.4 Autofluorescence bronchoscopy is an advanced technology that exploits the autofluorescent nature of the bronchial mucosa to detect tiny and subtle superficial lesions. The difference in superficial autofluorescence intensity between normal tissue and tumor tissue was reported as far back as 1965,5 and in the last two or three decades, autofluorescence bronchoscopy has been used to detect early-stage carcinomatous lesions through endoscopy workstations.6,7 Thus far, several types of medical autofluorescence bronchoscopy systems have been designed and developed, including the LIFE (Xillix Technologies, Richmond, Canada), D-Light (Karl Storz, Tuttlingen, Germany), and SAFE (Pentax, Tokyo, Japan) systems. In comparison with conventional WLB, these autofluorescence bronchoscopy systems have much higher sensitivity, which is interesting and exciting, but at the same time, the specificity is decreased,8–13 which is obviously a significant drawback.Autofluorescence imaging bronchoscopy (AFI, Olympus Optical Corporation, Tokyo, Japan) is a newly developed autofluorescence bronchoscopy system. Major indications for AFI include: detection of early-stage lung cancers and precancerous lesions in the large central airways in high-risk patients; detecting subtle synchronous malignancy preoperatively in patients with lung cancer; evaluation of extension of lung cancer or evaluation of margins in endoscopically visible tumors; selection of the optimal surgical procedure in patients with early-stage lung cancer; and bronchoscopic surveillance during follow-up of patients after curative surgery for nonsmall cell lung cancer. AFI has been shown to be a far more sensitive method for detecting superficial lung cancers and precancerous lesions compared with WLB (see Figure 1).14–16 However, when AFI is compared with WLB for specificity, the literature yields conflicting results.14–19
Figure 1
(A and B) WLB and AFI findings of adenocarcinoma at the bifurcation of the right middle lobe and lower lobe. (A) WLB image shows nonspecific swelling or thickening, may be treated as normal, and is not generally biopsied. (B) AFI image shows significant magenta color (white arrows indicate bifurcation of the right middle lobe and lower lobe). (C and D) WLB and AFI findings of small cell lung cancer on the wall of the left upper lobe. (C) Nonspecific protruding nodule can be found at the wall of the left upper lobe, may be treated as normal and is not generally biopsied. (D) AFI image shows significant magenta color (white arrows indicate a protruding nodule on the wall of the left upper lobe).
Abbreviations: AFI, autofluorescence imaging bronchoscopy; WLB, white light bronchoscopy.
The AFI system consists of three parts, ie, an autofluorescence video bronchoscope (BF-F260), a video processor unit (Evis Lucera Spectrum, CV-260SL), and a xenon light source. The autofluorescence video bronchoscopic image is presented on a 19-inch LCD monitor (OEV-191). The AFI system incorporates autofluorescence light (blue excitation light produces an autofluorescence signal at 490–700 nm, which is treated as green in the video processor), G′ reflected light (green, treated as red in the video processor) and R′ reflected light (red, treated as blue in the video processor). These three signals are synthesized by the video processor and a composite image is displayed on the LCD monitor.20 The onscreen image shows normal bronchial mucosa as green, ie, the color of general autofluorescence light. Because hemoglobin absorbs a large volume of autofluorescence light and G′ reflected light and little R′ reflected light, areas containing more hemoglobin show decreased autofluorescence (green) and G′ reflected light (red) but not R′ reflected light (blue). Therefore, areas containing more hemoglobin (local bleeding or bronchitis) produce a blue or green-blue color on AFI imaging.14 Loss of normal green autofluorescence can be caused by increased thickness of the epithelial layer and accumulation of cancerous material in superficial local tissue,21 which are the characteristics of early lung cancers and precancerous lesions. As a result, early lung cancers and precancerous lesions produce a magenta color, ie, a combination of red (G′ reflected light) and blue (R′ reflected light), in the AFI image (see Figure 1).Overall, autofluorescence bronchoscopy has much higher sensitivity and decreased specificity in comparison with WLB.22 However, some reports14,17 have demonstrated that AFI, a newly developed specialized type of autofluorescence bronchoscopy, has relatively superior specificity compared with WLB, while others15,16 have shown similar or even lower specificity. To determine if AFI can be used in routine practice to improve the outcome of endoscopic diagnosis of lung cancers and precancerous lesions, we performed a meta-analysis comparing the sensitivity, specificity, and overall diagnostic performance between AFI and WLB, and to our knowledge, ours is the first exclusive meta-analysis of AFI.
Materials and methods
Data source and search strategy
We did a literature search using the PubMed and EMBASE databases to identify studies comparing the diagnostic results of AFI and WLB with regard to detecting lung cancers or precancerous lesions. We used the search terms “autofluorescence bronchoscopy”, “autofluorescence imaging”, and “AFI”. Because autofluorescence bronchoscopy was first introduced in 1991,23,24 the literature published between March 1991 and March 2012 was examined. We then compared sources to exclude duplicate articles, and the search was further augmented by scanning references of identified articles or reviews for additional relevant papers.
Eligibility criteria
Papers were included if they used an AFI system and compared it with a WLB system, used histologic criteria as the golden standard when diagnosing lung cancers and precancerous lesions, and provided sufficient data (including true positives, false positives, false negatives, true negatives) to assess the diagnostic yield of AFI. When multiple publications involving the same study population were identified or study populations overlapped, only the larger study was included, unless the reported outcomes were mutually exclusive.
Data extraction
Data were extracted independently by two of the authors (YW, QW) and the final decision regarding inclusion was made by consensus. Data extracted from eligible studies included authors, study year, mean age and number of study participants, number of biopsies performed, biopsy strategy, and positive histologic standard used. True positive, false positive, false negative, and true negative numbers of AFI and WLB were extracted from the studies and two-dimensional tables were constructed.
Assessment of quality
The quality of each included study was assessed using the Quality Assessment of Diagnostic Accuracy Studies (QUADAS) tool.25 This tool is structured as a list of 14 questions which should be answered with, “yes”, “no”, or “unclear”.
Data synthesis and statistical analysis
Stata version 12.0 (Stata Corporation, College Station, TX, USA) was used for all statistical analyses, in particular the Midas commands. We evaluate publication bias and heterogeneity, pooled sensitivity, specificity, and positive and negative likelihood ratios, and evaluated two global measures of accuracy of the tests, ie, the diagnostic odds ratio and summary receiver operating characteristic.Publication bias is defined as the tendency on the part of investigators to submit and/or editors and reviewers to accept manuscripts based on the direction or strength of the study findings. This definition concentrates on the fact that the most positive and strongest studies are likely to be published, while negative results are less likely to be published in the journals, so publication bias is ubiquitous. Publication bias is a common problem for meta-analyses and cannot be avoided completely. Publication bias is generally described with funnel plots, and a symmetrical funnel plot indicates that publication bias does not exist. Linear regression as a test for funnel plot asymmetry was performed in our meta-analysis.Heterogeneity can be expected in the results of studies of diagnostic test accuracy.26 Heterogeneity of results between studies is generally assessed graphically using forest plots and statistically by the quantity I2 that describes the percentage of total variation across studies which is attributable to heterogeneity rather than chance. A value of 0% indicates no observed heterogeneity, and values greater than 50% are considered to indicate substantial heterogeneity.27Sensitivity is defined as the number of true positives divided by the sum of true positives and false negatives. Specificity is defined as the number of true negatives divided by the sum of true negatives and false positives. The positive likelihood ratio is sensitivity divided by 1 – specificity, and indicates how much the odds of the disease increase when a test is positive. The likelihood ratio can be used to assess the impact of a positive test result on diagnosis for an individual. When the likelihood ratio is higher than 10, we can make an almost definite diagnosis of a certain disease. The negative likelihood ratio is 1 – sensitivity divided by specificity, and indicates how much the odds of the disease decrease when a test is negative. When the likelihood ratio is <0.1, we can almost eliminate a certain disease. The diagnostic odds ratio of a test is the ratio of the positive likelihood divided by the negative likelihood ratio. The diagnostic odds ratio can be put forward as a useful single index of test performance indicating the strength of the association between test results and disease;28 it is difficult to interpret clinically, but useful from the statistical point of view for evaluation of overall test accuracy in meta-analysis.28–30 The summary receiver operating characteristic is also a global tool for measuring the accuracy of tests. The results are described as the area under the curve (AUC) and the Q-point (point on curve where sensitivity equals specificity) obtained from the summary receiver operating characteristic curve.31 A bivariate binomial mixed model was selected to calculate the pooled sensitivity, specificity, positive likelihood ratio, negative likelihood ratio, and diagnostic odds ratio on a per lesion basis, and takes into account variability within and between studies and the dependency between sensitivity and specificity.
Results
Study selection
As seen in Figure 2, 2339 articles were identified by searching the PubMed and EMBASE electronic databases. Of these, 158 were retained for abstract review and then 21 for full text review. A further six full texts were identified by searching the references of these articles. After full text review, six studies were included in the meta-analysis. To avoid language bias, we did not exclude articles not reported in English. However, all the six articles finally included in our study were published in English.
Figure 2
Summary of literature and study selection.
Abbreviations: AFB, autofluorescence bronchoscopy; AFI, autofluorescence imaging bronchoscopy; n, number.
Quality of included studies
The QUADAS tool results are shown in Table 1. Generally speaking, the studies included were ideal, in that they all fulfilled at least 12 of the 14 methodological criteria.
Table 1
quality of studies included using the quality assessment of diagnostic accuracy studies tool
Chiyo et al14
Ueno et al18
Li et al15
Herth et al16
Cetti et al19
Zaric et al17
Spectrum of patients representative of the patients who will receive the test in practice?
Yes
Yes
Yes
Yes
Yes
Yes
Were selection criteria clearly described?
Unclear
Unclear
Unclear
Yes
Yes
Yes
Is the reference standard likely to classify the target condition correctly?
Yes
Yes
Yes
Yes
Yes
Yes
Is the time period between reference standard and index test short enough to prevent change of the target condition between the two tests?
Yes
Yes
Yes
Yes
Yes
Yes
Did all subjects receive verification using a reference standard of diagnosis
Yes
Yes
Yes
Yes
Yes
Yes
Did all subjects receive the same reference standard regardless of the index test result?
Yes
Yes
Yes
Yes
Yes
Yes
Was the reference standard independent of the index test?
Yes
Yes
Yes
Yes
Yes
Yes
Execution of the index test described in sufficient detail to permit replication?
Yes
Yes
Yes
Yes
Yes
Yes
Execution of the reference standard described in sufficient detail to permit replication?
Yes
Yes
Yes
Yes
Yes
Yes
Were the index test results interpreted without knowledge of the results of the reference standard?
Yes
Yes
Yes
Yes
Yes
Yes
Were the reference standard results interpreted without knowledge of the results of the index test?
Yes
Yes
Yes
Yes
Yes
Yes
Were the same clinical data available when test results were interpreted as would be available when the test is used in practice?
Yes
Yes
Yes
Yes
Yes
Yes
Were uninterpretable/intermediate test results reported?
Unclear
Unclear
Unclear
Unclear
Unclear
Unclear
Were withdrawals from the study explained?
Yes
Yes
Yes
Yes
Yes
Yes
Characteristics of included studies
The characteristics of each study are listed in Table 2. A total of 1170 biopsy specimens from 409 patients examined with the AFI system and WLB system were included in the analysis. In all of the six articles, each sample was imaged with both systems. The mean age of the patients ranged from 56 to 69 years across the studies.
Table 2
Features of studies included in meta-analysis
Reference
Mean age
Patients (n)
Biopsies (n)
Control
Positive histologic standard
WLB
AFB
TP
FP
FN
TN
TP
FP
FN
TN
Chiyo et al14
66.6
32
62
No
MI
18
15
14
15
26
5
6
25
Ueno et al18
69
31
64
Yes
SEV
14
4
5
41
18
13
1
32
Li et al15
61.5
136
241
Yes
SEV
50
27
26
138
72
71
4
94
Herth et al16
56.2
57
98
No
MOC
3
5
14
35
11
24
6
16
Cetti et al19
ND
49
81
Yes
MO
15
5
1
60
15
53
1
11
Zaric et al17
56
104
624
Yes
Ca
242
46
70
266
286
23
26
289
Abbreviations: TP, true positive; FP, false positive; FN, false negative; TN, true negative; ND, not described; MI, mild dysplasia or worse; MO, moderate dysplasia or worse; SEV, severe dysplasia or worse; MOC, moderate to severe dysplasia or carcinoma in situ; Ca, cancer; AFB, autofluorescence bronchoscopy; WLB, white light bronchoscopy.
Publication bias
Linear regression testing of funnel plot asymmetry (Figure 3 and Table 3) showed that the funnel plot was asymmetric, indicating a certain degree of publication bias (P = 0.029). An asymmetric funnel plot would suggest that additional small studies may have been conducted but were not published because of unfavorable results.
Figure 3
Linear regression test of funnel plot asymmetry.
Table 3
Statistical results for small study effects or publication bias
Parameter
AFI
WLB
Coefficient
SE
t
P > |t|
Coefficient
SE
t
P > |t|
Bias
−30.0754
9.0667
−3.32
0.029
−7.3928
12.7558
−0.58
0.593
Intercept
5.9833
0.6866
8.71
0.001
3.1850
0.9658
3.30
0.030
Abbreviations: AFI, autofluorescence imaging bronchoscopy; WLB, white light bronchoscopy; SE, standard error of the mean.
Indices of diagnostic accuracy
The pooled sensitivity and specificity of AFI was 0.89 (95% confidence interval [CI] 0.81–0.94) and 0.64 (95% CI 0.37–0.84, Figure 4); the corresponding values for WLB were 0.67 (95% CI 0.46–0.83) and 0.84 (95% CI 0.74–0.91, Figure 5). The pooled positive and negative likelihood ratios were 2.5 (95% CI 1.21–4.97) and 0.17 (95% CI 0.08–0.36) for AFI; the corresponding values for WLB were 4.3 (95% CI 2.13–8.52) and 0.39 (95% CI 0.21–0.73). The pooled positive likelihood ratio for both AFI and WLB was not higher than 10 and the pooled negative likelihood ratio for both AFI and WLB was not lower than 0.1. The diagnostic odds ratio and AUC represent the overall diagnostic performance of a test. In our meta-analysis, the diagnostic odds ratio for AFI and WLB was 14.5 (95% CI 3.76–55.63) and 10.9 (95% CI 3.12–38.21), respectively, and the AUC of AFI (Figure 6A) and WLB (Figure 6B) was 0.89 (95% CI 0.86–0.92) and 0.85 (95% CI 0.81–0.88), respectively, indicating slightly better diagnostic performance for AFI.
Figure 4
Forest plot of sensitivity and specificity of AFI for detecting lung cancers and precancerous lesions.
Abbreviation: AFI, autofluorescence imaging bronchoscopy; CI, confidence interval; df, degree of freedom; Q, cochran’s heterogeneity statistic.
Figure 5
Forest plot of sensitivity and specificity of WLB for detecting lung cancers and precancerous lesions.
Abbreviation: WLB, white light bronchoscopy; CI, confidence interval; df, degree of freedom; Q, Cochran’s heterogeneity statistic.
Figure 6
Summary receiver operating characteristic curves for (A) AFI and (B) WLB.
Abbreviations: AFI, autofluorescence imaging bronchoscopy; WLB, white light bronchoscopy; SROC, summary receiver operating characteristic; SENS, sensitivity; SPEC, specificity; AUC, area under the curve.
Evaluation of heterogeneity
For AFI, the studies had substantial heterogeneity, with I2 values of 81.8% and 97.4% for sensitivity and specificity, respectively (Figure 4); there was also substantial heterogeneity for WLB, with I2 values of 87.1% and 83.9% for sensitivity and specificity, respectively (Figure 5).
Discussion
Survival rates in patients with lung cancer are strongly associated with stage of disease at the time of diagnosis.32 The five-year survival rate for patients with stage IA disease is about 73%; however, for those with stage II–IV, it ranges from 46% down to as low as 9%. Currently, only 16% of lung cancers are diagnosed when the disease is localized, and fewer at stage 0, resulting in a general five-year survival rate of only about 15%.33 As a newly developed technology, autofluorescence bronchoscopy shines some light, albeit not strongly, on this problem. It allows rapid scanning of large areas of the bronchial surface for subtle abnormalities that are not visible on WLB.34 However, in comparison with conventional WLB, some autofluorescence bronchoscopy systems have increased sensitivity but significantly reduced specificity.8–13 As an example, in a meta-analysis of autofluorescence bronchoscopy systems, Sun et al35 compared the accuracy of WLB ± autofluorescence bronchoscopy in the diagnosis of lung cancers. It should be noted that the AFI system was not included in these autofluorescence bronchoscopy systems. Twenty-one studies were collected for final analysis. The pooled relative sensitivity of WLB ± autofluorescence bronchoscopy in detecting intraepithelial neoplasia and invasive cancer was 2.04 and 1.15. The pooled relative specificity of WLB ± autofluorescence bronchoscopy was as low as 0.65. WLB combined with autofluorescence bronchoscopy seems to improve the sensitivity for detection of intraepithelial neoplasia significantly. However, this advantage over WLB alone seems much less in detecting invasive lung cancers. For example, Chen et al22 performed a meta-analysis and also concluded that autofluorescence bronchoscopy had significantly higher sensitivity and lower specificity in comparison with WLB. However, only two of the 15 studies included in their study actually included the AFI system. Including AFI with WLB when comparing autofluorescence bronchoscopy systems as a whole seems reasonable for obtaining relative sensitivity, because higher sensitivity is a characteristic common to all autofluorescence bronchoscopy systems, including AFI, LIFE, D-Light, and SAFE. However, taking the relative specificity of autofluorescence bronchoscopy systems as a whole is not appropriate. Some reports14,17 have demonstrated that AFI may have higher specificity in comparison with other types of autofluorescence bronchoscopy. AFI displays a light-green image for normal epithelium, blue to green-blue for bleeding and inflammation, and magenta for dysplasia or cancerous lesions. This ability to distinguish dysplasia and preinvasive malignant lesions from bronchitis, hyperplasia, bleeding, and other benign lesions appears to represent a great advantage in technology which may make the specificity of AFI better. However, generally speaking, the specificity of AFI for detecting lung cancers and preneoplastic lesions remains controversial. Some reports14,17 have demonstrated that AFI has superior specificity compared with WLB, while others15,16 have shown lower specificity. It was for this reason that we felt it necessary to undertake the present meta-analysis comparing the sensitivity, specificity, and overall diagnostic performance of AFI and WLB in the diagnosis of lung cancers and precancerous lesions.In the present meta-analysis of six studies, the pooled sensitivity of AFI (0.89) was significantly superior to that of WLB (0.67), while the pooled specificity of AFI (0.64) was lower than that of WLB (0.84). The diagnostic odds ratio for AFI and WLB was 14.5 and 10.9 and the AUC of AFI and WLB was 0.89 and 0.85, respectively. The diagnostic odds ratio and AUC represent the overall diagnostic performance of the studies included. Therefore, the pooled results demonstrate that the sensitivity of AFI is higher than for WLB while the specificity of AFI is lower, and indicate slightly better overall diagnostic performance for AFI. The pooled positive likelihood ratio for both AFI and WLB was not higher than 10, and the pooled negative likelihood ratio for both AFI and WLB was not lower than 0.1. For both AFI and WLB, a positive endoscopic finding cannot make a definite diagnosis of lung cancer or a precancerous lesion, while a negative endoscopic finding cannot exclude lung cancer or a precancerous lesion.The quality of WLB images has improved with the advent of video bronchoscopy, but is still plagued by limitations in diagnosing early-stage mucosal lesions. More specifically, because WLB is ineffective for detecting lesions smaller than 5 mm in diameter, the surface mucosa typically appears relatively normal on WLB when early-stage mucosal lesions are only a few cell layers thick. AFI is a new technology exploiting the autofluorescent nature of the bronchial mucosa to detect tiny and superficial lesions. The AFI can identify abnormal mucosa in different colors. Therefore, compared with WLB, AFI may be better for identifying early-stage mucosal lesions, thereby improving the diagnostic sensitivity of bronchoscopy for early-stage cancer and precancerous lesions.24,36 The increased sensitivity of AFI bronchoscopy can further improve the ability of trainees to make a correct diagnosis, and suggests an important role for AFI in the training of bronchoscopists at teaching hospitals.19Although AFI can detect early-stage lung cancers and precancerous lesions on the bronchial surface mucosa in the central airways with quite high sensitivity, its relatively low specificity may be a problem. Increased blood content or vessel growth and increased thickening of the bronchial epithelium are thought to be involved in areas of abnormal autofluorescence irrespective of malignancy. Increased blood content and thickening of the epithelium also occurs in other chronic inflammatory bronchial lesions arising from chronic bronchitis, bronchial asthma, or other benign morbidities, so incorrect diagnoses may be made when these “abnormal” AFI images are detected, and may account for why the false positive rate for AFI is high and the specificity is relatively lower than for WLB. Another explanation is that lesions identified under autofluorescence bronchoscopy and confirmed as false positives may carry molecular genetic lesions associated with malignancy, despite a normal histological appearance.37,38 The relative lower specificity of AFI may lead to more biopsies being taken, at greater cost to the health care system. However, the pattern of high sensitivity and low specificity of AFI is similar to that of some other imaging modalities, such as CT, which is generally used in the diagnosis of small malignant nodules, or other cancer screening tools, such as mammography and occult blood testing.39In our opinion, the combined use of AFI and WLB may be superior to WLB alone, and could result in better diagnostic yields with higher sensitivity and without significantly compromising specificity.40 In clinical practice, AFI and WLB are incorporated into one endoscopic system, making it convenient to switch between these two modes of examination by just pressing a button. Systematic training in AFI may be necessary for endoscopists because not only is subjective assessment of AFI images needed but also objective color tone analysis is important for distinguishing lung cancers and precancerous lesions from bronchitis to improve specificity.14Although heterogeneity among studies of diagnostic accuracy is an inevitable problem for meta-analysis,26 we discuss here the possible sources of heterogeneity as limitations of this review.
Inclusion criteria
Chiyo et al14 and Ueno et al18 included patients with suspected or known lung cancers in their studies of the efficacy of AFI in the diagnosis of cancers and precancerous lesions. Li et al15 did a prospective clinical study assessing the clinical value of AFI in an examination of the airways and recruited patients who had undergone both AFI and WLB for general diagnosis and for postoperative follow-up. Herth et al16 compared the yields of AFI and WLB in the diagnosis of intraepithelial neoplasia, and included patients who were at high risk for lung cancer but had no clinical signs raising suspicion for central airway malignancy. A study by Cetti et al19 evaluated the diagnostic performance of AFI in patients with suspected lung cancer, and excluded those with an established diagnosis of lung cancer involving the central airways. A study by Zaric et al17 evaluated the performance of AFI in the assessment of tumor extent (margins) and recruited patients with suspected lung cancer but excluded those with suspected or known lung cancer metastases. Although these authors recruited patients for different purposes using different inclusion criteria, the common key point of these studies is that they compared the sensitivity and specificity of AFI and WLB in detecting lung cancers and precancerous lesions, so we believe that it was acceptable to collect them together for a meta-analysis.
Endoscopic classification and positive histological standard
In study by Chiyo et al,14 a magenta image was classified as positive for malignant or preinvasive lesions, whereas biopsies with a histological grading of dysplasia or worse were considered positive. Ueno et al18 evaluated their AFI findings using a four-point scale and evaluated WLB finding using a three-point scale. AFI-III (images appeared magenta) and WLB-III (changes suggesting severe dysplasia and cancer) were considered as positive findings endoscopically; biopsies with severe dysplasia or worse were considered as positive histologically. Li et al15 classified endoscopic lesions into three grades in AFI or WLB. For AFI, AFI-II (images appeared pink or brown) and AFI-III (images appeared classic magenta) were considered as abnormal findings and for WLB, WLB-II (including hyperemia, edema, thickness, color changes, and regression or buckling of mucosal vessels) and WLB-III (including granulation of the bronchial mucosa or visible neoplasm) were considered as abnormal findings. In this study, biopsies with a histological grading of severe dysplasia or worse were considered positive. Herth et al16 evaluated AFI and WLB findings using a four-point scale, ie, normal, abnormal but not suspicious, suspicious for intraepithelial neoplasia, or tumor. Any lesion that met the criteria of “suspicious for intraepithelial malignancy” was classified as a positive finding on bronchoscopy for preinvasive neoplasia; biopsies with a histological grading of moderate to severe dysplasia or carcinoma in situ were considered positive for intraepithelial neoplasia. Cetti et al19 graded the appearance of the mucosa on bronchoscopy as normal, nonspecific abnormality or inflammation suspicious for cancer, or a definite tumor. Lesions meeting the criteria of “suspicious of cancer” or “definite tumor” were classified as a positive finding on bronchoscopy, and biopsies with a histological grading of moderate dysplasia or worse were considered positive. In the study by Zaric et al,17 red-brownish or magenta-colored areas were defined as pathologic areas by AFI, and biopsies with a histologic grading of carcinoma were considered as positive. As the authors of a meta-analysis, we accepted the endoscopic classification and positive histological standard made by the authors of the original articles because they had their own appropriate considerations regarding research purposes and were best placed to make decisions when choosing their positive criteria.
Biopsy strategy
In five of the six studies14–16,18,19 included in our meta-analysis, endobronchial biopsies were taken from the cores of areas that appeared suspicious under either AFI or WLB mode. The remaining study17 evaluated the value of AFI versus WLB in assessing tumor extension in the central airways for surgical decision-making, and biopsies were then taken from the margins of visualized pathologically changed mucosa. However, the key objective of all these studies was still to compare the sensitivity and specificity of AFI and WLB in detecting lung cancers and precancerous lesions.
Conclusion
Our meta-analysis indicates the sensitivity of AFI is higher than that of WLB and that the specificity of AFI is lower than that of WLB in detecting lung cancers or precancerous lesions. The overall diagnostic performance of AFI is slightly better than that of WLB. Combined use of AFI and WLB is recommended and may result in better diagnostic yields with higher sensitivity and without compromising specificity to a significant extent.40 The additional sensitivity provided by AFI may improve the ability of trainees to make a correct diagnosis during training19 and the false positive rate of AFI may be decreased by objective color tone analysis.14 For all these reasons, AFI should find its place in routine bronchoscopic examination and may improve the outcome of endoscopic diagnosis of lung cancers and precancerous lesions.
Authors: Afina S Glas; Jeroen G Lijmer; Martin H Prins; Gouke J Bonsel; Patrick M M Bossuyt Journal: J Clin Epidemiol Date: 2003-11 Impact factor: 6.437
Authors: K Häussinger; H Becker; F Stanzel; A Kreuzer; B Schmidt; J Strausz; S Cavaliere; F Herth; M Kohlhäufl; K-M Müller; R-M Huber; U Pichlmeier; Ch T Bolliger Journal: Thorax Date: 2005-06 Impact factor: 9.139
Authors: C I Henschke; D I McCauley; D F Yankelevitz; D P Naidich; G McGuinness; O S Miettinen; D M Libby; M W Pasmantier; J Koizumi; N K Altorki; J P Smith Journal: Lancet Date: 1999-07-10 Impact factor: 79.321
Authors: R L Keith; Y E Miller; R M Gemmill; H A Drabkin; E C Dempsey; T C Kennedy; S Prindiville; W A Franklin Journal: Clin Cancer Res Date: 2000-05 Impact factor: 12.531
Authors: Sean Blandin Knight; Phil A Crosbie; Haval Balata; Jakub Chudziak; Tracy Hussell; Caroline Dive Journal: Open Biol Date: 2017-09 Impact factor: 6.411