| Literature DB >> 31208050 |
Xue Zhang1, Yang Yang2, Yalan Wang3, Qi Fan4.
Abstract
This paper proposes a sensitive, sample preparation-free, rapid, and low-cost method for the detection of the B-rapidly accelerated fibrosarcoma (BRAF) gene mutation involving a substitution of valine to glutamic acid at codon 600 (V600E) in colorectal cancer (CRC) by near-infrared (NIR) spectroscopy in conjunction with counter propagation artificial neural network (CP-ANN). The NIR spectral data from 104 paraffin-embedded CRC tissue samples consisting of an equal number of the BRAF V600E mutant and wild-type ones calibrated and validated the CP-ANN model. As a result, the CP-ANN model had the classification accuracy of calibration (CAC) 98.0%, cross-validation (CACV) 95.0% and validation (CAV) 94.4%. When used to detect the BRAF V600E mutation in CRC, the model showed a diagnostic sensitivity of 100.0%, a diagnostic specificity of 87.5%, and a diagnostic accuracy of 93.8%. Moreover, this method was proven to distinguish the BRAF V600E mutant from the wild type based on intrinsic differences by using a total of 312 CRC tissue samples paraffin-embedded, deparaffinized, and stained. The novel method can be used for the auxiliary diagnosis of the BRAF V600E mutation in CRC. This work can expand the application of NIR spectroscopy in the auxiliary diagnosis of gene mutation in human cancer.Entities:
Keywords: BRAF V600E mutation; auxiliary diagnosis; colorectal cancer; counter propagation artificial neural network; deparaffinized; detection; near-infrared spectroscopy; paraffin-embedded; stained; tissue
Mesh:
Substances:
Year: 2019 PMID: 31208050 PMCID: PMC6631977 DOI: 10.3390/molecules24122238
Source DB: PubMed Journal: Molecules ISSN: 1420-3049 Impact factor: 4.411
Figure 1The structural formulas for valine (a) and glutamic acid (b).
The numbers of models calibrated and validated using 312 colorectal cancer (CRC) tissue samples.
| Model Number | Class of Samples | Number of Calibration Samples | Number of Validation Samples | ||
|---|---|---|---|---|---|
| Mutant | Wild-type | Mutant | Wild-type | ||
| 1 | Class 1 | 40 | 40 | 12 | 12 |
| 2 | Class 2 | 40 | 40 | 12 | 12 |
| 3 | Class 3 | 40 | 40 | 12 | 12 |
| 4 | Class 2&1 | 20&20 | 20&20 | NA | NA |
| 5 | Class 2&3 | 20&20 | 20&20 | NA | NA |
Note: NA for not available.
Figure 2Mean near-infrared (NIR) transflectance spectra for colorectal cancer (CRC) tissue sections. Red, light red, and dark red represent, respectively, the mutant samples of Class 1, Class 2, and Class 3. Blue, light blue, and dark blue represent, respectively, the wild-type samples of Class 1, Class 2, and Class 3.
Vital preprocessing strategies, spectral subranges, numbers of principal components (PCs), numbers of neurons on each side, and corresponding model performances of the counter propagation artificial neural network (CP-ANN) models built respectively using NIR data for Class 1, Class 2, Class 3, Class 2&1, and Class 2&3 samples.
| Model Number | Preprocessing | Spectral Subrange (cm−1) | Number of PCs/ | Number of Neurons on Each Side | Model Performances | ||
|---|---|---|---|---|---|---|---|
| CAC (%) | CACV (%) | CAV (%) | |||||
|
|
|
|
|
|
|
|
|
| 1.1 | MSC + MC | 9000–6800, 6500–4000 | 6/99.9 | 12 | 97.0 | 93.0 | 90.3 |
| 1.2 | SNV + MC | 9000–6800, 6500–4000 | 6/99.9 | 12 | 97.0 | 94.0 | 81.9 |
| 1.3 | FD + MC | 9000–6800, 6500–4000 | 6/98.8 | 12 | 93.0 | 86.0 | 88.9 |
| 1.4 | SD + MC | 9000–6800, 6500–4000 | 6/95.7 | 12 | 89.0 | 71.0 | 73.6 |
| 1.5 | SGS + MC | 9000–6800, 6500–4000 | 6/100.0 | 12 | 98.0 | 94.0 | 90.3 |
| 1.6 | SGS + FD + MC | 9000–6800, 6500–4000 | 6/99.1 | 12 | 94.0 | 88.0 | 90.3 |
| 1.7 | NDS + FD + MC | 9000–6800, 6500–4000 | 3/100.0 | 12 | 92.0 | 85.0 | 87.5 |
| 1.8 | MSC + SD + MC | 9000–6800, 6500–4000 | 6/ 96.0 | 12 | 90.0 | 74.0 | 77.8 |
| 1.9 | SNV + NDS + FD + MC | 9000–6800, 6500–4000 | 6/100.0 | 12 | 95.0 | 88.0 | 90.3 |
| 1.10 | MC | 9000–4000 | 6/100.0 | 12 | 98.0 | 94.0 | 91.7 |
| 1.11 | MC | 9000–6800, 6500–4000 | 6/100.0 | 10 | 97.0 | 94.0 | 88.9 |
| 1.12 | MC | 9000–6800, 6500–4000 | 6/100.0 | 15 | 98.0 | 96.0 | 88.9 |
|
|
|
|
|
|
|
|
|
| 2.1 | MSC + MC | 9000–6800, 6500–4000 | 6/100.0 | 12 | 94.0 | 85.0 | 79.2 |
| 2.2 | SNV + MC | 9000–6800, 6500–4000 | 6/ 99.9 | 12 | 89.0 | 83.0 | 83.3 |
| 2.3 | FD + MC | 9000–6800, 6500–4000 | 6/97.2 | 12 | 90.0 | 82.0 | 86.1 |
| 2.4 | SD + MC | 9000–6800, 6500–4000 | 20/84.6 | 12 | NA | NA | NA |
| 2.5 | SGS + MC | 9000–6800, 6500–4000 | 6/100.0 | 12 | 96.0 | 94.0 | 90.3 |
| 2.6 | SGS + FD + MC | 9000–6800, 6500–4000 | 6/97.8 | 12 | 92.0 | 88.0 | 81.9 |
| 2.7 | NDS + FD + MC | 9000–6800, 6500–4000 | 2/100.0 | 12 | 88.0 | 80.0 | 79.2 |
| 2.8 | MSC + SD + MC | 9000–6800, 6500–4000 | 20/80.6 | 12 | NA | NA | NA |
| 2.9 | SNV + NDS + FD + MC | 9000–6800, 6500–4000 | 3/100.0 | 12 | 90.0 | 85.0 | 87.5 |
| 2.10 | MC | 9000–4000 | 6/100.0 | 12 | 96.0 | 91.0 | 93.1 |
| 2.11 | MC | 9000–6800, 6500–4000 | 6/100.0 | 10 | 96.0 | 90.0 | 87.5 |
| 2.12 | MC | 9000–6800, 6500–4000 | 6/100.0 | 15 | 97.0 | 92.0 | 94.4 |
|
|
|
|
|
|
|
|
|
| 3.1 | MSC + MC | 9000–6800, 6500–4000 | 5/99.9 | 12 | 86.0 | 71.0 | 66.7 |
| 3.2 | SNV + MC | 9000–6800, 6500–4000 | 5/99.9 | 12 | 85.0 | 72.0 | 68.1 |
| 3.3 | FD + MC | 9000–6800, 6500–4000 | 13/85.5 | 12 | 90.0 | 77.0 | 79.2 |
| 3.4 | SD + MC | 9000–6800, 6500–4000 | 20/75.0 | 12 | NA | NA | NA |
| 3.5 | SGS + MC | 9000–6800, 6500–4000 | 5/100.0 | 12 | 93.0 | 89.0 | 90.3 |
| 3.6 | SGS + FD + MC | 9000–6800, 6500–4000 | 10/ 85.6 | 12 | 88.0 | 79.0 | 76.4 |
| 3.7 | NDS + FD + MC | 9000–6800, 6500–4000 | 2/100.0 | 12 | 90.0 | 82.0 | 77.8 |
| 3.8 | MSC + SD + MC | 9000–6800, 6500–4000 | 20/74.5 | 12 | NA | NA | NA |
| 3.9 | SNV + NDS + FD + MC | 9000–6800, 6500–4000 | 4/100.0 | 12 | 87.0 | 65.0 | 72.2 |
| 3.10 | MC | 9000–4000 | 5/100.0 | 12 | 95.0 | 88.0 | 87.5 |
| 3.11 | MC | 9000–6800, 6500–4000 | 5/100.0 | 10 | 93.0 | 89.0 | 86.1 |
| 3.12 | MC | 9000–6800, 6500–4000 | 5/100.0 | 15 | 95.0 | 89.0 | 88.9 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
Notes: MC for mean centering; MSC for multiplicative scatter correction; SNV for standard normal variate; FD for first derivative; SD for second derivative; SGS for Savitzky-Golay smoothing; NDS for Norris derivative smoothing; PC for principal component; CAC, CACV and CAV respectively for the classification accuracy of calibration, cross-validation and validation; NA for not available.
Figure 3The differences between the mean spectra for the mutant and wild-type samples. The full, long dashed, and short dashed lines represent respectively Class 1, Class 2, and Class 3 samples.
Figure 4Projection maps for the 12 × 12 CP-ANN models: (a) Model 1; (b) Model 2; and (c) Model 3. The uppercase letter “M” and the lowercase letter “m” for the mutant samples, respectively, in calibration and validation; “W” and “w” for the wild-type samples, respectively, in calibration and validation; “○” for the samples assigned incorrectly; the gray region for mutant; the white region for wild type.
The diagnostic performances of five CP-ANN models built sequentially using an equal number of Class 1, Class 2, Class 3, Class 2&1, and Class 2&3 samples.
| Model Number | Diagnostic Performances | ||
|---|---|---|---|
| Sensitivity (%) | Specificity (%) | Accuracy (%) | |
| 1 |
| 87.5 | 93.8 |
| 2 |
| 95.0 | 97.5 |
| 3 |
| 82.5 | 91.3 |
| 4 |
| 92.5 | 96.3 |
| 5 |
| 85.0 | 92.5 |