| Literature DB >> 18989714 |
Juan M García-Gómez1, Jan Luts, Margarida Julià-Sapé, Patrick Krooshof, Salvador Tortajada, Javier Vicente Robledo, Willem Melssen, Elies Fuster-García, Iván Olier, Geert Postma, Daniel Monleón, Angel Moreno-Torres, Jesús Pujol, Ana-Paula Candiota, M Carmen Martínez-Bisbal, Johan Suykens, Lutgarde Buydens, Bernardo Celda, Sabine Van Huffel, Carles Arús, Montserrat Robles.
Abstract
JUSTIFICATION: Automatic brain tumor classification by MRS has been under development for more than a decade. Nonetheless, to our knowledge, there are no published evaluations of predictive models with unseen cases that are subsequently acquired in different centers. The multicenter eTUMOUR project (2004-2009), which builds upon previous expertise from the INTERPRET project (2000-2002) has allowed such an evaluation to take place.Entities:
Mesh:
Substances:
Year: 2008 PMID: 18989714 PMCID: PMC2797843 DOI: 10.1007/s10334-008-0146-y
Source DB: PubMed Journal: MAGMA ISSN: 0968-5243 Impact factor: 2.310
Number of training (INTERPRET) and test (eTUMOUR) cases per class used in the study
| Class | INTERPRET | eTUMOUR |
|---|---|---|
| GBM | 84 | 28 |
| MEN | 57 | 17 |
| MET | 37 | 32 |
| LGG | 33 | 20 |
| 211 | 97 |
Short TE 1HMRS data were acquired according to a consensus protocol during the INTERPRET (2000–2002) and eTUMOUR (2004–2009) projects
Breakdown of cases per manufacturer included in the training (INTERPRET) and test (eTUMOUR) datasets
| Manufacturer | INTERPRET (%) | eTumour (%) |
|---|---|---|
| GE | 53.1 | 54.6 |
| Siemens | 6.6 | 12.4 |
| Philips | 40.3 | 33.0 |
Percentage of cases per acquisition center included in the training (INTERPRET) and test (eTUMOUR) datasets
| CENTERS | Training from INTERPRET (%) | Test from eTUMOUR (%) |
|---|---|---|
| UMC Nijmegen | 2.8 | 1.0 |
| St. George’s Hospital | 27.0 | 18.6 |
| Medical University of LODZ | 3.8 | 10.3 |
| FLENI | 1.9 | 6.2 |
| IDI-Bellvitge | 40.3 | |
| Centre de Diag. Pedralbes | 24.2 | |
| Centre de Diag. Pedralbes + IAT | 28.9 | |
| IDI-Badalona | 17.5 | |
| Univ. de Valencia | 16.5 | |
| Hospital Sant Joan de DEU | 1.0 | |
| Cases of project | ||
| exclusive centers (%) | 40.3 | 35.1 |
Last row indicates the percentage of training cases that belong to centers that did not produce eTUMOUR cases, and the percentage of test cases that belong to centers that did not acquired training data for INTERPRET
Best results obtained for the six pairwise classification problems
| Task | id | Features | Classif | CV | IT | ||
|---|---|---|---|---|---|---|---|
| ERR | BER | ERR | BER | ||||
| GBM versus MEN | 1.6 | 190 | MLP | 0.06 | 0.07 | 0.07 | 0.09 |
| GBM versus MET | 2.13 | PI | LDA | 0.33 | 0.40 | 0.22 | 0.21 |
| GBM versus LGG | 3.16 | PI | LS-SVM | 0.12 | 0.18 | 0.08 | 0.09 |
| MEN versus MET | 4.21 | PCA | MLP | 0.05 | 0.05 | 0.06 | 0.07 |
| MEN versus LGG | 5.10 | ICA | LS-SVM | 0.08 | 0.09 | 0.08 | 0.08 |
| MET versus LGG | 6.13, 21, 25–26 | PI | LDA/FLDA/MLP/LS-SVM | [0.01, 0.04] | [0.01, 0.04] | 0.06 | 0.07 |
The ERR and BER estimation based on CV over the INTERPRET data and based on the eTUMOUR IT set are shown. The columns of the table are: task: classification problem defined by the classes to discriminate by the classifiers; id, identification of the classifier; features: acronym of the feature extraction method, classif, acronym of the classification method; CV, results estimated by means of a tenfold CV in the INTERPRET database; IT, results estimated by means of the independent test, with the INTERPRET database as training and the eTUMOUR dataset as test; ERR, error rate; and BER, balanced error rate. ▭interval within every result falls
Fig. 1Box-whisker plots of the performance for each problem in the eTUMOUR dataset (based on the detailed list of results included in Sect. 1 of the on-line Supplementary Material). Performance is measured in BER. The box indicates the region between the lower (X 0.25) and the upper (X 0.75) quartiles. The horizontal line inside the box indicates the median of the distribution, and the vertical lines (the “whiskers”) extend to at most 1.5 times the box width. Any outlier of the distribution is displayed with a cross
Fig. 2Scatter plot of the performance measured in BER estimated by the IT set consisting of new eTUMOUR cases and the BER estimated by the CV using the INTERPRET cases. BER(IT) = BER(CV) is represented by the solid-blue line and the trend of the (BER(CV) < 0.2, BER(IT) < 0.3) region is indicated by the black-dashed line
Fig. 3Box-whisker plots of the performance for each feature extraction method in the eTUMOUR dataset. Performance is measured in BER and the box-whisker characteristics are the same as in Fig. 1
Fig. 4Box-whisker plots of the performance for each classification method in the eTUMOUR dataset. Performance is measured in BER and the box-whisker characteristics are the same as in Fig. 1
Fig. 5Unimodal prototypes of the short TE spectra for the four tumour groups of the training and test datasets. Each prototype is represented by the unsmoothed mean function and the mean ± SD function. The view is zoomed in the [0.5, 4.1] ppm region used in our experiments
Fig. 6Potential outliers (1/2) detected as a consequence of this study. Case numbering corresponds to eTUMOUR database (http://www.etumour. net) entries. For each case, the reference image and voxel location is shown on the left, and the region of interest of the real part of the short TE spectrum is shown on the right. For an easier visualization of the spectrum, vertical dashed lines indicate the position of the main resonances: Cho (3.21 ppm), Cr (3.02), NAA (2.01 ppm), L1 (1.29 ppm), L2 (0.92 ppm)
Fig. 7Potential outliers (2/2) detected as a consequence of this study. Figure characteristics are the same as in Fig. 6