| Literature DB >> 28768897 |
Elisa Kasbohm1, Sina Fischer, Anne Küntzel, Peter Oertel, Andreas Bergmann, Phillip Trefz, Wolfram Miekisch, Jochen K Schubert, Petra Reinhold, Mario Ziller, Andreas Fröhlich, Volkmar Liebscher, Heike Köhler.
Abstract
Modern statistical methods which were developed for pattern recognition are increasingly being used for data analysis in studies on emissions of volatile organic compounds (VOCs). With the detection of disease-related VOC profiles, novel non-invasive diagnostic tools could be developed for clinical applications. However, it is important to bear in mind that not all statistical methods are equally suitable for the investigation of VOC profiles. In particular, univariate methods are not able to discover VOC patterns as they consider each compound separately. The present study demonstrates this fact in practice. Using VOC samples from a controlled animal study on paratuberculosis, the random forest classification method was applied for pattern recognition and disease prediction. This strategy was compared with a prediction approach based on single compounds. Both methods were framed within a cross-validation procedure. A comparison of both strategies based on these VOC data reveals that random forests achieves higher sensitivities and specificities than predictions based on single compounds. Therefore, it will most likely be more fruitful to further investigate VOC patterns instead of single biomarkers for paratuberculosis. All methods used are thoroughly explained to aid the transfer to other data analyses.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28768897 DOI: 10.1088/1752-7163/aa83bb
Source DB: PubMed Journal: J Breath Res ISSN: 1752-7155 Impact factor: 3.262