| Literature DB >> 25171332 |
Zhenjiang Xu1, Daniel Malmer1, Morgan G I Langille2, Samuel F Way3, Rob Knight4.
Abstract
Entities:
Mesh:
Year: 2014 PMID: 25171332 PMCID: PMC4260698 DOI: 10.1038/ismej.2014.157
Source DB: PubMed Journal: ISME J ISSN: 1751-7362 Impact factor: 10.302
Figure 1Accuracy of supervised classification. Random Forest classification model was performed, using caret (Kuhn, 2008) R package, with 5 repeats of 10-fold cross validation on CBH, CS, CSS, FSH, FS (from Knights and following the same naming) and HMP (from Langille ; other data sets with paired shotgun and 16S sequences are too small in sample size for classification) data sets. Kappa statistic is used here as measure of accuracy to assess the agreement between predicted classification and reality. (a) The average accuracies using as input the OTUs clustered at 97% sequence similarity, the functional profile predicted from the OTUs using PICRUSt or the annotation from shotgun metagenomic sequences (for HMP data set only). (b) The pairwise comparisons between the accuracies using those three inputs as predictive features. The error bars indicate 95% confidence interval.