| Literature DB >> 23555553 |
Zhenqiu Liu1, Dechang Chen, Li Sheng, Amy Y Liu.
Abstract
The amount of metagenomic data is growing rapidly while the computational methods for metagenome analysis are still in their infancy. It is important to develop novel statistical learning tools for the prediction of associations between bacterial communities and disease phenotypes and for the detection of differentially abundant features. In this study, we presented a novel statistical learning method for simultaneous association prediction and feature selection with metagenomic samples from two or multiple treatment populations on the basis of count data. We developed a linear programming based support vector machine with L(1) and joint L(1,∞) penalties for binary and multiclass classifications with metagenomic count data (metalinprog). We evaluated the performance of our method on several real and simulation datasets. The proposed method can simultaneously identify features and predict classes with the metagenomic count data.Entities:
Mesh:
Year: 2013 PMID: 23555553 PMCID: PMC3608598 DOI: 10.1371/journal.pone.0053253
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Frequencies of Correctly Identified features with Different numbers of classes.
| Features | 2-Classes | 4-Classes |
| 1 | 99 | 96 |
| 2 | 100 | 97 |
| 3 | 97 | 100 |
| 4 | 100 | 100 |
| 5 | 100 | 99 |
| Av. # of Features | 4.9 | 4.86 |
Figure 1Test ROC curves and AUCs for simulation data: Left: 2-Classes; Right: 4-Classes.
Identified OTUs for hand surface bacteria data.
| Firmicutes;“Bacilli”; “Lactobacillales”;Lactobacillaceae;Lactobacillus (100) |
| Proteobacteria;Gammaproteobacteria;Pseudomonadaceae;Pseudomonas(83) |
| Firmicutes; “Bacilli”; “Lactobacillales”;Streptococcaceae;Streptococcus (100) |
| Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae;Neisseria (78) |
| Firmicutes; “Bacilli”;Bacillales; “Listeriaceae”;Brochothrix (76) |
| Firmicutes; “Bacilli”; “Lactobacillales”;Streptococcaceae;Lactococcus (100) |
| Firmicutes; “Bacilli”;Bacillales; “Staphylococcaceae”;Staphylococcus (100) |
| Proteobacteria;Betaproteobacteria;Burkholderiales;Comamonadaceae;Acidovorax (92) |
| Proteobacteria;Betaproteobacteria;Burkholderiales;Incertae sedis 5 (100) |
Identified OTUs for keyboard data.
| ID | OTU Name |
| 1 | Bacteria;Firmicutes;Bacilli;Lactobacillales;Carnobacteriaceae (100) |
| 2 | Bacteria;Proteobacteria;Betaproteobacteria;Neisseriales;Neisseriaceae (88) |
| 3 | Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Propionibacteriaceae (100) |
| 4 | Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Corynebacteriaceae (100) |
| 5 | Bacteria;Actinobacteria;Actinobacteria;Actinomycetales;Micrococcaceae (100) |
| 6 | Bacteria;Firmicutes;Bacilli;Bacillales;Staphylococcaceae (100) |
| 7 | Bacteria;Firmicutes;Bacilli;Lactobacillales;Streptococcaceae (100) |
| 8 | Bacteria;Cyanobacteria;Chloroplast;Streptophyta (100) |
Figure 2Relative abundances of the identified features for three healthy individuals: Left: Individual 1, Middle: 2, Right: 3.