| Literature DB >> 28179014 |
Anna Plantinga1, Xiang Zhan2, Ni Zhao3, Jun Chen4, Robert R Jenq5, Michael C Wu6,7.
Abstract
BACKGROUND: Community-level analysis of the human microbiota has culminated in the discovery of relationships between overall shifts in the microbiota and a wide range of diseases and conditions. However, existing work has primarily focused on analysis of relatively simple dichotomous or quantitative outcomes, for example, disease status or biomarker levels. Recently, there is also considerable interest in the relationship between the microbiota and censored survival outcomes, such as in clinical trials. How to conduct community-level analysis with censored survival outcomes is unclear, since standard dissimilarity-based tests cannot accommodate censored survival times and no alternative methods exist.Entities:
Keywords: Community-level analysis; Distance-based analysis; Kernel machine regression; Survival data
Mesh:
Substances:
Year: 2017 PMID: 28179014 PMCID: PMC5299808 DOI: 10.1186/s40168-017-0239-9
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Empirical type I errors for n=100, 200, or 500
| Number |
|
|
|
|
|---|---|---|---|---|
| 100 | 0.0544 | 0.0540 | 0.0530 | 0.0542 |
| 200 | 0.0494 | 0.0480 | 0.0470 | 0.0462 |
| 500 | 0.0506 | 0.0478 | 0.0536 | 0.0442 |
Empirical type I errors for sample sizes n=100, 200, and 500 with approximately 25% censoring. Results are based on 5000 simulated datasets. K w, K u, K BC, and K 0.5 represent results for the weighted UniFrac kernel, unweighted UniFrac kernel, Bray-Curtis kernel, and generalized UniFrac kernel with α=0.5, respectively
Empirical type I errors for n<100
| Number | Method |
|
|
|
|
|---|---|---|---|---|---|
| 25 | MiRKAT-S | 0.054 | 0.055 | 0.055 | 0.056 |
| Permutation | 0.046 | 0.048 | 0.046 | 0.049 | |
| 50 | MiRKAT-S | 0.045 | 0.058 | 0.051 | 0.051 |
| Permutation | 0.041 | 0.052 | 0.045 | 0.045 | |
| 75 | MiRKAT-S | 0.054 | 0.058 | 0.051 | 0.053 |
| Permutation | 0.051 | 0.053 | 0.048 | 0.049 |
Empirical type I errors for small sample sizes (n<100) with approximately 25% censoring. Results are based on 5000 simulated datasets, and permutation p values were obtained using 1000 permutations. K w, K u, K 0.5, and K BC represent results for the weighted UniFrac kernel, unweighted UniFrac kernel, Bray-Curtis kernel, and generalized UniFrac kernel with α=0.5, respectively
Fig. 1Empirical power. Empirical power was evaluated in all simulation settings, using a sample size of n=100 and 25% censoring. K w, K u, K BC, and K 0.5 represent results for the weighted UniFrac kernel, unweighted UniFrac kernel, Bray-Curtis kernel, and generalized UniFrac kernel with α=0.5, respectively. γ is the true effect size for the associated cluster or OTUs. a Setting 1; survival is associated with OTU counts in a common cluster containing 19.7% of all reads. b Setting 3; survival is associated with the presence or absence of each taxon in a common cluster. c Setting 1; survival is associated with OTU counts in a rare cluster containing 0.9% of all reads. d Setting 3; survival is associated with the presence or absence of each taxon in a rare cluster. e Setting 2; survival is associated with the ten most common OTUs, regardless of cluster membership. f Setting 4; survival is associated with 40 OTUs selected at random, regardless of cluster membership
Analysis of gut microbiome after allogeneic transplant
| Outcome | Method |
|
|
|
|
|---|---|---|---|---|---|
| Overall Survival | Uncorrected | 0.049 | 0.008 | 0.065 | 0.029 |
| Corrected | 0.046 | 0.007 | 0.065 | 0.022 | |
| Grade III aGVHD | Uncorrected | 0.496 | 0.514 | 0.472 | 0.849 |
| Corrected | 0.560 | 0.575 | 0.518 | 0.933 |
p values from MiRKAT-S using the weighted (K w) and unweighted (K u) UniFrac kernels, the generalized UniFrac kernel with α=0.5 (K 0.5), and the Bray-Curtis kernel (K BC) with outcomes of overall survival and severe (at least grade 3) graft-versus-host disease. “Corrected” indicates the p values are based on the modified score statistic with proper type I error; “uncorrected” indicates the p values are based on the original score statistic
Fig. 2Cluster analysis. a Clustering of individuals using Ward’s hierarchical clustering method, based on generalized UniFrac distances with α=0.5. b Kaplan-Meier curves for the two clusters of individuals with an outcome of overall survival