| Literature DB >> 35741811 |
Quran Wu1, James O'Malley2, Susmita Datta1, Raad Z Gharaibeh3, Christian Jobin3, Margaret R Karagas4, Modupe O Coker4, Anne G Hoen4, Brock C Christensen4, Juliette C Madan4, Zhigang Li1.
Abstract
BACKGROUND: The human microbiome can contribute to pathogeneses of many complex diseases by mediating disease-leading causal pathways. However, standard mediation analysis methods are not adequate to analyze the microbiome as a mediator due to the excessive number of zero-valued sequencing reads in the data and that the relative abundances have to sum to one. The two main challenges raised by the zero-inflated data structure are: (a) disentangling the mediation effect induced by the point mass at zero; and (b) identifying the observed zero-valued data points that are not zero (i.e., false zeros).Entities:
Keywords: mediation; microbiome; relative abundance; sparse data; zero-inflated composition
Mesh:
Year: 2022 PMID: 35741811 PMCID: PMC9223163 DOI: 10.3390/genes13061049
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Figure 1Potential causal mediation pathways of a zero-inflated mediator.
Simulation results for comparison between MarZIC and IKT with sample size of . Bias, percentage of the bias, the empirical standard errors, the the mean of estimated standard errors and the empirical coverage probability of the CI for each estimator is respectively reported under the columns Bias, Bias %, SE, Mean SE and CP(%). Mediation effects from the IKT approach are provided at the bottom part of the table.
| Low Relative Abundance (Mean = 0.0025) | High Relative Abundance (Mean = 0.5) | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Parameter | True | Mean | Bias | Bias | SE | Mean | CP (%) | True | Mean | Bias | Bias | SE | Mean | CP (%) |
| / |
|
|
|
|
|
| ||||||||
| MarZIC | ||||||||||||||
| NIE1 | 0.10 | 0.11 | 0.01 | 10.0 | 0.08 | 0.07 | 91 | 9.30 | 9.11 | −0.18 | −1.98 | 2.68 | 2.70 | 96 |
| NIE2 | 0.55 | 0.52 | −0.03 | −5.67 | 0.55 | 0.56 | 97 | 0.55 | 0.50 | −0.06 | −10.15 | 0.62 | 0.56 | 94 |
| NIE | 0.65 | 0.63 | −0.02 | −3.31 | 0.58 | 0.58 | 96 | 9.85 | 9.61 | −0.24 | −2.44 | 3.25 | 3.20 | 95 |
|
| −2.00 | −2.05 | −0.05 | −2.45 | 0.32 | 0.33 | 96 | −2.00 | −1.92 | 0.07 | 3.82 | 0.32 | 0.29 | 94 |
|
| 100.00 | 101.89 | 1.89 | 1.89 | 18.04 | 19.04 | 97 | 100.00 | 99.96 | −0.04 | −0.04 | 1.89 | 1.74 | 91 |
|
| 4.00 | 4.05 | 0.05 | 1.37 | 0.38 | 0.36 | 94 | 4.00 | 3.93 | −0.07 | −1.73 | 0.58 | 0.57 | 91 |
|
| 5.00 | 5.08 | 0.08 | 1.53 | 0.53 | 0.51 | 94 | 5.00 | 4.97 | −0.03 | −0.62 | 0.46 | 0.46 | 99 |
|
| 3.00 | 2.93 | −0.07 | −2.40 | 0.58 | 0.55 | 92 | 3.00 | 3.02 | 0.02 | 0.55 | 0.53 | 0.54 | 99 |
|
| 1.00 | 0.99 | −0.01 | −1.00 | 0.07 | 0.07 | 90 | 1.00 | 0.97 | −0.03 | −2.99 | 0.07 | 0.07 | 89 |
|
| −6.20 | −6.24 | −0.04 | −0.69 | 0.36 | 0.36 | 94 | −1.00 | −1.01 | −0.01 | −0.93 | 0.05 | 0.05 | 90 |
|
| 0.40 | 0.42 | 0.02 | 5.52 | 0.33 | 0.29 | 92 | 0.40 | 0.41 | 0.01 | 1.69 | 0.06 | 0.07 | 95 |
|
| 50.00 | 56.42 | 6.42 | 12.83 | 24.21 | 19.35 | 97 | 50.00 | 53.37 | 3.37 | 6.74 | 8.22 | 8.40 | 96 |
|
| −1.16 | −1.23 | −0.07 | −5.75 | 0.35 | 0.36 | 99 | −1.16 | −1.20 | −0.04 | −3.18 | 0.37 | 0.34 | 95 |
|
| −0.50 | −0.53 | −0.03 | −5.10 | 0.55 | 0.55 | 97 | −0.50 | −0.47 | 0.03 | 6.91 | 0.58 | 0.53 | 91 |
| IKT | ||||||||||||||
| NIE | 0.65 | 0.10 | −0.55 | −84.81 | - | - | 9 | 9.85 | 9.20 | −0.65 | −6.62 | - | - | 94 |
Simulation results for the comparison of MarZIC with CCMM and IKT. Here n denotes the sample size and denotes the number of taxa.
| Recall (%) | Precision (%) | F1 (%) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
|
| MarZIC | MarZIC | CCMM | IKT | MarZIC | MarZIC | CCMM | IKT | MarZIC | MarZIC | CCMM | IKT |
| (NIE1) | (NIE2) | (NIE1) | (NIE2) | (NIE1) | (NIE2) | ||||||||
| 10 | 200 | 99.00 | 100.00 | 100.00 | 58.00 | 97.70 | 98.00 | 38.80 | 99.70 | 97.90 | 98.60 | 55.30 | 68.10 |
| 25 | 200 | 99.50 | 100.00 | 96.00 | 39.50 | 98.20 | 99.50 | 52.40 | 100.00 | 98.50 | 99.60 | 66.10 | 48.30 |
| 50 | 200 | 97.50 | 100.00 | 97.00 | 44.00 | 100.00 | 100.00 | 46.40 | 100.00 | 98.30 | 100.00 | 60.60 | 54.70 |
| 100 | 200 | 96.00 | 98.90 | 100.00 | 32.50 | 95.50 | 100.00 | 42.80 | 100.00 | 94.50 | 98.90 | 58.00 | 41.30 |
| 300 | 200 | 86.00 | 97.80 | - | 25.00 | 90.80 | 99.50 | - | 100.00 | 85.80 | 97.50 | - | 31.30 |
| 500 | 200 | 77.50 | 94.70 | - | 23.50 | 97.80 | 87.20 | - | 99.00 | 83.00 | 87.30 | - | 30.00 |
Simulation results for the comparison of MarZIC with CCMM and IKT.
| Recall (%) | Precision (%) | F1 (%) | |||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
|
| Number of Taxa | MarZIC | MarZIC | CCMM | IKT | MarZIC | MarZIC | CCMM | IKT | MarZIC | MarZIC | CCMM | IKT |
| with Non-Zero NIE1 | (NIE1) | (NIE2) | (NIE1B) | (NIE2) | (NIE1) | (NIE2) | |||||||
| 50 | 5 | 95.00 | 100.00 | 89.00 | 66.20 | 99.00 | 98.50 | 27.90 | 99.60 | 96.60 | 99.00 | 42.20 | 78.20 |
| 50 | 10 | 95.70 | 92.00 | 66.00 | 62.40 | 98.80 | 91.80 | 33.20 | 99.60 | 97.10 | 86.20 | 43.90 | 75.70 |
| 100 | 5 | 96.60 | 99.00 | 89.40 | 60.60 | 92.70 | 98.30 | 19.00 | 99.10 | 94.10 | 97.80 | 31.20 | 73.30 |
| 100 | 10 | 92.10 | 91.00 | 80.10 | 46.00 | 93.70 | 97.80 | 27.20 | 100.00 | 92.50 | 89.50 | 40.40 | 61.20 |
| 300 | 5 | 94.20 | 96.00 | - | 56.10 | 80.50 | 97.00 | - | 99.70 | 85.20 | 94.00 | - | 69.90 |
| 300 | 10 | 85.30 | 93.00 | - | 29.30 | 77.10 | 91.00 | - | 99.60 | 79.60 | 86.60 | - | 43.40 |
Figure 2Heatmap of mediation strength based on NIE1 in VSL#3 study. The mediation strength is measured by (1-p) where p is the unadjusted p-value. Negative sign indicates negative NIE1. Taxonomic assignment is labeled on the vertical axis. Samples are labeled on the horizontal axis. Absence of an OTU in a sample is left blank in the heatmap.