| Literature DB >> 35885966 |
Dan Luo1, Wenwei Liu2, Tian Chen3, Lingling An1,2,4.
Abstract
Longitudinal metagenomics has been widely studied in the recent decade to provide valuable insight for understanding microbial dynamics. The correlation within each subject can be observed across repeated measurements. However, previous methods that assume independent correlation may suffer from incorrect inferences. In addition, methods that do account for intra-sample correlation may not be applicable for count data. We proposed a distribution-free approach, namely CorrZIDF, which extends the current method to model correlated zero-inflated metagenomic count data, offering a powerful and accurate solution for detecting significance features. This method can handle different working correlation structures without specifying each margin distribution of the count data. Through simulation studies, we have shown the robustness of CorrZIDF when selecting a working correlation structure for repeated measures studies to enhance the efficiency of estimation. We also compared four methods using two real datasets, and the new proposed method identified more unique features that were reported previously on the relevant research.Entities:
Keywords: correlation structure; distribution-free; longitudinal; metagenomic; microbial; zero-inflated count model
Mesh:
Year: 2022 PMID: 35885966 PMCID: PMC9316307 DOI: 10.3390/genes13071183
Source DB: PubMed Journal: Genes (Basel) ISSN: 2073-4425 Impact factor: 4.141
Summary of parameter settings for the simulation studies. Two correlation structures, AR (1) and exchangeable were generated.
| Setting | AR (1) | Exchangeable | ||
|---|---|---|---|---|
| 25 subjects per condition | Moderately Correlated | Highly Correlated | Moderately Correlated | Highly Correlated |
| 50 subjects per condition | ||||
Figure 1Boxplots for the power under various settings based on 20 replicated simulations with 1000 features (including 200 DAFs) after adjusting multiple comparisons. The p-values are adjusted by the BH procedure. Assume AR (1) correlation structure across different sampling points.
Figure 2Boxplots of Type I error rates under various settings based on 20 replicated simulations with 1000 features (including 200 DAFs) after adjusting multiple comparisons. The dashed line represents the cutoff of 0.05. Assume AR (1) correlation structure across different sampling points. The p-values are adjusted by the BH procedure.
Figure 3Bar plots of the numbers of detected true and false positives under various settings with 1000 features and 20 replicated simulations. Each bar represents the total number of features that are detected as statistically significant post BH adjustment, and the short error bars represent the standard deviation from 20 replications. Note: the true number of DAFs in the simulation is 200. Assume AR (1) correlation structure across different sampling points.
Figure 4A Venn diagram of the distribution of the significantly differentiated results by the CorrZIDF, ZIDF and FZINBMM for a pregnancy dataset.
List of unique genera by each method for the pregnancy data.
| Method | Genus | Relevance | Reference |
|---|---|---|---|
| CorrZIDF | Acinetobacter | Acinetobacter infection in adverse pregnancy and perinatal outcomes | [ |
| Aerococcus | Low abundance in preterms | [ | |
| Atopobium | High relative abundance of Atopobium vaginae at the midtrimester was highly predictive of preterm birth | [ | |
| Bacteroides | Abundance reduction in Bacteroides in women who delivered preterm | [ | |
| Brevibacterium | Occasionally found in the placenta, considered as contaminants | [ | |
| Campylobacter | Associated with an increased risk of spontaneous abortion, stillbirth, and preterm delivery | [ | |
| Fusobacterium | Associated with preterm birth and has been isolated from the amniotic fluid, placenta, and chorioamnionic membranes of women delivering prematurely | [ | |
| Mobiluncus | For women with a prior preterm delivery, high level of Mobiluncus significantly indicate a spontaneous preterm delivery | [ | |
| Oligella | Mostly found as a commensal organism of the human genitourinary tract, which is also the main infection site | [ | |
| Peptostreptococcus | Pregnant women with Bacterial vaginosis including | [ | |
| Porphyromonas | Significantly high abundance in preterms | [ | |
| Sneathia | Low abundance found in preterm | [ | |
| Sutterella | Associated with metabolic/inflammatory variables across pregnancy in Gestational diabetes mellitus patients; | [ | |
| ZIDF | Facklamia | More abundant in animals that failed to establish a pregnancy | [ |
| Ureaplasma | High abundance of | [ | |
| FZINBMM | Actinomyces | Actinomyces infections in pregnancy are rare but, if they occur, have been linked primarily with preterm deliveries. | [ |
| Anaerococcus | The vaginal microbiota of Non-aboriginal women had higher relative abundance of the taxa Anaerococcus | [ | |
| Finegoldia | Associated with bacterial vaginosis, which is linked to an increased risk of preterm birth: | [ |
Figure 5A Venn diagram of the distribution of the significantly differentiated results in the CorrZIDF, ZIDF, ZIBR and FZINBMM in diet dataset.
List of unique genera by each method for the mouse diet data.
| Method | Genus | Relevance | Reference |
|---|---|---|---|
| CorrZIDF | Anaerofilum | The relative abundances of Anaerofilum were significantly lower in the obese group. | [ |
| Bilophila | Increased abundance of Bilophila has been associated with fat feeding and inflammation | [ | |
| Clostridium | High fat diet lowers C. butyricum levels; C. butyricum maybe one of the species that constitute a core microbiota involved in energy storage and metabolism through mechanisms that are not yet known; Clostridium XIVb is more abundant in high fat diet group than the control group. | [ | |
| Eggerthella | It metabolized amino acids rather than sugar | [ | |
| ZIBR | Akkermansia | Akkermansia muciniphila abundance was strongly and negatively affected by high-fat diet feeding | [ |
| ErysipelotrichaceaeIncertaeSedis | Aaccelerated postnatal growth suppressed the abundance of Erysipelotrichaceae_incertae_sedi | [ | |
| FZINBMM | Alistipes | Were significantly different between the high-fat diet and low-fat diet groups | [ |
| Bryantella | Relatively high abundance in the gut in high protein fed mice | [ | |
| Mogibacterium | In overweight people, Mogibacterium is associated with PUFA-rich (polyunsaturated fatty acid) diets | [ |