| Literature DB >> 30081949 |
Emmanuel Montassier1,2, Gabriel A Al-Ghalith3, Benjamin Hillmann3, Kimberly Viskocil4, Amanda J Kabage4, Christopher E McKinlay3, Michael J Sadowsky5,6, Alexander Khoruts4,5, Dan Knights7,8.
Abstract
BACKGROUND: Dysbiosis of the human gut microbiome is defined as a maladaptive or clinically relevant deviation of the community profile from the healthy or normal state. Dysbiosis has been implicated in an extensive set of metabolic, auto-immune, and infectious diseases, and yet there is substantial inter-individual variation in microbiome composition even within body sites of healthy humans. An individual's microbiome varies over time in a high-dimensional space to form their personal microbiome cloud. This cloud may or may not be similar to that of other people, both in terms of the average microbiome profile (conformity) and the diameter of the cloud (stability). However, there is currently no robust non-parametric test that determines whether a patient's microbiome cloud is an outlier with respect to a reference group of healthy individuals with widely varying microbiome profiles.Entities:
Keywords: Conformity; Dysbiosis; Fecal microbiota transplantation; Microbiome; Outlier; Stability
Mesh:
Year: 2018 PMID: 30081949 PMCID: PMC6080375 DOI: 10.1186/s40168-018-0514-4
Source DB: PubMed Journal: Microbiome ISSN: 2049-2618 Impact factor: 14.650
Fig. 1Comparison of univariate blood and microbiome tests with the multivariate CLOUD test. a, b In contrast to a univariate blood panel, in which healthy ranges for individual metabolites are well defined, the normal ranges for individual bacterial species in the human microbiome are too wide to be meaningful, with many taxa being completely absent from some individuals’ guts and dominating other individuals’ guts. c, d In contrast, CLOUD uses a high-dimensional representation of the whole microbiome profile to define the normal range of healthy microbiomes
Fig. 2a First two dimensions of principal coordinates analysis (PCoA) of a simulated outlier and 29 reference microbiomes. Even though the outlier has no species in common with the reference samples, two-dimensional PCoA obscures the fact that it is an outlier, b PC6 PCoA axe plotted against PC1, showing that in this particular case six dimensions are sufficient to observe the outlier. In real clinical data, the true number of dimensions required is not known. c Example of an outlier that would not be detected by a centroid-based test
Fig. 3Graphical illustration of how with certain high-dimensional manifolds setting k too high can cause actual outliers to be classified as normal (false negative) and can cause normal points to be classified as outliers (false positive). Using large k approaching n defeats the purpose of the local distance measure, which is to allow the test to use only local regions in ecological distance space and can cause normal reference samples at the extremes of the distributions to be classified as outliers. On the other hand, if k is too small, then it is not robust to subtle variations in the reference group. By default, the CLOUD test sets the neighborhood size to 5% of the size of the reference set
Fig. 4a Number of nearest healthy neighbors chosen in the CLOUD test prediction to find outliers in an international cohort (the HMP data set). We randomly selected a test dataset of 50 subjects and randomly selected a training dataset of 100 subjects 30 times in the full dataset of 200 subjects. We repeated the random selection of the training dataset 30 times. We did not identify outliers, excepting for extreme value of k in several random training datasets. This analysis demonstrates the robustness of the CLOUD test to the neighborhood size. The vertical bar represents 5% of the training dataset, the default neighborhood size for the test. b The same analysis was performed for the Global gut data set. c Principal coordinate plot of the Global gut dataset, from Unweighted UniFrac distance, demonstrating that the CLOUD test is robust to strong clustering effects with a reference group
Fig. 5a The first 10 nearest independent healthy neighbors. This plot shows the restoration of the microbiome in responders to FMT, as the distances were very similar between samples from healthy subjects and those with successful FMT. This also shows failed restoration of the microbiome in the non-responder patient, as the distances were very different between the samples from failed patient and from the healthy subjects. b Plot of the log10 outlier percentile in patients who received FMT. The dashed line represents an outlier percentile of 0.05. When using k = 5% of the population, non-responder patient was considered as outlier. Using large neighborhood sizes classified 1 responder patient as outlier. c Patient stability as measured by self-similarity over time. Plot of the distance of a day to the corresponding previous day using Unweighted UniFrac distance. The figure shows stability between two consecutive samples of the fecal microbiome in healthy controls and in responder patients among days whereas the non-responder patient showed instability between two consecutive samples