| Literature DB >> 34900137 |
Jayanth Kumar Narayana1, Micheál Mac Aogáin2,3, Wilson Wen Bin Goh1,4, Kelin Xia5, Krasimira Tsaneva-Atanasova6, Sanjay H Chotirmall1,7.
Abstract
Traditionally, human microbiology has been strongly built on the laboratory focused culture of microbes isolated from human specimens in patients with acute or chronic infection. These approaches primarily view human disease through the lens of a single species and its relevant clinical setting however such approaches fail to account for the surrounding environment and wide microbial diversity that exists in vivo. Given the emergence of next generation sequencing technologies and advancing bioinformatic pipelines, researchers now have unprecedented capabilities to characterise the human microbiome in terms of its taxonomy, function, antibiotic resistance and even bacteriophages. Despite this, an analysis of microbial communities has largely been restricted to ordination, ecological measures, and discriminant taxa analysis. This is predominantly due to a lack of suitable computational tools to facilitate microbiome analytics. In this review, we first evaluate the key concerns related to the inherent structure of microbiome datasets which include its compositionality and batch effects. We describe the available and emerging analytical techniques including integrative analysis, machine learning, microbial association networks, topological data analysis (TDA) and mathematical modelling. We also present how these methods may translate to clinical settings including tools for implementation. Mathematical based analytics for microbiome analysis represents a promising avenue for clinical translation across a range of acute and chronic disease states.Entities:
Keywords: Integration; Machine learning; Mathematical modelling; Microbial association analysis; Microbiome; Topological data analysis
Year: 2021 PMID: 34900137 PMCID: PMC8637001 DOI: 10.1016/j.csbj.2021.11.029
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Fig. 1Overview of the analytical approaches to microbiome data. (A) Microbiome community samples can be assessed by (1) whole genome shotgun metagenomics: where the whole DNA content is sequenced or (2) Targeted amplicon sequencing: where a targeted region (i.e. 16S in bacteria or ITS in fungi) is amplified by polymerase chain reaction (PCR) followed by sequencing. (B) The derived sequences are next mapped to reference databases to yield taxonomic, anti-microbial resistance or functional profiles of the microbiome (whole genome shotgun metagenomics) or taxonomic profile (targeted amplicon sequencing). Derived microbiome profiles suffer from compositionality, high-dimensionality, over-dispersion, sparsity, and batch effects. (C) Various computational approaches for microbiome analytics can be leveraged including integrative microbiome analysis, machine learning, microbial association analysis, topological data analysis and mathematical modelling.
Table summarizing open-source tools and software available for the methods described in this manuscript.
| Method | Tools/Softwares available |
|---|---|
| Compositional Data Analysis | ‘compositions’ – a R package |
| Similarity Network Fusion (SNF) | ‘SNFtool’ – a R package |
| Data Integration Analysis for Biomarker discovery using Latent cOmponents (DIABLO) | Part of ‘mixOmics’ – a R package |
| Multi-Omics Factor Analysis (MOFA) | ‘MOFA2’ – a R package |
| DeepMicro | ‘DeepMicro’ – a python package |
| Co-occurrence network analysis including renormalization and bootstrap (CoNet) | ‘CoNet’ – a cytoscape app |
| Sparse inverse covariance estimation for ecological association inference (SPIEC-EASI) | ‘SpeicEasi’ – a R package |
| Microbial dynamic systems inference engine (MDSINE) | ‘mdsine’ – available as standalone and MATLAB library |
| Mapper | ‘Kepler Mapper’ – a python package |