Literature DB >> 28025202

Calypso: a user-friendly web-server for mining and visualizing microbiome-environment interactions.

Martha Zakrzewski1, Carla Proietti1, Jonathan J Ellis2, Shihab Hasan1,2, Marie-Jo Brion2, Bernard Berger3, Lutz Krause1,2.   

Abstract

Calypso is an easy-to-use online software suite that allows non-expert users to mine, interpret and compare taxonomic information from metagenomic or 16S rDNA datasets. Calypso has a focus on multivariate statistical approaches that can identify complex environment-microbiome associations. The software enables quantitative visualizations, statistical testing, multivariate analysis, supervised learning, factor analysis, multivariable regression, network analysis and diversity estimates. Comprehensive help pages, tutorials and videos are provided via a wiki page. Availability and Implementation: The web-interface is accessible via http://cgenome.net/calypso/ . The software is programmed in Java, PERL and R and the source code is available from Zenodo ( https://zenodo.org/record/50931 ). The software is freely available for non-commercial users. Contact: l.krause@uq.edu.au. Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2016. Published by Oxford University Press.

Entities:  

Mesh:

Substances:

Year:  2017        PMID: 28025202      PMCID: PMC5408814          DOI: 10.1093/bioinformatics/btw725

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


1 Introduction

We present the web-application Calypso, a powerful, yet easy-to-use tool for the higher-level analysis of microbial community composition data (e.g. Ainsworth ; Cantacessi ; Swe ). The software has a focus on multivariate methods and allows the analysis of bacterial, archaeal, viral and eukaryotic communities. Several software packages are already available for the analysis and visualization of metagenomic datasets (Arndt ; Kristiansson ; McMurdie and Holmes, 2013; Parks ; Paulson ; Sanli ). Compared to existing tools, Calypso is unique by providing access to an extensive range of data-mining methods via an easy-to-use web-interface (Tables S1–S3, Fig. S11). The software can easily be explored using a demo project.

2 Summary of features

2.1 Input files and output formats

As input, Calypso requires a counts file providing taxonomic assignments of metagenomic (or 16S rDNA) sequences and a metadata file providing meta-information for each sample. An optional matrix of pair-wise community distances can be uploaded, including UniFrac distances. Various file formats are supported, including the common biom-format, which allows direct upload of pre-processed files generated by other analysis pipelines, such as QIIME, mothur, MG-RAST or MetaPhlAn. Uploaded data can be normalized and transformed to account for the generally non-normal distribution of microbial community composition data. Publication-quality images can be generated in either PNG, PDF or SVG format.

2.2 Quantitative representations

Microbial composition data is presented as heatmap, bubble plot, scatter plot, strip chart, bar chart and boxplot. Calypso implements a newly developed module for visualizing hierarchical relationships as interactive dendrograms or interactive radial trees (Fig. 1A, Supplementary Text). Hierarchical relationships can further be explored using Krona charts.
Fig. 1

Analysis of intestinal 16S rDNA data in Calypso. (A) Interactive trees visualize hierarchical structures in microbial communities. Edges depict the relative abundance of the corresponding taxon. (B) Hierarchical clustering of microbial community profiles. (C) Principal Coordinates Analysis (PCoA) of intestinal microbiota of subjects from Malawi, USA and Venezuela. (D) Network analysis describing positive (yellow edges) and negative associations (blue edges) between bacterial taxa. Nodes are highlighted based on association between OTU abundance and geographic location (red: USA, blue: Venezuela, yellow: Malawi)

Analysis of intestinal 16S rDNA data in Calypso. (A) Interactive trees visualize hierarchical structures in microbial communities. Edges depict the relative abundance of the corresponding taxon. (B) Hierarchical clustering of microbial community profiles. (C) Principal Coordinates Analysis (PCoA) of intestinal microbiota of subjects from Malawi, USA and Venezuela. (D) Network analysis describing positive (yellow edges) and negative associations (blue edges) between bacterial taxa. Nodes are highlighted based on association between OTU abundance and geographic location (red: USA, blue: Venezuela, yellow: Malawi)

2.3 Cluster analysis and sample ordination

Unsupervised clustering of microbial community profiles is achieved by hierarchical clustering (Fig. 1B). Heatmaps can be fine-tuned for components such as the colour palette, trimming of outliers and the centre value of the colour palette. Community composition data is ordinated by principal components analysis, principal coordinates analysis (PCoA) (Fig. 1C), and non-metric multidimensional scaling.

2.4 Microbiome–environment associations

Associations between microbial community composition and multiple environmental variables can be identified using a wide range of multivariate methods, including redundancy analysis, canonical correspondence analysis, and permutational MANOVA. Abundance of individual taxa is compared by standard parametric and non-parametric tests and using tests specifically developed for counts data (DESeq2, ANCOM and ALDEx2). Calculated P-values are adjusted for multiple testing. Abundance of individual taxa can be associated with multiple biological conditions or confounding factors using multiple linear regression. Mixed effect regression models are used for the analysis of repeated measurements to distinguish between group-specific effects (e.g. case/control) and subject-specific effects. Additionally, feature selection methods facilitate selection of the optimal subset of taxa predictive of an outcome of interest, including step-wise linear regression, LASSO regularized regression and random forest.

2.5 Network analysis

A newly developed network module allows the identification of mutual exclusive bacteria and clusters of co-occurring bacteria (Fig. 1D). Taxa are represented as nodes, taxa abundance as node size, and edges depict positive (yellow) and negative (blue) associations. Nodes can be coloured by the phylum or family of the represented bacterial taxon or based on their association to environmental variables. Networks are generated by first computing associations between taxa using Pearson’s correlation. The resulting pairwise correlations are used to ordinate nodes in a two dimensional plot by PCoA. In this way, correlating nodes are placed in close proximity and anti-correlating nodes are placed at distant locations. Nodes of correlating taxa are connected by edges.

2.6 Analysis of microbial diversity

Multiple metrics for measuring microbial alpha diversity are provided, including Shannon’s index, evenness, richness, Simpson’s index, Chao 1 and Fisher’s Alpha. Community richness is estimated by rarefaction analysis to account for differences in sample sizes. Complex associations between microbial diversity and multiple explanatory variables are examined by multiple linear regression.

3 Conclusions

Calypso provides an easy-to-use statistical and visualization toolbox that allows rapid, robust and thorough analyses of compositional information from microbial datasets. Customized figures of publication-quality can be generated without requiring any programming knowledge. Click here for additional data file.
  9 in total

1.  ShotgunFunctionalizeR: an R-package for functional comparison of metagenomes.

Authors:  Erik Kristiansson; Philip Hugenholtz; Daniel Dalevi
Journal:  Bioinformatics       Date:  2009-08-20       Impact factor: 6.937

2.  STAMP: statistical analysis of taxonomic and functional profiles.

Authors:  Donovan H Parks; Gene W Tyson; Philip Hugenholtz; Robert G Beiko
Journal:  Bioinformatics       Date:  2014-07-23       Impact factor: 6.937

3.  METAGENassist: a comprehensive web server for comparative metagenomics.

Authors:  David Arndt; Jianguo Xia; Yifeng Liu; You Zhou; An Chi Guo; Joseph A Cruz; Igor Sinelnikov; Karen Budwill; Camilla L Nesbø; David S Wishart
Journal:  Nucleic Acids Res       Date:  2012-05-29       Impact factor: 16.971

4.  The coral core microbiome identifies rare bacterial taxa as ubiquitous endosymbionts.

Authors:  Tracy D Ainsworth; Lutz Krause; Thomas Bridge; Gergely Torda; Jean-Baptise Raina; Martha Zakrzewski; Ruth D Gates; Jacqueline L Padilla-Gamiño; Heather L Spalding; Celia Smith; Erika S Woolsey; David G Bourne; Pim Bongaerts; Ove Hoegh-Guldberg; William Leggat
Journal:  ISME J       Date:  2015-04-17       Impact factor: 10.302

5.  Differential abundance analysis for microbial marker-gene surveys.

Authors:  Joseph N Paulson; O Colin Stine; Héctor Corrada Bravo; Mihai Pop
Journal:  Nat Methods       Date:  2013-09-29       Impact factor: 28.547

6.  Impact of experimental hookworm infection on the human gut microbiota.

Authors:  Cinzia Cantacessi; Paul Giacomin; John Croese; Martha Zakrzewski; Javier Sotillo; Leisa McCann; Matthew J Nolan; Makedonka Mitreva; Lutz Krause; Alex Loukas
Journal:  J Infect Dis       Date:  2014-05-03       Impact factor: 5.226

7.  FANTOM: Functional and taxonomic analysis of metagenomes.

Authors:  Kemal Sanli; Fredrik H Karlsson; Intawat Nookaew; Jens Nielsen
Journal:  BMC Bioinformatics       Date:  2013-02-01       Impact factor: 3.169

8.  phyloseq: an R package for reproducible interactive analysis and graphics of microbiome census data.

Authors:  Paul J McMurdie; Susan Holmes
Journal:  PLoS One       Date:  2013-04-22       Impact factor: 3.240

9.  Scabies mites alter the skin microbiome and promote growth of opportunistic pathogens in a porcine model.

Authors:  Pearl M Swe; Martha Zakrzewski; Andrew Kelly; Lutz Krause; Katja Fischer
Journal:  PLoS Negl Trop Dis       Date:  2014-05-29
  9 in total
  240 in total

1.  A Consistent and Predictable Commercial Broiler Chicken Bacterial Microbiota in Antibiotic-Free Production Displays Strong Correlations with Performance.

Authors:  Timothy J Johnson; Bonnie P Youmans; Sally Noll; Carol Cardona; Nicholas P Evans; T Peter Karnezos; John M Ngunjiri; Michael C Abundo; Chang-Won Lee
Journal:  Appl Environ Microbiol       Date:  2018-05-31       Impact factor: 4.792

2.  A Vegetable Fermentation Facility Hosts Distinct Microbiomes Reflecting the Production Environment.

Authors:  Jonah E Einson; Asha Rani; Xiaomeng You; Allison A Rodriguez; Clifton L Randell; Tammy Barnaba; Mark K Mammel; Michael L Kotewicz; Christopher A Elkins; David A Sela
Journal:  Appl Environ Microbiol       Date:  2018-10-30       Impact factor: 4.792

3.  Fungi form interkingdom microbial communities in the primordial human gut that develop with gestational age.

Authors:  Kent A Willis; John H Purvis; Erin D Myers; Michael M Aziz; Ibrahim Karabayir; Charles K Gomes; Brian M Peters; Oguz Akbilgic; Ajay J Talati; Joseph F Pierre
Journal:  FASEB J       Date:  2019-08-31       Impact factor: 5.191

4.  Low dietary fiber intake increases Collinsella abundance in the gut microbiota of overweight and obese pregnant women.

Authors:  Luisa F Gomez-Arango; Helen L Barrett; Shelley A Wilkinson; Leonie K Callaway; H David McIntyre; Mark Morrison; Marloes Dekker Nitert
Journal:  Gut Microbes       Date:  2018-03-13

5.  Dynamic assessment of microbial ecology (DAME): a web app for interactive analysis and visualization of microbial sequencing data.

Authors:  Brian D Piccolo; Umesh D Wankhade; Sree V Chintapalli; Sudeepa Bhattacharyya; Luo Chunqiao; Kartik Shankar
Journal:  Bioinformatics       Date:  2018-03-15       Impact factor: 6.937

Review 6.  Microbiome data science.

Authors:  Sudarshan A Shetty; Leo Lahti
Journal:  J Biosci       Date:  2019-10       Impact factor: 1.826

Review 7.  Visual exploration of microbiome data.

Authors:  Bhusan K Kuntal; Sharmila S Mande
Journal:  J Biosci       Date:  2019-10       Impact factor: 1.826

8.  Diversity and functional profile of bacterial communities at Lancaster acid mine drainage dam, South Africa as revealed by 16S rRNA gene high-throughput sequencing analysis.

Authors:  Thabile Lukhele; Ramganesh Selvarajan; Hlengilizwe Nyoni; Bheki Brilliance Mamba; Titus Alfred Makudali Msagati
Journal:  Extremophiles       Date:  2019-09-13       Impact factor: 2.395

9.  Gut Microbiome Analysis Identifies Potential Etiological Factors in Acute Gastroenteritis.

Authors:  Natalia Castaño-Rodríguez; Alexander P Underwood; Juan Merif; Stephen M Riordan; William D Rawlinson; Hazel M Mitchell; Nadeem O Kaakoush
Journal:  Infect Immun       Date:  2018-06-21       Impact factor: 3.441

10.  Supplementation with dairy matrices impacts on homocysteine levels and gut microbiota composition of hyperhomocysteinemic mice.

Authors:  Paola Zinno; Vincenzo Motta; Barbara Guantario; Fausta Natella; Marianna Roselli; Cristiano Bello; Raffaella Comitato; Domenico Carminati; Flavio Tidona; Aurora Meucci; Paola Aiello; Giuditta Perozzi; Fabio Virgili; Paolo Trevisi; Raffaella Canali; Chiara Devirgiliis
Journal:  Eur J Nutr       Date:  2019-01-30       Impact factor: 5.614

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.