| Literature DB >> 27279729 |
Karel Sedlar1, Petra Videnska2, Helena Skutkova1, Ivan Rychlik3, Ivo Provaznik1.
Abstract
Visualization analysis plays an important role in metagenomics research. Proper and clear visualization can help researchers get their first insights into data and by selecting different features, also revealing and highlighting hidden relationships and drawing conclusions. To prevent the resulting presentations from becoming chaotic, visualization techniques have to properly tackle the high dimensionality of microbiome data. Although a number of different methods based on dimensionality reduction, correlations, Venn diagrams, and network representations have already been published, there is still room for further improvement, especially in the techniques that allow visual comparison of several environments or developmental stages in one environment. In this article, we represent microbiome data by bipartite graphs, where one partition stands for taxa and the other stands for samples. We demonstrated that community detection is independent of taxonomical level. Moreover, focusing on higher taxonomical levels and the appropriate merging of samples greatly helps improving graph organization and makes our presentations clearer than other graph and network visualizations. Capturing labels in the vertices also brings the possibility of clearly comparing two or more microbial communities by showing their common and unique parts.Entities:
Keywords: 16S rRNA; OTU table; bipartite graph; graph modularity; metagenomics; visualization analysis
Year: 2016 PMID: 27279729 PMCID: PMC4888752 DOI: 10.4137/EBO.S38546
Source DB: PubMed Journal: Evol Bioinform Online ISSN: 1176-9343 Impact factor: 1.625
Figure 1Flowchart describing proposed workflow. Every main step is implementable in several scripting languages/software according to the preferences of users.
Summary of parameters describing the reconstructed graphs for reduction of taxa partition.
| WHOLE OTU | GENUS | FAMILY | ORDER | CLASS | PHYLUM | |
|---|---|---|---|---|---|---|
| No. of vertices | 18,503 | 280 | 139 | 98 | 73 | 66 |
| No. of edges | 37,356 | 5,776 | 2,268 | 1,155 | 656 | 407 |
| Average degree | 4.037 | 41.257 | 32.633 | 23.571 | 17.973 | 12.333 |
| Graph density | <0.000 | 0.148 | 0.236 | 0.243 | 0.250 | 0.190 |
| Average path length | 3.713 | 2.118 | 2.042 | 2.053 | 1.916 | 1.960 |
| Modularity | 0.577 | 0.281 | 0.287 | 0.303 | 0.288 | 0.263 |
| No. of communities | 4 | 4 | 4 | 4 | 4 | 4 |
Summary of parameters describing the reconstructed graphs for reduction by abundance threshold.
| THRESHOLD | 0 | 0.005 | 0.01 | 0.05 | 0.1 |
|---|---|---|---|---|---|
| No. of vertices | 64 | 21 | 16 | 12 | 10 |
| No. of edges | 156 | 33 | 25 | 16 | 10 |
| Average degree | 4.875 | 3.143 | 3.125 | 2.667 | 2 |
| Graph density | 0.077 | 0.157 | 0.208 | 0.242 | 0.222 |
| Average path length | 2.105 | 2.181 | 2.133 | 2.242 | 2.711 |
| Modularity | 0.224 | 0.338 | 0.340 | 0.377 | 0.411 |
| No. of communities | 4 | 4 | 4 | 4 | 4 |
Figure 2Bipartite graph reconstructed from the entire OTU table with four detected communities.
Notes: Due to different sequencing depth for particular samples, relative abundances of OTUs were used. Communities were detected based on modularity maximization. Vertices (samples and OTUs) within the same community are colored with the same color.
Figure 3Bipartite graph representing four stages of microbiota development.
Notes: Four communities were detected based on modularity maximization. The color coding of particular communities (yellow, blue, red, and green) corresponds to the color coding used in Figure 2. Other colors represent intercommunity connections. Partitions are distinguished by the size of the vertices.
Figure 4Bipartite graph showing bacterial families exceeding the threshold of 0.5% abundance.
Notes: The color coding is similar to the color coding used in the previous figures. The threshold t = 0.5% was used to omit any vertices as well as edges that do not meet the requirement for sufficient abundance.