| Literature DB >> 29085573 |
Damayanthi Herath1,2, Duleepa Jayasundara3, David Ackland4, Isaam Saeed1, Sen-Lin Tang5, Saman Halgamuge6.
Abstract
Assessing biodiversity is an important step in the study of microbial ecology associated with a given environment. Multiple indices have been used to quantify species diversity, which is a key biodiversity measure. Measuring species diversity of viruses in different environments remains a challenge relative to measuring the diversity of other microbial communities. Metagenomics has played an important role in elucidating viral diversity by conducting metavirome studies; however, metavirome data are of high complexity requiring robust data preprocessing and analysis methods. In this review, existing bioinformatics methods for measuring species diversity using metavirome data are categorised broadly as either sequence similarity-dependent methods or sequence similarity-independent methods. The former includes a comparison of DNA fragments or assemblies generated in the experiment against reference databases for quantifying species diversity, whereas estimates from the latter are independent of the knowledge of existing sequence data. Current methods and tools are discussed in detail, including their applications and limitations. Drawbacks of the state-of-the-art method are demonstrated through results from a simulation. In addition, alternative approaches are proposed to overcome the challenges in estimating species diversity measures using metavirome data.Entities:
Keywords: Biodiversity; Bioinformatics; Metagenomics; Metavirome data; Phage studies; Species diversity
Year: 2017 PMID: 29085573 PMCID: PMC5650650 DOI: 10.1016/j.csbj.2017.09.001
Source DB: PubMed Journal: Comput Struct Biotechnol J ISSN: 2001-0370 Impact factor: 7.271
Summary of existing tools for estimating species diversity measures in metavirome studies.
| Tool | Estimated species diversity measures | Published in | Resource | |
|---|---|---|---|---|
| Sequence similarity- independent methods | PHACCS ( | Species richness | 2005 | |
| Shannon-Wiener index | ||||
| Evenness | ||||
| Rank-abundance distribution | ||||
| CatchAll | Species Richness | 2012 | ||
| Sequence similarity-dependent methods | UCLUST | Clusters of similar sequences | 2010 | |
| GAAS | Genome relative abundance | 2009 | ||
| GRAMMy | Genome relative abundance | 2011 | ||
| GASiC | Genome relative abundance | 2012 |
Fig. 1A schematic diagram summarising the stages where existing tools for measuring viral diversity can be integrated in a metagenomics data analysis pipeline.
Summary of implementation details of the existing tools.
| Tool | Input data | Programmed in | Operating system/s supported | Interface |
|---|---|---|---|---|
| PHACCS | Contig spectrum | Matlab, Perl | Linux,Mac OS, Windows | Web based GUI |
| Average genome length | ||||
| Sequencing and assembling settings | ||||
| CatchAll | Contig spectrum | .Net Framework | Linux,Mac OS,Windows | GUI, CLI |
| UCLUST | Metagenomic reads | – | Linux, Mac OS, Windows | CLI |
| GAAS | Metagenomic reads | Perl | Linux, Mac OS, Windows | CLI |
| GRAMMy | Metagenomic reads | C++ | Linux | CLI |
| Python | ||||
| GASiC | Metagenomics reads | Python | Linux | CLI |
Implementation details of the tool is not available.
Fig. 2The effect of variation of genome lengths on the accuracy of species richness estimates from PHACCS.