| Literature DB >> 25983555 |
Anastasis Oulas1, Christina Pavloudi2, Paraskevi Polymenakou1, Georgios A Pavlopoulos3, Nikolas Papanikolaou3, Georgios Kotoulas1, Christos Arvanitidis1, Ioannis Iliopoulos3.
Abstract
Advances in next-generation sequencing (NGS) have allowed significant breakthroughs in microbial ecology studies. This has led to the rapid expansion of research in the field and the establishment of "metagenomics", often defined as the analysis of DNA from microbial communities in environmental samples without prior need for culturing. Many metagenomics statistical/computational tools and databases have been developed in order to allow the exploitation of the huge influx of data. In this review article, we provide an overview of the sequencing technologies and how they are uniquely suited to various types of metagenomic studies. We focus on the currently available bioinformatics techniques, tools, and methodologies for performing each individual step of a typical metagenomic dataset analysis. We also provide future trends in the field with respect to tools and technologies currently under development. Moreover, we discuss data management, distribution, and integration tools that are capable of performing comparative metagenomic analyses of multiple datasets using well-established databases, as well as commonly used annotation standards.Entities:
Keywords: computational tools; data analysis; metagenomics; next-generation sequencing
Year: 2015 PMID: 25983555 PMCID: PMC4426941 DOI: 10.4137/BBI.S12462
Source DB: PubMed Journal: Bioinform Biol Insights ISSN: 1177-9322
Tools grouped according to their main functionality.
| Assembly | EULER | |
| Velvet | ||
| SOAP | ||
| ABySS | ||
| MetaVelvet | ||
| MetaVelvet-SL | ||
| Meta-IDBA | ||
| IDBA-UD | ||
| Newbler (Roche) | ||
| MIRA | ||
| Mapsembler | ||
| ALLPATHS | ||
| MetaORFA | ||
| MetAMOS | ||
| Binning | TETRA | |
| S-GSOM | ||
| PhylopythiaS | ||
| TACOA | ||
| PCAHIER | ||
| ESOM | ||
| ClaMS | ||
| CARMA | ||
| WGSQuikr | ||
| SPHINX | ||
| MetaPhyler | ||
| SOrt-ITEMS | ||
| PhymmBL | ||
| MetaCluster | ||
| Annotation | FASTX-Toolkit | |
| FastQC | ||
| SolexaQA | ||
| Lucy 2 | ||
| DUST | ||
| Bowtie | ||
| MetaGeneMark | ||
| LEfSe | ||
| TACOA | ||
| Metagene | ||
| CREST | ||
| Prodigal | ||
| mOTU-LG | ||
| Orphelia | ||
| Kraken | ||
| FragGeneScan | ||
| CRT | ||
| NBC | ||
| MyTaxa | ||
| RITA | ||
| PILER-CR | ||
| tRNAscan | ||
| KEGG | ||
| MetaCluster TA | ||
| SEED | ||
| eggNOG | ||
| ProViDE | ||
| COG/KOG | ||
| PFAM | ||
| TIGRFAM | ||
| MetaPhlAn | ||
| HighSSR | ||
| Blat | ||
| Analysis pipelines | IMG/MER | |
| MG-RAST | ||
| MEGAN 5 | ||
| CAMERA | ||
| Parallel-META | ||
| EBI Metagenomics | ||
| METAREP | ||
| PHACCS | ||
| Standalone software | QIIME | |
| Mothur | ||
| JAguc | ||
| M-pick | ||
| OTUbase | ||
| CopyRighter | ||
| AbundantOTU | ||
| UniFrac | ||
| ESPRIT | ||
| Analysis pipelines | SILVA | |
| FunFrame | ||
| PANGEA | ||
| FastGroupII | ||
| CLOTU | ||
| Denoising | AmpliconNoise | |
| DADA | ||
| JATAC | ||
| UCHIME | ||
| Bellerophon | ||
| CANGS | ||
| Databases | SILVA | |
| Greengenes | ||
| Ribosomal Database Project (RDP) | ||
| Unite |
Figure 1Flowchart of basic metagenomics steps and tools currently in practice.
Notes: The analysis pipeline can take two different routes depending on the type of sequencing data (marker gene or shotgun metagenomics) available. The flowchart outlines the basic steps in the analysis pipeline starting with preprocessing of the data to the final extraction of results and concurrent storage and management of the data. Some popular tools that have been used extensively by the metagenomics community are shown for every step, as a well as the databases and algorithms in common practice.