| Literature DB >> 31696235 |
Alex L Mitchell1, Alexandre Almeida1,2, Martin Beracochea1, Miguel Boland1, Josephine Burgin1, Guy Cochrane1, Michael R Crusoe3, Varsha Kale1, Simon C Potter1, Lorna J Richardson1, Ekaterina Sakharova1, Maxim Scheremetjew1, Anton Korobeynikov4, Alex Shlemov4, Olga Kunyavskaya4, Alla Lapidus4, Robert D Finn1.
Abstract
MGnify (http://www.ebi.ac.uk/metagenomics) provides a free to use platform for the assembly, analysis and archiving of microbiome data derived from sequencing microbial populations that are present in particular environments. Over the past 2 years, MGnify (formerly EBI Metagenomics) has more than doubled the number of publicly available analysed datasets held within the resource. Recently, an updated approach to data analysis has been unveiled (version 5.0), replacing the previous single pipeline with multiple analysis pipelines that are tailored according to the input data, and that are formally described using the Common Workflow Language, enabling greater provenance, reusability, and reproducibility. MGnify's new analysis pipelines offer additional approaches for taxonomic assertions based on ribosomal internal transcribed spacer regions (ITS1/2) and expanded protein functional annotations. Biochemical pathways and systems predictions have also been added for assembled contigs. MGnify's growing focus on the assembly of metagenomic data has also seen the number of datasets it has assembled and analysed increase six-fold. The non-redundant protein database constructed from the proteins encoded by these assemblies now exceeds 1 billion sequences. Meanwhile, a newly developed contig viewer provides fine-grained visualisation of the assembled contigs and their enriched annotations.Entities:
Year: 2020 PMID: 31696235 PMCID: PMC7145632 DOI: 10.1093/nar/gkz1035
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Krona plots showing ITS-based taxonomic analysis results for sequencing run ERR2237853 from agricultural soil. Panel A shows the analysis results obtained using the UNITE database and B shows those produced with ITSoneDB. These plots are integrated into the taxonomy tab within the analysis results section, https://www.ebi.ac.uk/metagenomics/analyses/MGYA00383253#taxonomic.
Figure 2.Screenshot showing the percentage completion for a series of KEGG pathways based on the presence of KEGG orthologues, https://www.ebi.ac.uk/metagenomics/analyses/MGYA00383254#path-systems.
Figure 3.The MGnify contig viewer allows visualization of functional annotation of contigs, https://www.ebi.ac.uk/metagenomics/analyses/MGYA00383254#contigs-viewer. In this particular view, the contig ERZ477576.854-NODE-854-length-9986-cov-4.0004:1-9986 has been selected and the proteins coloured according to the COG category, with the four cytochrome c subunits highlighted.