Vladimir I Ulyantsev1, Sergey V Kazakov1, Veronika B Dubinkina2, Alexander V Tyakht2, Dmitry G Alexeev3. 1. ITMO University, Saint Petersburg, Russian Federation. 2. Federal Research and Clinical Centre of Physical-Chemical Medicine, Moscow, Russian Federation Moscow Institute of Physics and Technology (State University), Dolgoprudny, Russian Federation. 3. Moscow Institute of Physics and Technology (State University), Dolgoprudny, Russian Federation.
Abstract
MOTIVATION: High-throughput metagenomic sequencing has revolutionized our view on the structure and metabolic potential of microbial communities. However, analysis of metagenomic composition is often complicated by the high complexity of the community and the lack of related reference genomic sequences. As a start point for comparative metagenomic analysis, the researchers require efficient means for assessing pairwise similarity of the metagenomes (beta-diversity). A number of approaches were used to address this task, however, most of them have inherent disadvantages that limit their scope of applicability. For instance, the reference-based methods poorly perform on metagenomes from previously unstudied niches, while composition-based methods appear to be too abstract for straightforward interpretation and do not allow to identify the differentially abundant features. RESULTS: We developed MetaFast, an approach that allows to represent a shotgun metagenome from an arbitrary environment as a modified de Bruijn graph consisting of simplified components. For multiple metagenomes, the resulting representation is used to obtain a pairwise similarity matrix. The dimensional structure of the metagenomic components preserved in our algorithm reflects the inherent subspecies-level diversity of microbiota. The method is computationally efficient and especially promising for an analysis of metagenomes from novel environmental niches. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available for download at https://github.com/ctlab/metafast The code is written in Java and is platform independent (tested on Linux and Windows x86_64). CONTACT: ulyantsev@rain.ifmo.ru SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
MOTIVATION: High-throughput metagenomic sequencing has revolutionized our view on the structure and metabolic potential of microbial communities. However, analysis of metagenomic composition is often complicated by the high complexity of the community and the lack of related reference genomic sequences. As a start point for comparative metagenomic analysis, the researchers require efficient means for assessing pairwise similarity of the metagenomes (beta-diversity). A number of approaches were used to address this task, however, most of them have inherent disadvantages that limit their scope of applicability. For instance, the reference-based methods poorly perform on metagenomes from previously unstudied niches, while composition-based methods appear to be too abstract for straightforward interpretation and do not allow to identify the differentially abundant features. RESULTS: We developed MetaFast, an approach that allows to represent a shotgun metagenome from an arbitrary environment as a modified de Bruijn graph consisting of simplified components. For multiple metagenomes, the resulting representation is used to obtain a pairwise similarity matrix. The dimensional structure of the metagenomic components preserved in our algorithm reflects the inherent subspecies-level diversity of microbiota. The method is computationally efficient and especially promising for an analysis of metagenomes from novel environmental niches. AVAILABILITY AND IMPLEMENTATION: Source code and binaries are freely available for download at https://github.com/ctlab/metafast The code is written in Java and is platform independent (tested on Linux and Windows x86_64). CONTACT: ulyantsev@rain.ifmo.ru SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
Authors: Bilal Wajid; Faria Anwar; Imran Wajid; Haseeb Nisar; Sharoze Meraj; Ali Zafar; Mustafa Kamal Al-Shawaqfeh; Ali Riza Ekti; Asia Khatoon; Jan S Suchodolski Journal: Funct Integr Genomics Date: 2021-10-18 Impact factor: 3.410
Authors: Advait Balaji; Nicolae Sapoval; Charlie Seto; R A Leo Elworth; Yilei Fu; Michael G Nute; Tor Savidge; Santiago Segarra; Todd J Treangen Journal: Comput Struct Biotechnol J Date: 2022-06-17 Impact factor: 6.155
Authors: Illyoung Choi; Alise J Ponsero; Matthew Bomhoff; Ken Youens-Clark; John H Hartman; Bonnie L Hurwitz Journal: Gigascience Date: 2019-02-01 Impact factor: 6.524
Authors: Noriko A Cassman; Késia S Lourenço; Janaína B do Carmo; Heitor Cantarella; Eiko E Kuramae Journal: Biotechnol Biofuels Date: 2018-02-22 Impact factor: 6.040
Authors: Veronika B Dubinkina; Alexander V Tyakht; Vera Y Odintsova; Konstantin S Yarygin; Boris A Kovarsky; Alexander V Pavlenko; Dmitry S Ischenko; Anna S Popenko; Dmitry G Alexeev; Anastasiya Y Taraskina; Regina F Nasyrova; Evgeny M Krupitsky; Nino V Shalikiani; Igor G Bakulin; Petr L Shcherbakov; Lyubov O Skorodumova; Andrei K Larin; Elena S Kostryukova; Rustam A Abdulkhakov; Sayar R Abdulkhakov; Sergey Y Malanin; Ruzilya K Ismagilova; Tatiana V Grigoryeva; Elena N Ilina; Vadim M Govorun Journal: Microbiome Date: 2017-10-17 Impact factor: 14.650
Authors: Maria José de la Cruz Peña; Luis Ignacio Gonzalez-Granado; Inmaculada Garcia-Heredia; Lucia Maestre Carballa; Manuel Martinez-Garcia Journal: Sci Rep Date: 2021-07-21 Impact factor: 4.379