Literature DB >> 29947757

BMC3C: binning metagenomic contigs using codon usage, sequence composition and read coverage.

Guoxian Yu1, Yuan Jiang1, Jun Wang1, Hao Zhang2, Haiwei Luo2.   

Abstract

Motivation: Metagenomics investigates the DNA sequences directly recovered from environmental samples. It often starts with reads assembly, which leads to contigs rather than more complete genomes. Therefore, contig binning methods are subsequently used to bin contigs into genome bins. While some clustering-based binning methods have been developed, they generally suffer from problems related to stability and robustness.
Results: We introduce BMC3C, an ensemble clustering-based method, to accurately and robustly bin contigs by making use of DNA sequence Composition, Coverage across multiple samples and Codon usage. BMC3C begins by searching the proper number of clusters and repeatedly applying the k-means clustering with different initializations to cluster contigs. Next, a weight graph with each node representing a contig is derived from these clusters. If two contigs are frequently grouped into the same cluster, the weight between them is high, and otherwise low. BMC3C finally employs a graph partitioning technique to partition the weight graph into subgraphs, each corresponding to a genome bin. We conduct experiments on both simulated and real-world datasets to evaluate BMC3C, and compare it with the state-of-the-art binning tools. We show that BMC3C has an improved performance compared to these tools. To our knowledge, this is the first time that the codon usage features and ensemble clustering are used in metagenomic contig binning. Availability and implementation: The codes of BMC3C are available at http://mlda.swu.edu.cn/codes.php?name=BMC3C. Supplementary information: Supplementary data are available at Bioinformatics online.

Entities:  

Mesh:

Substances:

Year:  2018        PMID: 29947757     DOI: 10.1093/bioinformatics/bty519

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  12 in total

1.  SolidBin: improving metagenome binning with semi-supervised normalized cut.

Authors:  Ziye Wang; Zhengyang Wang; Yang Young Lu; Fengzhu Sun; Shanfeng Zhu
Journal:  Bioinformatics       Date:  2019-11-01       Impact factor: 6.937

2.  Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.

Authors:  Yi Yue; Hao Huang; Zhao Qi; Hui-Min Dou; Xin-Yi Liu; Tian-Fei Han; Yue Chen; Xiang-Jun Song; You-Hua Zhang; Jian Tu
Journal:  BMC Bioinformatics       Date:  2020-07-28       Impact factor: 3.169

3.  Binning Metagenomic Contigs Using Unsupervised Clustering and Reference Databases.

Authors:  Zhongjun Jiang; Xiaobo Li; Lijun Guo
Journal:  Interdiscip Sci       Date:  2022-05-31       Impact factor: 3.492

4.  Binning long reads in metagenomics datasets using composition and coverage information.

Authors:  Anuradha Wickramarachchi; Yu Lin
Journal:  Algorithms Mol Biol       Date:  2022-07-11       Impact factor: 1.721

Review 5.  Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses.

Authors:  Ana Elena Pérez-Cobas; Laura Gomez-Valero; Carmen Buchrieser
Journal:  Microb Genom       Date:  2020-07-24

6.  Combining Genetic Mutation and Expression Profiles Identifies Novel Prognostic Biomarkers of Lung Adenocarcinoma.

Authors:  Yun Liu; Fu Liu; Xintong Hu; Jiaxue He; Yanfang Jiang
Journal:  Clin Med Insights Oncol       Date:  2020-10-28

Review 7.  A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data.

Authors:  Chao Yang; Debajyoti Chowdhury; Zhenmiao Zhang; William K Cheung; Aiping Lu; Zhaoxiang Bian; Lu Zhang
Journal:  Comput Struct Biotechnol J       Date:  2021-11-23       Impact factor: 7.271

8.  METAMVGL: a multi-view graph-based metagenomic contig binning algorithm by integrating assembly and paired-end graphs.

Authors:  Zhenmiao Zhang; Lu Zhang
Journal:  BMC Bioinformatics       Date:  2021-07-22       Impact factor: 3.169

9.  MetaBCC-LR: metagenomics binning by coverage and composition for long reads.

Authors:  Anuradha Wickramarachchi; Vijini Mallawaarachchi; Vaibhav Rajan; Yu Lin
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

10.  MetaEuk-sensitive, high-throughput gene discovery, and annotation for large-scale eukaryotic metagenomics.

Authors:  Eli Levy Karin; Milot Mirdita; Johannes Söding
Journal:  Microbiome       Date:  2020-04-03       Impact factor: 14.650

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.