Literature DB >> 30977806

SolidBin: improving metagenome binning with semi-supervised normalized cut.

Ziye Wang1,2,3, Zhengyang Wang2, Yang Young Lu4, Fengzhu Sun1,3,4, Shanfeng Zhu2,3,5.   

Abstract

MOTIVATION: Metagenomic contig binning is an important computational problem in metagenomic research, which aims to cluster contigs from the same genome into the same group. Unlike classical clustering problem, contig binning can utilize known relationships among some of the contigs or the taxonomic identity of some contigs. However, the current state-of-the-art contig binning methods do not make full use of the additional biological information except the coverage and sequence composition of the contigs.
RESULTS: We developed a novel contig binning method, Semi-supervised Spectral Normalized Cut for Binning (SolidBin), based on semi-supervised spectral clustering. Using sequence feature similarity and/or additional biological information, such as the reliable taxonomy assignments of some contigs, SolidBin constructs two types of prior information: must-link and cannot-link constraints. Must-link constraints mean that the pair of contigs should be clustered into the same group, while cannot-link constraints mean that the pair of contigs should be clustered in different groups. These constraints are then integrated into a classical spectral clustering approach, normalized cut, for improved contig binning. The performance of SolidBin is compared with five state-of-the-art genome binners, CONCOCT, COCACOLA, MaxBin, MetaBAT and BMC3C on five next-generation sequencing benchmark datasets including simulated multi- and single-sample datasets and real multi-sample datasets. The experimental results show that, SolidBin has achieved the best performance in terms of F-score, Adjusted Rand Index and Normalized Mutual Information, especially while using the real datasets and the single-sample dataset.
AVAILABILITY AND IMPLEMENTATION: https://github.com/sufforest/SolidBin. SUPPLEMENTARY INFORMATION: Supplementary data are available at Bioinformatics online.
© The Author(s) 2019. Published by Oxford University Press. All rights reserved. For permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Year:  2019        PMID: 30977806      PMCID: PMC6821242          DOI: 10.1093/bioinformatics/btz253

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  28 in total

1.  Efficient Semisupervised MEDLINE Document Clustering With MeSH-Semantic and Global-Content Constraints.

Authors:  Jun Gu; Wei Feng; Jia Zeng; Hiroshi Mamitsuka; Shanfeng Zhu
Journal:  IEEE Trans Cybern       Date:  2013-08       Impact factor: 11.448

2.  MaxBin 2.0: an automated binning algorithm to recover genomes from multiple metagenomic datasets.

Authors:  Yu-Wei Wu; Blake A Simmons; Steven W Singer
Journal:  Bioinformatics       Date:  2015-10-29       Impact factor: 6.937

3.  COCACOLA: binning metagenomic contigs using sequence COmposition, read CoverAge, CO-alignment and paired-end read LinkAge.

Authors:  Yang Young Lu; Ting Chen; Jed A Fuhrman; Fengzhu Sun
Journal:  Bioinformatics       Date:  2017-03-15       Impact factor: 6.937

4.  Binning_refiner: improving genome bins through the combination of different binning programs.

Authors:  Wei-Zhi Song; Torsten Thomas
Journal:  Bioinformatics       Date:  2017-06-15       Impact factor: 6.937

5.  Binning metagenomic contigs by coverage and composition.

Authors:  Johannes Alneberg; Brynjar Smári Bjarnason; Ino de Bruijn; Melanie Schirmer; Joshua Quick; Umer Z Ijaz; Leo Lahti; Nicholas J Loman; Anders F Andersson; Christopher Quince
Journal:  Nat Methods       Date:  2014-09-14       Impact factor: 28.547

6.  Structure, function and diversity of the healthy human microbiome.

Authors: 
Journal:  Nature       Date:  2012-06-13       Impact factor: 49.962

7.  Metagenomic binning and association of plasmids with bacterial host genomes using DNA methylation.

Authors:  John Beaulaurier; Shijia Zhu; Gintaras Deikus; Ilaria Mogno; Xue-Song Zhang; Austin Davis-Richardson; Ronald Canepa; Eric W Triplett; Jeremiah J Faith; Robert Sebra; Eric E Schadt; Gang Fang
Journal:  Nat Biotechnol       Date:  2017-12-11       Impact factor: 54.908

8.  Salt-responsive gut commensal modulates TH17 axis and disease.

Authors:  Nicola Wilck; Mariana G Matus; Sean M Kearney; Scott W Olesen; Kristoffer Forslund; Hendrik Bartolomaeus; Stefanie Haase; Anja Mähler; András Balogh; Lajos Markó; Olga Vvedenskaya; Friedrich H Kleiner; Dmitry Tsvetkov; Lars Klug; Paul I Costea; Shinichi Sunagawa; Lisa Maier; Natalia Rakova; Valentin Schatz; Patrick Neubert; Christian Frätzer; Alexander Krannich; Maik Gollasch; Diana A Grohme; Beatriz F Côrte-Real; Roman G Gerlach; Marijana Basic; Athanasios Typas; Chuan Wu; Jens M Titze; Jonathan Jantsch; Michael Boschmann; Ralf Dechend; Markus Kleinewietfeld; Stefan Kempa; Peer Bork; Ralf A Linker; Eric J Alm; Dominik N Müller
Journal:  Nature       Date:  2017-11-15       Impact factor: 49.962

9.  Recovery of genomes from metagenomes via a dereplication, aggregation and scoring strategy.

Authors:  Christian M K Sieber; Alexander J Probst; Allison Sharrar; Brian C Thomas; Matthias Hess; Susannah G Tringe; Jillian F Banfield
Journal:  Nat Microbiol       Date:  2018-05-28       Impact factor: 17.745

10.  Host-microbe interactions have shaped the genetic architecture of inflammatory bowel disease.

Authors:  Luke Jostins; Stephan Ripke; Rinse K Weersma; Richard H Duerr; Dermot P McGovern; Ken Y Hui; James C Lee; L Philip Schumm; Yashoda Sharma; Carl A Anderson; Jonah Essers; Mitja Mitrovic; Kaida Ning; Isabelle Cleynen; Emilie Theatre; Sarah L Spain; Soumya Raychaudhuri; Philippe Goyette; Zhi Wei; Clara Abraham; Jean-Paul Achkar; Tariq Ahmad; Leila Amininejad; Ashwin N Ananthakrishnan; Vibeke Andersen; Jane M Andrews; Leonard Baidoo; Tobias Balschun; Peter A Bampton; Alain Bitton; Gabrielle Boucher; Stephan Brand; Carsten Büning; Ariella Cohain; Sven Cichon; Mauro D'Amato; Dirk De Jong; Kathy L Devaney; Marla Dubinsky; Cathryn Edwards; David Ellinghaus; Lynnette R Ferguson; Denis Franchimont; Karin Fransen; Richard Gearry; Michel Georges; Christian Gieger; Jürgen Glas; Talin Haritunians; Ailsa Hart; Chris Hawkey; Matija Hedl; Xinli Hu; Tom H Karlsen; Limas Kupcinskas; Subra Kugathasan; Anna Latiano; Debby Laukens; Ian C Lawrance; Charlie W Lees; Edouard Louis; Gillian Mahy; John Mansfield; Angharad R Morgan; Craig Mowat; William Newman; Orazio Palmieri; Cyriel Y Ponsioen; Uros Potocnik; Natalie J Prescott; Miguel Regueiro; Jerome I Rotter; Richard K Russell; Jeremy D Sanderson; Miquel Sans; Jack Satsangi; Stefan Schreiber; Lisa A Simms; Jurgita Sventoraityte; Stephan R Targan; Kent D Taylor; Mark Tremelling; Hein W Verspaget; Martine De Vos; Cisca Wijmenga; David C Wilson; Juliane Winkelmann; Ramnik J Xavier; Sebastian Zeissig; Bin Zhang; Clarence K Zhang; Hongyu Zhao; Mark S Silverberg; Vito Annese; Hakon Hakonarson; Steven R Brant; Graham Radford-Smith; Christopher G Mathew; John D Rioux; Eric E Schadt; Mark J Daly; Andre Franke; Miles Parkes; Severine Vermeire; Jeffrey C Barrett; Judy H Cho
Journal:  Nature       Date:  2012-11-01       Impact factor: 49.962

View more
  9 in total

Review 1.  Application of computational approaches to analyze metagenomic data.

Authors:  Ho-Jin Gwak; Seung Jae Lee; Mina Rho
Journal:  J Microbiol       Date:  2021-02-10       Impact factor: 3.422

2.  Evaluating metagenomics tools for genome binning with real metagenomic datasets and CAMI datasets.

Authors:  Yi Yue; Hao Huang; Zhao Qi; Hui-Min Dou; Xin-Yi Liu; Tian-Fei Han; Yue Chen; Xiang-Jun Song; You-Hua Zhang; Jian Tu
Journal:  BMC Bioinformatics       Date:  2020-07-28       Impact factor: 3.169

3.  Binning long reads in metagenomics datasets using composition and coverage information.

Authors:  Anuradha Wickramarachchi; Yu Lin
Journal:  Algorithms Mol Biol       Date:  2022-07-11       Impact factor: 1.721

4.  vRhyme enables binning of viral genomes from metagenomes.

Authors:  Kristopher Kieft; Alyssa Adams; Rauf Salamzade; Lindsay Kalan; Karthik Anantharaman
Journal:  Nucleic Acids Res       Date:  2022-08-12       Impact factor: 19.160

Review 5.  Metagenomic approaches in microbial ecology: an update on whole-genome and marker gene sequencing analyses.

Authors:  Ana Elena Pérez-Cobas; Laura Gomez-Valero; Carmen Buchrieser
Journal:  Microb Genom       Date:  2020-07-24

Review 6.  A review of computational tools for generating metagenome-assembled genomes from metagenomic sequencing data.

Authors:  Chao Yang; Debajyoti Chowdhury; Zhenmiao Zhang; William K Cheung; Aiping Lu; Zhaoxiang Bian; Lu Zhang
Journal:  Comput Struct Biotechnol J       Date:  2021-11-23       Impact factor: 7.271

7.  A deep siamese neural network improves metagenome-assembled genomes in microbiome datasets across different environments.

Authors:  Shaojun Pan; Chengkai Zhu; Xing-Ming Zhao; Luis Pedro Coelho
Journal:  Nat Commun       Date:  2022-04-28       Impact factor: 17.694

8.  MAGNETO: An Automated Workflow for Genome-Resolved Metagenomics.

Authors:  Benjamin Churcheward; Maxime Millet; Audrey Bihouée; Guillaume Fertin; Samuel Chaffron
Journal:  mSystems       Date:  2022-06-15       Impact factor: 7.324

9.  MetaBCC-LR: metagenomics binning by coverage and composition for long reads.

Authors:  Anuradha Wickramarachchi; Vijini Mallawaarachchi; Vaibhav Rajan; Yu Lin
Journal:  Bioinformatics       Date:  2020-07-01       Impact factor: 6.937

  9 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.