Nicolas Terrapon1, Vincent Lombard2, Harry J Gilbert2, Bernard Henrissat1. 1. Centre National de la Recherche Scientifique, CNRS UMR 7257, 13288 Marseille, France, Aix-Marseille Université, AFMB, 13288 Marseille, France, Institute for Cell and Molecular Biosciences, The Medical School, Newcastle University, Newcastle upon Tyne NE2 4HH, UK and Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia Centre National de la Recherche Scientifique, CNRS UMR 7257, 13288 Marseille, France, Aix-Marseille Université, AFMB, 13288 Marseille, France, Institute for Cell and Molecular Biosciences, The Medical School, Newcastle University, Newcastle upon Tyne NE2 4HH, UK and Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia. 2. Centre National de la Recherche Scientifique, CNRS UMR 7257, 13288 Marseille, France, Aix-Marseille Université, AFMB, 13288 Marseille, France, Institute for Cell and Molecular Biosciences, The Medical School, Newcastle University, Newcastle upon Tyne NE2 4HH, UK and Department of Biological Sciences, Faculty of Science, King Abdulaziz University, Jeddah, Saudi Arabia.
Abstract
MOTIVATION: A bacterial polysaccharide utilization locus (PUL) is a set of physically linked genes that orchestrate the breakdown of a specific glycan. PULs are prevalent in the Bacteroidetes phylum and are key to the digestion of complex carbohydrates, notably by the human gut microbiota. A given Bacteroidetes genome can encode dozens of different PULs whose boundaries and precise gene content are difficult to predict. RESULTS: Here, we present a fully automated approach for PUL prediction using genomic context and domain annotation alone. By combining the detection of a pair of marker genes with operon prediction using intergenic distances, and queries to the carbohydrate-active enzymes database (www.cazy.org), our predictor achieved above 86% accuracy in two Bacteroides species with extensive experimental PUL characterization. AVAILABILITY AND IMPLEMENTATION: PUL predictions in 67 Bacteroidetes genomes from the human gut microbiota and two additional species, from the canine oral sphere and from the environment, are presented in our database accessible at www.cazy.org/PULDB/index.php.
MOTIVATION: A bacterial polysaccharide utilization locus (PUL) is a set of physically linked genes that orchestrate the breakdown of a specific glycan. PULs are prevalent in the Bacteroidetes phylum and are key to the digestion of complex carbohydrates, notably by the human gut microbiota. A given Bacteroidetes genome can encode dozens of different PULs whose boundaries and precise gene content are difficult to predict. RESULTS: Here, we present a fully automated approach for PUL prediction using genomic context and domain annotation alone. By combining the detection of a pair of marker genes with operon prediction using intergenic distances, and queries to the carbohydrate-active enzymes database (www.cazy.org), our predictor achieved above 86% accuracy in two Bacteroides species with extensive experimental PUL characterization. AVAILABILITY AND IMPLEMENTATION: PUL predictions in 67 Bacteroidetes genomes from the human gut microbiota and two additional species, from the canine oral sphere and from the environment, are presented in our database accessible at www.cazy.org/PULDB/index.php.
Authors: Didier Ndeh; Jose Munoz Munoz; Alan Cartmell; David Bulmer; Corinne Wills; Bernard Henrissat; Joseph Gray Journal: J Biol Chem Date: 2018-09-27 Impact factor: 5.157
Authors: Michael S Little; Samuel J Pellock; William G Walton; Ashutosh Tripathy; Matthew R Redinbo Journal: Proc Natl Acad Sci U S A Date: 2017-12-21 Impact factor: 11.205
Authors: Richard McLean; Joanne K Hobbs; Michael D Suits; Sami T Tuomivaara; Darryl R Jones; Alisdair B Boraston; D Wade Abbott Journal: J Biol Chem Date: 2015-07-09 Impact factor: 5.157
Authors: Olov Svartström; Johannes Alneberg; Nicolas Terrapon; Vincent Lombard; Ino de Bruijn; Jonas Malmsten; Ann-Marie Dalin; Emilie El Muller; Pranjul Shah; Paul Wilmes; Bernard Henrissat; Henrik Aspeborg; Anders F Andersson Journal: ISME J Date: 2017-07-21 Impact factor: 10.302
Authors: Alan Cartmell; Elisabeth C Lowe; Arnaud Baslé; Susan J Firbank; Didier A Ndeh; Heath Murray; Nicolas Terrapon; Vincent Lombard; Bernard Henrissat; Jeremy E Turnbull; Mirjam Czjzek; Harry J Gilbert; David N Bolam Journal: Proc Natl Acad Sci U S A Date: 2017-06-19 Impact factor: 11.205