Literature DB >> 34183047

ORFograph: search for novel insecticidal protein genes in genomic and metagenomic assembly graphs.

Tatiana Dvorkina1, Anton Bankevich2, Alexei Sorokin3, Fan Yang4,5, Boahemaa Adu-Oppong4,6, Ryan Williams4, Keith Turner4, Pavel A Pevzner7.   

Abstract

BACKGROUND: Since the prolonged use of insecticidal proteins has led to toxin resistance, it is important to search for novel insecticidal protein genes (IPGs) that are effective in controlling resistant insect populations. IPGs are usually encoded in the genomes of entomopathogenic bacteria, especially in large plasmids in strains of the ubiquitous soil bacteria, Bacillus thuringiensis (Bt). Since there are often multiple similar IPGs encoded by such plasmids, their assemblies are typically fragmented and many IPGs are scattered through multiple contigs. As a result, existing gene prediction tools (that analyze individual contigs) typically predict partial rather than complete IPGs, making it difficult to conduct downstream IPG engineering efforts in agricultural genomics.
METHODS: Although it is difficult to assemble IPGs in a single contig, the structure of the genome assembly graph often provides clues on how to combine multiple contigs into segments encoding a single IPG.
RESULTS: We describe ORFograph, a pipeline for predicting IPGs in assembly graphs, benchmark it on (meta)genomic datasets, and discover nearly a hundred novel IPGs. This work shows that graph-aware gene prediction tools enable the discovery of greater diversity of IPGs from (meta)genomes.
CONCLUSIONS: We demonstrated that analysis of the assembly graphs reveals novel candidate IPGs. ORFograph identified both already known genes "hidden" in assembly graphs and potential novel IPGs that evaded existing tools for IPG identification. As ORFograph is fast, one could imagine a pipeline that processes many (meta)genomic assembly graphs to identify even more novel IPGs for phenotypic testing than would previously be inaccessible by traditional gene-finding methods. While here we demonstrated the results of ORFograph only for IPGs, the proposed approach can be generalized to any class of genes. Video abstract.

Entities:  

Keywords:  Bacterial genomics; Bioinformatics; Bioinsecticides; Gene finding; Metagenomics

Year:  2021        PMID: 34183047     DOI: 10.1186/s40168-021-01092-z

Source DB:  PubMed          Journal:  Microbiome        ISSN: 2049-2618            Impact factor:   14.650


  45 in total

1.  SPAdes: a new genome assembly algorithm and its applications to single-cell sequencing.

Authors:  Anton Bankevich; Sergey Nurk; Dmitry Antipov; Alexey A Gurevich; Mikhail Dvorkin; Alexander S Kulikov; Valery M Lesin; Sergey I Nikolenko; Son Pham; Andrey D Prjibelski; Alexey V Pyshkin; Alexander V Sirotkin; Nikolay Vyahhi; Glenn Tesler; Max A Alekseyev; Pavel A Pevzner
Journal:  J Comput Biol       Date:  2012-04-16       Impact factor: 1.479

2.  Basic local alignment search tool.

Authors:  S F Altschul; W Gish; W Miller; E W Myers; D J Lipman
Journal:  J Mol Biol       Date:  1990-10-05       Impact factor: 5.469

3.  plasmidSPAdes: assembling plasmids from whole genome sequencing data.

Authors:  Dmitry Antipov; Nolan Hartwick; Max Shen; Mikhail Raiko; Alla Lapidus; Pavel A Pevzner
Journal:  Bioinformatics       Date:  2016-07-27       Impact factor: 6.937

4.  Comparative genomics of extrachromosomal elements in Bacillus thuringiensis subsp. israelensis.

Authors:  Alexandre Bolotin; Annika Gillis; Vincent Sanchis; Christina Nielsen-LeRoux; Jacques Mahillon; Didier Lereclus; Alexei Sorokin
Journal:  Res Microbiol       Date:  2016-11-01       Impact factor: 3.992

5.  New perspectives on Mega plasmid sequence (poh1) in Bacillus thuringiensis ATCC 10792 harbouring antimicrobial, insecticidal and antibiotic resistance genes.

Authors:  Ramachandran Chelliah; Shuai Wei; Byung-Jae Park; Joong-Hyun Park; Youn-Seo Park; Se-Hun Kim; Yong-Guo Jin; Deog-Hwan Oh
Journal:  Microb Pathog       Date:  2018-10-13       Impact factor: 3.738

6.  New strategy for identification of novel Cry-type genes from Bacillus thuringiensis strains.

Authors:  Corina M Berón; Leonardo Curatti; Graciela L Salerno
Journal:  Appl Environ Microbiol       Date:  2005-02       Impact factor: 4.792

7.  Prediction of insecticidal activity of Bacillus thuringiensis strains by polymerase chain reaction product profiles.

Authors:  N B Carozzi; V C Kramer; G W Warren; S Evola; M G Koziel
Journal:  Appl Environ Microbiol       Date:  1991-11       Impact factor: 4.792

8.  GeneMark: web software for gene finding in prokaryotes, eukaryotes and viruses.

Authors:  John Besemer; Mark Borodovsky
Journal:  Nucleic Acids Res       Date:  2005-07-01       Impact factor: 16.971

9.  Plasmid detection and assembly in genomic and metagenomic data sets.

Authors:  Dmitry Antipov; Mikhail Raiko; Alla Lapidus; Pavel A Pevzner
Journal:  Genome Res       Date:  2019-05-02       Impact factor: 9.043

10.  Geospatial Resolution of Human and Bacterial Diversity with City-Scale Metagenomics.

Authors:  Ebrahim Afshinnekoo; Cem Meydan; Shanin Chowdhury; Dyala Jaroudi; Collin Boyer; Nick Bernstein; Julia M Maritz; Darryl Reeves; Jorge Gandara; Sagar Chhangawala; Sofia Ahsanuddin; Amber Simmons; Timothy Nessel; Bharathi Sundaresh; Elizabeth Pereira; Ellen Jorgensen; Sergios-Orestis Kolokotronis; Nell Kirchberger; Isaac Garcia; David Gandara; Sean Dhanraj; Tanzina Nawrin; Yogesh Saletore; Noah Alexander; Priyanka Vijay; Elizabeth M Hénaff; Paul Zumbo; Michael Walsh; Gregory D O'Mullan; Scott Tighe; Joel T Dudley; Anya Dunaif; Sean Ennis; Eoghan O'Halloran; Tiago R Magalhaes; Braden Boone; Angela L Jones; Theodore R Muth; Katie Schneider Paolantonio; Elizabeth Alter; Eric E Schadt; Jeanne Garbarino; Robert J Prill; Jane M Carlton; Shawn Levy; Christopher E Mason
Journal:  Cell Syst       Date:  2015-03-03       Impact factor: 10.304

View more
  1 in total

1.  STRONG: metagenomics strain resolution on assembly graphs.

Authors:  Christopher Quince; Sergey Nurk; Sebastien Raguideau; Robert James; Orkun S Soyer; J Kimberly Summers; Antoine Limasset; A Murat Eren; Rayan Chikhi; Aaron E Darling
Journal:  Genome Biol       Date:  2021-07-26       Impact factor: 13.583

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.