Literature DB >> 33290720

Automated Prediction and Annotation of Small Open Reading Frames in Microbial Genomes.

Matthew G Durrant1, Ami S Bhatt2.   

Abstract

Small open reading frames (smORFs) and their encoded microproteins play central roles in microbes. However, there is a vast unexplored space of smORFs within human-associated microbes. A recent bioinformatic analysis used evolutionary conservation signals to enhance prediction of small protein families. To facilitate the annotation of specific smORFs, we introduce SmORFinder. This tool combines profile hidden Markov models of each smORF family and deep learning models that better generalize to smORF families not seen in the training set, resulting in predictions enriched for Ribo-seq translation signals. Feature importance analysis reveals that the deep learning models learn to identify Shine-Dalgarno sequences, deprioritize the wobble position in each codon, and group codon synonyms found in the codon table. A core-genome analysis of 26 bacterial species identifies several core smORFs of unknown function. We pre-compute smORF annotations for thousands of RefSeq isolate genomes and Human Microbiome Project metagenomes and provide these data through a public web portal.
Copyright © 2020 Elsevier Inc. All rights reserved.

Entities:  

Keywords:  deep learning; genome annotation; machine learning; microbiome; microproteins; small open reading frames

Mesh:

Substances:

Year:  2020        PMID: 33290720      PMCID: PMC7856248          DOI: 10.1016/j.chom.2020.11.002

Source DB:  PubMed          Journal:  Cell Host Microbe        ISSN: 1931-3128            Impact factor:   21.023


  38 in total

Review 1.  Profile hidden Markov models.

Authors:  S R Eddy
Journal:  Bioinformatics       Date:  1998       Impact factor: 6.937

Review 2.  Small proteins can no longer be ignored.

Authors:  Gisela Storz; Yuri I Wolf; Kumaran S Ramamurthi
Journal:  Annu Rev Biochem       Date:  2014-03-03       Impact factor: 23.643

3.  CDD/SPARCLE: the conserved domain database in 2020.

Authors:  Shennan Lu; Jiyao Wang; Farideh Chitsaz; Myra K Derbyshire; Renata C Geer; Noreen R Gonzales; Marc Gwadz; David I Hurwitz; Gabriele H Marchler; James S Song; Narmada Thanki; Roxanne A Yamashita; Mingzhang Yang; Dachuan Zhang; Chanjuan Zheng; Christopher J Lanczycki; Aron Marchler-Bauer
Journal:  Nucleic Acids Res       Date:  2020-01-08       Impact factor: 16.971

4.  Dual function of the McaS small RNA in controlling biofilm formation.

Authors:  Mikkel Girke Jørgensen; Maureen K Thomason; Johannes Havelund; Poul Valentin-Hansen; Gisela Storz
Journal:  Genes Dev       Date:  2013-05-10       Impact factor: 11.361

5.  Dual toxic-peptide-coding Staphylococcus aureus RNA under antisense regulation targets host cells and bacterial rivals unequally.

Authors:  Marie-Laure Pinel-Marie; Régine Brielle; Brice Felden
Journal:  Cell Rep       Date:  2014-04-03       Impact factor: 9.423

Review 6.  A primer on deep learning in genomics.

Authors:  James Zou; Mikael Huss; Abubakar Abid; Pejman Mohammadi; Ali Torkamani; Amalio Telenti
Journal:  Nat Genet       Date:  2018-11-26       Impact factor: 38.330

7.  MetaRibo-Seq measures translation in microbiomes.

Authors:  Brayon J Fremin; Hila Sberro; Ami S Bhatt
Journal:  Nat Commun       Date:  2020-06-29       Impact factor: 14.919

8.  MOXI Is a Mitochondrial Micropeptide That Enhances Fatty Acid β-Oxidation.

Authors:  Catherine A Makarewich; Kedryn K Baskin; Amir Z Munir; Svetlana Bezprozvannaya; Gaurav Sharma; Chalermchai Khemtong; Akansha M Shah; John R McAnally; Craig R Malloy; Luke I Szweda; Rhonda Bassel-Duby; Eric N Olson
Journal:  Cell Rep       Date:  2018-06-26       Impact factor: 9.423

9.  Analysis of effects of tRNA:message stability on frameshift frequency at the Escherichia coli RF2 programmed frameshift site.

Authors:  J F Curran
Journal:  Nucleic Acids Res       Date:  1993-04-25       Impact factor: 16.971

10.  Identifying Small Proteins by Ribosome Profiling with Stalled Initiation Complexes.

Authors:  Jeremy Weaver; Fuad Mohammad; Allen R Buskirk; Gisela Storz
Journal:  mBio       Date:  2019-03-05       Impact factor: 7.867

View more
  6 in total

1.  RiboReport - benchmarking tools for ribosome profiling-based identification of open reading frames in bacteria.

Authors:  Rick Gelhausen; Teresa Müller; Sarah L Svensson; Omer S Alkhnbashi; Cynthia M Sharma; Florian Eggenhofer; Rolf Backofen
Journal:  Brief Bioinform       Date:  2022-03-10       Impact factor: 11.622

2.  Thousands of small, novel genes predicted in global phage genomes.

Authors:  Brayon J Fremin; Ami S Bhatt; Nikos C Kyrpides
Journal:  Cell Rep       Date:  2022-06-21       Impact factor: 9.995

Review 3.  Simultaneous ribosome profiling of hundreds of microbes from the human microbiome.

Authors:  Brayon J Fremin; Cosmos Nicolaou; Ami S Bhatt
Journal:  Nat Protoc       Date:  2021-08-11       Impact factor: 13.491

4.  Comparative genomics identifies thousands of candidate structured RNAs in human microbiomes.

Authors:  Brayon J Fremin; Ami S Bhatt
Journal:  Genome Biol       Date:  2021-04-12       Impact factor: 13.583

5.  Engineering, decoding and systems-level characterization of chimpanzee cytomegalovirus.

Authors:  Quang Vinh Phan; Boris Bogdanow; Emanuel Wyler; Markus Landthaler; Fan Liu; Christian Hagemeier; Lüder Wiebusch
Journal:  PLoS Pathog       Date:  2022-01-04       Impact factor: 6.823

6.  Bakta: rapid and standardized annotation of bacterial genomes via alignment-free sequence identification.

Authors:  Oliver Schwengers; Lukas Jelonek; Marius Alfred Dieckmann; Sebastian Beyvers; Jochen Blom; Alexander Goesmann
Journal:  Microb Genom       Date:  2021-11
  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.