Literature DB >> 27737786

The combinatorics of overlapping genes.

Sophie Lèbre1, Olivier Gascuel2.   

Abstract

Overlapping genes exist in all domains of life and are much more abundant than expected upon their first discovery in the late 1970s. Assuming that the reference gene is read in frame +0, an overlapping gene can be encoded in two reading frames in the sense strand, denoted by +1 and +2, and in three reading frames in the opposite strand, denoted by -0, -1, and -2. This motivated numerous researchers to study the constraints induced by the genetic code on the various overlapping frames, mostly based on information theory. Our focus in this paper is on the constraints induced on two overlapping genes in terms of amino acids, as well as polypeptides. We show that simple linear constraints bind the amino-acid composition of two proteins encoded by overlapping genes. Novel constraints are revealed when polypeptides are considered, and not just single amino acids. For example, in double-coding sequences with an overlapping reading frame -2, each Tyrosine (denoted as Tyr or Y) in the overlapping frame overlaps a Tyrosine in the reference frame +0 (and reciprocally), whereas specific words (e.g. YY) never occur. We thus distinguish between null constraints (YY = 0 in frame -2) and non-null constraints (Y in frame +0 ⇔ Y in frame -2). Our equivalence-based constraints are symmetrical and thus enable the characterization of the joint composition of overlapping proteins. We describe several formal frameworks and a graph algorithm to characterize and compute these constraints. As expected, the degrees of freedom left by these constraints vary drastically among the different overlapping frames. Interestingly, the biological meaning of constraints induced on two overlapping proteins (hydropathy, forbidden di-peptides, expected overlap length …) is also specific to the reading frame. We study the combinatorics of these constraints for overlapping polypeptides of length n, pointing out that, (i) except for frame -2, non-null constraints are deduced from the amino-acid (length = 1) constraints and (ii) null constraints are deduced from the di-peptide (length = 2) constraints. These results yield support for understanding the mechanisms and evolution of overlapping genes, and for developing novel overlapping gene detection methods.
Copyright © 2016 Elsevier Ltd. All rights reserved.

Entities:  

Keywords:  Amino-acid composition; Codon usage; Double-coding sequences; Genetic code; Linear and logical constraints; Overlapping gene detection; Stop codons

Mesh:

Substances:

Year:  2016        PMID: 27737786     DOI: 10.1016/j.jtbi.2016.09.018

Source DB:  PubMed          Journal:  J Theor Biol        ISSN: 0022-5193            Impact factor:   2.691


  11 in total

1.  Combining genetic algorithm with machine learning strategies for designing potent antimicrobial peptides.

Authors:  Kyle Boone; Cate Wisdom; Kyle Camarda; Paulette Spencer; Candan Tamerler
Journal:  BMC Bioinformatics       Date:  2021-05-11       Impact factor: 3.169

2.  A novel short L-arginine responsive protein-coding gene (laoB) antiparallel overlapping to a CadC-like transcriptional regulator in Escherichia coli O157:H7 Sakai originated by overprinting.

Authors:  Sarah M Hücker; Sonja Vanderhaeghen; Isabel Abellan-Schneyder; Romy Wecko; Svenja Simon; Siegfried Scherer; Klaus Neuhaus
Journal:  BMC Evol Biol       Date:  2018-02-12       Impact factor: 3.260

3.  A Simple Method to Detect Candidate Overlapping Genes in Viruses Using Single Genome Sequences.

Authors:  Timothy E Schlub; Jan P Buchmann; Edward C Holmes
Journal:  Mol Biol Evol       Date:  2018-10-01       Impact factor: 16.240

4.  Molecular dissection of the replication system of plasmid pIGRK encoding two in-frame Rep proteins with antagonistic functions.

Authors:  Paweł Wawrzyniak; Agnieszka Sobolewska-Ruta; Piotr Zaleski; Natalia Łukasiewicz; Paulina Kabaj; Piotr Kierył; Agata Gościk; Anna Bierczyńska-Krzysik; Piotr Baran; Anna Mazurkiewicz-Pisarek; Andrzej Płucienniczak; Dariusz Bartosik
Journal:  BMC Microbiol       Date:  2019-11-13       Impact factor: 3.605

5.  Biological factors in the synthetic construction of overlapping genes.

Authors:  Stefan Wichmann; Siegfried Scherer; Zachary Ardern
Journal:  BMC Genomics       Date:  2021-12-11       Impact factor: 3.969

6.  Engineering gene overlaps to sustain genetic constructs in vivo.

Authors:  Antoine L Decrulle; Antoine Frénoy; Thomas A Meiller-Legrand; Aude Bernheim; Chantal Lotton; Arnaud Gutierrez; Ariel B Lindner
Journal:  PLoS Comput Biol       Date:  2021-10-08       Impact factor: 4.475

7.  Spotlight on alternative frame coding: Two long overlapping genes in Pseudomonas aeruginosa are translated and under purifying selection.

Authors:  Michaela Kreitmeier; Zachary Ardern; Miriam Abele; Christina Ludwig; Siegfried Scherer; Klaus Neuhaus
Journal:  iScience       Date:  2022-02-01

8.  Origin, Evolution and Stability of Overlapping Genes in Viruses: A Systematic Review.

Authors:  Angelo Pavesi
Journal:  Genes (Basel)       Date:  2021-05-26       Impact factor: 4.096

9.  Computational design of fully overlapping coding schemes for protein pairs and triplets.

Authors:  Vaitea Opuu; Martin Silvert; Thomas Simonson
Journal:  Sci Rep       Date:  2017-11-20       Impact factor: 4.379

10.  OLGenie: Estimating Natural Selection to Predict Functional Overlapping Genes.

Authors:  Chase W Nelson; Zachary Ardern; Xinzhu Wei
Journal:  Mol Biol Evol       Date:  2020-08-01       Impact factor: 16.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.