Literature DB >> 30758669

TA, GT and AC are significantly under-represented in open reading frames of prokaryotic and eukaryotic protein-coding genes.

Yong Wang1, Zhen Zeng2, Tian-Lei Liu3, Ling Sun3, Qin Yao2, Ke-Ping Chen2.   

Abstract

Genomes can be considered a combination of 16 dinucleotides. Analysing the relative abundance of different dinucleotides may reveal important features of genome evolution. In present study, we conducted extensive surveys on the relative abundances of dinucleotides in various genomic components of 28 bacterial, 20 archaean, 19 fungal, 24 plant and 29 animal species. We found that TA, GT and AC are significantly under-represented in open reading frames of all organisms and in intergenic regions and introns of most organisms. Specific dinucleotides are of greatly varied usage at different codon positions. The significantly low representations of TA, GT and AC are considered the evolutionary consequences of preventing formation of pre-mature stop codons and of reducing intron-splicing options in candidate primary mRNA sequences. These data suggest that a reduction of TA and GT occurred on both strands of the DNA sequence at an early stage of de novo gene birth. Interestingly, GT and AC are also significantly under-represented in current prokaryotic genomes, suggesting that ancient prokaryotic protein-coding genes might have contained introns. The greatly varied usages of specific dinucleotides at different codon positions are considered evolutionary accommodations to compensate the unavailability of specific codons and to avoid formation of pre-mature stop codons. This is the first report presenting data of dinucleotide relative abundance to indicate the possible existence of spliceosomal introns in ancient prokaryotic genes and to hypothesize early steps of de novo gene birth.

Entities:  

Keywords:  Composition; Dinucleotide; Gene birth; Genome evolution; Odds ratio

Mesh:

Substances:

Year:  2019        PMID: 30758669     DOI: 10.1007/s00438-019-01535-1

Source DB:  PubMed          Journal:  Mol Genet Genomics        ISSN: 1617-4623            Impact factor:   3.291


  2 in total

1.  Human SARS-CoV-2 has evolved to reduce CG dinucleotide in its open reading frames.

Authors:  Yong Wang; Jun-Ming Mao; Guang-Dong Wang; Zhi-Peng Luo; Liu Yang; Qin Yao; Ke-Ping Chen
Journal:  Sci Rep       Date:  2020-07-23       Impact factor: 4.379

2.  Codon usage bias and dinucleotide preference in 29 Drosophila species.

Authors:  Prajakta P Kokate; Stephen M Techtmann; Thomas Werner
Journal:  G3 (Bethesda)       Date:  2021-08-07       Impact factor: 3.154

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.