Literature DB >> 21132282

Introns form compositional clusters in parallel with the compositional clusters of the coding sequences to which they pertain.

Miguel A Fuertes1, José M Pérez, Emile Zuckerkandl, Carlos Alonso.   

Abstract

This report deals with the study of compositional properties of human gene sequences evaluating similarities and differences among functionally distinct sectors of the gene independently of the reading frame. To retrieve the compositional information of DNA, we present a neighbor base dependent coding system in which the alphabet of 64 letters (DNA triplets) is compressed to an alphabet of 14 letters here termed triplet composons. The triplets containing the same set of distinct bases in whatever order and number form a triplet composon. The reading of the DNA sequence is performed starting at any letter of the initial triplet and then moving, triplet-to-triplet, until the end of the sequence. The readings were made in an overlapping way along the length of the sequences. The analysis of the compositional content in terms of the composon usage frequencies of the gene sequences shows that: (i) the compositional content of the sequences is far from that of random sequences, even in the case of non-protein coding sequences; (ii) coding sequences can be classified as components of compositional clusters; and (iii) intron sequences in a cluster have the same composon usage frequencies, even as their base composition differs notably from that of their home coding sequences. A comparison of the composon usage frequencies between human and mouse homologous genes indicated that two clusters found in humans do not have their counterpart in mouse whereas the others clusters are stable in both species with respect to their composon usage frequencies in both coding and noncoding sequences.

Entities:  

Mesh:

Substances:

Year:  2010        PMID: 21132282     DOI: 10.1007/s00239-010-9411-6

Source DB:  PubMed          Journal:  J Mol Evol        ISSN: 0022-2844            Impact factor:   2.395


  36 in total

1.  Codon usage tabulated from international DNA sequence databases: status for the year 2000.

Authors:  Y Nakamura; T Gojobori; T Ikemura
Journal:  Nucleic Acids Res       Date:  2000-01-01       Impact factor: 16.971

Review 2.  Isochores and the evolutionary genomics of vertebrates.

Authors:  G Bernardi
Journal:  Gene       Date:  2000-01-04       Impact factor: 3.688

Review 3.  Combinatorial epigenetics, "junk DNA", and the evolution of complex organisms.

Authors:  Emile Zuckerkandl; Giacomo Cavalli
Journal:  Gene       Date:  2006-12-09       Impact factor: 3.688

4.  Natural selection and the origin of jingwei, a chimeric processed functional gene in Drosophila.

Authors:  M Long; C H Langley
Journal:  Science       Date:  1993-04-02       Impact factor: 47.728

Review 5.  The human genome: organization and evolutionary history.

Authors:  G Bernardi
Journal:  Annu Rev Genet       Date:  1995       Impact factor: 16.830

6.  Novel genes derived from noncoding DNA in Drosophila melanogaster are frequently X-linked and exhibit testis-biased expression.

Authors:  Mia T Levine; Corbin D Jones; Andrew D Kern; Heather A Lindfors; David J Begun
Journal:  Proc Natl Acad Sci U S A       Date:  2006-06-15       Impact factor: 11.205

7.  Human coding and noncoding DNA: compositional correlations.

Authors:  O Clay; S Cacciò; S Zoubak; D Mouchiroud; G Bernardi
Journal:  Mol Phylogenet Evol       Date:  1996-02       Impact factor: 4.286

8.  The 5' leader of plant PgiC has an intron: the leader shows both the loss and maintenance of constraints compared with introns and exons in the coding region.

Authors:  L D Gottlieb; V S Ford
Journal:  Mol Biol Evol       Date:  2002-09       Impact factor: 16.240

9.  Detection of potential positive regulatory motifs of transcription in yeast introns by comparative analysis of oligonucleotide frequencies.

Authors:  Jing Zhang; Jun Hu; Xiu-fan Shi; Huai Cao; Wei-bo Liu
Journal:  Comput Biol Chem       Date:  2003-10       Impact factor: 2.877

10.  Codon preference is but an illusion created by the construction principle of coding sequences.

Authors:  S Ohno
Journal:  Proc Natl Acad Sci U S A       Date:  1988-06       Impact factor: 11.205

View more
  4 in total

1.  Conserved Critical Evolutionary Gene Structures in Orthologs.

Authors:  Miguel A Fuertes; José R Rodrigo; Carlos Alonso
Journal:  J Mol Evol       Date:  2019-02-28       Impact factor: 2.395

2.  A Method for the Annotation of Functional Similarities of Coding DNA Sequences: the Case of a Populated Cluster of Transmembrane Proteins.

Authors:  Miguel Angel Fuertes; José Ramón Rodrigo; Carlos Alonso
Journal:  J Mol Evol       Date:  2016-11-03       Impact factor: 2.395

3.  Do Intron and Coding Sequences of Some Human-Mouse Orthologs Evolve as a Single Unit?

Authors:  Miguel Angel Fuertes; José Ramón Rodrigo; Carlos Alonso
Journal:  J Mol Evol       Date:  2016-05-24       Impact factor: 2.395

4.  Evolutionary conserved compositional structures hidden in genomes of the foot-and-mouth disease virus and of the human rhinovirus.

Authors:  Miguel Angel Fuertes; Silvia López-Arguello; Carlos Alonso
Journal:  Sci Rep       Date:  2019-11-12       Impact factor: 4.379

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.