Literature DB >> 12568727

Pentamer vocabularies characterizing introns and intron-like intergenic tracts from Caenorhabditis elegans and Drosophila melanogaster.

Emanuele Bultrini1, Elisabetta Pizzi, Paolo Del Giudice, Clara Frontali.   

Abstract

Overall compositional properties at the level of bases, dinucleotides and longer oligos characterize genomes of different species. In Caenorhabditis elegans, using recurrence analysis, we recognized the existence of a long-range correlation in the oligonucleotide usage of introns and intergenic regions. Through correlation analysis, this is confirmed here to be a genome-wide property of C. elegans non-coding portions. We then investigate the possibility of extracting a typical vocabulary through statistical analysis of experimentally confirmed introns of sufficient length (>1 kb), deprived of known splice signals, the focus being on distributed lexical features rather than on localized motifs. Lexical preferences typical of introns could be exposed using principal component analysis of pentanucleotide frequency distributions, both in C. elegans and in Drosophila melanogaster. In either species, the introns' pentamer preferences are largely shared by intergenic tracts. The pentamer vocabularies extracted for the two species exhibit interesting symmetry properties and overlap in part. A more extensive investigation of the interspecies relationship at the level of oligonucleotide preferences in non-coding regions, not related by sequence similarity, might form the basis of new approaches for the study of the evolutionary behaviour of these regions.

Entities:  

Mesh:

Substances:

Year:  2003        PMID: 12568727     DOI: 10.1016/s0378-1119(02)01206-4

Source DB:  PubMed          Journal:  Gene        ISSN: 0378-1119            Impact factor:   3.688


  8 in total

1.  Introns form compositional clusters in parallel with the compositional clusters of the coding sequences to which they pertain.

Authors:  Miguel A Fuertes; José M Pérez; Emile Zuckerkandl; Carlos Alonso
Journal:  J Mol Evol       Date:  2010-12-04       Impact factor: 2.395

2.  Sequence signature analysis of chromosome identity in three Drosophila species.

Authors:  Per Stenberg; Fredrik Pettersson; Anja O Saura; Anders Berglund; Jan Larsson
Journal:  BMC Bioinformatics       Date:  2005-06-23       Impact factor: 3.169

3.  K-mer Content, Correlation, and Position Analysis of Genome DNA Sequences for the Identification of Function and Evolutionary Features.

Authors:  Aaron Sievers; Katharina Bosiek; Marc Bisch; Chris Dreessen; Jascha Riedel; Patrick Froß; Michael Hausmann; Georg Hildenbrand
Journal:  Genes (Basel)       Date:  2017-04-19       Impact factor: 4.096

Review 4.  Neutralism versus selectionism: Chargaff's second parity rule, revisited.

Authors:  Donald R Forsdyke
Journal:  Genetica       Date:  2021-04-20       Impact factor: 1.633

5.  Eukaryotic Genomes Show Strong Evolutionary Conservation of k-mer Composition and Correlation Contributions between Introns and Intergenic Regions.

Authors:  Aaron Sievers; Liane Sauer; Michael Hausmann; Georg Hildenbrand
Journal:  Genes (Basel)       Date:  2021-10-01       Impact factor: 4.096

6.  Peptide vocabulary analysis reveals ultra-conservation and homonymity in protein sequences.

Authors:  Derek Gatherer
Journal:  Bioinform Biol Insights       Date:  2009-11-24

7.  The average mutual information profile as a genomic signature.

Authors:  Mark Bauer; Sheldon M Schuster; Khalid Sayood
Journal:  BMC Bioinformatics       Date:  2008-01-25       Impact factor: 3.169

8.  Conservation of k-mer Composition and Correlation Contribution between Introns and Intergenic Regions of Animalia Genomes.

Authors:  Aaron Sievers; Frederik Wenz; Michael Hausmann; Georg Hildenbrand
Journal:  Genes (Basel)       Date:  2018-10-04       Impact factor: 4.096

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.