Literature DB >> 33494278

Multiple Alignment of Promoter Sequences from the Arabidopsis thaliana L. Genome.

Eugene V Korotkov1, Yulia M Suvorova1, Dmitrii O Kostenko2, Maria A Korotkova2.   

Abstract

In this study, we developed a new mathematical method for performing multiple alignment of highly divergent sequences (MAHDS), i.e., sequences that have on average more than 2.5 substitutions per position (x). We generated sets of artificial DNA sequences with x ranging from 0 to 4.4 and applied MAHDS as well as currently used multiple sequence alignment algorithms, including ClustalW, MAFFT, T-Coffee, Kalign, and Muscle to these sets. The results indicated that most of the existing methods could produce statistically significant alignments only for the sets with x < 2.5, whereas MAHDS could operate on sequences with x = 4.4. We also used MAHDS to analyze a set of promoter sequences from the Arabidopsis thaliana genome and discovered many conserved regions upstream of the transcription initiation site (from -499 to +1 bp); a part of the downstream region (from +1 to +70 bp) also significantly contributed to the obtained alignments. The possibilities of applying the newly developed method for the identification of promoter sequences in any genome are discussed. A server for multiple alignment of nucleotide sequences has been created.

Entities:  

Keywords:  dynamic programming; genetic algorithm; multiple sequence alignment; promoter

Mesh:

Year:  2021        PMID: 33494278      PMCID: PMC7909805          DOI: 10.3390/genes12020135

Source DB:  PubMed          Journal:  Genes (Basel)        ISSN: 2073-4425            Impact factor:   4.096


  45 in total

1.  Phylogenetic profiles reveal evolutionary relationships within the "twilight zone" of sequence similarity.

Authors:  Gue Su Chang; Yoojin Hong; Kyung Dae Ko; Gaurav Bhardwaj; Edward C Holmes; Randen L Patterson; Damian B van Rossum
Journal:  Proc Natl Acad Sci U S A       Date:  2008-09-02       Impact factor: 11.205

2.  Structure and variability of recently inserted Alu family members.

Authors:  M A Batzer; G E Kilroy; P E Richard; T H Shaikh; T D Desselle; C L Hoppens; P L Deininger
Journal:  Nucleic Acids Res       Date:  1990-12-11       Impact factor: 16.971

3.  Significant improvement in accuracy of multiple protein sequence alignments by iterative refinement as assessed by reference to structural alignments.

Authors:  O Gotoh
Journal:  J Mol Biol       Date:  1996-12-13       Impact factor: 5.469

4.  Prevalence of the initiator over the TATA box in human and yeast genes and identification of DNA motifs enriched in human TATA-less core promoters.

Authors:  Chuhu Yang; Eugene Bolotin; Tao Jiang; Frances M Sladek; Ernest Martinez
Journal:  Gene       Date:  2006-10-10       Impact factor: 3.688

5.  Hidden Markov Models and their Applications in Biological Sequence Analysis.

Authors:  Byung-Jun Yoon
Journal:  Curr Genomics       Date:  2009-09       Impact factor: 2.236

Review 6.  The DPE, a core promoter element for transcription by RNA polymerase II.

Authors:  James T Kadonaga
Journal:  Exp Mol Med       Date:  2002-09-30       Impact factor: 8.718

7.  The eukaryotic promoter database in its 30th year: focus on non-vertebrate organisms.

Authors:  René Dreos; Giovanna Ambrosini; Romain Groux; Rouaïda Cavin Périer; Philipp Bucher
Journal:  Nucleic Acids Res       Date:  2016-11-28       Impact factor: 16.971

8.  G4PromFinder: an algorithm for predicting transcription promoters in GC-rich bacterial genomes based on AT-rich elements and G-quadruplex motifs.

Authors:  Marco Di Salvo; Eva Pinatel; Adelfia Talà; Marco Fondi; Clelia Peano; Pietro Alifano
Journal:  BMC Bioinformatics       Date:  2018-02-06       Impact factor: 3.169

9.  Fast statistical alignment.

Authors:  Robert K Bradley; Adam Roberts; Michael Smoot; Sudeep Juvekar; Jaeyoung Do; Colin Dewey; Ian Holmes; Lior Pachter
Journal:  PLoS Comput Biol       Date:  2009-05-29       Impact factor: 4.475

10.  Fast and sensitive multiple alignment of large genomic sequences.

Authors:  Michael Brudno; Michael Chapman; Berthold Göttgens; Serafim Batzoglou; Burkhard Morgenstern
Journal:  BMC Bioinformatics       Date:  2003-12-23       Impact factor: 3.169

View more
  5 in total

1.  Application of the MAHDS Method for Multiple Alignment of Highly Diverged Amino Acid Sequences.

Authors:  Dimitrii O Kostenko; Eugene V Korotkov
Journal:  Int J Mol Sci       Date:  2022-03-29       Impact factor: 5.923

Review 2.  Developments in Algorithms for Sequence Alignment: A Review.

Authors:  Jiannan Chao; Furong Tang; Lei Xu
Journal:  Biomolecules       Date:  2022-04-06

3.  Database of Potential Promoter Sequences in the Capsicum annuum Genome.

Authors:  Valentina Rudenko; Eugene Korotkov
Journal:  Biology (Basel)       Date:  2022-07-26

4.  Detection of Highly Divergent Tandem Repeats in the Rice Genome.

Authors:  Eugene V Korotkov; Anastasiya M Kamionskya; Maria A Korotkova
Journal:  Genes (Basel)       Date:  2021-03-25       Impact factor: 4.096

5.  Search for Highly Divergent Tandem Repeats in Amino Acid Sequences.

Authors:  Valentina Rudenko; Eugene Korotkov
Journal:  Int J Mol Sci       Date:  2021-07-01       Impact factor: 5.923

  5 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.