Maria Stepanova1, Tatiana Tiazhelova, Mikle Skoblov, Ancha Baranova. 1. Center for Biomedical Genomics and BioInformatics, Molecular and Microbiology Department, College of Arts and Sciences, George Mason University, Fairfax, VA 22031, USA.
Abstract
MOTIVATION: The detection of transcription factor binding sites (TFBS) in genomic sequences is a basic task for elucidating the transcriptional aspects of gene regulation. Evaluation procedures applicable to the TFBS prediction outputs need improvement. Predicted TFBS located outside of the transcription associated areas are often neglected from the functional and the evolutionary points of view, therefore deserving a systematic overview. RESULTS: We calculated theoretical occurrences of 184 TFBS according to their position weight matrices and the dinucleotide statistics of the completed vertebrate genomes, then performed a TFBS prediction in the corresponding complete genomic sequences and their repeat-free, repetitive and regulatory fractions. Repeat-free fractions of the closely related mammalian genomes were characterized by strong similarities in TFBS occurrences. A significant over-representation of multiple TFBS was found in both repetitive and non-repetitive genome fractions. AVAILABILITY: F-values and real TFBS occurrences calculated for human, chimp, mouse, rat, zebrafish and fugu genomes are available for free download at http://www.gmu.edu/departments/mmb/baranova/pages/bioinformatics
MOTIVATION: The detection of transcription factor binding sites (TFBS) in genomic sequences is a basic task for elucidating the transcriptional aspects of gene regulation. Evaluation procedures applicable to the TFBS prediction outputs need improvement. Predicted TFBS located outside of the transcription associated areas are often neglected from the functional and the evolutionary points of view, therefore deserving a systematic overview. RESULTS: We calculated theoretical occurrences of 184 TFBS according to their position weight matrices and the dinucleotide statistics of the completed vertebrate genomes, then performed a TFBS prediction in the corresponding complete genomic sequences and their repeat-free, repetitive and regulatory fractions. Repeat-free fractions of the closely related mammalian genomes were characterized by strong similarities in TFBS occurrences. A significant over-representation of multiple TFBS was found in both repetitive and non-repetitive genome fractions. AVAILABILITY: F-values and real TFBS occurrences calculated for human, chimp, mouse, rat, zebrafish and fugu genomes are available for free download at http://www.gmu.edu/departments/mmb/baranova/pages/bioinformatics
Authors: Kamil C Kural; Neetu Tandon; Mikhail Skoblov; Olga V Kel-Margoulis; Ancha V Baranova Journal: BMC Genomics Date: 2016-12-28 Impact factor: 3.969
Authors: Mikhail Skoblov; Konstantin Shakhbazov; Dmitry Oshchepkov; Dmitry Ivanov; Anna Guskova; Dmitry Ivanov; Petr Rubtsov; Vladimir Prasolov; Nick Yankovsky; Ancha Baranova Journal: Biochem Biophys Res Commun Date: 2006-02-17 Impact factor: 3.575
Authors: Luiz F Goulart; Francesco Bettella; Ida E Sønderby; Andrew J Schork; Wesley K Thompson; Morten Mattingsdal; Vidar M Steen; Verena Zuber; Yunpeng Wang; Anders M Dale; Ole A Andreassen; Srdjan Djurovic Journal: BMC Genomics Date: 2015-04-16 Impact factor: 3.969