Literature DB >> 19052060

A predictive model for identifying mini-regulatory modules in the mouse genome.

Mahesh Yaragatti1, Ted Sandler, Lyle Ungar.   

Abstract

MOTIVATION: Rapidly advancing genome technology has allowed access to a large number of diverse genomes and annotation data. We have defined a systems model that integrates assembly data, comparative genomics, gene predictions, mRNA and EST alignments and physiological tissue expression. Using these as predictive parameters, we engineered a machine learning approach to decipher putative active regions in the genome.
RESULTS: Analysis of genomic sequences showed nucleosome-free region (NFR) modules containing a higher percentage of conserved regions, RNA-encoding sequences, CpG islands, splice sites and GC-rich areas. In contrast, random in silico fragments revealed higher percentages of DNA repeats and a lower conservation. The larger conserved sequences from the Vista enhancer browser (VEB) showed a greater percentage of short DNA sequence matches and RNA coding regions in multiple species. Our model can predict small regulatory regions in the genome with >95% prediction accuracy using NFR modules and >85% prediction accuracy with VEB elements. Ultimately, this systems model can be applied to any organism to identify candidate transcriptional modules on a genome scale.

Entities:  

Mesh:

Substances:

Year:  2008        PMID: 19052060     DOI: 10.1093/bioinformatics/btn622

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  1 in total

1.  Analysis of functional genomic signals using the XOR gate.

Authors:  Mahesh Yaragatti; Qi Wen
Journal:  PLoS One       Date:  2009-05-19       Impact factor: 3.240

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.