Literature DB >> 11673231

Enrichment of regulatory signals in conserved non-coding genomic sequence.

S Levy1, S Hannenhalli, C Workman.   

Abstract

MOTIVATION: Whole genome shotgun sequencing strategies generate sequence data prior to the application of assembly methodologies that result in contiguous sequence. Sequence reads can be employed to indicate regions of conservation between closely related species for which only one genome has been assembled. Consequently, by using pairwise sequence alignments methods it is possible to identify novel, non-repetitive, conserved segments in non-coding sequence that exist between the assembled human genome and mouse whole genome shotgun sequencing fragments. Conserved non-coding regions identify potentially functional DNA that could be involved in transcriptional regulation.
RESULTS: Local sequence alignment methods were applied employing mouse fragments and the assembled human genome. In addition, transcription factor binding sites were detected by aligning their corresponding positional weight matrices to the sequence regions. These methods were applied to a set of transcripts corresponding to 502 genes associated with a variety of different human diseases taken from the Online Mendelian Inheritance in Man database. Using statistical arguments we have shown that conserved non-coding segments contain an enrichment of transcription factor binding sites when compared to the sequence background in which the conserved segments are located. This enrichment of binding sites was not observed in coding sequence. Conserved non-coding segments are not extensively repeated in the genome and therefore their identification provides a rapid means of finding genes with related conserved regions, and consequently potentially related regulatory mechanism. Conserved segments in upstream regions are found to contain binding sites that are co-localized in a manner consistent with experimentally known transcription factor pairwise co-occurrences and afford the identification of novel co-occurring Transcription Factor (TF) pairs. This study provides a methodology and more evidence to suggest that conserved non-coding regions are biologically significant since they contain a statistical enrichment of regulatory signals and pairs of signals that enable the construction of regulatory models for human genes. CONTACT: samuel.levy@celera.com.

Entities:  

Mesh:

Substances:

Year:  2001        PMID: 11673231     DOI: 10.1093/bioinformatics/17.10.871

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  66 in total

1.  Analysis of similarity within 142 pairs of orthologous intergenic regions of Caenorhabditis elegans and Caenorhabditis briggsae.

Authors:  Colleen T Webb; Svetlana A Shabalina; Aleksey Yu Ogurtsov; Alexey S Kondrashov
Journal:  Nucleic Acids Res       Date:  2002-03-01       Impact factor: 16.971

2.  rVista for comparative sequence-based discovery of functional transcription factor binding sites.

Authors:  Gabriela G Loots; Ivan Ovcharenko; Lior Pachter; Inna Dubchak; Edward M Rubin
Journal:  Genome Res       Date:  2002-05       Impact factor: 9.043

3.  GALA, a database for genomic sequence alignments and annotations.

Authors:  Belinda Giardine; Laura Elnitski; Cathy Riemer; Izabela Makalowska; Scott Schwartz; Webb Miller; Ross C Hardison
Journal:  Genome Res       Date:  2003-04       Impact factor: 9.043

Review 4.  In silico identification of metazoan transcriptional regulatory regions.

Authors:  Wyeth W Wasserman; William Krivan
Journal:  Naturwissenschaften       Date:  2003-03-27

5.  Conserved noncoding sequences among cultivated cereal genomes identify candidate regulatory sequence elements and patterns of promoter evolution.

Authors:  Hena Guo; Stephen P Moose
Journal:  Plant Cell       Date:  2003-05       Impact factor: 11.277

6.  Predicting transcription factor synergism.

Authors:  Sridhar Hannenhalli; Samuel Levy
Journal:  Nucleic Acids Res       Date:  2002-10-01       Impact factor: 16.971

7.  Conserved noncoding sequences in the grasses.

Authors:  Dan Choffnes Inada; Ali Bashir; Chunghau Lee; Brian C Thomas; Cynthia Ko; Stephen A Goff; Michael Freeling
Journal:  Genome Res       Date:  2003-09       Impact factor: 9.043

8.  Identifying transcriptional regulatory sites in the human genome using an integrated system.

Authors:  Hsien-Da Huang; Jorng-Tzong Horng; Yi-Ming Sun; Ann-Ping Tsou; Shir-Ly Huang
Journal:  Nucleic Acids Res       Date:  2004-03-29       Impact factor: 16.971

9.  Intronic hormone response elements mediate regulation of FKBP5 by progestins and glucocorticoids.

Authors:  Tina R Hubler; Jonathan G Scammell
Journal:  Cell Stress Chaperones       Date:  2004       Impact factor: 3.667

10.  Identification and characterization of putative methylation targets in the MAOA locus using bioinformatic approaches.

Authors:  Elena Shumay; Joanna S Fowler
Journal:  Epigenetics       Date:  2010-05-05       Impact factor: 4.528

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.