Warning: Undefined array key "mm" in /www/wwwroot/www.ai-bt.com/si.php on line 10 Deprecated: trim(): Passing null to parameter #1 ($string) of type string is deprecated in /www/wwwroot/www.ai-bt.com/si.php on line 10 Into the heart of darkness: large-scale clustering of human non-coding DNA.

Literature DB >> 15262779

Into the heart of darkness: large-scale clustering of human non-coding DNA.

Gill Bejerano¹, David Haussler, Mathieu Blanchette.

Abstract

MOTIVATION: It is currently believed that the human genome contains about twice as much non-coding functional regions as it does protein-coding genes, yet our understanding of these regions is very limited.
RESULTS: We examine the intersection between syntenically conserved sequences in the human, mouse and rat genomes, and sequence similarities within the human genome itself, in search of families of non-protein-coding elements. For this purpose we develop a graph theoretic clustering algorithm, akin to the highly successful methods used in elucidating protein sequence family relationships. The algorithm is applied to a highly filtered set of about 700 000 human-rodent evolutionarily conserved regions, not resembling any known coding sequence, which encompasses 3.7% of the human genome. From these, we obtain roughly 12 000 non-singleton clusters, dense in significant sequence similarities. Further analysis of genomic location, evidence of transcription and RNA secondary structure reveals many clusters to be significantly homogeneous in one or more characteristics. This subset of the highly conserved non-protein-coding elements in the human genome thus contains rich family-like structures, which merit in-depth analysis. AVAILABILITY: Supplementary material to this work is available at http://www.soe.ucsc.edu/~jill/dark.html

Entities: Species

Mesh：

Substances：
Untranslated Regions

Year: 2004 PMID： 15262779 DOI： 10.1093/bioinformatics/bth946

Source DB: PubMed Journal: Bioinformatics ISSN： 1367-4803 Impact factor: 6.937

Keyword Cloud
Cited

39 in total

1. Fast and reliable prediction of noncoding RNAs.

Authors: Stefan Washietl; Ivo L Hofacker; Peter F Stadler
Journal: Proc Natl Acad Sci U S A Date: 2005-01-21 Impact factor: 11.205

2. ESPERR: learning strong and weak signals in genomic sequence alignments to identify functional elements.

Authors: James Taylor; Svitlana Tyekucheva; David C King; Ross C Hardison; Webb Miller; Francesca Chiaromonte
Journal: Genome Res Date: 2006-10-19 Impact factor: 9.043

Review 3. The expanding transcriptome: the genome as the 'Book of Sand'.

Authors: Luis M Mendes Soares; Juan Valcárcel
Journal: EMBO J Date: 2006-03-02 Impact factor: 11.598

4. Metrics of sequence constraint overlook regulatory sequences in an exhaustive analysis at phox2b.

Authors: David M McGaughey; Ryan M Vinton; Jimmy Huynh; Amr Al-Saif; Michael A Beer; Andrew S McCallion
Journal: Genome Res Date: 2007-12-10 Impact factor: 9.043

Review 5. Transposable elements and the evolution of regulatory networks.

Authors: Cédric Feschotte
Journal: Nat Rev Genet Date: 2008-05 Impact factor: 53.242

6. Expression of transcribed ultraconserved regions of genome in rat cerebral cortex.

Authors: Suresh L Mehta; Ashutosh Dharap; Raghu Vemuganti
Journal: Neurochem Int Date: 2014-06-20 Impact factor: 3.921

7. Nucleotide bias observed with a short SELEX RNA aptamer library.

Authors: William H Thiel; Thomas Bair; Kristina Wyatt Thiel; Justin P Dassie; William M Rockey; Craig A Howell; Xiuying Y Liu; Adam J Dupuy; Lingyan Huang; Richard Owczarzy; Mark A Behlke; James O McNamara; Paloma H Giangrande
Journal: Nucleic Acid Ther Date: 2011-06-28 Impact factor: 5.486