Literature DB >> 26499213

The identification of cis-regulatory elements: A review from a machine learning perspective.

Yifeng Li1, Chih-Yu Chen2, Alice M Kaye3, Wyeth W Wasserman4.   

Abstract

The majority of the human genome consists of non-coding regions that have been called junk DNA. However, recent studies have unveiled that these regions contain cis-regulatory elements, such as promoters, enhancers, silencers, insulators, etc. These regulatory elements can play crucial roles in controlling gene expressions in specific cell types, conditions, and developmental stages. Disruption to these regions could contribute to phenotype changes. Precisely identifying regulatory elements is key to deciphering the mechanisms underlying transcriptional regulation. Cis-regulatory events are complex processes that involve chromatin accessibility, transcription factor binding, DNA methylation, histone modifications, and the interactions between them. The development of next-generation sequencing techniques has allowed us to capture these genomic features in depth. Applied analysis of genome sequences for clinical genetics has increased the urgency for detecting these regions. However, the complexity of cis-regulatory events and the deluge of sequencing data require accurate and efficient computational approaches, in particular, machine learning techniques. In this review, we describe machine learning approaches for predicting transcription factor binding sites, enhancers, and promoters, primarily driven by next-generation sequencing data. Data sources are provided in order to facilitate testing of novel methods. The purpose of this review is to attract computational experts and data scientists to advance this field. Crown
Copyright © 2015. Published by Elsevier Ireland Ltd. All rights reserved.

Entities:  

Keywords:  Cis-regulatory elements; Data integration; Deep learning; Enhancers; Ensemble learning; Gene regulation; Machine learning; Promoters

Mesh:

Year:  2015        PMID: 26499213     DOI: 10.1016/j.biosystems.2015.10.002

Source DB:  PubMed          Journal:  Biosystems        ISSN: 0303-2647            Impact factor:   1.973


  13 in total

1.  A novel method for improved accuracy of transcription factor binding site prediction.

Authors:  Abdullah M Khamis; Olaa Motwalli; Romina Oliva; Boris R Jankovic; Yulia A Medvedeva; Haitham Ashoor; Magbubah Essack; Xin Gao; Vladimir B Bajic
Journal:  Nucleic Acids Res       Date:  2018-07-06       Impact factor: 16.971

Review 2.  A Review of Computational Methods for Finding Non-Coding RNA Genes.

Authors:  Qaisar Abbas; Syed Mansoor Raza; Azizuddin Ahmed Biyabani; Muhammad Arfan Jaffar
Journal:  Genes (Basel)       Date:  2016-12-03       Impact factor: 4.096

3.  A cross-species approach to identify transcriptional regulators exemplified for Dnajc22 and Hnf4a.

Authors:  A C Aschenbrenner; K Bassler; M Brondolin; L Bonaguro; P Carrera; K Klee; T Ulas; J L Schultze; M Hoch
Journal:  Sci Rep       Date:  2017-06-22       Impact factor: 4.379

4.  Genome-wide prediction of cis-regulatory regions using supervised deep learning methods.

Authors:  Yifeng Li; Wenqiang Shi; Wyeth W Wasserman
Journal:  BMC Bioinformatics       Date:  2018-05-31       Impact factor: 3.169

5.  High-throughput evaluation of T7 promoter variants using biased randomization and DNA barcoding.

Authors:  Ryo Komura; Wataru Aoki; Keisuke Motone; Atsushi Satomura; Mitsuyoshi Ueda
Journal:  PLoS One       Date:  2018-05-07       Impact factor: 3.240

6.  An explainable artificial intelligence approach for decoding the enhancer histone modifications code and identification of novel enhancers in Drosophila.

Authors:  Jareth C Wolfe; Liudmila A Mikheeva; Hani Hagras; Nicolae Radu Zabet
Journal:  Genome Biol       Date:  2021-11-08       Impact factor: 13.583

7.  Genome-wide identification and expression profiling of the COBRA-like genes reveal likely roles in stem strength in rapeseed (Brassica napus L.).

Authors:  Qian Yang; Shan Wang; Hao Chen; Liang You; Fangying Liu; Zhongsong Liu
Journal:  PLoS One       Date:  2021-11-24       Impact factor: 3.240

8.  Supervised promoter recognition: a benchmark framework.

Authors:  Raul I Perez Martell; Alison Ziesel; Hosna Jabbari; Ulrike Stege
Journal:  BMC Bioinformatics       Date:  2022-04-02       Impact factor: 3.169

9.  Identification and expression profiling of proline metabolizing genes in Arabidopsis thaliana and Oryza sativa to reveal their stress-specific transcript alteration.

Authors:  Shatil Arabia; Md Nur Ahad Shah; Asif Ahmed Sami; Ajit Ghosh; Tahmina Islam
Journal:  Physiol Mol Biol Plants       Date:  2021-06-25

10.  Q&A: How do gene regulatory networks control environmental responses in plants?

Authors:  Ying Sun; José R Dinneny
Journal:  BMC Biol       Date:  2018-04-11       Impact factor: 7.431

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.