Literature DB >> 32329779

Prediction of condition-specific regulatory genes using machine learning.

Qi Song1, Jiyoung Lee1, Shamima Akter2, Matthew Rogers3, Ruth Grene1,2, Song Li1,2.   

Abstract

Recent advances in genomic technologies have generated data on large-scale protein-DNA interactions and open chromatin regions for many eukaryotic species. How to identify condition-specific functions of transcription factors using these data has become a major challenge in genomic research. To solve this problem, we have developed a method called ConSReg, which provides a novel approach to integrate regulatory genomic data into predictive machine learning models of key regulatory genes. Using Arabidopsis as a model system, we tested our approach to identify regulatory genes in data sets from single cell gene expression and from abiotic stress treatments. Our results showed that ConSReg accurately predicted transcription factors that regulate differentially expressed genes with an average auROC of 0.84, which is 23.5-25% better than enrichment-based approaches. To further validate the performance of ConSReg, we analyzed an independent data set related to plant nitrogen responses. ConSReg provided better rankings of the correct transcription factors in 61.7% of cases, which is three times better than other plant tools. We applied ConSReg to Arabidopsis single cell RNA-seq data, successfully identifying candidate regulatory genes that control cell wall formation. Our methods provide a new approach to define candidate regulatory genes using integrated genomic data in plants.
© The Author(s) 2020. Published by Oxford University Press on behalf of Nucleic Acids Research.

Entities:  

Mesh:

Substances:

Year:  2020        PMID: 32329779      PMCID: PMC7293043          DOI: 10.1093/nar/gkaa264

Source DB:  PubMed          Journal:  Nucleic Acids Res        ISSN: 0305-1048            Impact factor:   16.971


  103 in total

1.  DeepChrome: deep-learning for predicting gene expression from histone modifications.

Authors:  Ritambhara Singh; Jack Lanchantin; Gabriel Robins; Yanjun Qi
Journal:  Bioinformatics       Date:  2016-09-01       Impact factor: 6.937

2.  MYB107 and MYB9 Homologs Regulate Suberin Deposition in Angiosperms.

Authors:  Justin Lashbrooke; Hagai Cohen; Dorit Levy-Samocha; Oren Tzfadia; Irina Panizel; Viktoria Zeisler; Hassan Massalha; Adi Stern; Livio Trainotti; Lukas Schreiber; Fabrizio Costa; Asaph Aharoni
Journal:  Plant Cell       Date:  2016-09-07       Impact factor: 11.277

3.  Changes in chromatin accessibility between Arabidopsis stem cells and mesophyll cells illuminate cell type-specific transcription factor networks.

Authors:  Paja Sijacic; Marko Bajic; Elizabeth C McKinney; Richard B Meagher; Roger B Deal
Journal:  Plant J       Date:  2018-04       Impact factor: 6.417

4.  Enhanced Maps of Transcription Factor Binding Sites Improve Regulatory Networks Learned from Accessible Chromatin Data.

Authors:  Shubhada R Kulkarni; D Marc Jones; Klaas Vandepoele
Journal:  Plant Physiol       Date:  2019-07-25       Impact factor: 8.340

5.  An Arabidopsis gene regulatory network for secondary cell wall synthesis.

Authors:  M Taylor-Teeples; L Lin; M de Lucas; G Turco; T W Toal; A Gaudinier; N F Young; G M Trabucco; M T Veling; R Lamothe; P P Handakumbura; G Xiong; C Wang; J Corwin; A Tsoukalas; L Zhang; D Ware; M Pauly; D J Kliebenstein; K Dehesh; I Tagkopoulos; G Breton; J L Pruneda-Paz; S E Ahnert; S A Kay; S P Hazen; S M Brady
Journal:  Nature       Date:  2014-12-24       Impact factor: 49.962

6.  Gene regulatory network inference using fused LASSO on multiple data sets.

Authors:  Nooshin Omranian; Jeanne M O Eloundou-Mbebi; Bernd Mueller-Roeber; Zoran Nikoloski
Journal:  Sci Rep       Date:  2016-02-11       Impact factor: 4.379

7.  AGRIS: Arabidopsis gene regulatory information server, an information resource of Arabidopsis cis-regulatory elements and transcription factors.

Authors:  Ramana V Davuluri; Hao Sun; Saranyan K Palaniswamy; Nicole Matthews; Carlos Molina; Mike Kurtz; Erich Grotewold
Journal:  BMC Bioinformatics       Date:  2003-06-23       Impact factor: 3.169

8.  Improved DNase-seq protocol facilitates high resolution mapping of DNase I hypersensitive sites in roots in Arabidopsis thaliana.

Authors:  Jason S Cumbie; Sergei A Filichkin; Molly Megraw
Journal:  Plant Methods       Date:  2015-09-04       Impact factor: 4.993

9.  g:Profiler-a web server for functional interpretation of gene lists (2016 update).

Authors:  Jüri Reimand; Tambet Arak; Priit Adler; Liis Kolberg; Sulev Reisberg; Hedi Peterson; Jaak Vilo
Journal:  Nucleic Acids Res       Date:  2016-04-20       Impact factor: 16.971

10.  ZIFA: Dimensionality reduction for zero-inflated single-cell gene expression analysis.

Authors:  Emma Pierson; Christopher Yau
Journal:  Genome Biol       Date:  2015-11-02       Impact factor: 13.583

View more
  8 in total

1.  PGD: a machine learning-based photosynthetic-related gene detection approach.

Authors:  Yunchuan Wang; Xiuru Dai; Daohong Fu; Pinghua Li; Baijuan Du
Journal:  BMC Bioinformatics       Date:  2022-05-17       Impact factor: 3.307

Review 2.  Time-Based Systems Biology Approaches to Capture and Model Dynamic Gene Regulatory Networks.

Authors:  Jose M Alvarez; Matthew D Brooks; Joseph Swift; Gloria M Coruzzi
Journal:  Annu Rev Plant Biol       Date:  2021-03-05       Impact factor: 28.310

3.  Single Cell RNA-Seq and Machine Learning Reveal Novel Subpopulations in Low-Grade Inflammatory Monocytes With Unique Regulatory Circuits.

Authors:  Jiyoung Lee; Shuo Geng; Song Li; Liwu Li
Journal:  Front Immunol       Date:  2021-02-23       Impact factor: 7.561

4.  Integrated Analysis of Methylomic and Transcriptomic Data to Identify Potential Diagnostic Biomarkers for Major Depressive Disorder.

Authors:  Yinping Xie; Ling Xiao; Lijuan Chen; Yage Zheng; Caixia Zhang; Gaohua Wang
Journal:  Genes (Basel)       Date:  2021-01-27       Impact factor: 4.096

5.  OperonSEQer: A set of machine-learning algorithms with threshold voting for detection of operon pairs using short-read RNA-sequencing data.

Authors:  Raga Krishnakumar; Anne M Ruffing
Journal:  PLoS Comput Biol       Date:  2022-01-05       Impact factor: 4.475

Review 6.  Machine Learning for Plant Stress Modeling: A Perspective towards Hormesis Management.

Authors:  Amanda Kim Rico-Chávez; Jesus Alejandro Franco; Arturo Alfonso Fernandez-Jaramillo; Luis Miguel Contreras-Medina; Ramón Gerardo Guevara-González; Quetzalcoatl Hernandez-Escobedo
Journal:  Plants (Basel)       Date:  2022-04-02

7.  Identification of new marker genes from plant single-cell RNA-seq data using interpretable machine learning methods.

Authors:  Haidong Yan; Jiyoung Lee; Qi Song; Qi Li; John Schiefelbein; Bingyu Zhao; Song Li
Journal:  New Phytol       Date:  2022-03-26       Impact factor: 10.323

8.  GRAND: a database of gene regulatory network models across human conditions.

Authors:  Marouen Ben Guebila; Camila M Lopes-Ramos; Deborah Weighill; Abhijeet Rajendra Sonawane; Rebekka Burkholz; Behrouz Shamsaei; John Platig; Kimberly Glass; Marieke L Kuijjer; John Quackenbush
Journal:  Nucleic Acids Res       Date:  2022-01-07       Impact factor: 16.971

  8 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.