Literature DB >> 35717172

G4Boost: a machine learning-based tool for quadruplex identification and stability prediction.

H Busra Cagirici1, Hikmet Budak2, Taner Z Sen3.   

Abstract

BACKGROUND: G-quadruplexes (G4s), formed within guanine-rich nucleic acids, are secondary structures involved in important biological processes. Although every G4 motif has the potential to form a stable G4 structure, not every G4 motif would, and accurate energy-based methods are needed to assess their structural stability. Here, we present a decision tree-based prediction tool, G4Boost, to identify G4 motifs and predict their secondary structure folding probability and thermodynamic stability based on their sequences, nucleotide compositions, and estimated structural topologies.
RESULTS: G4Boost predicted the quadruplex folding state with an accuracy greater then 93% and an F1-score of 0.96, and the folding energy with an RMSE of 4.28 and R2 of 0.95 only by the means of sequence intrinsic feature. G4Boost was successfully applied and validated to predict the stability of experimentally-determined G4 structures, including for plants and humans.
CONCLUSION: G4Boost outperformed the three machine-learning based prediction tools, DeepG4, Quadron, and G4RNA Screener, in terms of both accuracy and F1-score, and can be highly useful for G4 prediction to understand gene regulation across species including plants and humans.
© 2022. The Author(s).

Entities:  

Keywords:  Energy; G-quadruplex; Humans; Machine learning; Plants; Stability; Topology

Mesh:

Substances:

Year:  2022        PMID: 35717172      PMCID: PMC9206279          DOI: 10.1186/s12859-022-04782-z

Source DB:  PubMed          Journal:  BMC Bioinformatics        ISSN: 1471-2105            Impact factor:   3.307


  60 in total

1.  Mfold web server for nucleic acid folding and hybridization prediction.

Authors:  Michael Zuker
Journal:  Nucleic Acids Res       Date:  2003-07-01       Impact factor: 16.971

2.  Post-transcriptional Regulation of Nkx2-5 by RHAU in Heart Development.

Authors:  Junwei Nie; Mingyang Jiang; Xiaotian Zhang; Hao Tang; Hengwei Jin; Xinyi Huang; Baiyin Yuan; Chenxi Zhang; Janice Ching Lai; Yoshikuni Nagamine; Dejing Pan; Wengong Wang; Zhongzhou Yang
Journal:  Cell Rep       Date:  2015-10-17       Impact factor: 9.423

Review 3.  DNA secondary structures: stability and function of G-quadruplex structures.

Authors:  Matthew L Bochman; Katrin Paeschke; Virginia A Zakian
Journal:  Nat Rev Genet       Date:  2012-10-03       Impact factor: 53.242

4.  How long is too long? Effects of loop size on G-quadruplex stability.

Authors:  Aurore Guédin; Julien Gros; Patrizia Alberti; Jean-Louis Mergny
Journal:  Nucleic Acids Res       Date:  2010-07-26       Impact factor: 16.971

5.  A crystallographic and modelling study of a human telomeric RNA (TERRA) quadruplex.

Authors:  Gavin W Collie; Shozeb M Haider; Stephen Neidle; Gary N Parkinson
Journal:  Nucleic Acids Res       Date:  2010-04-22       Impact factor: 16.971

6.  ViennaRNA Package 2.0.

Authors:  Ronny Lorenz; Stephan H Bernhart; Christian Höner Zu Siederdissen; Hakim Tafer; Christoph Flamm; Peter F Stadler; Ivo L Hofacker
Journal:  Algorithms Mol Biol       Date:  2011-11-24       Impact factor: 1.405

7.  Prevalence of quadruplexes in the human genome.

Authors:  Julian L Huppert; Shankar Balasubramanian
Journal:  Nucleic Acids Res       Date:  2005-05-24       Impact factor: 16.971

Review 8.  Human telomere, oncogenic promoter and 5'-UTR G-quadruplexes: diverse higher order DNA and RNA targets for cancer therapeutics.

Authors:  Dinshaw J Patel; Anh Tuân Phan; Vitaly Kuryavyi
Journal:  Nucleic Acids Res       Date:  2007-10-02       Impact factor: 16.971

9.  RNA G-quadruplexes cause eIF4A-dependent oncogene translation in cancer.

Authors:  Andrew L Wolfe; Kamini Singh; Yi Zhong; Philipp Drewe; Vinagolu K Rajasekhar; Viraj R Sanghvi; Konstantinos J Mavrakis; Man Jiang; Justine E Roderick; Joni Van der Meulen; Jonathan H Schatz; Christina M Rodrigo; Chunying Zhao; Pieter Rondou; Elisa de Stanchina; Julie Teruya-Feldstein; Michelle A Kelliher; Frank Speleman; John A Porco; Jerry Pelletier; Gunnar Rätsch; Hans-Guido Wendel
Journal:  Nature       Date:  2014-07-27       Impact factor: 49.962

10.  Re-evaluation of G-quadruplex propensity with G4Hunter.

Authors:  Amina Bedrat; Laurent Lacroix; Jean-Louis Mergny
Journal:  Nucleic Acids Res       Date:  2016-01-20       Impact factor: 16.971

View more
  1 in total

1.  Editorial: Biology of non-canonical nucleic acids from humans to pathogens.

Authors:  Ilaria Frasson
Journal:  Front Microbiol       Date:  2022-07-18       Impact factor: 6.064

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.