Literature DB >> 29081688

Noncoding Variants Functional Prioritization Methods Based on Predicted Regulatory Factor Binding Sites.

Haoyue Fu1, Xiangde Zhang1.   

Abstract

BACKGROUNDS: With the advent of the post genomic era, the research for the genetic mechanism of the diseases has found to be increasingly depended on the studies of the genes, the gene-networks and gene-protein interaction networks. To explore gene expression and regulation, the researchers have carried out many studies on transcription factors and their binding sites (TFBSs). Based on the large amount of transcription factor binding sites predicting values in the deep learning models, further computation and analysis have been done to reveal the relationship between the gene mutation and the occurrence of the disease. It has been demonstrated that based on the deep learning methods, the performances of the prediction for the functions of the noncoding variants are outperforming than those of the conventional methods. The research on the prediction for functions of Single Nucleotide Polymorphisms (SNPs) is expected to uncover the mechanism of the gene mutation affection on traits and diseases of human beings.
RESULTS: We reviewed the conventional TFBSs identification methods from different perspectives. As for the deep learning methods to predict the TFBSs, we discussed the related problems, such as the raw data preprocessing, the structure design of the deep convolution neural network (CNN) and the model performance measure et al. And then we summarized the techniques that usually used in finding out the functional noncoding variants from de novo sequence.
CONCLUSION: Along with the rapid development of the high-throughout assays, more and more sample data and chromatin features would be conducive to improve the prediction accuracy of the deep convolution neural network for TFBSs identification. Meanwhile, getting more insights into the deep CNN framework itself has been proved useful for both the promotion on model performance and the development for more suitable design to sample data. Based on the feature values predicted by the deep CNN model, the prioritization model for functional noncoding variants would contribute to reveal the affection of gene mutation on the diseases.

Entities:  

Keywords:  Deep convolution neural network; Noncoding variants; Saturated mutagenesis; Single nucleotide polymorphisms; Transcription factor binding sites

Year:  2017        PMID: 29081688      PMCID: PMC5635616          DOI: 10.2174/1389202918666170228143619

Source DB:  PubMed          Journal:  Curr Genomics        ISSN: 1389-2029            Impact factor:   2.236


  111 in total

Review 1.  DNA binding sites: representation and discovery.

Authors:  G D Stormo
Journal:  Bioinformatics       Date:  2000-01       Impact factor: 6.937

2.  ANN-Spec: a method for discovering transcription factor binding sites with improved specificity.

Authors:  C T Workman; G D Stormo
Journal:  Pac Symp Biocomput       Date:  2000

Review 3.  Accessing genetic variation: genotyping single nucleotide polymorphisms.

Authors:  A C Syvänen
Journal:  Nat Rev Genet       Date:  2001-12       Impact factor: 53.242

4.  High-resolution genome-wide in vivo footprinting of diverse transcription factors in human cells.

Authors:  Alan P Boyle; Lingyun Song; Bum-Kyu Lee; Darin London; Damian Keefe; Ewan Birney; Vishwanath R Iyer; Gregory E Crawford; Terrence S Furey
Journal:  Genome Res       Date:  2010-11-24       Impact factor: 9.043

5.  Phylogenetic footprinting of transcription factor binding sites in proteobacterial genomes.

Authors:  L McCue; W Thompson; C Carmack; M P Ryan; J S Liu; V Derbyshire; C E Lawrence
Journal:  Nucleic Acids Res       Date:  2001-02-01       Impact factor: 16.971

Review 6.  Control of cell fate and differentiation by Sry-related high-mobility-group box (Sox) transcription factors.

Authors:  Véronique Lefebvre; Bogdan Dumitriu; Alfredo Penzo-Méndez; Yu Han; Bhattaram Pallavi
Journal:  Int J Biochem Cell Biol       Date:  2007-06-06       Impact factor: 5.085

7.  Architecture of the human regulatory network derived from ENCODE data.

Authors:  Mark B Gerstein; Anshul Kundaje; Manoj Hariharan; Stephen G Landt; Koon-Kiu Yan; Chao Cheng; Xinmeng Jasmine Mu; Ekta Khurana; Joel Rozowsky; Roger Alexander; Renqiang Min; Pedro Alves; Alexej Abyzov; Nick Addleman; Nitin Bhardwaj; Alan P Boyle; Philip Cayting; Alexandra Charos; David Z Chen; Yong Cheng; Declan Clarke; Catharine Eastman; Ghia Euskirchen; Seth Frietze; Yao Fu; Jason Gertz; Fabian Grubert; Arif Harmanci; Preti Jain; Maya Kasowski; Phil Lacroute; Jing Jane Leng; Jin Lian; Hannah Monahan; Henriette O'Geen; Zhengqing Ouyang; E Christopher Partridge; Dorrelyn Patacsil; Florencia Pauli; Debasish Raha; Lucia Ramirez; Timothy E Reddy; Brian Reed; Minyi Shi; Teri Slifer; Jing Wang; Linfeng Wu; Xinqiong Yang; Kevin Y Yip; Gili Zilberman-Schapira; Serafim Batzoglou; Arend Sidow; Peggy J Farnham; Richard M Myers; Sherman M Weissman; Michael Snyder
Journal:  Nature       Date:  2012-09-06       Impact factor: 49.962

8.  A user's guide to the encyclopedia of DNA elements (ENCODE).

Authors: 
Journal:  PLoS Biol       Date:  2011-04-19       Impact factor: 8.029

9.  ENCODE whole-genome data in the UCSC Genome Browser.

Authors:  Kate R Rosenbloom; Timothy R Dreszer; Michael Pheasant; Galt P Barber; Laurence R Meyer; Andy Pohl; Brian J Raney; Ting Wang; Angie S Hinrichs; Ann S Zweig; Pauline A Fujita; Katrina Learned; Brooke Rhead; Kayla E Smith; Robert M Kuhn; Donna Karolchik; David Haussler; W James Kent
Journal:  Nucleic Acids Res       Date:  2009-11-17       Impact factor: 16.971

10.  Evolutionarily dynamic alternative splicing of GPR56 regulates regional cerebral cortical patterning.

Authors:  Byoung-Il Bae; Ian Tietjen; Kutay D Atabay; Gilad D Evrony; Matthew B Johnson; Ebenezer Asare; Peter P Wang; Ayako Y Murayama; Kiho Im; Steven N Lisgo; Lynne Overman; Nenad Šestan; Bernard S Chang; A James Barkovich; P Ellen Grant; Meral Topçu; Jeffrey Politsky; Hideyuki Okano; Xianhua Piao; Christopher A Walsh
Journal:  Science       Date:  2014-02-14       Impact factor: 47.728

View more
  1 in total

Review 1.  Systematics for types and effects of DNA variations.

Authors:  Mauno Vihinen
Journal:  BMC Genomics       Date:  2018-12-28       Impact factor: 3.969

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.