Literature DB >> 25248637

Discriminating between deleterious and neutral non-frameshifting indels based on protein interaction networks and hybrid properties.

Ning Zhang1, Tao Huang, Yu-Dong Cai.   

Abstract

More than ten thousand coding variants are contained in each human genome; however, our knowledge of the way genetic variants underlie phenotypic differences is far from complete. Small insertions and deletions (indels) are one of the most common types of human genetic variants, and indels play a significant role in human inherited disease. To date, we still lack a comprehensive understanding of how indels cause diseases. Therefore, identification and analysis of such deleterious variants is a key challenge and has been of great interest in the current research in genome biology. Increasing numbers of computational methods have been developed for discriminating between deleterious indels and neutral indels. However, most of the existing methods are based on traditional sequential or structural features, which cannot completely explain the association between indels and the resulting induced inherited disease. In this study, we establish a novel method to predict deleterious non-frameshifting indels based on features extracted from both protein interaction networks and traditional hybrid properties. Each indel was coded by 1,246 features. Using the maximum relevance minimum redundancy method and the incremental feature selection method, we obtained an optimal feature set containing 42 features, of which 21 features were derived from protein interaction networks. Based on the optimal feature set, an 88 % accuracy and a 0.76 MCC value were achieved by a Random Forest as evaluated by the Jackknife cross-validation test. This method outperformed existing methods of predicting deleterious indels, and can be applied in practice for deleterious non-frameshifting indel predictions in genome research. The analysis of the optimal features selected in the model revealed that network interactions play more important roles and could be informative for better illustrating an indel's function and disease associations than traditional sequential or structural features. These results could shed some light on the genetic basis of human genetic variations and human inherited diseases.

Entities:  

Mesh:

Year:  2014        PMID: 25248637     DOI: 10.1007/s00438-014-0922-5

Source DB:  PubMed          Journal:  Mol Genet Genomics        ISSN: 1617-4623            Impact factor:   3.291


  55 in total

1.  A sequence-based approach for predicting protein disordered regions.

Authors:  Tao Huang; Zhi-Song He; Wei-Ren Cui; Yu-Dong Cai; Xiao-He Shi; Le-Le Hu; Kuo-Chen Chou
Journal:  Protein Pept Lett       Date:  2013-03       Impact factor: 1.890

2.  Solving the protein sequence metric problem.

Authors:  William R Atchley; Jieping Zhao; Andrew D Fernandes; Tanja Drüke
Journal:  Proc Natl Acad Sci U S A       Date:  2005-04-25       Impact factor: 11.205

3.  Predicting the functional consequences of non-synonymous DNA sequence variants--evaluation of bioinformatics tools and development of a consensus strategy.

Authors:  Kimon Frousios; Costas S Iliopoulos; Thomas Schlitt; Michael A Simpson
Journal:  Genomics       Date:  2013-07-03       Impact factor: 5.736

4.  Evolution and functional impact of rare coding variation from deep sequencing of human exomes.

Authors:  Jacob A Tennessen; Abigail W Bigham; Timothy D O'Connor; Wenqing Fu; Eimear E Kenny; Simon Gravel; Sean McGee; Ron Do; Xiaoming Liu; Goo Jun; Hyun Min Kang; Daniel Jordan; Suzanne M Leal; Stacey Gabriel; Mark J Rieder; Goncalo Abecasis; David Altshuler; Deborah A Nickerson; Eric Boerwinkle; Shamil Sunyaev; Carlos D Bustamante; Michael J Bamshad; Joshua M Akey
Journal:  Science       Date:  2012-05-17       Impact factor: 47.728

5.  Predicting drug-target interaction networks based on functional groups and biological features.

Authors:  Zhisong He; Jian Zhang; Xiao-He Shi; Le-Le Hu; Xiangyin Kong; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2010-03-11       Impact factor: 3.240

6.  Analysis and prediction of the metabolic stability of proteins based on their sequential features, subcellular locations and interaction networks.

Authors:  Tao Huang; Xiao-He Shi; Ping Wang; Zhisong He; Kai-Yan Feng; Lele Hu; Xiangyin Kong; Yi-Xue Li; Yu-Dong Cai; Kuo-Chen Chou
Journal:  PLoS One       Date:  2010-06-04       Impact factor: 3.240

7.  Predicting the effects of frameshifting indels.

Authors:  Jing Hu; Pauline C Ng
Journal:  Genome Biol       Date:  2012-02-09       Impact factor: 13.583

8.  Relationship between insertion/deletion (indel) frequency of proteins and essentiality.

Authors:  Simon K Chan; Michael Hsing; Fereydoun Hormozdiari; Artem Cherkasov
Journal:  BMC Bioinformatics       Date:  2007-06-28       Impact factor: 3.169

9.  Signal propagation in protein interaction network during colorectal cancer progression.

Authors:  Yang Jiang; Tao Huang; Lei Chen; Yu-Fei Gao; Yudong Cai; Kuo-Chen Chou
Journal:  Biomed Res Int       Date:  2013-03-20       Impact factor: 3.411

10.  The Human Gene Mutation Database: 2008 update.

Authors:  Peter D Stenson; Matthew Mort; Edward V Ball; Katy Howells; Andrew D Phillips; Nick St Thomas; David N Cooper
Journal:  Genome Med       Date:  2009-01-22       Impact factor: 11.117

View more
  14 in total

Review 1.  Computational approaches to study the effects of small genomic variations.

Authors:  Kamil Khafizov; Maxim V Ivanov; Olga V Glazova; Sergei P Kovalenko
Journal:  J Mol Model       Date:  2015-09-08       Impact factor: 1.810

2.  The early detection of asthma based on blood gene expression.

Authors:  Shao-Bin Wang; Tao Huang
Journal:  Mol Biol Rep       Date:  2018-11-12       Impact factor: 2.316

3.  Genetic differences among ethnic groups.

Authors:  Tao Huang; Yang Shu; Yu-Dong Cai
Journal:  BMC Genomics       Date:  2015-12-21       Impact factor: 3.969

4.  Identification of tumor-educated platelet biomarkers of non-small-cell lung cancer.

Authors:  Meiling Sheng; Zhaohui Dong; Yanping Xie
Journal:  Onco Targets Ther       Date:  2018-11-14       Impact factor: 4.147

5.  Identification of the predictive genes for the response of colorectal cancer patients to FOLFOX therapy.

Authors:  Hengjun Lin; Xueke Qiu; Bo Zhang; Jichao Zhang
Journal:  Onco Targets Ther       Date:  2018-09-17       Impact factor: 4.147

6.  Identification and Analysis of Blood Gene Expression Signature for Osteoarthritis With Advanced Feature Selection Methods.

Authors:  Jing Li; Chun-Na Lan; Ying Kong; Song-Shan Feng; Tao Huang
Journal:  Front Genet       Date:  2018-08-30       Impact factor: 4.599

7.  The blood transcriptional signature for active and latent tuberculosis.

Authors:  Min Deng; Xiao-Dong Lv; Zhi-Xian Fang; Xin-Sheng Xie; Wen-Yu Chen
Journal:  Infect Drug Resist       Date:  2019-01-30       Impact factor: 4.003

8.  Pathogenicity and functional impact of non-frameshifting insertion/deletion variation in the human genome.

Authors:  Kymberleigh A Pagel; Danny Antaki; AoJie Lian; Matthew Mort; David N Cooper; Jonathan Sebat; Lilia M Iakoucheva; Sean D Mooney; Predrag Radivojac
Journal:  PLoS Comput Biol       Date:  2019-06-14       Impact factor: 4.475

9.  Cross talk of chromosome instability, CpG island methylator phenotype and mismatch repair in colorectal cancer.

Authors:  Tian-Ming Zhang; Tao Huang; Rong-Fei Wang
Journal:  Oncol Lett       Date:  2018-05-31       Impact factor: 2.967

10.  RFAmyloid: A Web Server for Predicting Amyloid Proteins.

Authors:  Mengting Niu; Yanjuan Li; Chunyu Wang; Ke Han
Journal:  Int J Mol Sci       Date:  2018-07-16       Impact factor: 5.923

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.