Literature DB >> 24389653

SNPdryad: predicting deleterious non-synonymous human SNPs using only orthologous protein sequences.

Ka-Chun Wong1, Zhaolei Zhang2.   

Abstract

MOTIVATION: The recent advances in genome sequencing have revealed an abundance of non-synonymous polymorphisms among human individuals; subsequently, it is of immense interest and importance to predict whether such substitutions are functional neutral or have deleterious effects. The accuracy of such prediction algorithms depends on the quality of the multiple-sequence alignment, which is used to infer how an amino acid substitution is tolerated at a given position. Because of the scarcity of orthologous protein sequences in the past, the existing prediction algorithms all include sequences of protein paralogs in the alignment, which can dilute the conservation signal and affect prediction accuracy. However, we believe that, with the sequencing of a large number of mammalian genomes, it is now feasible to include only protein orthologs in the alignment and improve the prediction performance.
RESULTS: We have developed a novel prediction algorithm, named SNPdryad, which only includes protein orthologs in building a multiple sequence alignment. Among many other innovations, SNPdryad uses different conservation scoring schemes and uses Random Forest as a classifier. We have tested SNPdryad on several datasets. We found that SNPdryad consistently outperformed other methods in several performance metrics, which is attributed to the exclusion of paralogous sequence. We have run SNPdryad on the complete human proteome, generating prediction scores for all the possible amino acid substitutions.
AVAILABILITY AND IMPLEMENTATION: The algorithm and the prediction results can be accessed from the Web site: http://snps.ccbr.utoronto.ca:8080/SNPdryad/ CONTACT: Zhaolei.Zhang@utoronto.ca Supplementary information: Supplementary data are available at Bioinformatics online.
© The Author 2014. Published by Oxford University Press. All rights reserved. For Permissions, please e-mail: journals.permissions@oup.com.

Entities:  

Mesh:

Substances:

Year:  2014        PMID: 24389653     DOI: 10.1093/bioinformatics/btt769

Source DB:  PubMed          Journal:  Bioinformatics        ISSN: 1367-4803            Impact factor:   6.937


  21 in total

1.  Big data challenges in genome informatics.

Authors:  Ka-Chun Wong
Journal:  Biophys Rev       Date:  2019-01-25

2.  eQTL networks unveil enriched mRNA master integrators downstream of complex disease-associated SNPs.

Authors:  Haiquan Li; Nima Pouladi; Ikbel Achour; Vincent Gardeux; Jianrong Li; Qike Li; Hao Helen Zhang; Fernando D Martinez; Joe G N 'Skip' Garcia; Yves A Lussier
Journal:  J Biomed Inform       Date:  2015-10-30       Impact factor: 6.317

Review 3.  A Novel Homozygous Mutation in SPTBN2 Leads to Spinocerebellar Ataxia in a Consanguineous Family: Report of a New Infantile-Onset Case and Brief Review of the Literature.

Authors:  Mohammad A Al-Muhaizea; Faten AlMutairi; Rawan Almass; Safinaz AlHarthi; Mazhor S Aldosary; Maysoon Alsagob; Ali AlOdaib; Dilek Colak; Namik Kaya
Journal:  Cerebellum       Date:  2018-06       Impact factor: 3.847

4.  Transcriptional regulator PRDM12 is essential for human pain perception.

Authors:  Ya-Chun Chen; Michaela Auer-Grumbach; Shinya Matsukawa; Manuela Zitzelsberger; Andreas C Themistocleous; Tim M Strom; Chrysanthi Samara; Adrian W Moore; Lily Ting-Yin Cho; Gareth T Young; Caecilia Weiss; Maria Schabhüttl; Rolf Stucka; Annina B Schmid; Yesim Parman; Luitgard Graul-Neumann; Wolfram Heinritz; Eberhard Passarge; Rosemarie M Watson; Jens Michael Hertz; Ute Moog; Manuela Baumgartner; Enza Maria Valente; Diego Pereira; Carlos M Restrepo; Istvan Katona; Marina Dusl; Claudia Stendel; Thomas Wieland; Fay Stafford; Frank Reimann; Katja von Au; Christian Finke; Patrick J Willems; Michael S Nahorski; Samiha S Shaikh; Ofélia P Carvalho; Adeline K Nicholas; Gulshan Karbani; Maeve A McAleer; Maria Roberta Cilio; John C McHugh; Sinead M Murphy; Alan D Irvine; Uffe Birk Jensen; Reinhard Windhager; Joachim Weis; Carsten Bergmann; Bernd Rautenstrauss; Jonathan Baets; Peter De Jonghe; Mary M Reilly; Regina Kropatsch; Ingo Kurth; Roman Chrast; Tatsuo Michiue; David L H Bennett; C Geoffrey Woods; Jan Senderek
Journal:  Nat Genet       Date:  2015-05-25       Impact factor: 38.330

5.  Prediction of disease-associated nsSNPs by integrating multi-scale ResNet models with deep feature fusion.

Authors:  Fang Ge; Ying Zhang; Jian Xu; Arif Muhammad; Jiangning Song; Dong-Jun Yu
Journal:  Brief Bioinform       Date:  2022-01-17       Impact factor: 11.622

6.  VaRank: a simple and powerful tool for ranking genetic variants.

Authors:  Véronique Geoffroy; Cécile Pizot; Claire Redin; Amélie Piton; Nasim Vasli; Corinne Stoetzel; André Blavier; Jocelyn Laporte; Jean Muller
Journal:  PeerJ       Date:  2015-03-03       Impact factor: 2.984

7.  A Novel Approach to Predict Core Residues on Cancer-Related DNA-Binding Domains.

Authors:  Ka-Chun Wong
Journal:  Cancer Inform       Date:  2016-06-02

Review 8.  Recent Progress in Machine Learning-Based Methods for Protein Fold Recognition.

Authors:  Leyi Wei; Quan Zou
Journal:  Int J Mol Sci       Date:  2016-12-16       Impact factor: 5.923

9.  RON tyrosine kinase mutations in brain metastases from lung cancer.

Authors:  Melissa Milan; Silvia Benvenuti; Alice Maria Balderacchi; Anna Rita Virzì; Alessandra Gentile; Rebecca Senetta; Paola Cassoni; Paolo Maria Comoglio; Giulia Maria Stella
Journal:  ERJ Open Res       Date:  2018-03-06

10.  Drug repositioning for non-small cell lung cancer by using machine learning algorithms and topological graph theory.

Authors:  Chien-Hung Huang; Peter Mu-Hsin Chang; Chia-Wei Hsu; Chi-Ying F Huang; Ka-Lok Ng
Journal:  BMC Bioinformatics       Date:  2016-01-11       Impact factor: 3.169

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.