Literature DB >> 33866367

usDSM: a novel method for deleterious synonymous mutation prediction using undersampling scheme.

Xi Tang1, Tao Zhang2, Na Cheng3, Huadong Wang2, Chun-Hou Zheng2, Junfeng Xia3, Tiejun Zhang4.   

Abstract

Although synonymous mutations do not alter the encoded amino acids, they may impact protein function by interfering with the regulation of RNA splicing or altering transcript splicing. New progress on next-generation sequencing technologies has put the exploration of synonymous mutations at the forefront of precision medicine. Several approaches have been proposed for predicting the deleterious synonymous mutations specifically, but their performance is limited by imbalance of the positive and negative samples. In this study, we firstly expanded the number of samples greatly from various data sources and compared six undersampling strategies to solve the problem of the imbalanced datasets. The results suggested that cluster centroid is the most effective scheme. Secondly, we presented a computational model, undersampling scheme based method for deleterious synonymous mutation (usDSM) prediction, using 14-dimensional biology features and random forest classifier to detect the deleterious synonymous mutation. The results on the test datasets indicated that the proposed usDSM model can attain superior performance in comparison with other state-of-the-art machine learning methods. Lastly, we found that the deep learning model did not play a substantial role in deleterious synonymous mutation prediction through a lot of experiments, although it achieves superior results in other fields. In conclusion, we hope our work will contribute to the future development of computational methods for a more accurate prediction of the deleterious effect of human synonymous mutation. The web server of usDSM is freely accessible at http://usdsm.xialab.info/.
© The Author(s) 2021. Published by Oxford University Press. All rights reserved. For Permissions, please email: journals.permissions@oup.com.

Entities:  

Keywords:  deep learning; deleterious synonymous mutation; machine learning; undersampling scheme

Year:  2021        PMID: 33866367     DOI: 10.1093/bib/bbab123

Source DB:  PubMed          Journal:  Brief Bioinform        ISSN: 1467-5463            Impact factor:   11.622


  3 in total

Review 1.  Synonymous Variants: Necessary Nuance in Our Understanding of Cancer Drivers and Treatment Outcomes.

Authors:  Nayiri M Kaissarian; Douglas Meyer; Chava Kimchi-Sarfaty
Journal:  J Natl Cancer Inst       Date:  2022-08-08       Impact factor: 11.816

2.  Characterization of Synonymous BRCA1:c.132C>T as a Pathogenic Variant.

Authors:  Jun Li; Ping Wang; Cuiyun Zhang; Sile Han; Han Xiao; Zhiyuan Liu; Xiaoyan Wang; Weiling Liu; Bing Wei; Jie Ma; Hongle Li; Yongjun Guo
Journal:  Front Oncol       Date:  2022-01-11       Impact factor: 6.244

3.  When a Synonymous Variant Is Nonsynonymous.

Authors:  Mauno Vihinen
Journal:  Genes (Basel)       Date:  2022-08-19       Impact factor: 4.141

  3 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.