Literature DB >> 26319782

iRSpot-GAEnsC: identifing recombination spots via ensemble classifier and extending the concept of Chou's PseAAC to formulate DNA samples.

Muhammad Kabir1, Maqsood Hayat2.   

Abstract

Meiotic recombination is vital for maintaining the sequence diversity in human genome. Meiosis and recombination are considered the essential phases of cell division. In meiosis, the genome is divided into equal parts for sexual reproduction whereas in recombination, the diverse genomes are combined to form new combination of genetic variations. Recombination process does not occur randomly across the genomes, it targets specific areas called recombination "hotspots" and "coldspots". Owing to huge exploration of polygenetic sequences in data banks, it is impossible to recognize the sequences through conventional methods. Looking at the significance of recombination spots, it is indispensable to develop an accurate, fast, robust, and high-throughput automated computational model. In this model, the numerical descriptors are extracted using two sequence representation schemes namely: dinucleotide composition and trinucleotide composition. The performances of seven classification algorithms were investigated. Finally, the predicted outcomes of individual classifiers are fused to form ensemble classification, which is formed through majority voting and genetic algorithm (GA). The performance of GA-based ensemble model is quite promising compared to individual classifiers and majority voting-based ensemble model. iRSpot-GAEnsC has achieved 84.46 % accuracy. The empirical results revealed that the performance of iRSpot-GAEnsC is not only higher than the examined algorithms but also better than existing methods in the literature developed so far. It is anticipated that the proposed model might be helpful for research community, academia and for drug discovery.

Entities:  

Keywords:  DNA; DNC; PNN; SVM; TNC

Mesh:

Substances:

Year:  2015        PMID: 26319782     DOI: 10.1007/s00438-015-1108-5

Source DB:  PubMed          Journal:  Mol Genet Genomics        ISSN: 1617-4623            Impact factor:   3.291


  85 in total

1.  Using the concept of Chou's pseudo amino acid composition for risk type prediction of human papillomaviruses.

Authors:  Maryam Esmaeili; Hassan Mohabatkar; Sasan Mohsenzadeh
Journal:  J Theor Biol       Date:  2009-12-02       Impact factor: 2.691

2.  Prediction of G-protein-coupled receptor classes in low homology using Chou's pseudo amino acid composition with approximate entropy and hydrophobicity patterns.

Authors:  Q Gu; Y S Ding; T L Zhang
Journal:  Protein Pept Lett       Date:  2010-05       Impact factor: 1.890

3.  TargetFreeze: Identifying Antifreeze Proteins via a Combination of Weights using Sequence Evolutionary Information and Pseudo Amino Acid Composition.

Authors:  Xue He; Ke Han; Jun Hu; Hui Yan; Jing-Yu Yang; Hong-Bin Shen; Dong-Jun Yu
Journal:  J Membr Biol       Date:  2015-06-10       Impact factor: 1.843

4.  iDNA-Methyl: identifying DNA methylation sites via pseudo trinucleotide composition.

Authors:  Zi Liu; Xuan Xiao; Wang-Ren Qiu; Kuo-Chen Chou
Journal:  Anal Biochem       Date:  2015-01-14       Impact factor: 3.365

5.  Predicting subcellular localization of mycobacterial proteins by using Chou's pseudo amino acid composition.

Authors:  Hao Lin; Hui Ding; Feng-Biao Guo; An-Ying Zhang; Jian Huang
Journal:  Protein Pept Lett       Date:  2008       Impact factor: 1.890

6.  Prediction of cell wall lytic enzymes using Chou's amphiphilic pseudo amino acid composition.

Authors:  Hui Ding; Liaofu Luo; Hao Lin
Journal:  Protein Pept Lett       Date:  2009       Impact factor: 1.890

7.  Identification of Heat Shock Protein families and J-protein types by incorporating Dipeptide Composition into Chou's general PseAAC.

Authors:  Saeed Ahmad; Muhammad Kabir; Maqsood Hayat
Journal:  Comput Methods Programs Biomed       Date:  2015-07-22       Impact factor: 5.428

8.  Prediction of G-protein-coupled receptor classes based on the concept of Chou's pseudo amino acid composition: an approach from discrete wavelet transform.

Authors:  Jian-Ding Qiu; Jian-Hua Huang; Ru-Ping Liang; Xiao-Quan Lu
Journal:  Anal Biochem       Date:  2009-04-11       Impact factor: 3.365

9.  Identification of real microRNA precursors with a pseudo structure status composition approach.

Authors:  Bin Liu; Longyun Fang; Fule Liu; Xiaolong Wang; Junjie Chen; Kuo-Chen Chou
Journal:  PLoS One       Date:  2015-03-30       Impact factor: 3.240

10.  Some remarks on protein attribute prediction and pseudo amino acid composition.

Authors:  Kuo-Chen Chou
Journal:  J Theor Biol       Date:  2010-12-17       Impact factor: 2.691

View more
  27 in total

1.  MULTiPly: a novel multi-layer predictor for discovering general and specific types of promoters.

Authors:  Meng Zhang; Fuyi Li; Tatiana T Marquez-Lago; André Leier; Cunshuo Fan; Chee Keong Kwoh; Kuo-Chen Chou; Jiangning Song; Cangzhi Jia
Journal:  Bioinformatics       Date:  2019-09-01       Impact factor: 6.937

Review 2.  Some illuminating remarks on molecular genetics and genomics as well as drug development.

Authors:  Kuo-Chen Chou
Journal:  Mol Genet Genomics       Date:  2020-01-01       Impact factor: 3.291

3.  Prediction of Protein Submitochondrial Locations by Incorporating Dipeptide Composition into Chou's General Pseudo Amino Acid Composition.

Authors:  Khurshid Ahmad; Muhammad Waris; Maqsood Hayat
Journal:  J Membr Biol       Date:  2016-01-08       Impact factor: 1.843

4.  iACP: a sequence-based tool for identifying anticancer peptides.

Authors:  Wei Chen; Hui Ding; Pengmian Feng; Hao Lin; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-03-29

5.  iHyd-PseCp: Identify hydroxyproline and hydroxylysine in proteins by incorporating sequence-coupled effects into general PseAAC.

Authors:  Wang-Ren Qiu; Bi-Qian Sun; Xuan Xiao; Zhao-Chun Xu; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-07-12

6.  iCar-PseCp: identify carbonylation sites in proteins by Monte Carlo sampling and incorporating sequence coupled effects into general PseAAC.

Authors:  Jianhua Jia; Zi Liu; Xuan Xiao; Bingxiang Liu; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-06-07

7.  iPhos-PseEn: identifying phosphorylation sites in proteins by fusing different pseudo components into an ensemble classifier.

Authors:  Wang-Ren Qiu; Xuan Xiao; Zhao-Chun Xu; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-08-09

8.  iRNA-PseU: Identifying RNA pseudouridine sites.

Authors:  Wei Chen; Hua Tang; Jing Ye; Hao Lin; Kuo-Chen Chou
Journal:  Mol Ther Nucleic Acids       Date:  2016

9.  iRNAm5C-PseDNC: identifying RNA 5-methylcytosine sites by incorporating physical-chemical properties into pseudo dinucleotide composition.

Authors:  Wang-Ren Qiu; Shi-Yu Jiang; Zhao-Chun Xu; Xuan Xiao; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2017-06-20

10.  iROS-gPseKNC: Predicting replication origin sites in DNA by incorporating dinucleotide position-specific propensity into general pseudo nucleotide composition.

Authors:  Xuan Xiao; Han-Xiao Ye; Zi Liu; Jian-Hua Jia; Kuo-Chen Chou
Journal:  Oncotarget       Date:  2016-06-07
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.