Literature DB >> 24091401

Informative SNPs selection based on two-locus and multilocus linkage disequilibrium: criteria of max-correlation and min-redundancy.

Xiong Li1, Bo Liao, Lijun Cai, Zhi Cao, Wen Zhu.   

Abstract

Currently, there are lots of methods to select informative SNPs for haplotype reconstruction. However, there are still some challenges that render them ineffective for large data sets. First, some traditional methods belong to wrappers which are of high computational complexity. Second, some methods ignore linkage disequilibrium that it is hard to interpret selection results. In this study, we innovatively derive optimization criteria by combining two-locus and multilocus LD measure to obtain the criteria of Max-Correlation and Min-Redundancy (MCMR). Then, we use a greedy algorithm to select the candidate set of informative SNPs constrained by the criteria. Finally, we use backward scheme to refine the candidate subset. We separately use small and middle (>1,000 SNPs) data sets to evaluate MCMR in terms of the reconstuction accuracy, the time complexity, and the compactness. Additionally, to demonstrate that MCMR is practical for large data sets, we design a parameter w to adapt to various platforms and introduce another replacement scheme for larger data sets, which sharply narrow down the computational complexity of evaluating the reconstruct ratio. Then, we first apply our method based on haplotype reconstruction for large size (>5,000 SNPs) data sets. The results confirm that MCMR leads to promising improvement in informative SNPs selection and prediction accuracy.

Mesh:

Year:  2013        PMID: 24091401     DOI: 10.1109/TCBB.2013.61

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  6 in total

1.  Informative SNP Selection Based on a Fuzzy Clustering and Improved Binary Particle Swarm Optimization Algorithm.

Authors:  Zejun Li; Li Ang; Wei Shi; Ning Xin; Min Chen; Hua Tang
Journal:  Comput Math Methods Med       Date:  2022-06-16       Impact factor: 2.809

2.  Discovering Genome-Wide Tag SNPs Based on the Mutual Information of the Variants.

Authors:  Abdulkadir Elmas; Tai-Hsien Ou Yang; Xiaodong Wang; Dimitris Anastassiou
Journal:  PLoS One       Date:  2016-12-16       Impact factor: 3.240

3.  Semi-Supervised Maximum Discriminative Local Margin for Gene Selection.

Authors:  Zejun Li; Bo Liao; Lijun Cai; Min Chen; Wenhua Liu
Journal:  Sci Rep       Date:  2018-06-05       Impact factor: 4.379

4.  Predicting Influenza Antigenicity by Matrix Completion With Antigen and Antiserum Similarity.

Authors:  Peng Wang; Wen Zhu; Bo Liao; Lijun Cai; Lihong Peng; Jialiang Yang
Journal:  Front Microbiol       Date:  2018-10-23       Impact factor: 5.640

5.  Improved Pre-miRNAs Identification Through Mutual Information of Pre-miRNA Sequences and Structures.

Authors:  Xiangzheng Fu; Wen Zhu; Lijun Cai; Bo Liao; Lihong Peng; Yifan Chen; Jialiang Yang
Journal:  Front Genet       Date:  2019-02-25       Impact factor: 4.599

6.  Gene function prediction based on combining gene ontology hierarchy with multi-instance multi-label learning.

Authors:  Zejun Li; Bo Liao; Yun Li; Wenhua Liu; Min Chen; Lijun Cai
Journal:  RSC Adv       Date:  2018-08-10       Impact factor: 4.036

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.