Literature DB >> 27429444

A Fast PC Algorithm for High Dimensional Causal Discovery with Multi-Core PCs.

Thuc Duy Le, Tao Hoang, Jiuyong Li, Lin Liu, Huawen Liu, Shu Hu.   

Abstract

Discovering causal relationships from observational data is a crucial problem and it has applications in many research areas. The PC algorithm is the state-of-the-art constraint based method for causal discovery. However, runtime of the PC algorithm, in the worst-case, is exponential to the number of nodes (variables), and thus it is inefficient when being applied to high dimensional data, e.g., gene expression datasets. On another note, the advancement of computer hardware in the last decade has resulted in the widespread availability of multi-core personal computers. There is a significant motivation for designing a parallelized PC algorithm that is suitable for personal computers and does not require end users' parallel computing knowledge beyond their competency in using the PC algorithm. In this paper, we develop parallel-PC, a fast and memory efficient PC algorithm using the parallel computing technique. We apply our method to a range of synthetic and real-world high dimensional datasets. Experimental results on a dataset from the DREAM 5 challenge show that the original PC algorithm could not produce any results after running more than 24 hours; meanwhile, our parallel-PC algorithm managed to finish within around 12 hours with a 4-core CPU computer, and less than six hours with a 8-core CPU computer. Furthermore, we integrate parallel-PC into a causal inference method for inferring miRNA-mRNA regulatory relationships. The experimental results show that parallel-PC helps improve both the efficiency and accuracy of the causal inference algorithm.

Entities:  

Mesh:

Year:  2016        PMID: 27429444     DOI: 10.1109/TCBB.2016.2591526

Source DB:  PubMed          Journal:  IEEE/ACM Trans Comput Biol Bioinform        ISSN: 1545-5963            Impact factor:   3.710


  6 in total

1.  Widespread Dysregulation of Long Noncoding Genes Associated With Fatty Acid Metabolism, Cell Division, and Immune Response Gene Networks in Xenobiotic-exposed Rat Liver.

Authors:  Kritika Karri; David J Waxman
Journal:  Toxicol Sci       Date:  2020-04-01       Impact factor: 4.849

2.  Bivariate Causal Discovery and Its Applications to Gene Expression and Imaging Data Analysis.

Authors:  Rong Jiao; Nan Lin; Zixin Hu; David A Bennett; Li Jin; Momiao Xiong
Journal:  Front Genet       Date:  2018-08-31       Impact factor: 4.599

Review 3.  A review of causal discovery methods for molecular network analysis.

Authors:  Jack Kelly; Carlo Berzuini; Bernard Keavney; Maciej Tomaszewski; Hui Guo
Journal:  Mol Genet Genomic Med       Date:  2022-09-10       Impact factor: 2.473

4.  Biomarker Categorization in Transcriptomic Meta-Analysis by Concordant Patterns With Application to Pan-Cancer Studies.

Authors:  Zhenyao Ye; Hongjie Ke; Shuo Chen; Raul Cruz-Cano; Xin He; Jing Zhang; Joanne Dorgan; Donald K Milton; Tianzhou Ma
Journal:  Front Genet       Date:  2021-07-02       Impact factor: 4.599

5.  Comparison of strategies for scalable causal discovery of latent variable models from mixed data.

Authors:  Vineet K Raghu; Joseph D Ramsey; Alison Morris; Dimitrios V Manatakis; Peter Sprites; Panos K Chrysanthis; Clark Glymour; Panayiotis V Benos
Journal:  Int J Data Sci Anal       Date:  2018-02-06

Review 6.  The Role of Network Science in Glioblastoma.

Authors:  Marta B Lopes; Eduarda P Martins; Susana Vinga; Bruno M Costa
Journal:  Cancers (Basel)       Date:  2021-03-02       Impact factor: 6.639

  6 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.