Literature DB >> 34319533

Facing small and biased data dilemma in drug discovery with enhanced federated learning approaches.

Zhaoping Xiong1,2,3, Ziqiang Cheng1,4, Xinyuan Lin5, Chi Xu5, Xiaohong Liu1,2, Dingyan Wang2,3, Xiaomin Luo2, Yong Zhang5, Hualiang Jiang6,7, Nan Qiao8, Mingyue Zheng9.   

Abstract

Artificial intelligence (AI) models usually require large amounts of high-quality training data, which is in striking contrast to the situation of small and biased data faced by current drug discovery pipelines. The concept of federated learning has been proposed to utilize distributed data from different sources without leaking sensitive information of the data. This emerging decentralized machine learning paradigm is expected to dramatically improve the success rate of AI-powered drug discovery. Here, we simulated the federated learning process with different property and activity datasets from different sources, among which overlapping molecules with high or low biases exist in the recorded values. Beyond the benefit of gaining more data, we also demonstrated that federated training has a regularization effect superior to centralized training on the pooled datasets with high biases. Moreover, different network architectures for clients and aggregation algorithms for coordinators have been compared on the performance of federated learning, where personalized federated learning shows promising results. Our work demonstrates the applicability of federated learning in predicting drug-related properties and highlights its promising role in addressing the small and biased data dilemma in drug discovery.
© 2021. Science China Press and Springer-Verlag GmbH Germany, part of Springer Nature.

Entities:  

Keywords:  FedAMP; Non-IID data; drug discovery; federated learning

Mesh:

Substances:

Year:  2021        PMID: 34319533     DOI: 10.1007/s11427-021-1946-0

Source DB:  PubMed          Journal:  Sci China Life Sci        ISSN: 1674-7305            Impact factor:   6.038


  37 in total

1.  A model for identifying HERG K+ channel blockers.

Authors:  Alex M Aronov; Brian B Goldman
Journal:  Bioorg Med Chem       Date:  2004-05-01       Impact factor: 3.641

Review 2.  Predictive in silico modeling for hERG channel blockers.

Authors:  Alex M Aronov
Journal:  Drug Discov Today       Date:  2005-01-15       Impact factor: 7.851

3.  Comparative study of the structure and interaction of the pore helices of the hERG and Kv1.5 potassium channels in model membranes.

Authors:  Maïwenn Beaugrand; Alexandre A Arnold; Steve Bourgault; Philip T F Williamson; Isabelle Marcotte
Journal:  Eur Biophys J       Date:  2017-03-17       Impact factor: 1.733

4.  Deep Learning-Based Prediction of Drug-Induced Cardiotoxicity.

Authors:  Chuipu Cai; Pengfei Guo; Yadi Zhou; Jingwei Zhou; Qi Wang; Fengxue Zhang; Jiansong Fang; Feixiong Cheng
Journal:  J Chem Inf Model       Date:  2019-02-15       Impact factor: 4.956

5.  A human ether-á-go-go-related (hERG) ion channel atomistic model generated by long supercomputer molecular dynamics simulations and its use in predicting drug cardiotoxicity.

Authors:  Anwar Anwar-Mohamed; Khaled H Barakat; Rakesh Bhat; Sergei Y Noskov; D Lorne Tyrrell; Jack A Tuszynski; Michael Houghton
Journal:  Toxicol Lett       Date:  2014-08-13       Impact factor: 4.372

6.  A systematic profile of clinical inhibitors responsive to EGFR somatic amino acid mutations in lung cancer: implication for the molecular mechanism of drug resistance and sensitivity.

Authors:  Xinghao Ai; Yingjia Sun; Haidong Wang; Shun Lu
Journal:  Amino Acids       Date:  2014-07       Impact factor: 3.520

7.  Quantitative prediction of the arrhythmogenic effects of de novo hERG mutations in computational models of human ventricular tissues.

Authors:  Alan P Benson; Moza Al-Owais; Arun V Holden
Journal:  Eur Biophys J       Date:  2011-01-14       Impact factor: 1.733

8.  Comprehensive assay of kinase catalytic activity reveals features of kinase inhibitor selectivity.

Authors:  Theonie Anastassiadis; Sean W Deacon; Karthik Devarajan; Haiching Ma; Jeffrey R Peterson
Journal:  Nat Biotechnol       Date:  2011-10-30       Impact factor: 54.908

Review 9.  Harnessing big 'omics' data and AI for drug discovery in hepatocellular carcinoma.

Authors:  Bin Chen; Lana Garmire; Diego F Calvisi; Mei-Sze Chua; Robin K Kelley; Xin Chen
Journal:  Nat Rev Gastroenterol Hepatol       Date:  2020-01-03       Impact factor: 46.802

10.  The ChEMBL bioactivity database: an update.

Authors:  A Patrícia Bento; Anna Gaulton; Anne Hersey; Louisa J Bellis; Jon Chambers; Mark Davies; Felix A Krüger; Yvonne Light; Lora Mak; Shaun McGlinchey; Michal Nowotka; George Papadatos; Rita Santos; John P Overington
Journal:  Nucleic Acids Res       Date:  2013-11-07       Impact factor: 16.971

View more
  2 in total

1.  Quorum sensing-based interactions among drugs, microbes, and diseases.

Authors:  Shengbo Wu; Shujuan Yang; Manman Wang; Nan Song; Jie Feng; Hao Wu; Aidong Yang; Chunjiang Liu; Yanni Li; Fei Guo; Jianjun Qiao
Journal:  Sci China Life Sci       Date:  2022-08-04       Impact factor: 10.372

2.  Federated learning of molecular properties with graph neural networks in a heterogeneous setting.

Authors:  Wei Zhu; Jiebo Luo; Andrew D White
Journal:  Patterns (N Y)       Date:  2022-06-02
  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.