Literature DB >> 30908915

Multitask Modeling with Confidence Using Matrix Factorization and Conformal Prediction.

Ulf Norinder1,2, Fredrik Svensson3,4.   

Abstract

Multitask prediction of bioactivities is often faced with challenges relating to the sparsity of data and imbalance between different labels. We propose class conditional (Mondrian) conformal predictors using underlying Macau models as a novel approach for large scale bioactivity prediction. This approach handles both high degrees of missing data and label imbalances while still producing high quality predictive models. When applied to ten assay end points from PubChem, the models generated valid models with an efficiency of 74.0-80.1% at the 80% confidence level with similar performance both for the minority and majority class. Also when deleting progressively larger portions of the available data (0-80%) the performance of the models remained robust with only minor deterioration (reduction in efficiency between 5 and 10%). Compared to using Macau without conformal prediction the method presented here significantly improves the performance on imbalanced data sets.

Mesh:

Year:  2019        PMID: 30908915     DOI: 10.1021/acs.jcim.9b00027

Source DB:  PubMed          Journal:  J Chem Inf Model        ISSN: 1549-9596            Impact factor:   4.956


  2 in total

1.  Recommender Systems in Antiviral Drug Discovery.

Authors:  Ekaterina A Sosnina; Sergey Sosnin; Anastasia A Nikitina; Ivan Nazarov; Dmitry I Osolodkin; Maxim V Fedorov
Journal:  ACS Omega       Date:  2020-06-21

2.  Skin Doctor CP: Conformal Prediction of the Skin Sensitization Potential of Small Organic Molecules.

Authors:  Anke Wilm; Ulf Norinder; M Isabel Agea; Christina de Bruyn Kops; Conrad Stork; Jochen Kühnl; Johannes Kirchmair
Journal:  Chem Res Toxicol       Date:  2020-12-09       Impact factor: 3.739

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.