Literature DB >> 33065027

Leveraging Uncertainty in Machine Learning Accelerates Biological Discovery and Design.

Brian Hie1, Bryan D Bryson2, Bonnie Berger3.   

Abstract

Machine learning that generates biological hypotheses has transformative potential, but most learning algorithms are susceptible to pathological failure when exploring regimes beyond the training data distribution. A solution to address this issue is to quantify prediction uncertainty so that algorithms can gracefully handle novel phenomena that confound standard methods. Here, we demonstrate the broad utility of robust uncertainty prediction in biological discovery. By leveraging Gaussian process-based uncertainty prediction on modern pre-trained features, we train a model on just 72 compounds to make predictions over a 10,833-compound library, identifying and experimentally validating compounds with nanomolar affinity for diverse kinases and whole-cell growth inhibition of Mycobacterium tuberculosis. Uncertainty facilitates a tight iterative loop between computation and experimentation and generalizes across biological domains as diverse as protein engineering and single-cell transcriptomics. More broadly, our work demonstrates that uncertainty should play a key role in the increasing adoption of machine learning algorithms into the experimental lifecycle.
Copyright © 2020 The Authors. Published by Elsevier Inc. All rights reserved.

Entities:  

Keywords:  Gaussian process; Mycobacterium tuberculosis; PknB; active learning; compound-kinase affinity; generative design; machine learning; protein fitness landscape; transcriptomic imputation; uncertainty prediction

Year:  2020        PMID: 33065027     DOI: 10.1016/j.cels.2020.09.007

Source DB:  PubMed          Journal:  Cell Syst        ISSN: 2405-4712            Impact factor:   10.304


  19 in total

Review 1.  Learning Strategies in Protein Directed Evolution.

Authors:  Xavier F Cadet; Jean Christophe Gelly; Aster van Noord; Frédéric Cadet; Carlos G Acevedo-Rocha
Journal:  Methods Mol Biol       Date:  2022

2.  Cluster learning-assisted directed evolution.

Authors:  Yuchi Qiu; Jian Hu; Guo-Wei Wei
Journal:  Nat Comput Sci       Date:  2021-12-09

3.  D-SCRIPT translates genome to phenome with sequence-based, structure-aware, genome-scale predictions of protein-protein interactions.

Authors:  Samuel Sledzieski; Rohit Singh; Lenore Cowen; Bonnie Berger
Journal:  Cell Syst       Date:  2021-10-09       Impact factor: 11.091

4.  Triangulating variation in the population to define mechanisms for precision management of genetic disease.

Authors:  Chao Wang; Frédéric Anglès; William E Balch
Journal:  Structure       Date:  2022-06-16       Impact factor: 5.871

5.  Genome-wide mapping of somatic mutation rates uncovers drivers of cancer.

Authors:  Maxwell A Sherman; Adam U Yaari; Oliver Priebe; Felix Dietlein; Po-Ru Loh; Bonnie Berger
Journal:  Nat Biotechnol       Date:  2022-06-20       Impact factor: 68.164

Review 6.  Deep generative models for peptide design.

Authors:  Fangping Wan; Daphne Kontogiorgos-Heintz; Cesar de la Fuente-Nunez
Journal:  Digit Discov       Date:  2022-03-31

Review 7.  The future of antibiotics begins with discovering new combinations.

Authors:  Meilin Zhu; Megan W Tse; Juliane Weller; Julie Chen; Paul C Blainey
Journal:  Ann N Y Acad Sci       Date:  2021-07-02       Impact factor: 6.499

8.  Learning the protein language: Evolution, structure, and function.

Authors:  Tristan Bepler; Bonnie Berger
Journal:  Cell Syst       Date:  2021-06-16       Impact factor: 11.091

9.  Assigning confidence to molecular property prediction.

Authors:  AkshatKumar Nigam; Robert Pollice; Matthew F D Hurley; Riley J Hickman; Matteo Aldeghi; Naruki Yoshikawa; Seyone Chithrananda; Vincent A Voelz; Alán Aspuru-Guzik
Journal:  Expert Opin Drug Discov       Date:  2021-06-15       Impact factor: 7.050

10.  Gene Module Analysis Reveals Cell-Type Specificity and Potential Target Genes in Autism's Pathogenesis.

Authors:  Guoli Ji; Shuchao Li; Lishan Ye; Jinting Guan
Journal:  Biomedicines       Date:  2021-04-10
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.