| Literature DB >> 28426940 |
Dante A Pertusi1, Gregory O'Donnell2,3, Michelle F Homsher2,3, Kelli Solly2,3, Amita Patel2,3, Shannon L Stahler2,3, Daniel Riley2,3, Michael F Finley2,4, Eleftheria N Finger2,5, Gregory C Adam2,3, Juncai Meng2, David J Bell2,6, Paul D Zuck6,7, Edward M Hudak8, Michael J Weber7, Jennifer E Nothstein3,7, Louis Locco7, Carissa Quinn4,7, Adam Amoss7, Brian Squadroni3,7, Michelle Hartnett4,7, Mee Ra Heo2,6, Tara White8, S Alex May7, Evelyn Boots2, Kenneth Roberts7, Patrick Cocchiarella8, Alex Wolicki2, Anthony Kreamer2,9, Peter S Kutchukian10, Anne Mai Wassermann10, Victor N Uebele2,6, Meir Glick10, Andrew Rusinko1, J Christopher Culberson1.
Abstract
High-throughput screening (HTS) is a widespread method in early drug discovery for identifying promising chemical matter that modulates a target or phenotype of interest. Because HTS campaigns involve screening millions of compounds, it is often desirable to initiate screening with a subset of the full collection. Subsequently, virtual screening methods prioritize likely active compounds in the remaining collection in an iterative process. With this approach, orthogonal virtual screening methods are often applied, necessitating the prioritization of hits from different approaches. Here, we introduce a novel method of fusing these prioritizations and benchmark it prospectively on 17 screening campaigns using virtual screening methods in three descriptor spaces. We found that the fusion approach retrieves 15% to 65% more active chemical series than any single machine-learning method and that appropriately weighting contributions of similarity and machine-learning scoring techniques can increase enrichment by 1% to 19%. We also use fusion scoring to evaluate the tradeoff between screening more chemical matter initially in lieu of replicate samples to prevent false-positives and find that the former option leads to the retrieval of more active chemical series. These results represent guidelines that can increase the rate of identification of promising active compounds in future iterative screens.Entities:
Keywords: chemoinformatics; computational chemistry; statistical analyses; ultra-high-throughput screening
Mesh:
Year: 2017 PMID: 28426940 DOI: 10.1177/2472555217706058
Source DB: PubMed Journal: SLAS Discov ISSN: 2472-5552 Impact factor: 3.341