| Literature DB >> 35511951 |
Brian Goldman1, Steven Kearnes1, Trevor Kramer1, Patrick Riley1, W Patrick Walters1.
Abstract
One application area of computational methods in drug discovery is the automated design of small molecules. Despite the large number of publications describing methods and their application in both retrospective and prospective studies, there is a lack of agreement on terminology and key attributes to distinguish these various systems. We introduce Automated Chemical Design (ACD) Levels to clearly define the level of autonomy along the axes of ideation and decision making. To fully illustrate this framework, we provide literature exemplars and place some notable methods and applications into the levels. The ACD framework provides a common language for describing automated small molecule design systems and enables medicinal chemists to better understand and evaluate such systems.Entities:
Mesh:
Year: 2022 PMID: 35511951 PMCID: PMC9150065 DOI: 10.1021/acs.jmedchem.2c00334
Source DB: PubMed Journal: J Med Chem ISSN: 0022-2623 Impact factor: 8.039
Figure 1Summary graphic for the two dimensions of design automation and the named ACD levels defined in this Perspective. The x-axis represents where the final decision making power lies and the y-axis represents the source of molecular ideas.
Definitions of Words Used to Describe Previous Literature as It Relates to the ACD Framework
| ACD system | A complete system (both human and machine) that has been used to create and test molecules in the real world. |
| components | Algorithms and models that address some of the problems a complete ACD system must address (such as molecular property prediction or decision making under uncertainty). |
| potential ACD system | An assembly of components that would form an ACD system except that it has only been tested in virtual or retrospective ways. |
| approaching an ACD Level X system | A system which almost matches the description of an ACD system at level X, but adds a crucial point of human decision making. |
Figure 2Graphical representation of the difference between standard machine learning and active machine learning.
Figure 3The most notable ACD systems discussed in this paper are grouped into the levels defined in Figure . Systems described as approaching a particular ACD Level in the text are categorized by their actual ACD Level, and no potential ACD systems are listed.
Important Questions to Help Teams Evaluate the Quality of ACD Systems (As Opposed to Categorizing the System into the Appropriate Level)
| How specific is the system to this particular problem vs generally applicable to many problems? |
| Is the system efficient (in terms of time, number of compounds, or money spent) in finding an acceptable molecule? |
| Does the system outperform simple approaches like random selection from the chemical space? |
| How many cycles did the system run autonomously? |
| How complex were the goals given to the machine (for example, optimizing for potency vs balancing multiple competing objectives)? |
| How much choice and guidance does the human exercise? |
| How big was the chemical space of molecules given to the machine? |
| Did the system explore only local modifications to known molecules or larger changes? |
| Did the machine generate a similar space of ideas as medicinal chemists would generate? |
| Were the machine’s ideas generally synthesizable? |