Literature DB >> 35936413

Diving into the Deep End: Machine Learning for the Chemist.

Silvia Imberti1.   

Abstract

Entities:  

Year:  2022        PMID: 35936413      PMCID: PMC9352319          DOI: 10.1021/acsomega.2c04373

Source DB:  PubMed          Journal:  ACS Omega        ISSN: 2470-1343


× No keyword cloud information.
In the past year, ACS Omega has seen a dramatic increase in the number of articles published with an Artificial Intelligence (AI) or Machine Learning, Deep Learning, Neural Networks theme. In 2021, 105 articles were published vs 45 articles published in the previous year, an increase of 133%. This exceptional growth for a fully open access broad scope journal pairs with the growth seen at many other journals in the ACS portfolio. Interestingly, the growth registered in the past 2–3 years is not confined to the journals that specialize in chemical informatics: the Journal of Chemical Information and Modeling, the Journal of Chemical Theory and Computation, and, to some extent, the Journal of Physical Chemistry C. It also encompasses journals in the materials sciences, the physical sciences, measurement science, chemical engineering, and environmental science in the broader ACS portfolio (Figure ).
Figure 1

Top 12 journals at ACS Publications for publishing AI research. Bar graphs represent Web of Science results for the named ACS journals using “artificial intelligence”, “machine learning”, “deep learning”, and “neural networks” as search topics. ACS Omega was launched in 2016. These 12 journals constitute approximately two-thirds of the publications of the entire portfolio in this area.

Looking at the wider publication landscape, the Directory of Open Access Journal lists 65 journal entries for scientific publications that pertain to the topic of AI.[29] Eleven of these were added in the last year alone, and this includes only those journals queried in the computational science category. In addition to these, numerous other open access, broader scope journals also publish work without any perceived evaluation of immediate impact and where existing AI tools have been successfully applied to a variety of chemistry questions. Top 12 journals at ACS Publications for publishing AI research. Bar graphs represent Web of Science results for the named ACS journals using “artificial intelligence”, “machine learning”, “deep learning”, and “neural networks” as search topics. ACS Omega was launched in 2016. These 12 journals constitute approximately two-thirds of the publications of the entire portfolio in this area. Over the past 10 to 15 years, AI, especially deep learning, has effected dramatic technological progress and proven success in areas such as computer vision,[1] speech recognition,[2] natural language processing,[3,4] common sense knowledge,[5,6] strategic reasoning,[7] and robotics.[8] Exceptional results have also been reported in the medical sciences; for example, deep neural networks facilitated accurate diagnosis of skin cancer,[9] and deep learning enabled extraction of new knowledge from old data, enabling accurate prediction of age, gender, smoking status, blood pressure, and heart attack propensity of individuals just by analyzing previously acquired retinal images.[10] But, what about the status of AI and its perception in chemistry? In the ensuing text, as an entrée to the associated Virtual Issue, I will present a brief overview of the perceived usefulness of AI at this time in some fields of the chemical sciences and related areas, based on recently published reviews and perspectives by experts in the area, as well as other resources. A review by Baum et al.[11] charted the growth and distribution of AI-related chemistry publications in the last two decades using the CAS Content Collection, which includes patents as well as research articles. In their paper, they refer to the “Hype Cycle of Emerging Technologies”,[12] and from the data gathered, they determine that AI adoption in life sciences and analytical chemistry has navigated the so-called “peak of inflated expectations” and “trough of disillusionment” and successfully progressed to the “plateau of productivity”. It is common to overestimate the effect of a technology in the short run and underestimate it in the long run (anecdotally known as Amara’s law). For example, despite commercial interests and consequent investments being enormous, to this day, no new drug has yet been synthesized using AI.[13,14] As another (counter) example, Peiretti and Brunel[15], in their Perspective published in 2018, ponder whether organic chemistry and in particular retrosynthesis might be the ideal next application of AI techniques. After all, it “fits perfectly” the definition of AI as a problem with “complex input–output relationships [that] are difficult or impractical to model procedurally”. However, Baum et al.[11] firmly assess that “there are still areas of Chemistry like organic synthetic chemistry where AI is yet to make an impact”. Still, work is in progress, as standardized formats for reporting a chemical synthesis procedure are being developed and even classified (an essential step for scientific fields to exist) in the new taxonomy of digital chemistry or chemputation.[16] At this stage in the discussion, it may be interesting to explore what the drivers are to greater or lesser success in applying AI methods to scientific problems. This point is examined in the Editorial by Jones et al.[17], which accompanies an excellent . In their overview, three main reasons are indicated. Two of them are quite intrinsic to the method (algorithm development and theoretical derivation of descriptors), while one of them is, notably, not specific but resides in the availability of a collection of standardized, high-quality data. A case in point, and unanimously defined as the most spectacular recent success of AI, is the recent prediction of a protein’s three-dimensional structure from its amino acid sequence via AlphaFold.[18,19] The tool was trained on large publicly available databases, such as the RSCB Protein Data Bank,[20] as well as protein sequences of unknown structure. In return, it is somewhat expected, although not guaranteed, that the new information generated is also made available to the public.[21] The AlphaFold code is now publicly available. It can be run with moderate computational resources,[22] and the “most complete” database of predicted 3D structures of human proteins has been shared with the world.[23,24] In the words of EMBL-EBI Director Ewan Birney, this is the most important data set since the mapping of the Human Genome.[25] The success of this AI project seems to rely, among other things, on the availability of an extensive, high-quality, and structured database as well as on its scientific and engineering advancements.[26] This is not a trivial endeavor and one that requires not only great effort and coordination but also ad hoc funding. This is argued for in a recently published perspective by Campo et al. “Now Is the Time to Build a National Data Ecosystem for Materials Science and Chemistry Research Data”:[27]access to data and associated analysis software—cyber infrastructure—is critical to efficiency and productivity in an information-driven economy. In short, the fast development of AI seems to be going hand-in-hand with the availability of open-well structured data in high-quality databases, i.e., where metadata standards are adopted to enable interoperability, according to the “FAIR” data principles.[28] In this collection (https://pubs.acs.org/page/vi/ACSomega_AI), we are proud to present a selection of articles that provide a quick pulse of the growing spectrum of topics (Figure ) that are “experimenting” with AI tools. The topics vary from bioinformatics, chemometrics, and small-molecule design and synthesis to diagnostics in medicine as well as in physical and chemical processes, such as energy materials, metallurgy, wastewater treatment, and, finally, image analysis applied to agriculture. The ever-increasing volume of submissions and the diversity of the themes have encouraged us to start looking at Artificial Intelligence research, not just as an attractive future development but as a technique that is reaching a steady state of application and productivity in several fields.
Figure 2

This AI-generated word cloud is obtained from the selected article titles in this Virtual Issue via MonkeyLearn.

This AI-generated word cloud is obtained from the selected article titles in this Virtual Issue via MonkeyLearn.
  14 in total

1.  Artificial Intelligence in Chemistry: Current Trends and Future Directions.

Authors:  Zachary J Baum; Xiang Yu; Philippe Y Ayala; Yanan Zhao; Steven P Watkins; Qiongqiong Zhou
Journal:  J Chem Inf Model       Date:  2021-07-15       Impact factor: 4.956

2.  Superhuman AI for heads-up no-limit poker: Libratus beats top professionals.

Authors:  Noam Brown; Tuomas Sandholm
Journal:  Science       Date:  2017-12-17       Impact factor: 47.728

3.  Dermatologist-level classification of skin cancer with deep neural networks.

Authors:  Andre Esteva; Brett Kuprel; Roberto A Novoa; Justin Ko; Susan M Swetter; Helen M Blau; Sebastian Thrun
Journal:  Nature       Date:  2017-01-25       Impact factor: 49.962

4.  What's next for AlphaFold and the AI protein-folding revolution.

Authors:  Ewen Callaway
Journal:  Nature       Date:  2022-04       Impact factor: 69.504

5.  Prediction of cardiovascular risk factors from retinal fundus photographs via deep learning.

Authors:  Ryan Poplin; Avinash V Varadarajan; Katy Blumer; Yun Liu; Michael V McConnell; Greg S Corrado; Lily Peng; Dale R Webster
Journal:  Nat Biomed Eng       Date:  2018-02-19       Impact factor: 25.671

Review 6.  Artificial Intelligence: The Future for Organic Chemistry?

Authors:  Franck Peiretti; Jean Michel Brunel
Journal:  ACS Omega       Date:  2018-10-16

7.  Emerging Chemistry & Machine Learning.

Authors:  Christopher W Jones; Wasiu Lawal; Xin Xu
Journal:  JACS Au       Date:  2022-03-11

8.  Now Is the Time to Build a National Data Ecosystem for Materials Science and Chemistry Research Data.

Authors:  Eva M Campo; Sadasivan Shankar; Alexander S Szalay; Robert J Hanisch
Journal:  ACS Omega       Date:  2022-04-13

Review 9.  Deep Learning for Computer Vision: A Brief Review.

Authors:  Athanasios Voulodimos; Nikolaos Doulamis; Anastasios Doulamis; Eftychios Protopapadakis
Journal:  Comput Intell Neurosci       Date:  2018-02-01
View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.