Literature DB >> 28475312

Information Retrieval and Text Mining Technologies for Chemistry.

Martin Krallinger1, Obdulia Rabal2, Anália Lourenço3,4,5, Julen Oyarzabal2, Alfonso Valencia6,7,8.   

Abstract

Efficient access to chemical information contained in scientific literature, patents, technical reports, or the web is a pressing need shared by researchers and patent attorneys from different chemical disciplines. Retrieval of important chemical information in most cases starts with finding relevant documents for a particular chemical compound or family. Targeted retrieval of chemical documents is closely connected to the automatic recognition of chemical entities in the text, which commonly involves the extraction of the entire list of chemicals mentioned in a document, including any associated information. In this Review, we provide a comprehensive and in-depth description of fundamental concepts, technical implementations, and current technologies for meeting these information demands. A strong focus is placed on community challenges addressing systems performance, more particularly CHEMDNER and CHEMDNER patents tasks of BioCreative IV and V, respectively. Considering the growing interest in the construction of automatically annotated chemical knowledge bases that integrate chemical information and biological data, cheminformatics approaches for mapping the extracted chemical names into chemical structures and their subsequent annotation together with text mining applications for linking chemistry with biological information are also presented. Finally, future trends and current challenges are highlighted as a roadmap proposal for research in this emerging field.

Mesh:

Year:  2017        PMID: 28475312     DOI: 10.1021/acs.chemrev.6b00851

Source DB:  PubMed          Journal:  Chem Rev        ISSN: 0009-2665            Impact factor:   60.622


  39 in total

1.  Data Mining Approach for Extraction of Useful Information About Biologically Active Compounds from Publications.

Authors:  Olga A Tarasova; Nadezhda Yu Biziukova; Dmitry A Filimonov; Vladimir V Poroikov; Marc C Nicklaus
Journal:  J Chem Inf Model       Date:  2019-09-10       Impact factor: 4.956

2.  Auto-generating databases of Yield Strength and Grain Size using ChemDataExtractor.

Authors:  Pankaj Kumar; Saurabh Kabra; Jacqueline M Cole
Journal:  Sci Data       Date:  2022-06-09       Impact factor: 6.444

Review 3.  Into the Unknown: How Computation Can Help Explore Uncharted Material Space.

Authors:  Austin M Mroz; Victor Posligua; Andrew Tarzia; Emma H Wolpert; Kim E Jelfs
Journal:  J Am Chem Soc       Date:  2022-10-07       Impact factor: 16.383

4.  LPTK: a linguistic pattern-aware dependency tree kernel approach for the BioCreative VI CHEMPROT task.

Authors:  Neha Warikoo; Yung-Chun Chang; Wen-Lian Hsu
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

5.  Extracting chemical-protein relations using attention-based neural networks.

Authors:  Sijia Liu; Feichen Shen; Ravikumar Komandur Elayavilli; Yanshan Wang; Majid Rastegar-Mojarad; Vipin Chaudhary; Hongfang Liu
Journal:  Database (Oxford)       Date:  2018-01-01       Impact factor: 3.451

6.  Extraction of chemical-protein interactions from the literature using neural networks and narrow instance representation.

Authors:  Rui Antunes; Sérgio Matos
Journal:  Database (Oxford)       Date:  2019-01-01       Impact factor: 3.451

7.  Improving the reliability of chemical manufacturing life cycle inventory constructed using secondary data.

Authors:  David E Meyer; Sarah Cashman; Anthony Gaglione
Journal:  J Ind Ecol       Date:  2021-02-01       Impact factor: 6.946

8.  Auto-generated materials database of Curie and Néel temperatures via semi-supervised relationship extraction.

Authors:  Callum J Court; Jacqueline M Cole
Journal:  Sci Data       Date:  2018-06-19       Impact factor: 6.444

9.  Sachem: a chemical cartridge for high-performance substructure search.

Authors:  Miroslav Kratochvíl; Jiří Vondrášek; Jakub Galgonek
Journal:  J Cheminform       Date:  2018-05-23       Impact factor: 5.514

10.  Linguistic measures of chemical diversity and the "keywords" of molecular collections.

Authors:  Michał Woźniak; Agnieszka Wołos; Urszula Modrzyk; Rafał L Górski; Jan Winkowski; Michał Bajczyk; Sara Szymkuć; Bartosz A Grzybowski; Maciej Eder
Journal:  Sci Rep       Date:  2018-05-15       Impact factor: 4.379

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.