| Literature DB >> 15534707 |
Joe A Townsend1, Sam E Adams, Christopher A Waudby, Vanessa K de Souza, Jonathan M Goodman, Peter Murray-Rust.
Abstract
Automatically extracting chemical information from documents is a challenging task, but an essential one for dealing with the vast quantity of data that is available. The task is least difficult for structured documents, such as chemistry department web pages or the output of computational chemistry programs, but requires increasingly sophisticated approaches for less structured documents, such as chemical papers. The identification of key units of information, such as chemical names, makes the extraction of useful information from unstructured documents possible.Mesh:
Year: 2004 PMID: 15534707 DOI: 10.1039/b411033a
Source DB: PubMed Journal: Org Biomol Chem ISSN: 1477-0520 Impact factor: 3.876