| Literature DB >> 28286573 |
John W Mayfield1, Roger A Sayle1.
Abstract
The symbols for the new IUPAC elements named in November 2016 can introduce subtle ambiguities within cheminformatics software. The ambiguities are described and demonstrated by highlighting inconsistencies between software when handling existing element symbols.Entities:
Keywords: Cheminformatics; Elements; IUPAC; Periodic; SMARTS
Year: 2017 PMID: 28286573 PMCID: PMC5307489 DOI: 10.1186/s13321-017-0196-0
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Fig. 16-(Diacetoxyiodo)-1-tosylindoline Intermediate 30 in US 2016/362375 A1. The intended meaning of Ts is Tosyl, and OAc is Acetoxy
Ambiguous SMARTS for transfermium element symbols officially named since 1997
| Ambiguous SMARTS | Element name | Element SMARTS | Expression meaning | Expression SMARTS |
|---|---|---|---|---|
| [No] | Nobelium | [#102] | Aliphatic nitrogen and aromatic oxygen (logically impossible) | – |
| [Db] | Dubnium | [#105] | Aromatic boron with on explicit bond (possible on fragment matching) | [D&b] or [bD] |
| [Bh] | Bohrium | [#107] | Aliphatic boron with at least one implicit hydrogen | [B&h] or [hB] |
| [Hs] | Hassium | [#108] | Aromatic sulfur with one explicit hydrogen | [H&s] or [sH] |
| [Ds] | Darmstadtium | [#110] | Aromatic sulfur with one explicit bond (possible on fragment matching) | [D&s] or [sD] |
| [Cn] | Copernicium | [#112] | Aliphatic Carbon and aromatic nitrogen (logically impossible) | – |
| [Nh] | Nihonium | [#113] | Nitrogen with at least one implicit hydrogen | [N&h] or [hN] |
The meaning in SMARTS changes between cheminformatics toolkits and are either interpreted as matching specific elements or as expressions. Unambiguous SMARTS are provided for each of these cases