| Literature DB >> 16026614 |
Peter Murray-Rust1, John B O Mitchell, Henry S Rzepa.
Abstract
The current methods of publishing chemical information in bioscience articles are analysed. Using 3 papers as use-cases, it is shown that conventional methods using human procedures, including cut-and-paste are time-consuming and introduce errors. The meaning of chemical terms and the identity of compounds is often ambiguous. valuable experimental data such as spectra and computational results are almost always omitted. We describe an Open XML architecture at proof-of-concept which addresses these concerns. Compounds are identified through explicit connection tables or links to persistent Open resources such as PubChem. It is argued that if publishers adopt these tools and protocols, then the quality and quantity of chemical information available to bioscientists will increase and the authors, publishers and readers will find the process cost-effective.Entities:
Mesh:
Year: 2005 PMID: 16026614 PMCID: PMC1187874 DOI: 10.1186/1471-2105-6-180
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Isomers of Glutamic Acid
| Name(s) | |
| 611 | glutamic acid |
| 33032 | L-glutamic acid |
| 23327 | D-glutamic acid |
Figure 1Tautomers of Hydroxypyridine
Examples of approaches to chemical Identification.
| Type of information | ||
| <p><cml:molecule> <cml:identifier convention="iupac:inchi">1/C2H4O/c1-2-3/h2H,1H3</cml:identifier> <cml:identifier convention="pubchem:CID">177</cml:identifier> </cml:molecule> has a ...</p> | precise, redundant | |
| <p><cml:molecule><cml:identifier convention="pubchem:CID">2336</cm l:identifier></cml:molecule> is a ...</p> | precise | |
| <p><cml:molecule title="glycine"><cml:identifier convention="iupac:inchi">1/C2H5NO 2/c3-1-2(4)5/h1,3H2,(H,4,5)</cml: identifier></cml:molecule> is a ...</p> | hydrogens mobile | |
| calculations on | <p><cml:molecule title="g><cml:identifier convention="pubchem:CID">InChI=1/C2H5NO2/c3-1-2(4)5/h1H3,3H2</cml: identifier></cml:molecule> is a ...</p> | hydrogens precise |
| ... a monosaccharide transporter... | <p>a <cml:molecule title="monosaccharide"><cml:ident ifier convention="iupac:carbohydrate">2 -Carb-2</cml:identifier></cml:mol ecule> transporter ...</p> | Data |
Figure 2Abbreviations used in reference 13.
Identification of Small-molecules noted in Ref. 14
| Not found directly in PubChem. Located in supplier's catalog. Synonym from that found in PubChem | |||
| 2MeSADP | 2-methylthio-ADP | [121990] | |
| 5-HT | 5-hydroxytryptamine | 5202 | |
| CCh | carbacholine | 521353 | |
| CP-55940 | (-)-3-[2-hydroxy-4-(1,1-dimeth ylheptyl)-phenyl]-4-[3-hydroxyp ropyl]cyclohexan-1-ol | 104895 | IUPAC: 5-(1,1-dimethylheptyl)-2-[5-hydroxy-2-(3-hydroxyprop yl)cyclohexyl]-phenol |
| CysNO | S-nitrosocysteine | 39933 | |
| CysNOGly | S-nitroso-cysteinyl-glycine | Text search on PubChem found wrong compound. Not found in major supplier | |
| DAMGO | [D-Ala2, N-Me-Phe4, Gly5-ol]-enkephalin | 104742 | |
| DPCPX | 8-cyclopentyl-1,3-dipropylxanthi ne | 1320 | |
| Glu-CysNO | L-?-glutamyl-S-nitrosocysteine | Identity unresolved | |
| GSH | glutathione | 745 | |
| GSNO | S-nitrosoglutathione | 104858 | |
| LPA | lysophosphatidic acid | 3987 | |
| NA | noradrenaline | 951 | PubChem CID covers both racemic and d-enantiomer |
| NO | nitric oxide | 84878 | PubChem also lists 945 (with incorrect formula HNO) as nitric oxide |
| NOBF4 | nitrosodium tetrafluoroborate | 151929 | Paper has a typographical error for "nitrosonium". Structure in PubChem is wrong (formula should be NO+BF4-, not H2NO+.BF4-) |
| SNAP | S-nitroso-N-acetyl-D,L-penicill amine | 5231 | PubChem does not list stereochemistry |
| RSNO | S-nitrosothiol | Appears to be a generic compound (R-S-N=O) | |
| SNP | sodium nitroprusside | 26256 |
Figure 3A linear text-based description of experimental detail and data taken from Ref. 14.
Figure 4OSCAR output from the text-based description in Ref 14.
Figure 5OSCAR generated spectrum of analytical information reported in Ref 14.
Figure 6Structure diagram reported in Ref 15.
Figure 7Data-flow illustrating the use of XML.
Figure 8A Biocheminformatics Cycle.