| Literature DB >> 22870956 |
Weerapong Phadungsukanan1, Markus Kraft, Joe A Townsend, Peter Murray-Rust.
Abstract
: This paper introduces a subdomain chemistry format for storing computational chemistry data called CompChem. It has been developed based on the design, concepts and methodologies of Chemical Markup Language (CML) by adding computational chemistry semantics on top of the CML Schema. The format allows a wide range of ab initio quantum chemistry calculations of individual molecules to be stored. These calculations include, for example, single point energy calculation, molecular geometry optimization, and vibrational frequency analysis. The paper also describes the supporting infrastructure, such as processing software, dictionaries, validation tools and database repositories. In addition, some of the challenges and difficulties in developing common computational chemistry dictionaries are discussed. The uses of CompChem are illustrated by two practical applications.Entities:
Year: 2012 PMID: 22870956 PMCID: PMC3434037 DOI: 10.1186/1758-2946-4-15
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Rules of CompChem
| - A jobList module element MUST have an id attribute the value of which MUST be unique within the module specifying the compchem convention. | |
| | - A jobList module element MUST contain at least one job module child element. |
| | - A jobList module element SHOULD have a title attribute the value of which MUST be a non-empty string specifying a human-readable title for the module. |
| | - A jobList module element MAY contain more than one child element in any namespace. |
| - A job module element MUST contain exactly one initialization module child element. | |
| | - A job module element MAY contain zero or more calculation module child elements. |
| | - A job module element MAY contain no more than one finalization module child element. |
| | - A job module element MAY contain no more than one environment module element. |
| | - The order of the calculation module elements in a job module MUST represent the order of the calculation steps but there is no restriction on the order of other child element types. |
| | - If a calculation module element is present, a finalization module element MUST also be present as a child of a job module element. |
| | - A job module element SHOULD have a title attribute, the value of which MUST be a non-empty string specifying a human-readable title for the module. |
| | - A job module element MAY also contain other child elements in any namespace. |
| - An initialization module element MUST NOT contain more than one | |
| | - An initialization module element MUST NOT contain more than one |
| | - An initialization module element MAY contain any number of user defined module element. |
| | - An initialization module element MUST contain at least one child of molecule, |
| | - An initialization module element MAY contain more than one child element in any namespace but MUST NOT contain a property child element or a |
| | - A job module element MAY also contain other child elements in any namespace. |
| - An initialization module element MUST NOT contain more than one | |
| | - An initialization module element MUST NOT contain more than one |
| | - An initialization module element MAY contain any number of user defined module element. |
| | - An initialization module element MUST contain at least one child of molecule, |
| | - An initialization module element MAY contain more than one child element in any namespace but MUST NOT contain a property child element or a |
| - A calculation module element MUST NOT contain more than one molecule child element. The molecule MUST specify a convention using the convention attribute and the convention SHOULD be one of the RECOMMENDED molecular conventions. | |
| | - A calculation module element MUST NOT contain more than one |
| | - A calculation module element MUST NOT contain more than one |
| | - A calculation module element MAY contain any number of user defined module elements. |
| | - A calculation module element MUST contain at least one child of molecule, |
| | - A calculation module element MAY contain more than one child element in any namespace. |
| - A finalization module element MUST NOT contain more than one molecule child element. The molecule MUST specify a convention using the convention attribute and the convention SHOULD be one of the RECOMMENDED molecular conventions. | |
| | - A finalization module element MUST NOT contain more than one |
| | - A finalization module element MAY contain any number of user defined module elements. |
| | - A finalization module element MUST contain at least one molecule child, |
| | - A finalization module element MAY contain more than one child element in any namespace but MUST NOT contain a parameter child element or a |
| - An environment module element MUST NOT contain more than one | |
| | - Any environment property element MUST be a child of a |
| | - An environment module element MAY contain more than one child element in any namespace including any number of user defined module elements. However, CompChem can only understand a particular set of concepts. |
| | - An environment module MUST contain at least one child of |
| - An environment module element MAY contain more than one child element in any namespace but MUST NOT contain a parameter child element or a |
The keywords “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in RFC 2119 [37].
Figure 3The structure of CML for storing computational chemistry output: (1) CompChem convention declaration, (2) CML convention namespace, (3) a jobList module, (4) a job module, (5) an initialization module, (6) Molecular convention declaration, (7) a basis set parameter specified by cc:basis dictionary reference, (8) a Gaussian specific parameter declared in Gaussian dictionary, (9) a finalization module, (10) si:none for dimensionless units, (11) CML identifier.
| - A | |
| | - The data type of |
| | - A |
| | - A |
| - An | |
| | - The data type of |
| | - An |
| | - The size of |
| | - The |
| - A | |
| | - The data type of |
| | - A |
| | - The dimension of a |
| | - The |
| - A | |
| | - A |
| - A |
The keywords “MUST”, “MUST NOT”, “REQUIRED”, “SHALL”, “SHALL NOT”, “SHOULD”, “SHOULD NOT”, “RECOMMENDED”, “MAY”, and “OPTIONAL” in this document are to be interpreted as described in RFC 2119 [37].
Figure 5MolHub - data repository for computational quantum chemistry.
Figure 6A snippet of TiO data in CompChem format consisting of two job modules.