| Literature DB >> 26089747 |
Saulius Gražulis1, Andrius Merkys2, Antanas Vaitkus3, Mykolas Okulič-Kazarinas2.
Abstract
Crystallographic investigations deliver high-accuracy information about positions of atoms in crystal unit cells. For chemists, however, the structure of a molecule is most often of interest. The structure must thus be reconstructed from crystallographic files using symmetry information and chemical properties of atoms. Most existing algorithms faithfully reconstruct separate molecules but not the overall stoichiometry of the complex present in a crystal. Here, an algorithm that can reconstruct stoichiometrically correct multimolecular ensembles is described. This algorithm uses only the crystal symmetry information for determining molecule numbers and their stoichiometric ratios. The algorithm can be used by chemists and crystallographers as a standalone implementation for investigating above-molecular ensembles or as a function implemented in graphical crystal analysis software. The greatest envisaged benefit of the algorithm, however, is for the users of large crystallographic and chemical databases, since it will permit database maintainers to generate stoichiometrically correct chemical representations of crystal structures automatically and to match them against chemical databases, enabling multidisciplinary searches across multiple databases.Entities:
Keywords: molecular structure; multimolecular ensembles
Year: 2015 PMID: 26089747 PMCID: PMC4453171 DOI: 10.1107/S1600576714025904
Source DB: PubMed Journal: J Appl Crystallogr ISSN: 0021-8898 Impact factor: 3.304
Figure 1Reconstructions of molecular moieties using the simple algorithm. (a) The asymmetric unit of the crystal provided by the COD 2231955 (Wei, 2011 ▶) entry. (b) Molecules reconstructed using the cif_molecule implementation. Image generated using Jmol (Hanson, 2010 ▶) and POV-Ray (Persistence of Vision, 2004 ▶).
Figure 2Reconstructions of stoichiometrically correct molecular ensemble from the COD 2231955 entry. (a) Molecular cluster reconstructed using the stoichiometry-preserving algorithm. (b) The structural formula automatically generated from the reconstructed molecular file. Chemical bond and charge comprehension was performed using the JUMBO converters suite (Murray-Rust, 1997 ▶), and then the SVG file was automatically generated using the Open Babel package (O’Boyle, Guha et al., 2011 ▶).
The cited run times and their standard deviations were computed from five successive runs for 200 randomly selected COD structures on an unloaded Dell Precision T3500 computer with 6GB RAM and the Intel Xeon CPU W3565 running at 3.2GHz, under Ubuntu 12.04 LTS GNU/Linux 32 bit OS, using the distribution’s default Perl interpreter (version 5.14.2), the gcc compiler (version 4.6.3) and cod-tools revision 2749.
| Program |
|
|
|---|---|---|
| Run time | 4 min 53s | 5 min 05s |
| Program |
|
|
|---|---|---|
| Run time | 6 min 55s | 5 min 27s |