| Literature DB >> 23113939 |
Qian Zhang1, Wei Zhang, Youyong Li, Junmei Wang, Liling Zhang, Tingjun Hou.
Abstract
Assigning bond orders is a necessary and essential step for characterizing a chemical structure correctly in force field based simulations. Several methods have been developed to do this. They all have advantages but with limitations too. Here, an automatic algorithm for assigning chemical connectivity and bond order regardless of hydrogen for organic molecules is provided, and only three dimensional coordinates and element identities are needed for our algorithm. The algorithm uses hard rules, length rules and conjugation rules to fix the structures. The hard rules determine bond orders based on the basic chemical rules; the length rules determine bond order by the length between two atoms based on a set of predefined values for different bond types; the conjugation rules determine bond orders by using the length information derived from the previous rule, the bond angles and some small structural patterns. The algorithm is extensively evaluated in three datasets, and achieves good accuracy of predictions for all the datasets. Finally, the limitation and future improvement of the algorithm are discussed.Entities:
Year: 2012 PMID: 23113939 PMCID: PMC3557220 DOI: 10.1186/1758-2946-4-26
Source DB: PubMed Journal: J Cheminform ISSN: 1758-2946 Impact factor: 5.514
Figure 1The flow chart of the algorithm.
The performance of different algorithms on the first data set
| Baber | 7 | 41.2% |
| Meng | 5 | 29.4% |
| Sayle | 17 | 100% |
| Labute | 14 | 82.4% |
| Our | 16 | 94.1% |
Figure 2The representation of the ligand in 2trm in different orientations.
The nine failed structures predicted by our algorithm
| 1bzm | ||
| 1cps | ||
| 1nnb | ||
| 1rne | ||
| 3dfr | ||
| 3fx2 | ||
| 5tim | ||
| 5tln | ||
| 8xia |
Figure 3Four failed small molecules with the bond types beyond the rules in our algorithm (the unfixed bonds are colored in red).
Figure 4The definition of φ. S is the plane that line a1 and line a2 forms, line F is the normal line of plane S, and φ is the angle between line F and line a3.
Figure 5Patterns of atom C with the connection number of 3 (the dashed bond means that its order is unsure).
The reference bond lengths (Å)
| C-C | 1.49 | 1.33 |
| C-N | 1.43 | 1.29 |
| C-O | 1.41 | 1.28 |
| C-S | 1.80 | 1.72 |
| N-N | 1.38 | 1.29 |
| N-O | 1.39 | 1.24 |
L1 represents the length of single bond; L2 represents the length of double bond.
Return numbers based on small structural patterns
| | | ||||
| Bond types are 5 and 6 | | | | ||
| | | ||||
| | | | |||
| | | | | ||
| | | | | ||
| | | | |||
| Others | others |
The dashed bond means this bond has already been fixed. The bold atom/bond means atoms/bonds around it are on the same plane. Conditions are taken into account from up to down. The plain bonds represents bonds to be fixed. Some fixed bonds are not given in the structures.