| Literature DB >> 28341861 |
Longguang Liao1, Yu-Jun Zhao1,2, Zexian Cao3, Xiao-Bao Yang4,5.
Abstract
An effective indexing scheme for clusters that enables fast structure comparison and congruence check is desperately desirable in the field of mathematics, artificial intelligence, materials science, etc. Here we introduce the concept of minimum vertex-type sequence for the indexing of clusters on square lattice, which contains a series of integers each labeling the vertex type of an atom. The minimum vertex-type sequence is orientation independent, and it builds a one-to-one correspondence with the cluster. By using minimum vertex-type sequence for structural comparison and congruence check, only one type of data is involved, and the largest amount of data to be compared is n pairs, n is the cluster size. In comparison with traditional coordinate-based methods and distance-matrix methods, the minimum vertex-type sequence indexing scheme has many other remarkable advantages. Furthermore, this indexing scheme can be easily generalized to clusters on other high-symmetry lattices. Our work can facilitate cluster indexing and searching in various situations, it may inspire the search of other practical indexing schemes for handling clusters of large sizes.Entities:
Year: 2017 PMID: 28341861 PMCID: PMC5428236 DOI: 10.1038/s41598-017-00398-z
Source DB: PubMed Journal: Sci Rep ISSN: 2045-2322 Impact factor: 4.379
Figure 1The fifteen possible vertex types appearing in a cluster on square lattice, as labeled by integers 1–15. The black stones denote the atoms of concern, while the white stones indicate the existence of possible nearest neighbors.
Figure 2The different orientations of a given structure for 4-atom cluster under the action of group D4 for square lattice and the corresponding vertex-type sequences. The minimum vertex-type sequence is (1,5,8,4).
Comparison of the four structural description schemes for clusters on high-symmetry lattice.
| Method | Information | Orientation independence | Labeling of atoms | Number of data types | Complexity of data sampling | Data size |
|---|---|---|---|---|---|---|
| CB | coordinates | No | N/A | 1 | easy |
|
| DM | distances | Yes | Ambiguous | 1 | easy | ( |
| BS | Bond orientations | Yes | definite | 3 | hard |
|
| MVTS | Vertex types | Yes | definite | 1 | easy |
|
CB: coordinate-based method; DM: distance-matrix method; BS: Balaban & von Sheleyer’s technique; MVTS: minimum vertex-type sequence.
a m denotes the dimension of the system concerned.
bIf a structure is formed only from the main chain, the size of the descriptive data is (n − 1); if a structure is branched, the data size is then greater than (n − 1).
Figure 3The two configurations for cluster of 7 atoms corresponding to the minimum vertex-type sequence (1,5,8,2,10,5,9) (a), and (1,5,8,6,1,5,9) (b), respectively.