| Literature DB >> 32082439 |
Yosuke Kanda1, Hitoshi Fujii2, Tamio Oguchi1,2.
Abstract
A sparse model for quantifying energy difference between zinc-blende and rock-salt crystal structures in octet elemental and binary materials is constructed by using the linearly independent descriptor-generation method and exhaustive search, following the previous work by Ghiringhelli et al. [Phys Rev Lett. 2015;114:105503]. The obtained simplest model includes only atomic radius information of constituent atoms and its physical meaning is interpreted in relation to van Arkel-Ketelaar's triangle for classifying chemical bonding in binary compounds.Entities:
Keywords: 404 Materials informatics / Genomics; Sparse modeling; binary compounds; chemical bonding; machine learning
Year: 2019 PMID: 32082439 PMCID: PMC7006824 DOI: 10.1080/14686996.2019.1697858
Source DB: PubMed Journal: Sci Technol Adv Mater ISSN: 1468-6996 Impact factor: 8.090
Figure 1.Measure of predictivity for the best models with descriptor space 1 (DS1), 2 (DS2), and 3 (DS3) as a function of the number of descriptors obtained by the exhaustive search.
Figure 2.Regression performance of Model 1. (a) predicted and DFT data. (b) predicted and DFT data for each semiconductor. ID corresponds to that in Table A1.
Target data for regression taken from Ghiringhelli et al. [5]. is total energy difference (in eV/atom) between zinc-blende and rock-salt structures of 82 elementary and binary systems .
| ID | ID | ID | |||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|
| 0 | Li | F | 28 | Si | Si | 0.275 | 56 | Rb | I | ||
| 1 | Li | Cl | 29 | Si | Ge | 0.264 | 57 | Sr | O | ||
| 2 | Li | Br | 30 | Si | Sn | 0.136 | 58 | Sr | S | ||
| 3 | Li | I | 31 | K | F | 59 | Sr | Se | |||
| 4 | Be | O | 0.430 | 32 | K | Cl | 60 | Sr | Te | ||
| 5 | Be | S | 0.506 | 33 | K | Br | 61 | Ag | F | ||
| 6 | Be | Se | 0.495 | 34 | K | I | 62 | Ag | Cl | ||
| 7 | Be | Te | 0.466 | 35 | Ca | O | 63 | Ag | Br | ||
| 8 | B | N | 1.713 | 36 | Ca | S | 64 | Ag | I | 0.037 | |
| 9 | B | P | 1.020 | 37 | Ca | Se | 65 | Cd | O | ||
| 10 | B | As | 0.879 | 38 | Ca | Te | 66 | Cd | S | 0.070 | |
| 11 | B | Sb | 0.581 | 39 | Cu | F | 67 | Cd | Se | 0.083 | |
| 12 | C | C | 2.638 | 40 | Cu | Cl | 0.156 | 68 | Cd | Te | 0.113 |
| 13 | C | Si | 0.668 | 41 | Cu | Br | 0.152 | 69 | In | N | 0.150 |
| 14 | C | Ge | 0.808 | 42 | Cu | I | 0.203 | 70 | In | P | 0.170 |
| 15 | C | Sn | 0.450 | 43 | Zn | O | 0.102 | 71 | In | As | 0.122 |
| 16 | Na | F | 44 | Zn | S | 0.275 | 72 | In | Sb | 0.080 | |
| 17 | Na | Cl | 45 | Zn | Se | 0.259 | 73 | Sn | Sn | 0.016 | |
| 18 | Na | Br | 46 | Zn | Te | 0.241 | 74 | Cs | F | ||
| 19 | Na | I | 47 | Ga | N | 0.433 | 75 | Cs | Cl | ||
| 20 | Mg | O | 48 | Ga | P | 0.341 | 76 | Cs | Br | ||
| 21 | Mg | S | 49 | Ga | As | 0.271 | 77 | Cs | I | ||
| 22 | Mg | Se | 50 | Ga | Sb | 0.158 | 78 | Ba | O | ||
| 23 | Mg | Te | 51 | Ge | Ge | 0.202 | 79 | Ba | S | ||
| 24 | Al | N | 0.072 | 52 | Ge | Sn | 0.087 | 80 | Ba | Se | |
| 25 | Al | P | 0.219 | 53 | Rb | F | 81 | Ba | Te | ||
| 26 | Al | As | 0.212 | 54 | Rb | Cl | |||||
| 27 | Al | Sb | 0.150 | 55 | Rb | Br |
Figure 3.Regression performance of Model 2. (a) predicted and DFT data. (b) predicted and DFT data for each semiconductor. ID corresponds to that in Table A1.
Regression performance of models obtained in the present and the previous works. , , , AIC, MAE, and MaxAE are the number of descriptors, decision coefficient [16], measure of predictivity (Equation 4) [12], Akaike information criterion [17,18], mean absolute error, and maximum absolute error, respectively. Models in the previous work are given in the text.
| Present | Previous work [ | ||||
|---|---|---|---|---|---|
| Criterion | Model 1 | Model 2 | Model A | Model B | Model C |
| 3 | 2 | 1 | 2 | 3 | |
| 0.913 | 0.876 | 0.883 | 0.929 | 0.957 | |
| 0.902 | 0.866 | 0.867 | 0.918 | 0.946 | |
| AIC | |||||
| MAE (eV) | 0.102 | 0.118 | 0.121 | 0.097 | 0.071 |
| MaxAE (eV) | 0.457 | 0.460 | 0.400 | 0.349 | 0.301 |
Relations between atomic radius and stable structure derived from Model 2 (Equation 7). is defined in Equation 1.
| Atomic radius | Stable structure | |
|---|---|---|
| Rock salt | ||
| and | Zinc blende | |
Relations between electronegativity, stable structure, and chemical bond, derived from Table 2 and Equation 6.
| Electronegativity | Stable structure | Chemical bond | |
|---|---|---|---|
| Rock salt | Ionic | ||
| and | Zinc blende | Covalent | |
Figure 4.Total energy difference map in a triable of the sum and difference of atomic radius of the constituent atoms given by Model 2 (Equation 7). Red-colored (blue-colored) dots form an area where zinc-blende (rock-salt) structure is stable and covalent (ionic) bonding is realized. An area with no dots corresponds to the region where training data are not included, possibly indicating a metallic bonding region.
Basic descriptors. , , , and are atomic radius, ionization potential, electron affinity, and electronegativity, respectively. , , and are the radius at maximum probability amplitude of , , and orbitals, respectively.
| Atom | |||||||
|---|---|---|---|---|---|---|---|
| Li | 1.67 | 5.392 | 0.6180 | 0.98 | 1.652 | 1.995 | 6.930 |
| Be | 1.12 | 9.322 | 1.57 | 1.078 | 1.211 | 2.877 | |
| B | 0.87 | 8.298 | 0.2770 | 2.04 | 0.805 | 0.826 | 1.946 |
| C | 0.67 | 11.260 | 1.2629 | 2.55 | 0.644 | 0.630 | 1.631 |
| N | 0.56 | 14.534 | 3.04 | 0.539 | 0.511 | 1.540 | |
| O | 0.48 | 13.618 | 1.4611 | 3.44 | 0.462 | 0.427 | 2.219 |
| F | 0.42 | 17.422 | 3.3990 | 3.98 | 0.406 | 0.371 | 1.428 |
| Na | 1.90 | 5.139 | 0.5479 | 0.93 | 1.715 | 2.597 | 6.566 |
| Mg | 1.45 | 7.646 | 1.31 | 1.330 | 1.897 | 3.171 | |
| Al | 1.18 | 5.986 | 0.4410 | 1.61 | 1.092 | 1.393 | 1.939 |
| Si | 1.11 | 8.151 | 1.3850 | 1.90 | 0.938 | 1.134 | 1.890 |
| P | 0.98 | 10.486 | 0.7465 | 2.19 | 0.826 | 0.966 | 1.771 |
| S | 0.88 | 10.360 | 2.0771 | 2.58 | 0.742 | 0.847 | 2.366 |
| Cl | 0.79 | 12.967 | 3.6170 | 3.16 | 0.679 | 0.756 | 1.666 |
| K | 2.43 | 4.341 | 0.5015 | 0.82 | 2.128 | 2.443 | 1.785 |
| Ca | 1.94 | 6.113 | 1.00 | 1.757 | 2.324 | 0.679 | |
| Cu | 1.45 | 7.726 | 1.2280 | 1.90 | 1.197 | 1.680 | 2.576 |
| Zn | 1.42 | 9.394 | 1.65 | 1.099 | 1.547 | 2.254 | |
| Ga | 1.36 | 5.999 | 0.3000 | 1.81 | 0.994 | 1.330 | 2.163 |
| Ge | 1.25 | 7.899 | 1.2000 | 2.01 | 0.917 | 1.162 | 2.373 |
| As | 1.14 | 9.810 | 0.8100 | 2.18 | 0.847 | 1.043 | 2.023 |
| Se | 1.03 | 9.752 | 2.0207 | 2.55 | 0.798 | 0.952 | 2.177 |
| Br | 0.94 | 11.814 | 3.3650 | 2.96 | 0.749 | 0.882 | 1.869 |
| Rb | 2.65 | 4.177 | 0.4859 | 0.82 | 2.240 | 3.199 | 1.960 |
| Sr | 2.19 | 5.695 | 0.95 | 1.911 | 2.548 | 1.204 | |
| Ag | 1.65 | 7.576 | 1.3020 | 1.93 | 1.316 | 1.883 | 2.968 |
| Cd | 1.61 | 8.993 | 1.69 | 1.232 | 1.736 | 2.604 | |
| In | 1.56 | 5.786 | 0.3000 | 1.78 | 1.134 | 1.498 | 3.108 |
| Sn | 1.45 | 7.344 | 1.2000 | 1.96 | 1.057 | 1.344 | 2.030 |
| Sb | 1.33 | 8.641 | 1.0700 | 2.05 | 1.001 | 1.232 | 2.065 |
| Te | 1.23 | 9.009 | 1.9708 | 2.10 | 0.945 | 1.141 | 1.827 |
| I | 1.15 | 10.451 | 3.0591 | 2.66 | 0.896 | 1.071 | 1.722 |
| Cs | 2.98 | 3.894 | 0.4716 | 0.79 | 2.464 | 3.164 | 1.974 |
| Ba | 2.53 | 5.212 | 0.89 | 2.149 | 2.632 | 1.351 |
aRef. [14]:,bRef. [29]:,cRef. [30]:,dRef. [15]:,eRef. [5].
Descriptor space 1 (DS1): 86 descriptors up to second order of atomic radius, ionization potential, electron affinity, and electronegativity.
| Order | Descriptor |
|---|---|
| 1 | |
| 2 | |
Descriptor space 2 (DS2): 24 descriptors up to fourth order of atomic radius.
| Order | Descriptors |
|---|---|
| 1 | |
| 2 | |
| 3 | |
| 4 | |
Descriptor space 3 (DS3): 24 descriptors up to fourth order of electronegativity.
| Order | Descriptors |
|---|---|
| 1 | |
| 2 | |
| 3 | |
| 4 | |
Results of model selection in descriptor space 1 (DS1) by exhaustive search. Models are ranked by score and listed only top 3.
| Ranking | ||||
|---|---|---|---|---|
| 1 | 1 | 0.658 | 0.595 | 4.12 |
| 2 | 0.592 | 0.522 | 3.56 | |
| 3 | 0.546 | 0.481 | 86.63 | |
| 2 | 1 | 0.823 | 0.782 | |
| 2 | 0.771 | 0.728 | ||
| 3 | 0.767 | 0.716 | ||
| 3 | 1 | 0.913 | 0.902 | 0.59 |
| + 6.15 | ||||
| 2 | 0.911 | 0.899 | 0.12 | |
| + 5.77 | ||||
| 3 | 0.904 | 0.892 | 0.10 | |
| + 5.89 | ||||
| 4 | 1 | 0.934 | 0.921 | 1.86 |
| 2 | 0.933 | 0.921 | 0.04 | |
| 3 | 0.930 | 0.920 | 0.13 | |
| + 0.22 | ||||
| 5 | 1 | 0.945 | 0.936 | 0.22 |
| + 0.22 | ||||
| + 0.21 | ||||
| 2 | 0.946 | 0.936 | ||
| + 0.05 | ||||
| 3 | 0.944 | 0.935 | 0.22 | |
| + 0.22 | ||||
| + 0.21 |
Results of model selection in descriptor space 2 (DS2) by exhaustive search. Models are ranked by score and listed only top 3.
| Ranking | ||||
|---|---|---|---|---|
| 1 | 1 | 0.711 | 0.682 | 8.08 |
| 2 | 0.698 | 0.652 | 5.70 | |
| 3 | 0.658 | 0.595 | 4.12 | |
| 2 | 1 | 0.876 | 0.866 | 6.87 |
| 2 | 0.863 | 0.844 | 5.16 | |
| 3 | 0.844 | 0.833 | ||
| 3 | 1 | 0.903 | 0.893 | 0.08 |
| 2 | 0.902 | 0.893 | 5.97 | |
| 3 | 0.902 | 0.892 | 5.85 | |
| 4 | 1 | 0.903 | 0.892 | 0.02 |
| 2 | 0.903 | 0.892 | 5.99 | |
| 3 | 0.903 | 0.892 | 0.07 | |
| 5 | 1 | 0.905 | 0.892 | |
| + 5.83 | ||||
| 2 | 0.904 | 0.892 | ||
| 3 | 0.905 | 0.891 | ||
| + 0.003 |
Results of model selection in descriptor space 3 (DS3) by exhaustive search. Models are ranked by score and listed only top 3.
| Ranking | ||||
|---|---|---|---|---|
| 1 | 1 | 0.375 | 0.338 | |
| 2 | 0.375 | 0.336 | ||
| 3 | 0.336 | 0.294 | ||
| 2 | 1 | 0.572 | 0.498 | 0.72 |
| 2 | 0.565 | 0.482 | 0.08 | |
| 3 | 0.546 | 0.478 | ||
| + 3.31 | ||||
| 3 | 1 | 0.708 | 0.654 | 0.07 |
| 2 | 0.708 | 0.646 | 0.02 | |
| 3 | 0.697 | 0.630 | 0.02 | |
| 4 | 1 | 0.728 | 0.676 | 0.05 |
| + 0.01 | ||||
| 2 | 0.725 | 0.673 | 0.11 | |
| + 0.01 | ||||
| 3 | 0.718 | 0.662 | 0.32 | |
| + 0.01 | ||||
| 5 | 1 | 0.754 | 0.714 | 1.20 |
| + 0.03 | ||||
| 2 | 0.752 | 0.709 | 1.08 | |
| + 0.19 | ||||
| 3 | 0.753 | 0.709 | 2.33 | |