| Literature DB >> 35136085 |
Katsumi Hagita1, Takahiro Murashima2, Masao Ogino3, Manabu Omiya4, Kenji Ono5, Tetsuo Deguchi6, Hiroshi Jinnai7, Toshihiro Kawakatsu2.
Abstract
To effectively archive configuration data during molecular dynamics (MD) simulations of polymer systems, we present an efficient compression method with good numerical accuracy that preserves the topology of ring-linear polymer blends. To compress the fraction of floating-point data, we used the Jointed Hierarchical Precision Compression Number - Data Format (JHPCN-DF) method to apply zero padding for the tailing fraction bits, which did not affect the numerical accuracy, then compressed the data with Huffman coding. We also provided a dataset of well-equilibrated configurations of MD simulations for ring-linear polymer blends with various lengths of linear and ring polymers, including ring complexes composed of multiple rings such as polycatenane. We executed 109 MD steps to obtain 150 equilibrated configurations. The combination of JHPCN-DF and SZ compression achieved the best compression ratio for all cases. Therefore, the proposed method enables efficient archiving of MD trajectories. Moreover, the publicly available dataset of ring-linear polymer blends can be employed for studies of mathematical methods, including topology analysis and data compression, as well as MD simulations.Entities:
Year: 2022 PMID: 35136085 PMCID: PMC8825841 DOI: 10.1038/s41597-022-01138-3
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Parameter conditions.
| Item | Values |
|---|---|
| Type of ring complex | single, bonded-two-ring, bonded-three-ring, poly [2]catenane, poly [3]catenane |
| Number of beads in a ring polymer ( | 80, 120, 160 |
| Number of beads in a linear chain ( | 10, 20, 40, 80, 160 |
| Ring fraction | 0.05, 0.1 |
Fig. 1Schematics of single ring, bonded-rings, poly-catenanes, and ring-linear mixture. The snapshot of the ring-linear mixture with primitive path (PP)[53] presentations for Nring = Nlinear = 160 with ring fraction 0.1 was rendered by OVITO[60]. In (f), ring polymers and linear chains are shown in red and green, respectively. The ends of linear chains are shown in blue.
Fig. 2Required number of bits for visualization and analysis.
Fig. 3Example application of separated binary files created within the JHPCN-DF. In this example, the required number of bits was 24 bits and 41 bits for visualization and analysis, respectively. 64 bits of double-precision data were split into three 64-bit recordings: [24 bits + 0-padding (40 bits)], [0-padding (24 bits) + 17 bits + 0-padding (23 bits)], and [0-padding (41 bits) + 23 bits]. Huffman cording reduced the total size of the original 64 bits to less than 64 bits.
Single-precision binary recording: compressed file size [bytes] and confusion matrix of topology judgments.
| Specified error level | Compressed size [bytes] | Compression ratio | Confusion matrix | Error ratio |
|---|---|---|---|---|
| Lossless | 1,058,081,560 | — | — | 0 (ground truth) |
| 0.1 | 568,736,963 480,696,320 480,542,720 | 53.75% 45.43% 45.41% | [[180,543, 6,745],[6,864, 1,374,717,848]] | 9.898e-06 |
| 0.01 | 697,258,978 679,249,920 668,119,040 | 65.89% 64.42% 63.14% | [[186,525, 763],[726, 1,374,723,986]] | 1.083e-06 |
| 0.001 | 828,550,734 1,056,235,520 805,140,480 | 78.31% 99.83% 76.09% | [[187,216, 72],[83, 1,374,724,629]] | 1.127e-07 |
| 0.0001 | 985,218,102 1,058,918,400 978,309,120 | 93.12% 100.08% 92.46% | [[187,283, 5],[10, 1,374,724,702]] | 1.091e-08 |
Regarding the compressed size and compression ratio, the upper, middle, and lower records correspond to the results of JHPCN-DF, SZ compression, and both JHPCN-DF and SZ compression, respectively.
Double-precision binary recording: compressed file size [bytes] and confusion matrix of topology judgments.
| Specified error level | Compressed size [bytes] | Compression ratio | Confusion matrix | Error ratio |
|---|---|---|---|---|
| Lossless | 1,460,836,817 | — | — | 0 (ground truth) |
| 0.00001 | 1,133,326,989 1,147,484,160 1,111,470,080 | 77.58% 78.55% 76.08% | [[187,290, 0],[0, 1,374,724,710]] | 0 |
| 0.000001 | 1,219,899,212 1,213,665,280 1,187,983,360 | 83.51% 83.08% 81.32% | [[187,290, 0],[0, 1,374,724,710]] | 0 |
| 0.0000001 | 1,265,078,931 1,256,069,120 1,238,814,720 | 86.60% 85.98% 84.80% | [[187,290, 0],[0, 1,374,724,710]] | 0 |
| Single precision | 1,058,081,560 | 72.43% | [[187,289, 1],[0, 1,374,724,710]] | 7.273 e-10 |
Regarding the compressed size and compression ratio, the upper, middle, and lower records correspond to the results of JHPCN-DF, SZ compression, and both JHPCN-DF and SZ compression, respectively.
Single-precision binary recording: Nlinear-dependence of the error ratio of topology judgments.
| Specified error level | |||||
|---|---|---|---|---|---|
| 0.1 | 1.020e-05 | 9.678e-06 | 9.475e-06 | 9.210e-06 | 9.898e-06 |
| 0.01 | 1.099e-06 | 1.096e-06 | 1.026e-06 | 1.003e-06 | 1.105e-06 |
| 0.001 | 1.113e-07 | 1.240e-07 | 1.296e-07 | 6.764e-08 | 6.764e-08 |
| 0.0001 | 9.864e-09 | 1.409e-08 | 1.127e-08 | 1.127e-08 | 0 |
Double-precision binary recording: Nlinear-dependence of the error ratio of topology judgments.
| Specified error level | |||||
|---|---|---|---|---|---|
| 0.00001 | 0 | 0 | 0 | 0 | 0 |
| 0.000001 | 0 | 0 | 0 | 0 | 0 |
| 0.0000001 | 0 | 0 | 0 | 0 | 0 |
| Single precision | 0 | 0 | 5.637e-09 | 0 | 0 |
| Measurement(s) | equilibrated configurations of ring-linear polymer blends |
| Technology Type(s) | molecular dynamics simulation |
| Factor Type(s) | length of linear and ring polymer |