| Literature DB >> 36081009 |
Ming Zhao1, Shuo-Tsung Chen2, Shu-Yi Tu3.
Abstract
Due to the rapid development of sensor technology and the popularity of the Internet, not only has the amount of digital information transmission skyrocketed, but also its acquisition and dissemination has become easier. The study mainly investigates audio security issues with data compression for private data transmission on the Internet or MEMS (micro-electro-mechanical systems) audio sensor digital microphones. Imperceptibility, embedding capacity, and robustness are three main requirements for audio information-hiding techniques. To achieve the three main requirements, this study proposes a high-quality audio information-hiding technology in the wavelet domain. Due to the fact that wavelet domain provides a useful and robust platform for audio information hiding, this study applies multi-coefficients of discrete wavelet transform (DWT) to hide information. By considering a good, imperceptible concealment, we combine signal-to-noise ratio (SNR) with quantization embedding for these coefficients in a mathematical model. Moreover, amplitude-thresholding compression technology is combined in this model. Finally, the matrix-type Lagrange principle plays an essential role in solving the model so as to reduce the carrying capacity of network transmission while protecting personal copyright or private information. Based on the experimental results, we nearly maintained the original quality of the embedded audio by optimization of signal-to-noise ratio (SNR). Moreover, the proposed method has good robustness against common attacks.Entities:
Keywords: DWT; MEMS; compression; digital information; optimization; sensor
Mesh:
Year: 2022 PMID: 36081009 PMCID: PMC9460818 DOI: 10.3390/s22176548
Source DB: PubMed Journal: Sensors (Basel) ISSN: 1424-8220 Impact factor: 3.847
Figure 1The block diagram of the proposed algorithm.
Embedding capacity and SNR.
| Number of Consecutive Coefficients in DWT Level 8 | Embedding Capacity | Averaged SNR (dB) | ||||
|---|---|---|---|---|---|---|
| Dance | Love Song | Folklore | Symphony | |||
| Reference [ | 1000 | 35.8 | 33.4 | 27.9 | 26.3 | |
| 500 | 37.7 | 33.5 | 28.6 | 26.2 | ||
| Reference [ | 1000 | 24.3 | 25.4 | 23.2 | 22.3 | |
| 500 | 24.1 | 26.0 | 23.6 | 22.9 | ||
| Proposed | 1000 | 38.3 | 35.6 | 28.7 | 27.5 | |
| 500 | 37.1 | 41.3 | 34.5 | 33.2 | ||
BER of Testing Re-sampling.
| Audio Type | Dance | Folklore | Love Song | Symphony | ||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Re-Sampling Rate (kHz) | 22.05 | 11.025 | 8 | 22.05 | 11.025 | 8 | 22.05 | 11.025 | 8 | 22.05 | 11.025 | 8 | ||
| Reference | mean | 8.32 | 13.31 | 14.01 | 0.74 | 4.22 | 4.36 | 5.74 | 3.08 | 2.39 | 0.78 | 4.52 | 4.76 | |
| SD | 0.40 | 0.43 | 0.41 | 0.23 | 0.28 | 0.26 | 0.16 | 0.15 | 0.16 | 0.16 | 0.26 | 0.28 | ||
| mean | 2.36 | 8.01 | 8.01 | 0.20 | 1.24 | 1.26 | 0.72 | 1.05 | 1.06 | 0.32 | 1.29 | 1.29 | ||
| SD | 0.25 | 0.38 | 0.36 | 0.18 | 0.21 | 0.19 | 0.10 | 0.12 | 0.12 | 0.13 | 0.21 | 0.21 | ||
| Reference | mean | 9.14 | 15.26 | 15.31 | 0.74 | 4.22 | 4.36 | 5.74 | 3.08 | 2.39 | 0.78 | 4.52 | 4.76 | |
| SD | 0.41 | 0.43 | 0.42 | 0.19 | 0.24 | 0.27 | 0.17 | 0.14 | 0.13 | 0.14 | 0.27 | 0.25 | ||
| mean | 2.17 | 8.03 | 8.04 | 0.23 | 1.21 | 1.31 | 0.62 | 1.21 | 1.02 | 0.35 | 1.27 | 1.28 | ||
| SD | 0.21 | 0.39 | 0.37 | 0.15 | 0.20 | 0.19 | 0.11 | 0.12 | 0.11 | 0.13 | 0.22 | 0.21 | ||
| Proposed method | mean | 8.25 | 14.42 | 0.87 | 0.82 | 4.65 | 4.35 | 4.87 | 3.29 | 0.68 | 0.66 | 1.28 | 1.28 | |
| SD | 0.39 | 0.4 | 0.16 | 0.18 | 0.22 | 0.21 | 0.15 | 0.16 | 0.09 | 0.14 | 0.21 | 0.21 | ||
| mean | 2.1 | 8.16 | 0.26 | 0.23 | 1.42 | 1.25 | 1.34 | 1.45 | 0.57 | 0.53 | 1.24 | 1.23 | ||
| SD | 0.23 | 0.38 | 0.14 | 0.16 | 0.19 | 0.15 | 0.13 | 0.11 | 0.08 | 0.12 | 0.20 | 0.19 | ||
BER of Testing Low-pass Filtering.
| Audio Type | Love Song | Symphony | Dance | Folklore | ||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Cutoff Frequency | 3 kHz | 5 kHz | 3 kHz | 5 kHz | 3 kHz | 5 kHz | 3 kHz | 5 kHz | ||
| Reference [ | mean | 24.18 | 25.82 | 27.58 | 8.68 | 33.62 | 21.52 | 33.62 | 15.72 | |
| SD | 0.28 | 0.22 | 0.29 | 0.20 | 0.35 | 0.29 | 0.34 | 0.21 | ||
| mean | 23.82 | 23.48 | 27.55 | 8.41 | 33.25 | 21.28 | 33.02 | 13.84 | ||
| SD | 0.27 | 0.19 | 0.28 | 0.20 | 0.36 | 0.27 | 0.34 | 0.19 | ||
| Reference [ | mean | 26.18 | 25.82 | 27.53 | 8.68 | 33.62 | 21.52 | 33.62 | 15.72 | |
| SD | 0.29 | 0.21 | 0.28 | 0.19 | 0.35 | 0.27 | 0.33 | 0.19 | ||
| mean | 25.82 | 24.81 | 27.54 | 8.41 | 33.02 | 20.87 | 33.02 | 11.84 | ||
| SD | 0.29 | 0.21 | 0.25 | 0.21 | 0.34 | 0.28 | 0.33 | 0.17 | ||
| Proposed method | mean | 22.84 | 23.63 | 27.85 | 8.38 | 32.28 | 20.03 | 31.82 | 13.32 | |
| SD | 0.27 | 0.19 | 0.27 | 0.18 | 0.35 | 0.26 | 0.29 | 0.15 | ||
| mean | 21.42 | 23.63 | 27.54 | 8.25 | 30.39 | 20.02 | 32.50 | 13.15 | ||
| SD | 0.25 | 0.18 | 0.24 | 0.19 | 0.33 | 0.25 | 0.30 | 0.16 | ||
BER of Testing MP3 compression.
| Audio Type | Love Song | Symphony | Dance | Folklore | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Bit Rate (kbps) | 128 | 112 | 96 | 80 | 128 | 112 | 96 | 80 | 128 | 112 | 96 | 80 | 128 | 112 | 96 | 80 | ||
| Reference | mean | 0.16 | 1.38 | 2.09 | 2.72 | 0.35 | 1.45 | 2.44 | 3.17 | 0.74 | 2.11 | 2.12 | 3.02 | 0.36 | 1.48 | 2.42 | 3.12 | |
| SD | 0.11 | 0.11 | 0.13 | 0.15 | 0.12 | 0.13 | 0.14 | 0.17 | 0.15 | 0.18 | 0.18 | 0.21 | 0.12 | 0.15 | 0.14 | 0.16 | ||
| mean | 0.09 | 0.11 | 1.41 | 2.53 | 0.14 | 0.15 | 2.29 | 3.84 | 0.11 | 0.15 | 1.02 | 3.0 | 0.15 | 0.15 | 2.40 | 3.93 | ||
| SD | 0.10 | 0.09 | 0.12 | 0.15 | 0.10 | 0.10 | 0.13 | 0.16 | 0.12 | 0.13 | 0.17 | 0.23 | 0.11 | 0.10 | 0.13 | 0.17 | ||
| Reference | mean | 0.15 | 1.32 | 2.13 | 2.73 | 0.27 | 1.45 | 2.44 | 3.17 | 0.75 | 2.13 | 2.16 | 3.02 | 0.36 | 1.46 | 2.43 | 3.14 | |
| SD | 0.11 | 0.13 | 0.13 | 0.14 | 0.13 | 0.14 | 0.15 | 0.15 | 0.14 | 0.18 | 0.19 | 0.20 | 0.13 | 0.13 | 0.14 | 0.15 | ||
| mean | 0.09 | 0.11 | 1.42 | 2.53 | 0.14 | 0.17 | 2.32 | 3.74 | 0.11 | 0.16 | 1.02 | 3.0 | 0.15 | 0.13 | 2.40 | 3.02 | ||
| SD | 0.10 | 0.10 | 0.11 | 0.15 | 0.12 | 0.09 | 0.13 | 0.15 | 0.13 | 0.15 | 0.15 | 0.20 | 0.12 | 0.08 | 0.13 | 0.15 | ||
| Proposed method | mean | 0.75 | 2.67 | 2.91 | 3.31 | 0.18 | 0.15 | 2.29 | 3.92 | 0.83 | 2.46 | 2.54 | 2.62 | 0.45 | 2.13 | 2.64 | 3.25 | |
| SD | 0.13 | 0.14 | 0.14 | 0.15 | 0.14 | 0.08 | 0.12 | 0.15 | 0.16 | 0.18 | 0.19 | 0.19 | 0.14 | 0.12 | 0.12 | 0.16 | ||
| mean | 0.69 | 2.23 | 2.24 | 2.28 | 0.17 | 0.12 | 1.93 | 2.09 | 0.15 | 0.13 | 2.48 | 2.49 | 0.39 | 1.94 | 1.95 | 1.94 | ||
| SD | 0.12 | 0.14 | 0.13 | 0.13 | 0.13 | 0.06 | 0.09 | 0.12 | 0.15 | 0.15 | 0.16 | 0.16 | 0.13 | 0.13 | 0.09 | 0.10 | ||
BER of Testing Amplitude Scaling.
| Audio Type | Love Song | Symphony | Dance | Folklore | |||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Amplitude Modification Factor | 0.5 | 0.8 | 1.1 | 1.2 | 0.5 | 0.8 | 1.1 | 1.2 | 0.5 | 0.8 | 1.1 | 1.2 | 0.5 | 0.8 | 1.1 | 1.2 | |
| Reference | 47.25 | 45.55 | 41.40 | 43.85 | 48.00 | 38.72 | 23.63 | 24.54 | 43.12 | 41.40 | 40.15 | 40.84 | 45.90 | 43.52 | 42.54 | 42.86 | |
| 43.82 | 40.63 | 40.84 | 41.25 | 45.22 | 32.04 | 23.15 | 23.56 | 42.33 | 41.02 | 39.56 | 40.16 | 42.52 | 41.86 | 41.35 | 41.24 | ||
| Reference | 40.02 | 32.15 | 31.18 | 33.65 | 38.06 | 31.22 | 28.13 | 28.55 | 38.92 | 31.41 | 32.10 | 34.24 | 39.82 | 33.12 | 32.74 | 32.62 | |
| 38.22 | 30.63 | 30.84 | 31.25 | 35.22 | 32.04 | 23.15 | 23.56 | 40.02 | 31.11 | 30.51 | 30.46 | 32.42 | 26.81 | 24.75 | 24.26 | ||
| Proposed method | 2.03 | 1.15 | 1.08 | 1.13 | 1.65 | 0.97 | 1.43 | 1.45 | 2.85 | 1.76 | 1.85 | 2.06 | 1.67 | 1.31 | 0.93 | 1.32 | |
| 0.97 | 0.86 | 0.84 | 0.92 | 1.14 | 0.88 | 0.92 | 0.98 | 2.04 | 1.56 | 0.98 | 1.93 | 1.05 | 0.86 | 0.83 | 0.85 | ||
BER of Testing Time Scaling.
| Audio Type | Love Song | Symphony | Dance | Folklore | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Time-Scaling (%) | −5 | −2 | 2 | 5 | −5 | −2 | 2 | 5 | −5 | −2 | 2 | 5 | −5 | −2 | 2 | 5 | ||
| Reference | mean | 47.11 | 42.91 | 43.67 | 46.32 | 42.74 | 37.82 | 46.42 | 46.19 | 45.18 | 40.21 | 46.58 | 47.98 | 43.18 | 39.12 | 46.35 | 47.43 | |
| SD | 0.10 | 0.09 | 0.08 | 0.09 | 0.11 | 0.11 | 0.09 | 0.10 | 0.12 | 0.12 | 0.11 | 0.13 | 0.12 | 0.12 | 0.10 | 0.12 | ||
| mean | 47.04 | 40.23 | 45.11 | 46.58 | 43.11 | 36.64 | 46.24 | 46.86 | 44.37 | 39.91 | 44.92 | 47.98 | 43.03 | 38.62 | 46.53 | 47.54 | ||
| SD | 0.08 | 0.07 | 0.07 | 0.08 | 0.09 | 0.10 | 0.09 | 0.10 | 0.12 | 0.11 | 0.11 | 0.12 | 0.13 | 0.12 | 0.09 | 0.10 | ||
| Reference | mean | 48.24 | 45.03 | 41.13 | 42.62 | 42.24 | 40.73 | 43.62 | 45.21 | 46.29 | 42.07 | 44.98 | 45.18 | 44.15 | 40.22 | 45.39 | 47.37 | |
| SD | 0.09 | 0.08 | 0.07 | 0.08 | 0.09 | 0.10 | 0.08 | 0.09 | 0.13 | 0.13 | 0.12 | 0.14 | 0.13 | 0.12 | 0.11 | 0.11 | ||
| mean | 46.12 | 41.25 | 44.01 | 44.52 | 42.01 | 38.34 | 45.27 | 45.89 | 45.27 | 40.91 | 45.02 | 46.13 | 42.53 | 39.24 | 45.63 | 45.58 | ||
| SD | 0.08 | 0.08 | 0.08 | 0.07 | 0.10 | 0.09 | 0.09 | 0.09 | 0.12 | 0.13 | 0.11 | 0.14 | 0.13 | 0.11 | 0.10 | 0.09 | ||
| Proposed method | mean | 47.23 | 42.05 | 43.53 | 45.15 | 42.32 | 37.64 | 45.18 | 46.21 | 45.35 | 40.42 | 46.24 | 46.47 | 43.18 | 38.93 | 46.41 | 47.13 | |
| SD | 0.07 | 0.07 | 0.06 | 0.07 | 0.08 | 0.09 | 0.09 | 0.10 | 0.11 | 0.10 | 0.10 | 0.13 | 0.12 | 0.11 | 0.08 | 0.09 | ||
| mean | 46.43 | 40.08 | 44.37 | 46.54 | 43.06 | 36.83 | 46.32 | 46.25 | 44.14 | 39.65 | 44.78 | 47.95 | 42.25 | 38.26 | 46.42 | 46.37 | ||
| SD | 0.07 | 0.05 | 0.05 | 0.06 | 0.08 | 0.09 | 0.07 | 0.08 | 0.11 | 0.11 | 0.10 | 0.12 | 0.10 | 0.10 | 0.08 | 0.08 | ||
Figure 2Comparison among the original audio, compressed audio, and decompressed audio in 1 and 100 audio samples with a threshold value of 500 and with/without embedding private information. (a) Original audio. (b) Compressed audio with threshold value of 500. (c) Recovering the compressed audio in (b). (d) Compressed audio with threshold value of 500 and embedding private information of embedding strength Q = 1000. (e) Recovering the compressed audio in (d).
Relationship between CR and SNR with and without embedding private information (N).
| Threshold ε |
| CR | SNR before Decompression |
|---|---|---|---|
| 0.1 | 1 | 1.0016 | 36.2503 |
| 100 | 1.0173 | 38.9726 | |
| 500 | 1.0905 | 31.7549 | |
| 1000 | 1.1900 | 28.0046 | |
| 2048 | 1.4187 | 23.0792 | |
| 4096 | 1.8970 | 17.4999 | |
| 10 | 1 | 1.0028 | 35.2693 |
| 100 | 1.0173 | 38.9726 | |
| 500 | 1.0905 | 31.7549 | |
| 1000 | 1.1900 | 28.0046 | |
| 2048 | 1.4187 | 23.0792 | |
| 4096 | 1.8970 | 17.4999 | |
| 100 | 1 | 1.0314 | 34.1682 |
| 100 | 1.0173 | 38.9726 | |
| 500 | 1.0905 | 31.7549 | |
| 1000 | 1.1900 | 28.0046 | |
| 2048 | 1.4187 | 23.0792 | |
| 4096 | 1.8970 | 17.4999 | |
| 500 | 1 | 1.1953 | 25.4036 |
| 100 | 1.1415 | 29.6514 | |
| 500 | 1.0905 | 31.7549 | |
| 1000 | 1.1900 | 28.0046 | |
| 2048 | 1.4187 | 23.0792 | |
| 4096 | 1.8970 | 17.4999 | |
| 1000 | 1 | 1.4308 | 22.1784 |
| 100 | 1.4115 | 25.8566 | |
| 500 | 1.3092 | 27.0426 | |
| 1000 | 1.1900 | 28.0046 | |
| 2048 | 1.4187 | 23.0792 | |
| 4096 | 1.8970 | 17.4999 |
Figure 3Changing the threshold ε to obtain the relationship between CR and SNR using different markers by keeping green and blue fixed to include all the Q values in Table 7.