| Literature DB >> 35310596 |
Abstract
With the development of the Internet of Things, many industries have been on the train of the information age, and digital audio technology is also constantly developing. Music retrieval has gradually become a research hotspot in the music industry. Among them, the auxiliary recognition of music characteristics is also a particularly important Task. Music retrieval is mainly to manually extract music signals, but now the music signal extraction technology has encountered a bottleneck. The article uses Internet and artificial intelligence technology to design an SNN music feature recognition model to identify and classify music features. The research results of the article show (1) statistic graphs of the main melody and accompanying melody of different music. The absolute value of the main melody and accompanying melody mainly fluctuates in the range of 0-7, and the proportion of the main melody can reach 36%. The accompanying melody can reach 17%. After the absolute value of the interval reaches 13, the interval ratio of the main melody and the accompanying melody tends to be stable, maintaining between 0.6 and 0.9, and the melody interval ratio value completely coincides; the main melody in the interval variable is X. (1) The relative difference value in the interval of -X(16) fluctuates greatly. After the absolute value of the interval reaches 17, the interval ratio of the main melody and the accompanying melody tends to be stable, maintaining between 0.01 and 0.04 and the main melody. The value of the difference is always higher than the accompanying melody. (2) When the number of feature maps is 24∗5, the recognition result is the most accurate, MAP recognition result can reach 78.8, and the recognition result of precision@ is 79.2; when the feature map size is 5∗5, the recognition result is the most accurate, MAP recognition result can reach 78.9, the recognition result of precision@ is 79.2, and the recognition result of HAM2 (%) is 78.6. The detection accuracy of the SNN music recognition model proposed in the article is the highest. When the number of bits is 64, the detection accuracy of the SNN detection model is 59.2%, and the detection accuracy of the improved SNN music recognition model is 79.3%, which is better than the detection rate of ITQ music recognition model of 17.9%, which is 61.4% higher. The experimental data further shows that the detection efficiency of the ITQ music recognition model is the highest. (3) The SNN music recognition model proposed in the article has the highest detection accuracy, regardless of whether it is in a noisy or no-noise music environment, with an accuracy rate of 97.97% and a detection accuracy value of 0.88, which is 5 types of music. The highest one among the recognition models, the ITQ music recognition model, has the lowest detection accuracy, with a detection accuracy of 67.47% in the absence of noise and a detection accuracy of 70.23% in the presence of noise. Although there is a certain noise removal technology, it can suppress noise interference to a certain extent, but cannot accurately describe music information, and the detection accuracy rate is also low.Entities:
Mesh:
Year: 2022 PMID: 35310596 PMCID: PMC8933112 DOI: 10.1155/2022/3733818
Source DB: PubMed Journal: Comput Intell Neurosci
Figure 1Framework diagram of music feature recognition.
Figure 2Functional structure diagram.
Tone-emotion mapping relationship.
| Musical emotion type | Tonal characteristics |
|---|---|
| Hate class | The tone is sharp, rough, and bright |
| Depression | The tone is deep, pure, simple, monotonous bass, dim, plain, and hollow |
| Calm meditation class | The tone is soft, pure, and simple |
| Desire | The tone is deep, plain, hollow, pure, and simple |
| Pastoral style | The tone is soft, pure, and simple |
| Perceptual | The tone is soft, sweet and soft, rich, gorgeous, pleasant, and nasal |
| Active class | The tone is bright, rich, and gorgeous |
| Awesome | The tone is bright, sharp, and brilliant |
Algorithm definition table.
| Definition | Publicity | |
|---|---|---|
| Interval statistics | Statistics of the melody interval are carried out on each track of the music [ |
|
| Classification algorithm | According to the different characteristics of the interval distribution, the main track and the accompanying track are distinguished [ | Rhythm=( |
Figure 3Interval statistics of different audio tracks.
Figure 4Relative difference distribution of interval statistics.
Recognition results of different number of feature maps.
| Quantity | MAP (%) | Precision@500 (%) | HAM2 (%) |
|---|---|---|---|
| 8 | 74.7 | 76.3 | 74.9 |
| 16 | 74.5 | 77.1 | 76.2 |
| 24 | 78.8 | 79.2 | 79.6 |
| 32 | 77.6 | 78.9 | 78.1 |
| 48 | 77.3 | 77.5 | 76.6 |
| 64 | 74.8 | 76.6 | 75.7 |
Recognition results of different feature map sizes.
| Quantity | MAP (%) | Precision@500 (%) | HAM2 (%) |
|---|---|---|---|
| 4 | 78.7 | 79.2 | 78.8 |
| 5 | 78.9 | 79.2 | 78.6 |
| 6 | 78.8 | 79.2 | 79.6 |
| 7 | 78.7 | 78.9 | 78.6 |
| 8 | 77.6 | 77.9 | 77.7 |
| 9 | 76.4 | 77.2 | 76.8 |
| 10 | 76.8 | 77.5 | 77.1 |
| 11 | 75.7 | 76.6 | 76.3 |
| 12 | 75.8 | 75.9 | 76.1 |
| 13 | 75.2 | 75.6 | 75.7 |
| 14 | 74.1 | 75.7 | 75.8 |
Figure 5Statistics of recognition results.
Figure 6Statistics of recognition results.
Average mean precision of different number of bits.
| Method | 16 bits | 24 bits | 32 bits | 48 bits | 64 bits |
|---|---|---|---|---|---|
| SNN music recognition model | 55.2 | 56.6 | 55.8 | 58.1 | 59.2 |
| Improved SNN music recognition model | 76.7 | 77.9 | 78.3 | 78.9 | 79.3 |
| CNNH music recognition model | 46.5 | 52.1 | 52.1 | 53.2 | 53.3 |
| KSH music recognition model | 30.3 | 33.7 | 34.7 | 35.6 | 36.5 |
| ITQ music recognition model | 16.2 | 16.9 | 17.3 | 17.5 | 17.9 |
Figure 7Average mean precision statistics.
Evaluation criteria table.
| Index | Metrics | Formula |
|---|---|---|
| Accuracy | The accuracy measurement standard refers to the ratio of the number of correct music types to the number of all music types [ | Precision=(hits |
| Recall rate | The recall rate standard refers to the proportion of the theoretically largest number of hits that recognize musical characteristics [ | Recall=(hits |
|
| The |
|
Music classification and detection object table.
| Music type number | Noisy | No noise |
|---|---|---|
| 1 | 10 | 30 |
| 2 | 10 | 30 |
| 3 | 20 | 40 |
| 4 | 10 | 40 |
| 5 | 20 | 50 |
| 6 | 20 | 50 |
No-noise recognition results.
| Model | Accuracy (%) | Accuracy (%) | Recall rate (%) |
|
|---|---|---|---|---|
| SNN music recognition model | 95.71 | 96.83 | 96.74 | 95.82 |
| Improved SNN music recognition model | 97.97 | 98.10 | 97.89 | 98.00 |
| CNNH music recognition model | 80.21 | 82.31 | 83.24 | 84.51 |
| KSH music recognition model | 72.14 | 73.46 | 90.26 | 91.32 |
| ITQ music recognition model | 67.47 | 68.24 | 66.76 | 67.12 |
Noisy recognition results.
| Model | Accuracy (%) | Accuracy (%) | Recall rate (%) |
|
|---|---|---|---|---|
| SNN music recognition model | 92.12 | 91.83 | 91.64 | 91.28 |
| Improved SNN music recognition model | 93.91 | 94.21 | 94.74 | 94.62 |
| CNNH music recognition model | 75.32 | 77.23 | 76.34 | 77.21 |
| KSH music recognition model | 68.62 | 69.24 | 69.12 | 68.24 |
| ITQ music recognition model | 70.23 | 71.22 | 74.21 | 72.45 |
Figure 8Comparison of uninteresting music classification and detection accuracy.
Figure 9Comparison of dry music classification and detection accuracy.