| Literature DB >> 26868955 |
Martin Kirchberger1, Frank A Russo2.
Abstract
Dynamic range compression serves different purposes in the music and hearing-aid industries. In the music industry, it is used to make music louder and more attractive to normal-hearing listeners. In the hearing-aid industry, it is used to map the variable dynamic range of acoustic signals to the reduced dynamic range of hearing-impaired listeners. Hence, hearing-aided listeners will typically receive a dual dose of compression when listening to recorded music. The present study involved an acoustic analysis of dynamic range across a cross section of recorded music as well as a perceptual study comparing the efficacy of different compression schemes. The acoustic analysis revealed that the dynamic range of samples from popular genres, such as rock or rap, was generally smaller than the dynamic range of samples from classical genres, such as opera and orchestra. By comparison, the dynamic range of speech, based on recordings of monologues in quiet, was larger than the dynamic range of all music genres tested. The perceptual study compared the effect of the prescription rule NAL-NL2 with a semicompressive and a linear scheme. Music subjected to linear processing had the highest ratings for dynamics and quality, followed by the semicompressive and the NAL-NL2 setting. These findings advise against NAL-NL2 as a prescription rule for recorded music and recommend linear settings.Entities:
Keywords: compression; dynamic range; hearing aids; hearing loss; music genre
Mesh:
Year: 2016 PMID: 26868955 PMCID: PMC4753356 DOI: 10.1177/2331216516630549
Source DB: PubMed Journal: Trends Hear ISSN: 2331-2165 Impact factor: 3.293
Peer-Reviewed Research Exploring the Effect of Different Compression Settings on Music Perception in Hearing-Impaired Listeners.
| Reference | No. of stimuli/genres | Methods/conditions | Results |
|---|---|---|---|
| 4/piano, orchestra, pop, lied | Playback to one ear only. Conditions include all combinations of four different CRs (0.25, 0.5, 2, 4) and 3 different number of bands from (1, 4, 16) plus an uncompressed condition as benchmark. | Highest pleasantness ratings for linear amplification. | |
| 4/orchestra, chamber music, pop (not specified which genre contributed two stimuli) | Experiment 1: Conditions included 4 AT/RT combinations: 1/40 10/400 1/4000 100/4000. Gain shape was defined by NAL-R rule. CR was fixed at 2.1:1 for all channels and subjects. Experiment II: Conditions were 4 combinations of AT/RT/CT/CR: 1/40/20/2.1, 1/40/50/3, 1/40/50/2.1, 1/4000/20/2.1 | Highest preference for longer RT, smaller CR, and lower CT. | |
| 2/classic instrumental, lied | Comparison of peak clipping, compression limiting, and WDRC for 80 dB SPL (loud) signals. | WDRC preferred over peak clipping and limiting. | |
| Arehart, Kates, & Anderson (2011) | 3/orchestra, jazz instrumental, single voice | WDRC with 18 bands in 4 combinations of CR/RT: 10/10 ms, 10/70 ms, 2/70 ms, 2/200 ms | Lower CR and longer RT were preferred. All compressive settings were rated worse than the uncompressive setting (linear gain shape NAL-R). |
| 3/classic, jazz, rock | Comparison of ADRO and WDRC. ADRO was less compressive than WDRC. | ADRO preferred to WDRC. However, the experimental setup does not allow to attribute preference only to the linearity but also to further differences such as gain shape. | |
| Moore et al. (2011) | 2/jazz, classic | Comparison of slow (AT: 50 ms, RT: 3 s) and fast compression speeds (AT: 10 ms, RT: 100 ms) at 3 input levels (dB SPL): 50, 65, and 80. | Longer time constants were preferred for classic at 65 dB SPL and 80 dB SPL and for jazz at 80 dB SPL. |
| 2/classic, rock | Three levels of music industry compression (no; mild, CT: −8 dBFS; heavy: −20 dBFS) combined with 3 levels of HA compression (linear; fast, RT: 50 ms; slow RT: 1000 ms) with either 3 or 18 channels. | WDRC generally least preferred. For classic, linear processing and slow WDRC was equally preferred. For rock, linear was preferred over slow WDRC. The effect of HA WDRC was more important than music-industry compression limiting for music preference. |
Note. CR = compression ratio; AT = attack time; RT = release time; CT = compression thresholds; WDRC = wide dynamic range compression (multiband dynamic compression); ADRO = adaptive dynamic range compression; SPL = sound pressure level; HA = hearing aid.
Additional Criteria for Each Genre That the Songs and Stimuli (45-s Segments) Had to Meet Beyond the iTunes Genre Classification.
| Genre | Additional criteria |
|---|---|
| Chamber | Instrumentation: string quartets. |
| Choir | Stimulus contains only vocal elements and no accompanying instruments. |
| Opera | Stimulus contains vocal and instrumental elements. |
| Orchestra | Only nonvocal excerpts accepted. |
| Piano | Only solo piano accepted. |
| Jazz | Stimulus contains drums, bass, piano, and a lead brass instrument. |
| Pop | Stimulus contains vocal and instrumental elements. |
| Rap | Stimulus contains rapped vocal elements and instrumental elements. |
| Rock | Stimulus contains vocals and a distorted guitar. |
| Schlager | Stimulus contains vocals. |
Figure 1.Dynamic range of speech and 10 different music genres. The lines represent percentiles in dB (SPL) across frequency in kHz (99th: upper dashed line, 90th: upper solid line, 65th: thick line, 30th: lower solid line, and 10th: lower dashed line).
SPL = sound pressure level.
Figure 2.Dynamic range comparison between different music genres and speech across frequency. The dynamics are calculated as the difference between the 99th and 30th percentiles according to the IEC 60118-15 standard.
SPL = sound pressure level.
Characteristics of the Test Participants: Audiometric Data (dB HL), Age (Years), Hearing Aid Experience/HAE (Years), Music Experience/ME (Range From −3: Low to 4: High), and Coupling (Dome: o. = Open, cl. = Closed, po. = Power; Rec./receiver: s = Standard, p = Power).
| Left ear, frequency (kHz) | Right ear, frequency (kHz) | Sex | Age | HAE | ME | Cpl | ||||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| 0.1 | 0.25 | 0.5 | 1 | 2 | 4 | 6 | 8 | 0.1 | 0.25 | 0.5 | 1 | 2 | 4 | 6 | 8 | Dome | Rec. | |||||
| P1 | 30 | 30 | 35 | 60 | 60 | 70 | 115 | 105 | 30 | 35 | 35 | 45 | 60 | 70 | 90 | 80 | m | 80 | 11 | 2.5 | cl. | s |
| P2 | 20 | 10 | 5 | 20 | 65 | 75 | 85 | 100 | 25 | 20 | 10 | 5 | 40 | 70 | 90 | 90 | f | 64 | >10 | 1 | o. | s |
| P3 | 10 | 10 | 5 | 25 | 45 | 60 | 85 | 100 | 15 | 10 | 15 | 10 | 55 | 90 | 100 | 100 | m | 72 | 7 | −1.5 | o. | s |
| P4 | 5 | 10 | 20 | 55 | 65 | 70 | 80 | 75 | 10 | 10 | 15 | 30 | 75 | 70 | 90 | 80 | m | 74 | 12 | −2 | cl. | s |
| P5 | 40 | 45 | 45 | 65 | 70 | 80 | 90 | 80 | 35 | 35 | 40 | 50 | 60 | 80 | 80 | 75 | m | 67 | >10 | 1.5 | po. | s |
| P6 | 25 | 40 | 40 | 75 | 75 | 100 | 105 | 105 | 15 | 20 | 30 | 35 | 45 | 65 | 75 | 85 | m | 73 | 8.5 | −1 | cl. | s |
| P7 | 10 | 20 | 55 | 70 | 75 | 85 | 95 | 90 | 10 | 10 | 20 | 50 | 65 | 95 | 100 | 100 | m | 72 | 6 | −2.5 | cl. | p |
| P8 | 15 | 15 | 25 | 50 | 55 | 75 | 100 | 95 | 15 | 15 | 20 | 25 | 70 | 85 | 100 | 105 | m | 70 | 0.2 | −2 | o. | s |
| P9 | 20 | 15 | 5 | 30 | 45 | 90 | 105 | 95 | 10 | 10 | 10 | 45 | 40 | 100 | 100 | 100 | m | 70 | 10 | −3 | o. | p |
| P10 | 25 | 30 | 55 | 70 | 75 | 75 | 80 | 85 | 25 | 25 | 35 | 50 | 70 | 70 | 75 | 75 | m | 65 | 8 | 0 | cl. | s |
| P11 | 15 | 30 | 55 | 75 | 75 | 80 | 85 | 90 | 40 | 45 | 50 | 65 | 75 | 85 | 100 | 85 | m | 73 | 16 | −2.5 | cl. | p |
| P12 | 25 | 40 | 65 | 90 | 75 | 75 | 105 | 121 | 30 | 40 | 45 | 60 | 90 | 75 | 90 | 121 | m | 68 | 50 | −3 | po. | p |
| P13 | 20 | 30 | 35 | 35 | 50 | 60 | 70 | 65 | 30 | 30 | 35 | 40 | 40 | 65 | 85 | 80 | m | 69 | 9 | −2.5 | po. | s |
| P14 | 15 | 15 | 10 | 55 | 60 | 65 | 90 | 95 | 15 | 10 | 10 | 15 | 55 | 65 | 80 | 95 | m | 76 | 12 | 0 | cl. | s |
| P15 | 40 | 50 | 55 | 80 | 85 | 95 | 100 | NT | 20 | 25 | 40 | 40 | 55 | 80 | NT | 85 | m | 71 | 5 | 0.5 | cl. | p |
| P16 | 30 | 30 | 65 | 80 | 80 | 85 | 90 | 85 | 25 | 25 | 35 | 40 | 55 | 80 | 90 | 90 | m | 75 | 21 | 1.5 | po. | p |
| P17 | 10 | 5 | 10 | 45 | 65 | 80 | 105 | 85 | 5 | 10 | 10 | 15 | 65 | 80 | 90 | 75 | f | 55 | 7 | −2 | cl. | s |
| P18 | 35 | 55 | 60 | 65 | 65 | 65 | 80 | 65 | 45 | 50 | 55 | 55 | 60 | 55 | 65 | 65 | f | 66 | 25 | 0 | cl. | s |
| P19 | 55 | 60 | 55 | 55 | 55 | 55 | 65 | 70 | 35 | 55 | 55 | 55 | 60 | 70 | 70 | 65 | m | 73 | 10 | −2 | po. | s |
| P20 | 35 | 60 | 55 | 65 | 60 | 75 | 105 | 106 | 15 | 40 | 50 | 50 | 60 | 60 | 60 | 100 | m | 68 | 30 | −2 | po. | p |
| P21 | 60 | 70 | 80 | 85 | 75 | 75 | 80 | 80 | 55 | 60 | 60 | 65 | 85 | 75 | 80 | 85 | m | 66 | 25 | −2 | po. | p |
| P22 | 20 | 40 | 45 | 60 | 55 | 60 | 65 | 60 | 20 | 25 | 35 | 50 | 55 | 60 | 65 | 65 | m | 67 | 13 | 2 | cl. | p |
| P23 | 35 | 55 | 55 | 65 | 70 | 75 | 80 | 80 | 40 | 40 | 55 | 50 | 60 | 75 | 75 | 85 | m | 77 | 30 | 0.5 | po. | s |
| P24 | 50 | 60 | 50 | 65 | 60 | 65 | 80 | 75 | 35 | 40 | 35 | 40 | 55 | 50 | 60 | 70 | f | 78 | 3 | −2 | po. | s |
| P25 | 35 | 55 | 55 | 65 | 75 | 80 | 90 | 105 | 55 | 60 | 60 | 60 | 70 | 85 | 100 | 105 | m | 76 | >10 | −1.5 | po. | p |
| P26 | 30 | 60 | 70 | 55 | 65 | 70 | 75 | 85 | 30 | 35 | 60 | 70 | 65 | 80 | 85 | 105 | m | 67 | 37 | 1 | po. | p |
| P27 | 25 | 45 | 45 | 50 | 70 | 60 | 60 | 55 | 20 | 30 | 40 | 50 | 50 | 65 | 55 | 60 | m | 48 | 7 | −2 | cl. | s |
| P28 | 15 | 35 | 45 | 70 | 60 | 65 | 70 | 95 | 20 | 20 | 40 | 60 | 75 | 70 | 65 | 85 | m | 70 | 16 | 1.5 | po. | p |
| P29 | 30 | 55 | 60 | 55 | 45 | 45 | 50 | 70 | 30 | 40 | 45 | 55 | 50 | 40 | 50 | 65 | f | 69 | 10 | −3 | po. | s |
| P30 | 25 | 45 | 65 | 70 | 60 | 70 | 75 | 85 | 30 | 30 | 45 | 65 | 70 | 70 | 75 | 70 | m | 51 | >10 | 4 | cShell | s |
| P31 | 40 | 45 | 55 | 50 | 55 | 60 | 65 | 90 | 45 | 45 | 45 | 55 | 60 | 55 | 80 | 80 | f | 72 | 28 | 0.5 | po. | p |
Note. NT indicates a hearing threshold that was not tested.
Figure 3.Schematic gain curves within a compression band for the linear, semicompressive, and compressive condition. The compression ratio of the semicompressive condition (CR = Δgain/Δinput) is half the compression ratio of the compressive condition (CR = 2 * Δgain/Δinput).
Figure 4.Example loudness curves of the linear, semicompressive, and compressive version (here for participant 29 and stimulus 3).
Figure 5.Dynamic range of the test sample in Experiment 2 (left column) and the audio corpus in Experiment 1 (right column) for the genres choir, opera, orchestra, pop, and schlager. The lines represent percentiles in dB (SPL) across frequency in kHz (99th: upper dashed line, 90th: upper solid line, 65th: thick line, 30th: lower solid line, and 10th: lower dashed line).
Details About the 20 Music Segments of Experiment 2 (Music Taste: Participant’s Average Music Taste Ratings [−1: Do Not Like; 1: Like]).
| Genre | Title | Artist | Length (s) | Release (year) | Music taste |
|---|---|---|---|---|---|
| Choir | Op. 113 No. 5—Frauenchöre Kanons—Wille, wille, will | Brahms | 9.9 | 2003 | 0.31 |
| Op. 29—Zwei Motetten—Es ist das Heil uns kommen her | Brahms | 12.2 | 2003 | 0.17 | |
| Jube Domine, for soloists & douple chorus in C major | Mendelssohn | 10.0 | 2002 | 0.17 | |
| Op. 69 No. 1—1 Herr nun lassest du deinen Diener in Frieden fahren | Mendelssohn | 15.1 | 2002 | 0.28 | |
| Opera | Carmen—Act 1—Attention! Chut! Attention! Taisons-Nous! | Bizet | 11.2 | 2003 | 0.21 |
| Don Giovanni—Act 1—Udisti? Qualche bella | Mozart | 10.1 | 2007 | 0.41 | |
| Boris Godunov—Act 2—Uf! Tyazheló! Day Dukh Perevedú | Mussorgsky | 16.0 | 2002 | −0.21 | |
| Rosenkavalier—Act 3—Haben euer Gnaden noch weitere Befehle | Strauss | 16.3 | 2001 | 0.14 | |
| Orchestra | Symphonie No. 9 in C-major KV 73—Molto allegro | Mozart | 10.7 | 2002 | 0.52 |
| Symphony No. 1 in E-flat major—Molto Allegro | Mozart | 8.8 | 2002 | 0.79 | |
| Symphonie No. 8—Tempo di Menuetto | Beethoven | 10.0 | 2002 | 0.69 | |
| Symphonic Poem Op. 16—Ma Vlast Hakon Jarl | Smetana | 11.0 | 2007 | 0.31 | |
| Pop | Downtown | Gareth Gates | 9.2 | 2002 | 0.17 |
| Tears and rain | James Blunt | 11.3 | 2004 | −0.07 | |
| Shape of my heart | Backstreet Boys | 12.5 | 2000 | 0.31 | |
| Rock with you | Michael Jackson | 8.4 | 2003 | 0.21 | |
| Schlager | Kleine Schönheitsfehler | Sylvia Martens | 12.6 | 2011 | 0.38 |
| Sternenfänger | Leonard | 10.9 | 2011 | 0.21 | |
| Tränen der Liebe | Peter Rubin | 10.3 | 2011 | 0.00 | |
| Der Mann ist das Problem | Udo Jürgens | 14.7 | 2014 | 0.17 |
Figure 6.Example screen of the main test.
Figure 7.Ratings and standard error for the data on dynamics, quality, and loudness in the linear, semicompressive, and compressive condition.