| Literature DB >> 25101013 |
Alessandro Soranzo1, Massimo Grassi2.
Abstract
PSYCHOACOUSTICS is a new MATLAB toolbox which implements three classic adaptive procedures for auditory threshold estimation. The first includes those of the Staircase family (method of limits, simple up-down and transformed up-down); the second is the Parameter Estimation by Sequential Testing (PEST); and the third is the Maximum Likelihood Procedure (MLP). The toolbox comes with more than twenty built-in experiments each provided with the recommended (default) parameters. However, if desired, these parameters can be modified through an intuitive and user friendly graphical interface and stored for future use (no programming skills are required). Finally, PSYCHOACOUSTICS is very flexible as it comes with several signal generators and can be easily extended for any experiment.Entities:
Keywords: auditory perception; matlab toolbox; maximum likelihood estimation; pest; psychoacoustics; staircase
Year: 2014 PMID: 25101013 PMCID: PMC4104800 DOI: 10.3389/fpsyg.2014.00712
Source DB: PubMed Journal: Front Psychol ISSN: 1664-1078
Figure 1Hypothetical results of a 3 The dotted curve interpolating the subject's data points is the psychometric function.
Figure 2Hypothetical threshold tracking with the method of limits (left) and with the simple up-down procedure (right). The plus sign represents the correct responses whereas asterisk represents the incorrect responses. Note that the threshold trackings are identical up to trial n. 9. Both trackings start with a stimulus level of 6.
Figure 3Hypothetical threshold tracking with the transformed up-down procedure. The plus sign represents the correct responses whereas asterisk represents the incorrect responses. The starting stimulus level is 6. The total number of reversals is 12. The first four reversals are performed with a step size of 1 and the successive eight are performed with a step size of 0.5. Note how the transformed rule lengthens the threshold tracking in comparison with the method of limits or the simple up-down procedure (see Figure 2).
Figure 4Hypothetical threshold tracking with PEST. The plus sign represents the correct responses whereas asterisk represents the incorrect responses. The starting stimulus level is 6. W is set to 1 and step size is initially equal to 2 and it is halved twice during the block.
Figure 5Hypothetical threshold tracking with MLP. The plus sign represents the correct responses whereas the asterisk represents the incorrect responses. The starting stimulus level is 6. Note how in the first trials MLP literally “jumps” between very different stimuli levels.
Figure 6Screenshot of the IntensityDiscriminationPureTone.m file.
Figure 7Screenshot of the file IntensityDiscriminatioPureTone.m after the frequency and the tone duration have been changed.
| Absolute threshold for a 500-ms pure tone of 1-kHz. The tone is gated on and off with two raised cosine ramps of 10-ms. | |
| A 20-ms, 1-kHz pure tone (the signal) is presented immediately before (i.e., no silent gap) a band of bandpass noise of 300-ms (400–1600 Hz). All sounds are onset and offset gated by means of two raised cosine onset and offset ramps of 10-ms. The subject has to detect the tone (in yes/no task) or to tell which interval has the tone. | |
| A 20-ms, 1-kHz pure tone (the signal) is presented immediately after (i.e., no silent gap) band of bandpass noise of 300-ms (400–1600 Hz). All sounds are onset and offset gated by means of two raised cosine onset and offset gates of 10-ms. The subject has to detect the tone (in yes/no task) or to tell which interval has the tone. | |
| A 20-ms, 1-kHz sine tone (the signal) is presented in the temporal center of a band of bandpass noise of 300-ms (400–1600 Hz). All sounds are onset and offset gated by means of two raised cosine ramps of 10-ms. The subject has to detect the tone (in yes/no task) or to tell which interval has the tone. | |
| Pitch discrimination threshold for a 250-ms, 1-kHz pure tone. The subject has to tell the highest pitch tone. Onset and offset of tones are gated on and off with two 10-ms raised cosine ramps. See Kidd et al. ( | |
| Intensity discrimination threshold for a 1-kHz, 250-ms pure tone. The subject has to tell the loudest tone. The onset and offset of the tones are gated with two 10-ms raised cosine ramps. The standard is −30-dB attenuated in level. See Kidd et al. ( | |
| Duration discrimination for a 1-kHz, 250-ms pure tone. The subject has to tell the longest tone. The tone has raised cosine onset and offset gates of 10-ms. See Kidd et al. ( | |
| Pulse-train discrimination. The standard stimulus consists of six 20-ms pulses of a 1-kHz tone. These pulses are arranged in three pairs, with 40-ms of silence between members of a pair and 120 ms between pairs. The temporal structure of the variable sequence is varied by increasing the separation between members of each pair, with a corresponding decrease in the between-pair time and, thus, a constant interval between the first tones in each of the successive pairs. Thus, the first, third, and fifth tones are fixed in time, while the onsets of the second, fourth, and sixth tones are delayed by varying amounts. See Kidd et al. ( | |
| Subjects listen for one member of a sequence of nine tones with frequencies ranging from 300 to 3000-Hz. A different, randomly selected series of nine tones is presented on each trial. The task is to detect the presence of the fifth tone in the sequence. The tone is absent in the standard. The duration of all tones except the fifth, or target tone, is 40-ms. All tones have 2.5-ms raised cosine onset and offset gates. The test is made more difficult by reducing the duration of the target tone. See Kidd et al. ( | |
| Temporal order for tones. The task is to discriminate the order in which two equal-duration pure tones of 550 and 710-Hz are presented. The duration of the two tones is varied according to listener performance. Tones are presented without a gap between them and are preceded and followed, without gaps, by 100-ms “leader” and “trailer” tones at 625-Hz. The onset and offset of the tones are gated with two 10-ms raised cosine ramps. See Kidd et al. ( | |
| Sinusoidal Amplitude Modulation (SAM) noise discrimination. A 500-ms Gaussian noise is sinusoidally amplitude modulated at 8-Hz. The depth of the modulation is expressed as 20log(m), where m is a modulation index that ranges from 0.0 (no modulation) to 1.0 (full modulation). The subject has to detect the modulation (in yes/no task) or to tell which interval has the modulated noise. Modulated and unmodulated stimuli are equated for total RMS power. Noises have two 10-ms raised cosine ramps at onset and offset. The threshold is the modulation depth (in dB). See Kidd et al. ( | |
| Sinusoidal Amplitude Modulation (SAM) noise discrimination. A 500-ms Gaussian noise is sinusoidally amplitude modulated at 20-Hz. The depth of the modulation is expressed as 20log(m), where m is a modulation index that ranges from 0.0 (no modulation) to 1.0 (full modulation). The subject has to detect the modulation (in yes/no task) or to tell which interval has the modulated noise. Modulated and unmodulated stimuli are equated for total RMS power. Noises have two 10-ms raised cosine ramps at onset and offset. The threshold is the modulation depth (in dB). See Kidd et al. ( | |
| Sinusoidal Amplitude Modulation (SAM) noise discrimination. A 500-ms Gaussian noise is sinusoidally amplitude modulated at 60-Hz. The depth of the modulation is expressed as 20log(m), where m is a modulation index that ranges from 0.0 (no modulation) to 1.0 (full modulation). The subject has to detect The modulation (in yes/no task) or to tell which interval has the modulated noise. Modulated and unmodulated stimuli are equated for total RMS power. Noises have two 10-ms raised cosine ramps at onset and offset. The threshold is the modulation depth (in dB). See Kidd et al. ( | |
| Sinusoidal Amplitude Modulation (SAM) noise discrimination. A 500-ms Gaussian noise is sinusoidally amplitude modulated at 200-Hz. The depth of the modulation is expressed as 20log(m), where m is a modulation index that ranges from 0.0 (no modulation) to 1.0 (full modulation). The subject has to detect the modulation (in yes/no task) or to tell which interval has the modulated noise. Modulated and unmodulated stimuli are equated for total RMS power. Noises have two 10-ms raised cosine ramps at onset and offset. The threshold is the modulation depth (in dB). See Kidd et al. ( | |
| Ripple noise discrimination. A 500-ms digital Gaussian noise is lowpass filtered at 3000-Hz. Sinusoidal ripples are created by adding the noise to itself with a 5-ms delay. The delayed noise is attenuated by a variable amount. The standard is always a 500-ms broadband noise with the same bandpass filtering as the “rippled” samples, but with a uniform power spectrum. Standard and variable are equalized to average RMS power. The threshold is the attenuation (in dB) of the delayed noise. See Kidd et al. ( | |
| Gap detection. A band of 750-ms gaussian noise has a gap in its temporal center. Gap duration is varied according to the listener performance. The noise has 0.5-ms cosine ramps at the beginning and end of the gap. In nI-nAFC tasks, the standard is always a 750-ms broadband noise with no gap whereas the variable contains the gap. See Kidd et al. ( | |
| Gap-duration discrimination. The standard is a 750-ms Gaussian noise with a silent gap of 40-ms placed at its temporal center. The variable has a variable gap duration and the length of the gap is changed as a function of the subject performance. All noises have a 0.5-ms cosine ramp at onset and offset. See Kidd et al. ( | |
| PitchDiscriminationComplexTone | Pitch discrimination threshold for a 250-ms complex tone. The tone has four harmonics ( |
| IntensityDiscriminationComplexTone | Intensity discrimination threshold for a 250-ms complex tone. The tone has four harmonics ( |
| Intensity discrimination threshold for a 250-ms white noise. The subject has to tell the loudest noise. The onset and offset of the noises are gated with two 10-ms raised cosine ramps. The standard is −30-dB attenuated in level. | |
| Duration discrimination for a 250-ms complex tone. The tone has four harmonics ( | |
| Duration discrimination for 250-ms white noise. The subject has to tell the longest noise. The noise has raised cosine onset and offset gates of 10-ms. | |
| Profile Analysis. In this experiment the subject listens to three complex tones. Two are identical (the standards). They have five harmonics all at the same amplitude ( | |
| Melody mistuning detection. The major diatonic equitempered scale is played (starting do, do4 = 261.6-Hz). The sol note has a variable pitch. The subject has to tell whether the scale is in tune or out of tune (in yes/no task) or to tell the out of tune scale (in nAFC task). Notes are 500-ms complex tones of five harmonics. All tones are gated on and off with two raised cosine ramps of 10-ms. The threshold is estimated in cents. To convert the threshold in hertz: threshold = 261.6*2 ∧ ((700 + |
.
| Method of limits | 50 | −1.03 |
| Simple up-down | 50 | −1.03 |
| Transformed up-down | 70.7 | 0.32 |
| PEST | 75 | 0.60 |
| MLP | 72.8 | 0.45 |