Literature DB >> 34046902

The left dorsal stream causally mediates the tone labeling in absolute pitch.

Lars Rogenmoser1, Andra Arnicane2, Lutz Jäncke2,3, Stefan Elmer2.   

Abstract

Absolute pitch (AP) refers to the ability to effortlessly identify given pitches without any reference. Correlative evidence suggests that the left posterior dorsolateral prefrontal cortex (DLPFC) is responsible for the process underlying pitch labeling in AP. Here, we measured the sight-reading performance of right-handed AP possessors and matched controls under cathodal and sham transcranial direct current stimulation of the left DLPFC. The participants were instructed to report notations as accurately and as fast as possible by playing with their right hand on a piano. The notations were simultaneously presented with distracting auditory stimuli that either matched or mismatched them in different semitone degrees. Unlike the controls, AP possessors revealed an interference effect in that they responded slower in mismatching conditions than in the matching one. Under cathodal stimulation, this interference effect disappeared. These findings confirm that the pitch-labeling process underlying AP occurs automatically and is largely nonsuppressible when triggered by tone exposure. The improvement of the AP possessors' sight-reading performances in response to the suppression of the left DLPFC using cathodal stimulation confirms a causal relationship between this brain structure and pitch labeling.
© 2020 The Authors. Annals of the New York Academy of Sciences published by Wiley Periodicals LLC on behalf of New York Academy of Sciences.

Entities:  

Keywords:  Stroop; cathodal tDCS; interference; musicians; sight-reading

Mesh:

Year:  2021        PMID: 34046902      PMCID: PMC8518498          DOI: 10.1111/nyas.14616

Source DB:  PubMed          Journal:  Ann N Y Acad Sci        ISSN: 0077-8923            Impact factor:   5.691


Introduction

Absolute pitch (AP) is the ability to effortlessly identify the chroma of a tone without the aid of any reference pitch. , This ability is sparsely distributed in the population (<1%), yet it bears phylogenetic and ontogenetic significance. There is considerable scientific consensus on its acquisition, namely that AP emerges from an interplay of a certain genetic predisposition and specific environmental inputs and learning factors (i.e., music engagement and language exposure) that operate within a sensitive period during childhood development. , , , , , , , By contrast, lesser consensus exists on the exact mechanisms and involved brain structures driving AP. One brain structure frequently reported to contribute to AP is the planum temporale (PT), , , , , , , a region that covers the superior temporal plane posterior to the Heschl's gyrus and is involved in language and higher auditory functions. , , , In AP possessors, the PT is likely responsible for a higher resolution in the pitch perception of categories; in other words, the encoding of tones within narrower long‐term stored categories. , , , Neuroimaging studies have found an increased leftward asymmetry of the PT, , , possibly underlying this so‐called categorical pitch perception in AP. , In the right brain hemisphere, this structure, together with a network comprising the Heschl's gyrus and some nonauditory structures, was found to have an impact on AP perception. Another brain structure reported to be related to AP is the left posterior dorsolateral prefrontal cortex (DLPFC). This area drives conditional associative learning and memory. , , , , , In the context of AP, the posterior DLPFC may be responsible for the process underlying the association between categorized pitches and verbal labels or other abstract or sensorimotor codes; , , in other words, the pitch‐labeling process. Neuroimaging studies revealed that AP possessors exhibit comparatively thinner cortical thickness there and selectively recruit this particular brain region during mere tone listening. In the same study, musicians without AP recruited this region while performing a interval‐labeling task. However, they additionally recruited the right inferior frontal gyrus (IFG), an area involved in working memory (WM) retrieval. , Conversely, AP possessors did not show this additional activation while performing the same task. This lack of IFG involvement suggests that AP underlies an automatic pitch‐labeling process, functioning without the use of WM resources. A series of experiments using electroencephalography (EEG) confirmed this interpretation and provided corroborative evidence for the specificity of the pitch‐labeling process underlying AP. In auditory oddball or labeling tasks, AP possessors displayed absent, reduced, or accelerated specific electrophysiological responses pertaining to the P3 complex, , , , , , reflecting a more efficient and parsimonious tone processing in AP. Further event‐related EEG components representing hierarchically higher cognitive processes were found to be specific and linked to the labeling performance of AP possessors. The purpose of the present study was to investigate the causal role of the left DLPFC in the pitch‐labeling process underlying AP by applying a cathodal‐sham transcranial direct current stimulation (tDCS) protocol. Cathodal tDCS suppresses cortical excitability of the targeted region, diminishing its underlying function. Given that the pitch‐labeling process in genuine AP occurs automatically and thus is rather nonsuppressible, , , , , , , , a modulation in the pitch‐labeling performance as a result of cathodal stimulation of the left DLPFC would reveal its causal impact on AP. During cathodal and sham stimulation, participants with AP and matched control participants without AP were instructed to sight‐read, reporting the presenting notations as accurately and fast as possible by playing with their right hand on a piano. Sight‐reading, the practice of reading and immediately performing notations on an instrument or by singing, is an activity that musicians with and without AP easily master. Simultaneously, auditory stimuli were presented during the task that either matched with the notations or mismatched with them. However, these tones were irrelevant to the task. This experimental setup corresponded to a so‐called Stroop paradigm. , The Stroop paradigm measures the interference in performances resulting from conflicting asymmetrical processes, namely between overlearned automatically running and more effortful processes. The classical Stroop experiments revealed the robust finding of worsened color‐naming performance when participants were challenged to name depicted colors of color names semantically standing for different colors. , In the case of the classical Stroop task, the interference is due to the fact that reading is virtually overlearned among literate people, making the execution of the less familiar practice, namely color naming, demanding when simultaneously suppressing the decoding of target words. Meanwhile, this Stroop principle was extended to capture interference effects across multiple domains, including music cognition, , , but also to verify the authenticity of conditions, such as synesthesia , , and AP, , , both of which are characterized by nonsuppressible uncommon additional experiences that are inaccessible to outsiders. In accordance with the Stroop paradigm, , we expected that only AP possessors reveal an interference in the mismatching trials, resulting from the distraction in performing the actual sight‐reading task owing to the unique pitch‐labeling process automatically triggered by tone exposure. Thus, this interference is expected to be reflected as a performance drop in the sight‐reading activity. Furthermore, we expected this drop to diminish when suppressing the left DLPFC using cathodal stimulation. Should the automatic pitch‐labeling process be driven by the left DLPFC, then suppressing its cortical excitability is expected to diminish its function, leading to less interference with the sight‐reading activity and, therefore, to an improvement in performing this task.

Material and methods

Participants

A sample of 36 healthy musicians participated in this study, of which half were AP possessors (13 females) and the others were non‐AP (NAP) control participants (12 females). All participants were right‐handed, as determined using the Annett Handedness Inventory and the Edinburgh Handedness Inventory. The two samples were comparable regarding age (t 34 = 1.86, P = 0.07, d = 0.62), the distribution of the sexes (χ 2 1 = 0.13, P = 0.72), general cognitive capability (t 34 = 0.2, P = 0.84, d = 0.10) as measured by a standard German intelligence screening test (“Kurztest für allgemeine Intelligenz”), and musical aptitude (tonal: t 34 < 0.01, P = 1, d < 0.01; rhythm: t 34 = 0.74, P = 0.46, d = 0.25; total: t 33.23 = 0.09, P = 0.71, d = 0.13) as evaluated using the Advanced Measures of Music Audition test. Both AP and NAP participants commenced their musical training at a comparable age range (t 34 = 1.90, P = 0.07, d = 0.63) and trained for a comparable number of years (t 34 = 1.09, P = 0.29, d = 0.47). Six participants of the AP sample and seven of the NAP sample professionally played the piano as their first instrument. This distribution (binary categorization of a “pianist” versus a “non‐pianist”) did not differ between the two samples (χ 2 1 = 0.12, P = 0.72). However, all participants were skilled at playing the piano, as this instrument was taught during music education as part of their professional degree program. The values on characteristics and musical background are reported in Table 1. All participants gave written informed consent to a protocol that was approved by the Cantonal Institutional Review Board of Zurich.
Table 1

Characteristics and data on the musical background of the two samples

APNAP
Age (years)27.83 (10.14)33.56 (8.23)
Cognitive capability (IQ scores)123.33 (11.25)122.05 (13.20)
Age at commencement of musical practice (years)5.36 (1.55)6.67 (2.47)
Duration of musical training (years)22.47 (9.95)26.61 (7.3)
Advanced Measures of Music Audition test (raw scores)65.33 (7.34)64.33 (8.56)
Tonal (raw scores)31.89 (4.27)31.89 (4.33)
Rhythm (raw scores)33.44 (3.42)32.44 (4.57)

Listed are the means with the standard deviations in parentheses. All independent‐samples t‐test calculated for each variable revealed the values of P > 0.05. AP, absolute pitch; NAP, nonabsolute pitch (each N = 18).

Characteristics and data on the musical background of the two samples Listed are the means with the standard deviations in parentheses. All independent‐samples t‐test calculated for each variable revealed the values of P > 0.05. AP, absolute pitch; NAP, nonabsolute pitch (each N = 18).

AP verification

AP was confirmed using an established pitch‐labeling test previously used in multiple studies on AP. , , , In this test, the participants were instructed to immediately write down the accordant tonal label of corresponding sine tones (A4 tuned at 440 Hz) presented to them. Hundred and eight tones covering three octaves from A3 to A5 were presented in a pseudorandomized order. Each tone was presented three times, the same tones were never presented successively, and each tone had a duration of 1 second. The interstimulus interval was 4 s and filled with Brownian noise. Accuracy was evaluated by summing the number of correct responses. However, the participants were not asked to identify the octaves of the presented tones. AP possessors performed (% correct responses) considerably better (mean correct: 69.5, SD = 21.5) than NAP participants (mean correct: 9.36, SD = 6.9; t 20.46 = 11.29, P < 3.04 × 10−10, d = 3.59). NAP participants did not perform better than chance level (8.3%; t 17 = 0.65, P = 0.53). The individual scores are depicted in Figure 1.
Figure 1

The pitch‐labeling performance. Depicted are the individual scores (%) achieved by the participants with absolute pitch (AP, blue dots; N = 18) and participants without absolute pitch (NAP, red dots; N = 18) from the pitch‐labeling test. The plotted values are the means with standard deviations. The dotted line represents the baseline at 8.3%.

The pitch‐labeling performance. Depicted are the individual scores (%) achieved by the participants with absolute pitch (AP, blue dots; N = 18) and participants without absolute pitch (NAP, red dots; N = 18) from the pitch‐labeling test. The plotted values are the means with standard deviations. The dotted line represents the baseline at 8.3%.

Experimental task and stimulus material

The involvement of the left DLPFC in AP ability was investigated by letting both participant samples perform a modified Stroop task under a tDCS protocol. In this task, the participants were exposed to a stream of bimodal musical stimuli, comprising visually presented notations in combination with auditorily presented tones. The participants were instructed to sight‐read and report the presenting notations as accurately and fast as possible by playing an electronic piano (Yamaha Electronic Piano, P‐60S) with their right hand. During the experiment, the participants positioned their right hand over the piano keys, covering the C4 scale in order to be able to respond promptly. The piano did not deliver auditory feedback but recorded the response behavior. The bimodal musical stimuli were randomly presented in different matching conditions. In half of the trials, the tones correctly corresponded to the notations (congruent), whereas in the other half, they did not (incongruent). The incongruent trials mismatched in six different conditions, deviating between tones and notations in ±1, ±2, and ±3 semitones. Each of the six incongruent conditions had an occurrence probability of 0.083. The set of presented notations comprised the C scale of the 12 subsequent notes ranging from C4 to B4. The set of presented auditory stimuli were piano tones corresponding to the particular tones of the C scale (C4 tuned at 262 Hz), including the extension of the three successive “deviating” semitones at the scale edges in both directions (A3, B3‐flat, B3, and C5, C5‐sharp, D5). The tones were professionally recorded with an acoustic piano (the University of Iowa Electronic Music Studios, http://theremin.music.uiowa.edu/MISpiano.html) and were trimmed later, resulting in lengths of 500 milliseconds. In the task, the notations were presented 1000 ms longer than the tones, resulting in a duration of 1500 milliseconds. Afterward, pink noise followed for a duration of 500 milliseconds. Per trial, responses were allowed and recorded for a duration of <2 s after stimulus onset. The intertrial interval varied randomly between 100 and 200 milliseconds. The tones were delivered via Sennheiser HD 205 headphones at the sound pressure level of 75 dB, and the notations were shown in the center of a PC monitor mounted on top of the electronic piano. The procedure of one trial is illustrated in Figure 2. Stimulus presentation, as well as behavior collection (via Musical Instrument Digital Interface), was controlled by the Presentation software (Neurobehavioral System, Version 18.2).
Figure 2

A schematic representation of the task. Each trial began with a bimodal stimulus that lasted for 500 ms (A), comprising a notation (e.g., G) presented on a monitor and a piano tone presented via headphones. The piano tone was either congruent (e.g., G) or incongruent (e.g., G#) with the notation. The visual counterpart lasted for an additional 1000 ms (B), followed by pink noise for a duration of 500 milliseconds. The next trial followed after a duration jittered between 100 and 200 milliseconds (D). The participants were instructed to report the notation as quickly and accurately as possible by sight‐reading, specifically by playing with their right hand on a piano. Responses were allowed and recorded for a duration of <2 s after stimulus onset (A–C).

A schematic representation of the task. Each trial began with a bimodal stimulus that lasted for 500 ms (A), comprising a notation (e.g., G) presented on a monitor and a piano tone presented via headphones. The piano tone was either congruent (e.g., G) or incongruent (e.g., G#) with the notation. The visual counterpart lasted for an additional 1000 ms (B), followed by pink noise for a duration of 500 milliseconds. The next trial followed after a duration jittered between 100 and 200 milliseconds (D). The participants were instructed to report the notation as quickly and accurately as possible by sight‐reading, specifically by playing with their right hand on a piano. Responses were allowed and recorded for a duration of <2 s after stimulus onset (A–C).

The tDCS protocol

After the participants performed a practice block of 20 trials, the tDCS equipment was applied to them. The participants underwent two subsequent experimental blocks, namely one with the tDCS technique turned on, inhibiting the left DLPFC (cathodal stimulation), and one with it turned off (sham stimulation). The order of the blocks was randomized across participants, and the participants were kept unaware of the respective stimulation condition to avoid confounding effects of expectation and order. Each block lasted 10 min and consisted of 288 trials, of which half were congruent and the other half incongruent. Regarding the incongruent trials, each combination (12 notations paired with tones mismatching in six particular degrees) was presented twice, resulting in 144 trials in total. Regarding the congruent trials, each combination (12 notations paired with matching tones) was presented 12 times, also resulting in 144 trials in total. The current intensity of 1.5 mA was transferred by a saline‐soaked pair of surface sponge electrodes and delivered by a battery‐driven constant current stimulator (NeuroConn GmnH, Ilmenau, Germany). The sponges were stitched to an electroencephalogram cap based on the international 10–20 system to ensure the same placement for all participants. This specific current intensity was chosen based on documentation on the time course of the tDCS aftereffect and on previous tDCS studies investigating the DLPFC. , For cathodal stimulation, the current was applied for 9 min, including fade‐in/out phases of 10 s, respectively. The stimulation and the task were initiated simultaneously. In the sham condition, stimulation was applied for 30 s, including fade‐in/out phases of 10 s, respectively, so that the subjects experienced the initial tingling associated with the onset of stimulation. Between the two blocks, a break of 30 min was held to ensure an unaffected sham session. The left DLPFC was targeted by placing the (cathodal) electrode with a surface of 4 × 8 cm into a sponge on the scalp at the coordinate F3 according to the international 10–20 system. This localization method was successfully conducted in previous studies, , , , , , and was confirmed as an appropriate approach by neuronavigational techniques. As a reference, an (anodal) electrode with a larger surface of 10 × 10 cm was placed on the right parietal area, fixating the corners of the sponge at the coordinates Cz, C4, POZ, and P8 according to the international 10–20 system. This larger surface size was used to minimize current density over the parietal cortex.

Data analysis

For each participant, the median reaction times (RT) and accuracy scores were obtained from each matching and stimulation condition. During the entire study, only 13 missing responses occurred, which were omitted. The RT and accuracy values were imported into the SPSS® software (IBM SPSS Statistics, IBM Corp., Armonk, NY) for statistical analyses. The accuracy scores were not further subjected to inferential statistics due to an obvious ceiling effect (Table 2). Given that the values of the RTs were largely not normally distributed (Table 3), as assessed by the Shapiro–Wilk test, nonparametric procedures were performed to determine the effects of interest. Values outside of the interquartile range (IQR) ± the IQR multiplied by 1.5 were considered outliers. For each group (i.e., AP and NAP) and stimulation condition (i.e., sham and cathodal), Friedman tests with matching as within‐factor (i.e., seven levels: 0, ±1, ±2, and ±3 semitones) were run. Significant results were followed up with pairwise comparisons using Wilcoxon signed‐rank tests. The Bonferroni procedure was applied to correct for multiple comparisons (corrected α′ < 0.05/21 = 0.0024). The impact of the interaction between the within‐factors “stimulation” and “matching” on the RT was estimated using generalized estimating equations (GEE) with gamma as distribution and log as link function together with an unstructured working correlation matrix. Effect size measures were calculated, namely the Cohen's d for the t‐tests, Kendall's coefficient of concordance (W) for the Friedman tests, and the rank‐biserial correlation coefficient (r) for the Wilcoxon signed‐rank tests.
Table 2

Accuracy scores achieved in the modified Stroop task

APNAP
MatchingShamCathodalShamCathodal
−30.99 (0.03)0.98 (0.05)0.98 (0.04)0.97 (0.05)
−20.98 (0.03)1.00 (0.01)1.00 (0.01)0.98 (0.03)
−10.99 (0.02)0.99 (0.01)0.99 (0.02)0.99 (0.02)
00.99 (0.02)0.99 (0.02)0.99 (0.01)0.99 (0.02)
10.99 (0.02)0.98 (0.04)0.97 (0.04)0.98 (0.02)
20.98 (0.05)0.97 (0.07)0.99 (0.02)0.99 (0.02)
30.99 (0.02)0.99 (0.03)0.99 (0.02)0.97 (0.05)

The mean accuracy score (correct responses/total trials) is listed for each condition for both samples (AP, absolute pitch; NAP, nonabsolute pitch; each N = 18). The standard deviations are listed in parentheses. Matching units are in semitones. Negative numbers indicate “flat” (i.e., lower) deviations, and positive numbers indicate “sharp” (i.e., higher) deviations.

Table 3

Reaction times achieved in the modified Stroop task

APNAP
MatchingShamCathodalShamCathodal
−3655 (632−756)*** 668 (615−725)** 648 (601−751)656 (616−718)
−2669 (618−725)*** 661 (608−709)*** 653 (598−746)655 (607−716)
−1667 (620−763)*** 662 (614−719)** 641 (617−732)* 660 (614−736)
0651 (601−701)*** 655 (597−694)*** 629 (597−744)* 653 (607−718)
1662 (622−730)*** 662 (624−740)** 631 (595−755)* 654 (602−723)
2663 (614−739)*** 661 (601−730)** 640 (608−721)* 654 (616−726)
3666 (632−752)*** 665 (611−768)** 644 (609−768)* 656 (626−709)

The median reaction time (ms) is listed for each condition for both samples (AP, absolute pitch; NAP, non‐absolute pitch, each N = 18). The lower and upper quartiles are listed in parentheses. Matching units are in semitones. Negative numbers indicate “flat” (i.e., lower) deviations, and positive numbers indicate “sharp” (i.e., higher) deviations.

* P < 0.05, ** P < 0.01, and *** P < 0.001, as revealed by the Shapiro–Wilk test.

Accuracy scores achieved in the modified Stroop task The mean accuracy score (correct responses/total trials) is listed for each condition for both samples (AP, absolute pitch; NAP, nonabsolute pitch; each N = 18). The standard deviations are listed in parentheses. Matching units are in semitones. Negative numbers indicate “flat” (i.e., lower) deviations, and positive numbers indicate “sharp” (i.e., higher) deviations. Reaction times achieved in the modified Stroop task The median reaction time (ms) is listed for each condition for both samples (AP, absolute pitch; NAP, non‐absolute pitch, each N = 18). The lower and upper quartiles are listed in parentheses. Matching units are in semitones. Negative numbers indicate “flat” (i.e., lower) deviations, and positive numbers indicate “sharp” (i.e., higher) deviations. * P < 0.05, ** P < 0.01, and *** P < 0.001, as revealed by the Shapiro–Wilk test.

Results

The accuracy scores and RT values achieved at each condition by both samples are listed in Tables 2 and 3. Three AP participants were identified as outliers, of which two stood out systematically across all matching and stimulation conditions. In the NAP sample, one outlier was detected. Overall, the outliers did not skew the results. The RT values, including outliers, are depicted in Figures 3 and 4.
Figure 3

Reaction times achieved by AP possessors in the modified Stroop task. The plotted values are the medians together with the IQRs for each matching and stimulation condition. The colored triangles represent outliers. Each color (i.e., blue, red, and green) refers to a single participant. Negative semitones indicate “flat” (i.e., lower) deviations, and positive semitones indicate “sharp” (i.e., higher) deviations. Bonferroni‐adjusted * P < 0.05; (*) P = 0.054 without outliers, otherwise P value is <0.05.

Figure 4

Reaction times achieved by the control participants in the modified Stroop task. Plotted are the medians together with the IQRs for each matching and stimulation condition. The triangle represents an outlier. Negative semitones indicate “flat” (i.e., lower) deviations, and positive semitones indicate “sharp” (i.e., higher) deviations.

Reaction times achieved by AP possessors in the modified Stroop task. The plotted values are the medians together with the IQRs for each matching and stimulation condition. The colored triangles represent outliers. Each color (i.e., blue, red, and green) refers to a single participant. Negative semitones indicate “flat” (i.e., lower) deviations, and positive semitones indicate “sharp” (i.e., higher) deviations. Bonferroni‐adjusted * P < 0.05; (*) P = 0.054 without outliers, otherwise P value is <0.05. Reaction times achieved by the control participants in the modified Stroop task. Plotted are the medians together with the IQRs for each matching and stimulation condition. The triangle represents an outlier. Negative semitones indicate “flat” (i.e., lower) deviations, and positive semitones indicate “sharp” (i.e., higher) deviations. In the AP sample, the Friedman test revealed a significant effect of matching in the sham (χ 2 6 = 20.44, P = 0.002, W = 0.19) but not in the cathodal condition (χ 2 6 = 11.63, P = 0.07, W = 0.11). This effect remained after removing the outliers (sham: χ 2 6 = 18.95, P = 0.004, W = 0.21; cathodal: χ 2 6 = 11.84, P = 0.07, W = 0.13). In the sham condition, post hoc Wilcoxon signed‐rank tests revealed significant differences in two congruent‐incongruent pairs (−2: z = −3.42, Bonferroni‐adjusted P = 0.013, r = −0.57; −1: z = −3.47, Bonferroni‐adjusted P = 0.012, r = −0.59) and in one pair after removing the outliers (−1: z = −3.12, Bonferroni‐adjusted P = 0.037, r = −0.57). The GEE analysis revealed a significant effect of the interaction between “matching” and “stimulation” on the RT (Wald χ 2 6 = 16.95, P = 0.009) even after removing the outliers (Wald χ 2 6 = 27.71, P < 0.001). In the NAP sample, the Friedman test did not reveal any significant effects of matching on the RT (sham: χ 2 6 = 2.17, P = 0.903, W = 0.02; cathodal: χ 2 6 = 6.34, P = 0.386, W = 0.06) even after removing the outlier (sham: χ 2 6 = 4.15, P = 0.657, W = 0.04; cathodal: χ 2 6 = 5.01, P = 0.543, W = 0.05). The GEE analysis did not reveal any interaction effect between “matching” and “stimulation” on the RT (Wald χ 2 6 = 1.246, P = 0.975) even after removing the outlier (Wald χ 2 6 = 0.82, P = 0.992).

Discussion

In this study, we investigated the causal role of the left DLPFC in the pitch‐labeling process underlying AP by using a customized Stroop task in the context of a cathodal‐sham tDCS protocol. Whereas the previous studies on AP using Stroop tasks recorded the participants’ responses vocally , , or per button click, our experimental setup allowed a more natural behavior, namely the responding by performing on an actual piano, constituting a highly familiar situation incorporated in a musician's everyday life.

Stroop and tDCS findings

The nearly perfect accuracy scores achieved in the modified Stroop task by both possessors and nonpossessors of AP ensure that the notations have properly been internalized and that the sight‐reading activity has conscientiously been executed. This compliance appears nontrivial due to the large range and variation of stimulus–response commands provided during the task. Whereas the previous Stroop studies on AP included only a handful of stimuli with two matching conditions (i.e., congruent and incongruent), , , , , our task explored an entire scale (i.e., 12 notations and 12 particular piano key responses) with matching and mismatching conditions, systematically covering three double‐sided levels of deviation (±1, ±2, and ±3 semitones). Despite the participants’ high performance, an interference was still detectable under the sham condition in the RT of AP possessors. This interference was reflected in that AP possessors responded slower to notations with mismatching tones than to notations with matching tones. This effect was centered within narrow deviations in the flat direction (i.e., −2 and −1). This finding suggests that the impact of distraction on pitch labeling increases as a function of the difficulty in pitch categorization. Furthermore, this finding implies a certain asymmetrical proneness that may underlie AP, mirroring, in this case, the observation of usual “undershooting” when singing out‐of‐tune. Consistent with this presumption are findings showing that some aging AP possessors undergo a distortion in pitch perception that are mostly biased toward one direction. , Under cathodal stimulation, the interference effect shown by the AP possessors disappeared. More precisely, the suppression of cortical excitability of the left DLPFC led to a better sight‐reading performance in AP possessors, suggesting less interference and thus that at least some of the irrelevant tones became more suppressible for them. By contrast, the control participants showed no variation in the RT as a function of the matching condition. In line with previous Stroop and other interference studies on AP, , , , , , , , these results confirm that the pitch‐labeling process is largely nonsuppressible when triggered by tone exposure, interfering with the conflicting task of sight‐reading. The fact that our matched control musician sample without AP did not show this interference pattern further suggests that automaticity in the context of pitch labeling is unique for AP. The modulation of the task performance as a function of the tDCS stimulation of the left DLPFC indicated, at least to some extent, a causal relationship between this specific brain region and the pitch‐labeling process. However, other brain structures may likely be involved in this process as well or may perhaps even be indispensable in this regard. Correlative evidence strongly suggests that the PT is another crucial brain structure for AP. , , , , , , The PT is anatomically connected via the arcuate fasciculus with the DLPFC and, in AP possessors, the left PT functionally interacts with the left DLPFC already at rest. However, the PT was rather reported to specifically be responsible for early AP‐related encoding processes, such as categorical pitch perception. , , , But, its causal contribution to AP as a whole or to pitch labeling as a subprocess has not been established yet. A few lesion cases of AP possessors have been documented, but in their entirety so far, they have been inconclusive in this regard. A few patients were able to retain their AP ability after undergoing left or right temporal lobectomy. , , , Others either lost their AP ability or underwent a severe “sharp”‐aligned distortion after left or right hemispheric strokes. , In order to advance our understanding of the mechanisms and the involved brain structures driving AP, future studies using techniques in the field of neuromodulation should be conducted, particularly undertaking the PT next for a more systematic investigation on its causal impact on AP.

Pitch labeling: a case for the dorsal stream function

Previous studies on AP yielded results that are not readily reconcilable with the so‐called dual‐stream models of auditory cortical processing. These models propose that auditory cortical processing pathways are organized dually, namely ventrally (i.e., the ventral or “what” stream) and dorsally (i.e., the dorsal or “where” stream). , Whereas the ventral stream processes the identification of nonspatial auditory properties, the dorsal processing stream integrates spatial sensorimotor information, ultimately associating auditory properties with spatial codes and motor commands. , , In relation to AP, some research has identified the ventral processing stream within the temporal lobe, , while others have loosely assigned the dorsal stream function to the left DLPFC and related them to the pitch‐labeling process. , , , , In two previous studies from our research group, we provided evidence for a correlation between the pitch‐labeling performance and the functional and structural connectivity within the left dorsal pathway in AP possessors. , Here, we extended these findings, providing causal evidence of the left DLPFC on the pitch labeling. On this basis, we argue more strongly that the pitch‐labeling process underlies the dorsal stream function. Our first argument concerns the dorsal location of the assigned brain structure itself that not only drives the pitch‐labeling process in AP, as revealed by the present findings and previous studies, , , , but also bears an associative–integrative function in NAP musicians, nonmusicians, and even monkeys while learning or performing certain association tasks. , , , , , , Further in line with the dorsal stream function concerns the “where” dimension of pitch labeling. In the human mind, the linkage between space and pitch is profoundly incorporated, apparent not only in our musical notations based on a vertical mapping system but also in our usage of the word “height” to describe both space and pitch. Intercultural research revealed that pitch labels are internally represented in a systematic spatial order and that the availability of space–pitch mapping may even be of prelinguistic nature. , , , , In some rare cases (e.g., pitch–space synesthesia), this order may even reach explicitness, consisting of particularly unique pitch–location pairs. , Finally, in line with the dorsal stream function is the coactivation of motor commands during pitch labeling. There is evidence that AP possessors not only rely on verbal information during pitch labeling but also on sensorimotor codes (e.g., specific vocalization or fingering unambiguously coupled to specific tone responses). Consistent with this framework and our findings, AP possessors show an interference when vocally imitating mistuned tones and a stronger left hemispheric activation during the processing of auditory feedback for vocal motor control.

Conclusions

By applying a cathodal‐sham tDCS protocol, we provided, for the first time, causal evidence that the left DLPFC drives the pitch‐labeling process underlying AP. Furthermore, the findings yielded from our customized piano‐playing Stroop task support automaticity as a unique feature of AP, confirming a unique pitch‐processing mode virtually nonreliant on cognitive load. Altogether, these findings substantiate previous functional studies showing that AP possessors selectively recruit the left DLPFC during tone exposure and label tones without relying on WM resources, as discussed with reference to reduced or absent P3 responses and lack of activation in the IFG. , , , , , ,

Competing interests

The authors declare no competing interests.
  89 in total

Review 1.  The planum temporale as a computational hub.

Authors:  Timothy D Griffiths; Jason D Warren
Journal:  Trends Neurosci       Date:  2002-07       Impact factor: 13.837

2.  Reading music modifies spatial mapping in pianists.

Authors:  Lauren Stewart; Vincent Walsh; Uta Frith
Journal:  Percept Psychophys       Date:  2004-02

Review 3.  Half a century of research on the Stroop effect: an integrative review.

Authors:  C M MacLeod
Journal:  Psychol Bull       Date:  1991-03       Impact factor: 17.737

4.  Sound-colour synaesthesia: to what extent does it use cross-modal mechanisms common to us all?

Authors:  Jamie Ward; Brett Huckstep; Elias Tsakanikos
Journal:  Cortex       Date:  2006-02       Impact factor: 4.027

5.  Absolute pitch: an approach for identification of genetic and nongenetic components.

Authors:  S Baharloo; P A Johnston; S K Service; J Gitschier; N B Freimer
Journal:  Am J Hum Genet       Date:  1998-02       Impact factor: 11.025

6.  Intact absolute pitch ability after left temporal lobectomy.

Authors:  R J Zatorre
Journal:  Cortex       Date:  1989-12       Impact factor: 4.027

7.  The sound-free SMARC effect: The spatial-musical association of response codes using only sound imagery.

Authors:  Qi Jiang; Atsunori Ariga
Journal:  Psychon Bull Rev       Date:  2020-10

8.  Musical space synesthesia: automatic, explicit and conceptual connections between musical stimuli and space.

Authors:  Lilach Akiva-Kabiri; Omer Linkovski; Limor Gertner; Avishai Henik
Journal:  Conscious Cogn       Date:  2014-07-12

9.  The importance of the fibre tracts connecting the planum temporale in absolute pitch possessors.

Authors:  Anja Burkhard; Jürgen Hänggi; Stefan Elmer; Lutz Jäncke
Journal:  Neuroimage       Date:  2020-01-28       Impact factor: 6.556

10.  Anodal transcranial direct current stimulation of the prefrontal cortex enhances complex verbal associative thought.

Authors:  Carlo Cerruti; Gottfried Schlaug
Journal:  J Cogn Neurosci       Date:  2009-10       Impact factor: 3.225

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.