Literature DB >> 34917714

Music genre neuroimaging dataset.

Tomoya Nakai1,2,3, Naoko Koide-Majima1,4, Shinji Nishimoto1,4,5.   

Abstract

This dataset includes functional magnetic resonance imaging (fMRI) data collected while five subjects extensively listened to 540 music pieces from 10 music genres over the course of 3 days. Behavioral data are also available. Data are separated into training and test samples to facilitate the application of machine learning algorithms. Test stimuli were repeated four times and can be used to evaluate the signal to noise ratio of brain activity. Using this dataset, both neuroimaging and machine learning researchers can test multiple algorithms regarding the prediction performance of brain activity induced by various music stimuli. Although two previous studies have used this dataset, there remains much room to apply different acoustic models. This dataset can contribute to integration of the fields of auditory neuroscience and machine learning.
© 2021 The Authors. Published by Elsevier Inc.

Entities:  

Keywords:  Brain activity; Machine learning; Music genre recognition; fMRI

Year:  2021        PMID: 34917714      PMCID: PMC8666334          DOI: 10.1016/j.dib.2021.107675

Source DB:  PubMed          Journal:  Data Brief        ISSN: 2352-3409


Specifications Table

Value of the Data

Brain activity data with music genre labels to allow for supervised machine learning of music information. Neuroimaging and machine learning researchers can test multiple algorithms regarding the prediction and decoding performances of brain activity induced by various music stimuli. The large amount of data for each participant allows researchers to construct participant-wise modeling of brain activity. The test dataset with multiple repetitions of identical stimuli can be used to evaluate the signal to noise ratio of brain activity. Behavioral assessment of a genre recognition task allows for a comparison of categorical judgments based on behavior and brain activity. Formatting in the Brain Imaging Data Structure (BIDS) allows researchers to efficiently process the dataset.

Data Description

All raw brain data are available in OpenNeuro (https://openneuro.org/datasets/ds003720). The brain data are organized based on the BIDS [1], which makes it easier for researchers to preprocess and analyze brain data. The dataset includes anatomical magnetic resonance imaging (MRI) and functional MRI (fMRI) data. The behavioral data, presentation software script, and stimuli preprocessing scripts are available in the Open Science Framework (OSF) (https://osf.io/2zwkt/). The original music stimuli are available at (http://marsyas.info/downloads/datasets.html). These original stimuli are preprocessed for the experiments, as described in the Materials and Methods section. Fig. 1 illustrates the experimental design and stimuli. Table 1 reports the age, sex, and number of fMRI runs and samples for each participant. Our experiment is based on a small-N design (five subjects). We obtained 3 hours of neuroimaging data for each participant, allowing researchers to construct participant-wise modeling of brain activity. Although this dataset has been used in two previous studies [2,3], there remains much room to apply different acoustic models. The original music stimuli have been widely used in previous music information retrieval studies (see [4] for a review). Therefore, various hand-crafted models and neural network models can be tested using this dataset.
Fig. 1

Experimental design. (A) Subjects passively listened to 540 music pieces from 10 music genres. (B) The dataset is separated into training and test samples. Training samples were measured while the sequence of training music clips was presented once. Test samples were measured while the sequence of test music clips was presented four times. Modified from [3].

Table 1

Age, sex, and number of fMRI runs and samples for each participant.

sub-001sub-002sub-003sub-004sub-005
Age3033233025
SexMFMMF
Music experience (years)121541210
Total fMRI runs1818181818
Training samples (volumes)48004800480048004800
Test samples (volumes) (with four repetitions)600600600600600
Experimental design. (A) Subjects passively listened to 540 music pieces from 10 music genres. (B) The dataset is separated into training and test samples. Training samples were measured while the sequence of training music clips was presented once. Test samples were measured while the sequence of test music clips was presented four times. Modified from [3]. Age, sex, and number of fMRI runs and samples for each participant.

Experimental Design, Materials and Methods

Subjects

Five healthy subjects took part in the MRI and behavioral experiments (referred to as sub-001 to sub-005; aged between 23 and 33 years old; 3 males; music experience between 4 and 15 years). The subjects were all right-handed, with normal hearing, normal or corrected to normal vision, and no history of psychological or neurological disorders. No participant had absolute pitch. A questionnaire was used to assess the number of years that subjects trained on their primary instrument; this was used as the index of musical experience.

Stimuli and task

We obtained the original 1,000 music pieces from the GTZAN music genre dataset (http://marsyas.info/downloads/datasets.html) [5]. This dataset consists of 10 music genres (blues, classical, country, disco, hip-hop, jazz, metal, pop, reggae, and rock). From the original music pieces, we randomly selected 54 music pieces (30 s, 22,050 Hz) from each music genre. A total of 540 music pieces were thus used in the current study. A 15 s music clip was randomly extracted from each music piece. We applied 2 s of fade-in and fade-out effects for each music clip. All music clips were normalized for their root mean square. The fMRI experiment consisted of 12 training runs and 6 test runs (18 runs in total). Each run lasted 10 min and consisted of 40 music clips. At the beginning of each run, 15 s of dummy images were acquired to allow for the equilibration of the MRI signal; this should be omitted from further analysis. During the dummy scanning, the last music clip of the previous run was presented in the training runs, while the last clip of the same run was presented in the test runs. A total of 480 music clips were used for the training runs, and the left-out 60 music clips were used for the test runs. In each test run, a set of 10 music clips (150 s in total) was presented four times in the same order. There was no repetition in the training runs. During scanning, subjects were instructed to fixate on a crosshair at the center of the screen and to listen to the music stimuli through MRI-compatible insert earphones (Model S14, Sensimetrics). This type of earphone has been widely used in previous MRI studies to attenuate scanner noise [7]. After each 10 min run, the subjects were asked to describe their physical condition. A 1–2 min break was allowed if they felt fatigued or sleepy. At the end of the experiment of each day, the subjects were asked whether they fell asleep during the fMRI experiment. According to their self-reports, no subjects slept during the experiments. The MRI experiment was performed over the course of three days, with six fMRI runs performed each day. Presentation software (Neurobehavioral Systems, Albany, CA, USA) was used to control the stimulus presentation and the collection of behavioral data. The reference anatomical scans were also obtained on the first day.

MRI data acquisition

We used a 3.0 T MRI scanner (TIM Trio; Siemens, Erlangen, Germany). A 32-channel head coil was used. All experiments were performed at the Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology (NICT), Osaka, Japan. For functional scanning, we scanned 68 interleaved axial slices with a thickness of 2.0 mm without a gap using a T2*-weighted gradient echo multi-band echo-planar imaging (MB-EPI) sequence [6] (repetition time (TR) = 1500 ms, echo time (TE) = 30 ms, flip angle (FA) = 62°, field of view (FOV) = 192 × 192 mm2, voxel size = 2 × 2 × 2 mm3, MB factor = 4). We obtained 410 volumes (615 s) for each run. We also used a magnetization prepared rapid acquisition gradient echo sequence (MPRAGE) to acquire high-resolution T1-weighted images of the whole brain (TR = 2530 ms, TE = 3.26 ms, FA = 9°, FOV = 256 × 256 mm2, voxel size = 1 × 1 × 1 mm3).

Behavioral experiment

To compare brain activation patterns and behavioral performance, we asked the subjects to participate in additional behavioral experiments. The behavioral experiments were conducted in a soundproof room. Subjects were first instructed to listen to three original music clips (30 s) for each music genre as a reference. These music clips were selected randomly from the 460 music pieces which were not used in the MRI experiment. The subjects were provided the information on the correct music genres in this training session. The subjects then listened to the 60 music clips which had been used as the test dataset in the MRI experiment. The subjects selected the music genre of the target music clip by filling in 1 of 10 cells on the answer sheet displayed on a laptop computer screen. The music clips were presented through closed headphones. Music clips were presented in the same order as in the MRI experiments. Subjects were allowed to listen to each music clip only once.

Ethics Statement

We obtained informed consent from all subjects before their participation. The current experiment was approved by the ethics and safety committees of the NICT in Osaka and Tokyo, Japan (protocol number: 194104).

CRediT Author Statement

Tomoya Nakai: Conceptualization, Methodology, Software, Investigation, Software, Visualization, Data curation, Formal analysis, Writing-Original draft; Naoko Koide-Majima: Conceptualization, Investigation, Visualization, Writing-Reviewing, and Editing; Shinji Nishimoto: Writing-Reviewing and Editing, Supervision, Funding acquisition.

Declaration of Competing Interest

The authors declare that they have no known competing financial interests or personal relationships that could have appeared to influence the work reported in this paper.
SubjectNeuroscience: Cognitive
Specific subject areaNeuroimaging of human subjects listening to music stimuli from 10 different genres
Type of dataImages
How data were acquired3T Siemens MRI (TIM Trio), 32-channel head coil. Presentation software
Data formatRaw
Parameters for data collectionAll subjects are right-handed, with normal hearing, normal or corrected to normal vision, and no history of psychological or neurological disorders.
Description of data collectionFive subjects underwent a 3-day MRI experiment and a one-day behavioral experiment. Subjects listened to 540 music pieces from 10 different music genres.
Data source locationInstitution: Center for Information and Neural Networks (CiNet), National Institute of Information and Communications Technology (NICT)City/Town/Region: OsakaCountry: Japan
Data accessibility(1) Repository name: OpenNeuro, Data identification number: 10.18112/openneuro.ds003720.v1.0.0, Direct URL to data: https://openneuro.org/datasets/ds003720/versions/1.0.0(2) Repository name: OSF, Data identification number: 10.17605/OSF.IO/2ZWKT, Direct URL to data: https://osf.io/2zwkt/
Related research article(1) Nakai, T., Koide‐Majima, N., & Nishimoto, S. (2021). Correspondence of categorical and feature‐based representations of music in the human brain. Brain and behavior11(1), e01936.(2) Nakai T, Koide-Majima N, Nishimoto S (2018) Encoding and decoding of music genre representations in the human brain. In: Proceedings of the IEEE International Conference on Systems, Man, and Cybernetics.
  4 in total

1.  Distinct Cortical Pathways for Music and Speech Revealed by Hypothesis-Free Voxel Decomposition.

Authors:  Nancy G Kanwisher; Josh H McDermott; Sam Norman-Haignere
Journal:  Neuron       Date:  2015-12-16       Impact factor: 17.173

2.  Multiband multislice GE-EPI at 7 tesla, with 16-fold acceleration using partial parallel imaging with application to high spatial and temporal whole-brain fMRI.

Authors:  Steen Moeller; Essa Yacoub; Cheryl A Olman; Edward Auerbach; John Strupp; Noam Harel; Kâmil Uğurbil
Journal:  Magn Reson Med       Date:  2010-05       Impact factor: 4.668

3.  The brain imaging data structure, a format for organizing and describing outputs of neuroimaging experiments.

Authors:  Krzysztof J Gorgolewski; Tibor Auer; Vince D Calhoun; R Cameron Craddock; Samir Das; Eugene P Duff; Guillaume Flandin; Satrajit S Ghosh; Tristan Glatard; Yaroslav O Halchenko; Daniel A Handwerker; Michael Hanke; David Keator; Xiangrui Li; Zachary Michael; Camille Maumet; B Nolan Nichols; Thomas E Nichols; John Pellman; Jean-Baptiste Poline; Ariel Rokem; Gunnar Schaefer; Vanessa Sochat; William Triplett; Jessica A Turner; Gaël Varoquaux; Russell A Poldrack
Journal:  Sci Data       Date:  2016-06-21       Impact factor: 6.444

4.  Correspondence of categorical and feature-based representations of music in the human brain.

Authors:  Tomoya Nakai; Naoko Koide-Majima; Shinji Nishimoto
Journal:  Brain Behav       Date:  2020-11-08       Impact factor: 2.708

  4 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.