| Literature DB >> 21927602 |
Francisco Pereira1, Greg Detre, Matthew Botvinick.
Abstract
Recent work has shown that it is possible to take brain images acquired during viewing of a scene and reconstruct an approximation of the scene from those images. Here we show that it is also possible to generate text about the mental content reflected in brain images. We began with images collected as participants read names of concrete items (e.g., "Apartment'') while also seeing line drawings of the item named. We built a model of the mental semantic representation of concrete concepts from text data and learned to map aspects of such representation to patterns of activation in the corresponding brain image. In order to validate this mapping, without accessing information about the items viewed for left-out individual brain images, we were able to generate from each one a collection of semantically pertinent words (e.g., "door," "window" for "Apartment''). Furthermore, we show that the ability to generate such words allows us to perform a classification task and thus validate our method quantitatively.Entities:
Keywords: classification; fMRI; multivariate; semantic categories; topic models
Year: 2011 PMID: 21927602 PMCID: PMC3159951 DOI: 10.3389/fnhum.2011.00072
Source DB: PubMed Journal: Front Hum Neurosci ISSN: 1662-5161 Impact factor: 3.169
Figure 1The approach we follow to generate text (bottom) parallels that used in Naselaris et al. (.
The top 10 most probable words according to each topic in the 40 topic model used in Figure .
| Topic | Top 10 words |
|---|---|
| 1 | Plant fruit seed grow leaf flower tree sugar produce species |
| 2 | Color green light red white blue skin pigment black eye |
| 3 | Light drink lamp wine beer bottle water produce valve pipe |
| 4 | Drug chemical acid opium cocaine alcohol substance produce form reaction |
| 5 | School university student child education college degree state train Unite |
| 6 | Animal species cat wolf breed hunt dog male wild human |
| 7 | Water metal form temperature carbon process air element iron salt |
| 8 | Vehicle wheel gear car aircraft passenger speed drive truck design |
| 9 | Market party state country price government political trade people economic |
| 10 | Water ice rock river surface form sea ocean wind soil |
| 11 | Species bird egg fish insect female ant live feed bee |
| 12 | Language book write art century form story character word publish |
| 13 | War military force army weapon service submarine soviet world train |
| 14 | Blood cause disease patient treatment infection health risk increase pain |
| 15 | Church bishop pope catholic priest roman soap cardinal religious time |
| 16 | Cell muscle body brain form tissue human organism bone animal |
| 17 | Ship fish boat water vessel sail design build ski bridge |
| 18 | Iron blade steel handle head cut hair metal tool nail |
| 19 | Film image camera digital shotgun movie lens magazine rifle gun |
| 20 | Wear horse woman clothe saddle century dress fashion ride trail |
| 21 | Law state court legal police crime person act Unite criminal |
| 22 | Smoke chocolate light tobacco sign speed cigaret cigar state traffic |
| 23 | key lock switch machine needle tube bicycle type knit design |
| 24 | Card record information service company product datum process program credit |
| 25 | State cross head salute plate model symbol portrait scale circus |
| 26 | Love sexual god woman people pyramid death sex religion evil |
| 27 | Coin gold silver issue currency stamp state dollar value bank |
| 28 | Game play player ball team sport rule football hit league |
| 29 | Fuel engine gas energy power oil hydrogen heat rocket produce |
| 30 | Woman marriage god word christian child term jesus family gender |
| 31 | Fiber sheep wool cotton fabric weave hamlet pig produce silk |
| 32 | City build house store street town state home road bus |
| 33 | Tea tooth pearl kite shoe culture wear tattoo jewelry form |
| 34 | Earth sun star planet moon solar time orbit day comet |
| 35 | Material wood paint build wall structure construction design size window |
| 36 | Human social study people culture theory individual nature behavior term |
| 37 | Power station train signal line locomotive radio steam electric frequency |
| 38 | Food diamond cook meat bread coffee sauce chicken kitchen eat |
| 39 | Measure scale angle [formula theory object unit energy line property |
| 40 | Music instrument play string band bass sound note player guitar |
Figure 2(A) Topic probabilities for the Wikipedia articles about the 60 concepts for which we have fMRI data. Each concept belongs to one of 12 semantic categories, and concepts are grouped by category (five animals, five insects, etc.). Note that the category structure visible is due to how we sorted the columns for display; the model is trained in an unsupervised manner and knows nothing about category structure. Note also that there are topics that are not probable for any of the concepts, which happens because they are used for other concepts in the 3500 concept corpus. Below this are the top 10 most probable words in the probability distributions associated with three of the topics. (B) The decomposition of the brain image for “House” into a weighted combination of topic basis images. The weights allow us to combine the corresponding topic word distributions into an overall word distribution (top 10 words shown).
Figure 3The inset under each article shows the top words from the corresponding brain-derived distribution [10 which are present in the article (black) and 10 which are not (gray)]. Each word of the two articles is colored to reflect the ratio papartment (word)/phammer(word) between the probabilities assigned to it by the brain-derived distributions for concepts “apartment” and “hammer” (red means higher probability under “apartment,” blue under “hammer,” gray means the word is not considered by the text model).
Figure 4Classification procedure for the “apartment” and “hammer” example images.
Average number of voxels selected from each cortical AAL ROI over all leave-two-out folds.
| AAL ROI | P1 | P2 | P3 | P4 | P5 | P6 | P7 | P8 | P9 |
|---|---|---|---|---|---|---|---|---|---|
| Angular_L | 3 | 3 | 7 | 2 | 2 | 7 | 8 | 6 | 1 |
| Angular_R | 2 | 5 | 7 | 2 | 3 | 4 | 3 | 11 | 15 |
| Calcarine_L | 47 | 56 | 41 | 34 | 63 | 29 | 42 | 53 | 19 |
| Calcarine_R | 38 | 47 | 65 | 50 | 33 | 27 | 25 | 32 | 27 |
| Caudate_L | 0 | 0 | 0 | 0 | 1 | 7 | 7 | 1 | 4 |
| Caudate_R | 0 | 0 | 0 | 0 | 0 | 7 | 5 | 0 | 4 |
| Cingulum_ant_L | 0 | 4 | 0 | 0 | 7 | 4 | 1 | 4 | 2 |
| Cingulum_ant_R | 0 | 0 | 0 | 2 | 1 | 2 | 0 | 0 | 1 |
| Cingulum_mid_L | 4 | 2 | 2 | 3 | 6 | 2 | 2 | 3 | 6 |
| Cingulum_mid_R | 0 | 3 | 1 | 0 | 4 | 4 | 4 | 1 | 5 |
| Cuneus_L | 0 | 6 | 9 | 26 | 9 | 6 | 7 | 7 | 11 |
| Cuneus_R | 17 | 20 | 23 | 18 | 9 | 22 | 5 | 10 | 14 |
| Frontal_inf_oper_L | 7 | 6 | 3 | 15 | 0 | 6 | 9 | 5 | 1 |
| Frontal_inf_oper_R | 0 | 2 | 1 | 4 | 1 | 7 | 2 | 6 | 10 |
| Frontal_inf_orb_L | 2 | 5 | 1 | 3 | 1 | 9 | 5 | 4 | 8 |
| Frontal_inf_orb_R | 0 | 3 | 1 | 2 | 11 | 3 | 1 | 1 | 4 |
| Frontal_inf_tri_L | 2 | 22 | 16 | 23 | 9 | 3 | 9 | 6 | 7 |
| Frontal_inf_tri_R | 0 | 4 | 1 | 12 | 14 | 19 | 6 | 2 | 12 |
| Frontal_mid_L | 3 | 4 | 7 | 3 | 16 | 23 | 13 | 21 | 13 |
| Frontal_mid_R | 1 | 1 | 11 | 5 | 9 | 21 | 7 | 2 | 12 |
| Frontal_sup_L | 3 | 15 | 6 | 15 | 15 | 20 | 7 | 14 | 12 |
| Frontal_sup_medial_L | 1 | 4 | 0 | 0 | 12 | 8 | 4 | 7 | 16 |
| Frontal_sup_medial_R | 0 | 1 | 1 | 1 | 3 | 15 | 1 | 4 | 10 |
| Frontal_sup_R | 0 | 0 | 5 | 16 | 17 | 15 | 4 | 2 | 11 |
| Fusiform_L | 74 | 50 | 38 | 39 | 50 | 24 | 46 | 35 | 39 |
| Fusiform_R | 94 | 40 | 46 | 44 | 57 | 53 | 54 | 25 | 28 |
| Hippocampus_L | 0 | 0 | 1 | 0 | 3 | 7 | 9 | 0 | 0 |
| Hippocampus_R | 0 | 3 | 2 | 1 | 2 | 1 | 10 | 3 | 0 |
| Lingual_L | 29 | 30 | 32 | 33 | 34 | 36 | 33 | 38 | 9 |
| Lingual_R | 24 | 51 | 36 | 24 | 40 | 28 | 40 | 46 | 17 |
| Occipital_inf_L | 36 | 39 | 27 | 9 | 24 | 24 | 16 | 34 | 22 |
| Occipital_inf_R | 11 | 13 | 22 | 5 | 16 | 30 | 15 | 8 | 21 |
| Occipital_mid_L | 130 | 67 | 118 | 63 | 67 | 72 | 57 | 94 | 68 |
| Occipital_mid_R | 73 | 58 | 54 | 42 | 22 | 17 | 54 | 34 | 37 |
| Occipital_sup_L | 15 | 9 | 7 | 49 | 19 | 7 | 12 | 14 | 32 |
| Occipital_sup_R | 29 | 23 | 46 | 15 | 17 | 27 | 19 | 20 | 31 |
| Parahippocampal_L | 2 | 2 | 1 | 1 | 1 | 6 | 2 | 6 | 1 |
| Parahippocampal_R | 4 | 1 | 4 | 1 | 12 | 6 | 7 | 2 | 0 |
| Parietal_inf_L | 21 | 23 | 27 | 37 | 13 | 18 | 27 | 25 | 28 |
| Parietal_inf_R | 2 | 1 | 0 | 12 | 4 | 2 | 3 | 8 | 8 |
| Parietal_sup_L | 19 | 25 | 4 | 19 | 21 | 15 | 12 | 26 | 13 |
| Parietal_sup_R | 17 | 36 | 6 | 19 | 8 | 2 | 12 | 18 | 28 |
| Postcentral_L | 30 | 8 | 39 | 20 | 6 | 17 | 17 | 10 | 4 |
| Postcentral_R | 0 | 3 | 4 | 6 | 3 | 15 | 4 | 0 | 14 |
| Precentral_L | 6 | 21 | 3 | 46 | 7 | 8 | 33 | 18 | 7 |
| Precentral_R | 0 | 1 | 3 | 14 | 2 | 13 | 8 | 1 | 8 |
| Precuneus_L | 1 | 17 | 53 | 9 | 23 | 7 | 18 | 10 | 21 |
| Precuneus_R | 21 | 12 | 27 | 27 | 20 | 3 | 16 | 2 | 16 |
| Supp_motor_Area_L | 3 | 1 | 0 | 37 | 3 | 9 | 2 | 4 | 0 |
| Supp_motor_Area_R | 0 | 5 | 2 | 10 | 2 | 12 | 4 | 4 | 2 |
| Supramarginal_L | 9 | 4 | 12 | 34 | 2 | 8 | 11 | 23 | 10 |
| Supramarginal_R | 0 | 6 | 0 | 14 | 0 | 9 | 5 | 3 | 4 |
| Temporal_inf_L | 13 | 18 | 5 | 5 | 10 | 5 | 12 | 27 | 11 |
| Temporal_inf_R | 41 | 16 | 10 | 5 | 19 | 10 | 10 | 10 | 21 |
| Temporal_mid_L | 29 | 29 | 12 | 7 | 30 | 30 | 31 | 39 | 28 |
| Temporal_mid_R | 24 | 21 | 9 | 8 | 37 | 18 | 14 | 13 | 17 |
| Temporal_sup_L | 0 | 1 | 2 | 4 | 5 | 7 | 14 | 17 | 14 |
| Temporal_sup_R | 0 | 3 | 1 | 1 | 3 | 6 | 9 | 12 | 3 |
| Not_labeled | 50 | 67 | 65 | 65 | 50 | 73 | 80 | 94 | 65 |
Figure 5Average classification accuracy across models using 10 to 100 topics, for each of 9 subjects (chance level is 0.5); the accuracy is broken down into classification of concept pairs where concepts are in different categories (“Between”) and pairs where the category is the same (“Within”). Error bars are across numbers of topics, chance level is 0.5.
Figure 6. Right: Similarity between the topic probability representations predicted from the brain images for each pair of concepts, when they were being used as the test set, for the two best subjects with the highest classification accuracy. These were obtained using a 40 topic model, the general pattern is similar for the other subjects. Note that the representations for concepts in the same category are similar when obtained from brain images and that this is also the case when those representations are derived from text.