| Literature DB >> 25309356 |
Puiu F Balan1, Annelies Gerits1, Wim Vanduffel2.
Abstract
The exponential growth in publications represents a major challenge for researchers. Many scientific domains, including neuroscience, are not yet fully engaged in exploiting large bodies of publications. In this paper, we promote the idea to partially automate the processing of scientific documents, specifically using text mining (TM), to efficiently review big corpora of publications. The "cognitive advantage" given by TM is mainly related to the automatic extraction of relevant trends from corpora of literature, otherwise impossible to analyze in short periods of time. Specifically, the benefits of TM are increased speed, quality and reproducibility of text processing, boosted by rapid updates of the results. First, we selected a set of TM-tools that allow user-friendly approaches of the scientific literature, and which could serve as a guide for researchers willing to incorporate TM in their work. Second, we used these TM-tools to obtain basic insights into the relevant literature on cognitive rehabilitation (CR) and cognitive enhancement (CE) using transcranial magnetic stimulation (TMS). TM readily extracted the diversity of TMS applications in CR and CE from vast corpora of publications, automatically retrieving trends already described in published reviews. TMS emerged as one of the important non-invasive tools that can both improve cognitive and motor functions in numerous neurological diseases and induce modulations/enhancements of many fundamental brain functions. TM also revealed trends in big corpora of publications by extracting occurrence frequency and relationships of particular subtopics. Moreover, we showed that CR and CE share research topics, both aiming to increase the brain's capacity to process information, thus supporting their integration in a larger perspective. Methodologically, despite limitations of a simple user-friendly approach, TM served well the reviewing process.Entities:
Keywords: cognitive; enhancement; rehabilitation; text mining; transcranial magnetic stimulation
Year: 2014 PMID: 25309356 PMCID: PMC4176459 DOI: 10.3389/fnsys.2014.00182
Source DB: PubMed Journal: Front Syst Neurosci ISSN: 1662-5137
TM-tools II presented in the order (indicated by numbers) of their use.
Gray scale qualitatively codes their weights in TM (the darker, the higher the importance). Each subgroup (1–4) includes kernels of TM-tools (circled red) and “targeted processing” (red text) that have the highest importance for our study. INPUT: query terms (T) = query using combinations of terms; query sentences (S) − query using free text or questions; predefined = fixed user-predefined input consisting of lists of terms or publications; filters = tuning the search using supplementary terms, constrains regarding the type/part of the publication to be processed etc. OUTPUT: Statistic about authors, journals, papers per year, topics; corpus (C) = retrieving sets of relevant papers; terms (T) = extracting lists of frequent and relevant terms and their relationships; sentences (S) = extracting relevant sentences and paper summaries. Targeted processing = processing from the TM-tool repertoire used for this review. The last 7 columns show topics whose corpora of publications were studied with TM (gray scale codes the weight of the approach). References: MEDSUM (Bridges-Webb, .
Number of publications retrieved from PubMed using search queries defined previously.
| TMS | 8386 | 0.909 | 1018 |
| CR | 4220 | 0.896 | 914 |
| CE | 1235 | 0.895 | 312 |
| TMS-DIAG | 212 | 0.994 | 68 |
| TMS-RES | 360 | 0.994 | 93 |
| TMS-fMRI | 875 | 0.906 | 211 |
PBRK, the probability of correct ranking of a random positive-negative pair of publications determined with Medline Ranker.
Figure 1Qualitative approximation of a larger context for CR- and CE-TMS. Topics are represented by ellipses with CR-TMS (filled blue) and CE-TMS (red) representing intersections of topics.
Figure 2Number of publication per year (2014. Results for topics represented by small numbers of publications are multiplied by 10 (×10). We show also results for conjunctions of topics: (CR&CE-TMS) (A); (CR&CE) (B); (TMS-DIAG&RES) (C).
Figure 3Co-occurrence matrix built with PubMatrix. The matrix (B) represents the decimal logarithm of the number of publications retrieved from PubMed using queries combining all possible conjunctions of pairs of terms (e.g., TMS and neurorehabilitation), which label the lines and the rows of the matrix. Panel (A) represents the first line in the matrix, showing co-occurrences involving the term TMS. Different colors (see the left color-coding bar) represent different powers of 10. The color of the text marks terms associated dominantly with: TMS (black); CR-TMS (red); CE-TMS (green); TMS-RES (blue); TMS-DIAG (purple).
Figure 4Number of common publications (in decimal logarithmic scale) for pairs of corpora representing topics labeling the rows and the columns of the matrix. Different shades of gray (see the color-coding bar) represent different powers of 10.
KWIC examples for CR-TMS.
| Treatment | 160 | 63 | 97 | 25 | 11 | 11 | 16 | 0 | TMS | 5 | 17 | 53 | 12 | 10 |
| Depression | 113 | 37 | 76 | 8 | 11 | 18 | 0 | 0 | TMS | 0 | 22 | 13 | 14 | 27 |
| Alzheimer | 100 | 55 | 45 | 20 | 13 | 19 | 3 | 0 | TMS | 2 | 12 | 10 | 5 | 16 |
| Therapy | 98 | 58 | 40 | 25 | 4 | 3 | 26 | 0 | TMS | 5 | 3 | 6 | 22 | 4 |
| fMRI | 79 | 57 | 22 | 14 | 6 | 19 | 18 | 0 | TMS | 0 | 3 | 8 | 0 | 11 |
| Stroke | 50 | 35 | 15 | 9 | 5 | 11 | 10 | 0 | TMS | 0 | 4 | 3 | 4 | 4 |
| Disorder | 38 | 27 | 11 | 9 | 3 | 2 | 13 | 0 | TMS | 0 | 0 | 6 | 4 | 1 |
| Recovery | 34 | 28 | 6 | 15 | 6 | 4 | 3 | 0 | TMS | 0 | 2 | 0 | 0 | 4 |
| Schizophrenia | 15 | 12 | 3 | 5 | 1 | 2 | 4 | 0 | TMS | 0 | 1 | 0 | 0 | 2 |
| Parkinson | 14 | 6 | 8 | 2 | 4 | 0 | 0 | 0 | TMS | 0 | 5 | 1 | 1 | 1 |
| Improvement | 13 | 5 | 8 | 3 | 0 | 1 | 1 | 0 | TMS | 0 | 0 | 2 | 1 | 5 |
| Neurorehabilitation | 20 | 14 | 6 | 7 | 0 | 1 | 6 | 0 | TMS | 0 | 2 | 1 | 1 | 2 |
Columns show the number of co-occurrences of the context word (left) at different positions left/right (L5-L1/R1-R5) in the sentence relative to the query keyword (KW) and their total (LT/RT).
KWIC examples for CE-TMS.
| Induce | 29 | 6 | 23 | 2 | 1 | 3 | 0 | 0 | TMS | 3 | 15 | 1 | 2 | 2 |
| Performance | 24 | 12 | 12 | 5 | 4 | 3 | 0 | 0 | TMS | 0 | 1 | 7 | 2 | 2 |
| EEG | 18 | 5 | 13 | 2 | 0 | 1 | 2 | 0 | TMS | 0 | 11 | 1 | 1 | 0 |
| Paired-pulse | 17 | 16 | 1 | 0 | 1 | 2 | 0 | 13 | TMS | 0 | 0 | 0 | 0 | 1 |
| Cognitive | 16 | 4 | 12 | 2 | 0 | 2 | 0 | 0 | TMS | 1 | 2 | 6 | 2 | 1 |
| Memory | 13 | 9 | 4 | 2 | 3 | 3 | 1 | 0 | TMS | 0 | 1 | 1 | 2 | 0 |
| Stimulation | 13 | 5 | 8 | 1 | 2 | 0 | 2 | 0 | TMS | 2 | 0 | 3 | 2 | 1 |
| fMRI | 9 | 7 | 2 | 1 | 1 | 3 | 2 | 0 | TMS | 0 | 2 | 0 | 0 | 0 |
| Enhance | 8 | 3 | 5 | 1 | 1 | 1 | 0 | 0 | TMS | 1 | 2 | 0 | 2 | 0 |
| Facilitate | 7 | 1 | 6 | 1 | 0 | 0 | 0 | 0 | TMS | 1 | 1 | 0 | 3 | 1 |
| Improve | 6 | 1 | 5 | 0 | 0 | 1 | 0 | 0 | TMS | 1 | 3 | 0 | 0 | 1 |
| Perception | 5 | 3 | 2 | 0 | 2 | 1 | 0 | 0 | TMS | 0 | 0 | 0 | 0 | 2 |
Example sentences retrieved with BioRAT using a specific rule.
| rTMS | for | – | treatment | of | depression | “… the first cases report of using rTMS for the treatment of depression … ” |
| TMS | for | – | treatment | of | obsessive compulsive disorder | “… TMS for the treatment of obsessive compulsive disorder … ” |
“< >” delimitate a block of the rule; LOOKUP, looking for a term included in a class ([TMS], [CR-TMS-effects] or [mental disabilities]); MACRO WORD, any word; “?”, the block is optional.