| Literature DB >> 35479111 |
Alessio Giacomel1, Daniel Martins1, Matteo Frigo2,3, Federico Turkheimer1, Steven C R Williams1, Ottavia Dipasquale1, Mattia Veronese1,4.
Abstract
The integration of neuroimaging and transcriptomics data, Imaging Transcriptomics, is becoming increasingly popular but standardized workflows for its implementation are still lacking. We describe the Imaging Transcriptomics toolbox, a new package that implements a full imaging transcriptomics pipeline using a user-friendly, command line interface. This toolbox allows the user to identify patterns of gene expression which correlates with a specific neuroimaging phenotype and perform gene set enrichment analyses to inform the biological interpretation of the findings using up-to-date methods. For complete details on the use and execution of this protocol, please refer to Martins et al. (2021).Entities:
Keywords: Bioinformatics; Computer sciences; Health Sciences; Neuroscience; Sequence analysis
Mesh:
Year: 2022 PMID: 35479111 PMCID: PMC9036395 DOI: 10.1016/j.xpro.2022.101315
Source DB: PubMed Journal: STAR Protoc ISSN: 2666-1667
Figure 1Example of the structure of an output folder
The folder includes tabular files with the results of both the correlation analysis and the GSEA analysis, plots for the variance explained by each component in case of a PLS analysis and enrichment plots for the results from the GSEA analysis.
Figure 3Example of tabular file containing the results of gene ranking according to alignment with neuroimaging phenotype
The toolbox outputs a tabular file containing: i) gene ID, ranked according to strength of correlation; ii) z-score of gene weight in PLS component (or Spearman’s coefficient in case of mass-univariate correlation analysis); and iii) uncorrected and FDR corrected p values for each gene.
Figure 2Example of variance plots produced in case of a PLS analysis
(A) Cumulative variance explained by different PLS models with increasing number of components; (B) Individual variance explained by each of the first 15 components.
Figure 4Example of tabular file with the results from the GSEA analysis
The tabular file contains data about the enrichment score (ES), normalized enrichment score (NES), uncorrected p value (p_val), FDR corrected p value (fdr), number of genes in the gene set term (geneset_size), number of matched genes from the correlation results (matched_size), label of the matched genes (matched_genes) and ledge genes (ledge_genes) for each of the terms included in a certain gene set.
Figure 5Example of an enrichment plot from the GSEA analysis
The analysis produces a plot for each term of the gene set used. The top portion of the plot shows the running enrichment score (ES) for the gene set as the analysis walks down the ranked list. The score at the peak of the plot (the score furthest from 0.0) is the ES for the gene set. The middle portion of the plot shows where the members of the gene set appear in the ranked list of genes. The bottom portion of the plot shows the value of the ranking metric as you move down the list of ranked genes. The ranking metric measures a gene’s correlation with a phenotype. The value of the ranking metric goes from positive to negative as you move down the ranked list.
| REAGENT or RESOURCE | SOURCE | IDENTIFIER |
|---|---|---|
| Gene expression data from human brain tissues | Allen Human Brain Atlas (AHBA) | |
| Single cell RNA-seq data | ( | NA |
| MATLAB | ( | |
| FMRIB Software Library (FSL) | ( | |
| Anaconda | Anaconda Inc. | |
| Abagen toolbox | ( | |
| Alleninf | ( | |
| ENIGMA Toolbox | ( | |
| Netneurotools python library | Network Neuroscience Lab, Brain Imaging Centre, McGill University | |
| Gseapy python library | ( | |
| The Imaging Transcriptomics Toolbox | ( | |
| PET template | ( | |