Literature DB >> 32392297

BIOMEX: an interactive workflow for (single cell) omics data interpretation and visualization.

Federico Taverna¹, Jermaine Goveia¹, Tobias K Karakach¹, Shawez Khan¹, Katerina Rohlenova¹, Lucas Treps¹, Abhishek Subramanian¹, Luc Schoonjans¹, Mieke Dewerchin¹, Guy Eelen¹, Peter Carmeliet¹.

Abstract

The amount of biological data, generated with (single cell) omics technologies, is rapidly increasing, thereby exacerbating bottlenecks in the data analysis and interpretation of omics experiments. Data mining platforms that facilitate non-bioinformatician experimental scientists to analyze a wide range of experimental designs and data types can alleviate such bottlenecks, aiding in the exploration of (newly generated or publicly available) omics datasets. Here, we present BIOMEX, a browser-based software, designed to facilitate the Biological Interpretation Of Multi-omics EXperiments by bench scientists. BIOMEX integrates state-of-the-art statistical tools and field-tested algorithms into a flexible but well-defined workflow that accommodates metabolomics, transcriptomics, proteomics, mass cytometry and single cell data from different platforms and organisms. The BIOMEX workflow is accompanied by a manual and video tutorials that provide the necessary background to navigate the interface and get acquainted with the employed methods. BIOMEX guides the user through omics-tailored analyses, such as data pretreatment and normalization, dimensionality reduction, differential and enrichment analysis, pathway mapping, clustering, marker analysis, trajectory inference, meta-analysis and others. BIOMEX is fully interactive, allowing users to easily change parameters and generate customized plots exportable as high-quality publication-ready figures. BIOMEX is open source and freely available at https://www.vibcancer.be/software-tools/biomex.

Entities: Chemical Disease Gene Species

Year: 2020 PMID： 32392297 PMCID： PMC7319461 DOI： 10.1093/nar/gkaa332

Source DB: PubMed Journal: Nucleic Acids Res ISSN： 0305-1048 Impact factor: 16.971

INTRODUCTION

The recent growth of unbiased high-throughput sequencing and profiling technologies has revolutionized the generation and analysis of biological data (1). The commoditization, exponential growth and increased throughput of these technologies (2) has helped the community to develop breakthroughs in bioanalytical research. For example, using next generation sequencing technologies, it is possible to analyze whole genome and transcriptome sequences within an hour (3). In addition, using mass spectrometry, thousands of proteins and metabolites can be measured simultaneously (4,5). Until recently, traditional profiling methods could be applied only to ‘bulk’ samples homogenized from whole tissue or organ extracts. With the advent of single cell genomics, transcriptomics and proteomics profiling technologies, individual cell contents can now be measured (6) allowing the characteristics of individual cells to be studied (7,8), further increasing the volume of data to analyze. Availability of such large and complex datasets introduces multiple challenges. Computational challenges relate to the handling, processing and analysis of the data; new bioinformatics tools are continuously developed in tandem with technological advances. Biological challenges stem from the need to understand the biological significance of the information in the data and require in-depth knowledge of the biological question. Consequently, detailed biological interpretation of omics data requires a synthesis of domain-knowledge and computational skills, which continues to inspire interdisciplinary projects between researchers with complementary expertise. Various tools exist to meet the challenges related to analyzing omics data, and can be loosely defined as workflow tools (e.g. Galaxy (9), Taverna (10)), pre-processing tools (e.g. XCMS Online (11), MaxQuant (12)), specialized tools and more broad data analysis platforms. Easy-to-use data analysis platforms that allow experimental scientists to autonomously analyze omics data have been highly successful as solutions to bridge the gap between data generation and interpretation (e.g. Perseus (13), EXPANDER (14), InstantClue (15), MetaboAnalyst (16)). However, currently available data analysis platforms mostly focus on analyses of bulk omics data types, and in many cases they are not tailored to support the pretreatment and analysis of a wide range of omics experiments within the same interactive framework (e.g. RNA-sequencing, gene expression microarrays, metabolomics, proteomics), making the unified analysis of all these varieties of omics data challenging. Moreover, these tools do not scale, nor provide a structured data mining workflow to explore single cell datasets (e.g. single cell RNA-sequencing and mass cytometry). Here we present BIOMEX, a data mining software developed for the Biological Interpretation Of Multi-omics EXperiments. BIOMEX integrates a range of publicly available algorithms and field-tested data analysis approaches into a well-defined and guided stepwise workflow that accommodates a wide variety of experimental designs and multi-omics data, including metabolomics, transcriptomics, proteomics, single cell RNA-sequencing and mass cytometry experiments (Figure 1). The software is capable of handling large-scale data such as single cell omics experiments, scaling from tens to hundreds of thousands of cells.

Figure 1.

The BIOMEX workflow. The workflow guides the user through distinct analysis steps. The data (and metadata) need to be uploaded, and each uploaded dataset must be annotated with the relevant information (i.e. the omics data type, technology, feature identifier, etc.). In the processing step, the data is cleaned and consistency checks are performed to verify that the data was uploaded in the correct format. After the feature identifiers are mapped to feature names, the data is filtered and normalized in the pretreatment step. Depending on the omics data type, the data can also be imputed or batch corrected. Once the data is pretreated, it is ready for downstream analysis. The different analyses available in BIOMEX can be divided into five categories: (i) quantification analyses quantify the abundance level of the features in the data (and metadata); (ii) exploratory analyses assist in understanding the underlying structure of the data; (iii) pairwise analyses reveal the functional differences between groups; (iv) meta-analysis combines results from different studies in a singular, unique and robust result and (v) auxiliary analyses (e.g. machine learning and survival analysis). Ultimately, all the analyses can be saved in a self-contained folder that can be shared between scientists and results can be customized and exported either as tables or high quality publication-ready figures. Abbreviations: TP, true positive, FN, false negative, TN, true negative, FP, false positive. Note: The term ‘features’ is used to indicate genes, metabolites and proteins.

OVERVIEW

Functional requirements and design rationale

BIOMEX is designed to allow non-bioinformatician experimental scientists to perform interactive data mining of bulk and single cell omics datasets. We therefore defined the following functional requirements for the software: To accommodate multi-omics data across select biological species and computational platforms. To allow users to interactively analyze complex experimental designs using state-of-the-art and field-tested algorithms. To facilitate the re-use of publicly available data. To provide a flexible, well-defined data analysis workflow. To provide self-contained, non-technical background information to aid the meaningful use of each analysis module. To enable the generation of highly-customizable publication-ready plots and figures. BIOMEX is implemented in the open source R programming language (https://cran.r-project.org/); the majority of algorithms required for biological data mining are available through open source R packages. BIOMEX integrates these algorithms and packages into a workflow, in which parameters can be interactively tuned using the Shiny web framework (https://shiny.rstudio.com/, the full list of packages used in BIOMEX is available in Supplementary Table 1) (18). Together, BIOMEX creates a workflow that allows users to iteratively fine-tune complex analyses in order to facilitate detailed biological interpretation through interactive visualizations.

Manual and video tutorials

A comprehensive web manual that describes all functionalities, data formats, parameters and analyses related to the workflow is provided within the BIOMEX software. The manual guides the user through the step-by-step procedure required to execute the workflow and is complemented with video tutorials that introduce users to all interface elements and software functionalities.

DATA IMPORT: DATA AND METADATA MATRIX

BIOMEX requires two files for each experiment, the data and metadata matrix.

Data matrix

The data matrix contains typical omics (i.e. transcriptomics, metabolomics, proteomics, mass cytometry) output in a text (.txt) or comma separated values (.csv) format. The data file is organized such that the first column contains feature identifiers (i.e. genes, metabolites, proteins), while the first row contains descriptors (i.e. sample or cell IDs). The data matrix can be uploaded as unprocessed gene expression values (raw read counts, unique molecular identifier counts for single cell RNA-sequencing, non-log transformed intensities for microarrays) or absolute abundances for metabolomics and proteomics data. Alternatively, BIOMEX accepts preprocessed data (e.g. filtered and normalized, batch corrected, etc.). BIOMEX automatically checks the uniformity and compatibility of the data, while also dealing with irrelevant (empty) observations and features. For transcriptomics and proteomics, the feature (gene or protein) identifiers are mapped to feature names to allow downstream interpretation of the results and further analyses.

Metadata matrix

The second required file is the metadata file (.txt or .csv format) that contains all the auxiliary information about the experimental design (variables). The metadata file is organized such that the first column contains descriptors matching with the data file, while the first row contains the variables (e.g. factors, numeric, etc.). This file can be modified interactively within the software.

Example datasets

We provide example datasets (the data and metadata matrices) that are readily available to be uploaded into the software. These example datasets encompass all the omics data types supported by BIOMEX, and they include case studies (described below) available directly from the software via the ‘Case studies’ section.

DATA ANALYSIS: ANALYSIS MODULES

The BIOMEX workflow contains 10 data analysis modules, which are briefly described below.

Module 1: Data pretreatment

After data upload, the data is filtered to remove low quality features, normalized, and, if necessary, corrected for unwanted technical variation (e.g. batch effects) (19–22). Algorithms for quality filtering, normalization and regression are often omics-type specific: BIOMEX automatically suggests applicable algorithms and parameter settings depending on the type of data being analyzed. For example, single cell RNA-sequencing data can be corrected for batch effects by using the mutual nearest neighbor (MNN) method (22). The output of this module is a clean/corrected data matrix that can be used for downstream analysis.

Module 2: Feature engineering

Complementing unbiased and automated methods, domain knowledge can be used to craft new features from the existing features in the data to estimate biological variation (e.g. from gene expression to pathway activity). During this (optional) step, BIOMEX employs gene set variation analysis (23) (GSVA) to convert the features-by-observations data matrix (output of module 1) into an engineered sets-by-observations data matrix. The newly created engineered data can then be used to perform downstream analysis, including differential analysis. BIOMEX includes the KEGG sets by default, but users can also upload custom sets. Alternatively, feature engineered data, created with independent methods, can be directly uploaded to BIOMEX and subsequently used in downstream analysis.

Module 3: Visualization of trends

Feature magnitudes and trends are visualized in bar plots, box plots, violin plots and density kernel estimation plots. These plots are grouped based on the information present in the metadata. The uploaded metadata can also be explored through pie charts and horizontal bar plots.

Module 4: Unsupervised analyses

Unsupervised analysis aims to unbiasedly detect patterns in the data. For dimensionality reduction and visualization, BIOMEX includes Principal Component Analysis (24) (PCA, flashPCA package (25)), t-distributed Stochastic Neighbor Embedding (26) (t-SNE, Rtsne package) and Uniform Manifold Approximation and Projection (27) (UMAP, umap package). Also, BIOMEX supports K-means, hierarchical and graph-based clustering (Seurat (21) and FlowSOM (28) packages). The output of hierarchical clustering can be visualized via dendrograms, and the associated uncertainty can be assessed using multi-scale bootstrap resampling (pvclust package (29)). BIOMEX provides interactive heatmaps (heatmaply package (30)) to visualize inherent associations between groups or clusters.

Module 5: Supervised analyses

Supervised pairwise analyses are used to explore quantitative differences in expression or abundance levels between groups (differential analysis). BIOMEX uses linear models (limma and MAST packages (31,32)) to describe the relationship between expression levels of features between two groups. This enables handling of complex experimental designs, and allows including covariates in the modeling process. The magnitude of differential expression (log fold change) and the P-values are provided for each feature, together with the false discovery rate adjusted P-values calculated with the Benjamini-Hochberg method (33). Volcano plots are used to visually represent the differential analysis results. As an extension of pair-wise differential analysis, BIOMEX includes marker analysis that can be used to detect key discriminating features between multiple groups (or clusters in single cell data). This analysis consists of a two-step intra-dataset meta-analysis approach. First, BIOMEX performs a differential analysis for each group against all the other groups separately and filters out features that are not consistently differentially expressed (34). Subsequently, marker features are ranked using a product-based meta-analysis (median-, sum- or P-value-based meta-analysis can be used to rank features) (35). Functional analysis of omics data can be performed in BIOMEX using several tools. These include Gene Set Enrichment Analysis (36) (GSEA, clusterProfiler (37) package) used to perform the competitive set enrichment analysis, and rotation gene set tests (ROAST) (38) to perform self-contained set enrichment analysis. Results of such analyses can be either displayed as a waterfall plot or a horizontal bar plot. These analyses provide functional information regarding, for example, pathways or biological processes that may be deregulated in a select set of conditions. The default setting in the software is to use the (metabolic) KEGG pathway sets (39), but this can be changed by the user to include other pathways or biological processes. BIOMEX uses the KEGG pathways to map features in the data (genes, proteins, metabolites) to well defined and constructed pathways using the pathview (40) package. The pathway visualizations are interactive and can be customized by the user to incorporate a priori biological insight (e.g. irrelevant isoforms can be manually excluded).

Module 6: Single cell specific algorithms

Single cell data can be used to infer differentiation trajectories using computational methods. BIOMEX includes Monocle (41) and SCORPIUS (42) to infer branched and linear cell trajectories, respectively. BIOMEX also uses locally estimated scatterplot smoothing (LOESS) regression to subsequently model the dynamic behavior of features in pseudotime. As a second single cell-specific approach, BIOMEX includes scmap (43) to project cluster identities from a reference dataset to another non-clustered dataset by calculating the similarities between cells of the non-clustered dataset and the cluster centroids in the reference dataset.

Module 7: Survival analysis

Survival analysis is implemented in BIOMEX in order to link omics data to a disease outcome. For example, using The Cancer Genome Atlas (TCGA) (44) and other resources, it is possible to infer the effect of deregulation of a given gene to a treatment outcome or patient survival. BIOMEX uses the Kaplan–Meier (45) test to generate the survival functions, and the logrank test (46) to assess the significance of those survival functions (survival package).

Module 8: Machine learning

Machine learning is a set of approaches that can model the relationship between a set of variables (features) and instances (observations) based on a given training dataset. BIOMEX includes the ranger (47) implementation of the random forest model to perform classification and regression tasks (48). Recursive feature elimination (RFE) (49) is used as the feature selection method of choice to select the most predictive features. The machine learning pipeline is based on the caret package and includes cross-validation strategies to assess the predictive performance of the model (50).

Module 9: Meta-analysis of bulk omics data

Integrative data analysis approaches have been successfully used to analyze multiple datasets simultaneously to compare the results of independent experiments (51,52). With the availability of added-value databases, publicly available preprocessed data can be easily accessed by scientists and used to perform meta-analyses. In a meta-analysis, (i) a pair-wise differential analysis is performed for each dataset independently; (ii) we rank the features in each dataset by a metric (e.g. fold change) and (iii) we combine the rank numbers for all features using a product-based (or median-, sum- and P-value-based) meta-analysis approach. As a result, we obtain a ranked list of features, which are consistently differentially expressed across all the selected comparisons (i.e. differential analyses) in different datasets. The results can be visually explored through violin plots.

Module 10: Single cell meta-analysis

Meta-analysis can also be performed by measuring the similarity between clusters (53). This analysis, developed specifically for single cell omics data, assesses the conservation of cell phenotypes between different tissues, organs, studies, conditions, etc. BIOMEX performs the cluster similarity analysis by combining the results obtained during the marker set analysis. Similarity between the clusters present in the marker set results are calculated using the pairwise Jaccard similarity coefficients (54) for all clusters against all other clusters. The output of this analysis is a similarity score matrix, which describes quantitatively how each cluster is similar to other clusters. PCA is applied to the pairwise Jaccard similarity coefficient matrix to visually represent the similarity between clusters.

DATA EXPORT: PLOTS AND TABLES

All the plots and tables (plotly, ggplot2 (55), DT packages) can be fully customized and exported in a variety of high quality formats (e.g. vectorized image format). BIOMEX saves all parameters and results in a self-contained folder, which can be shared between users and loaded into BIOMEX, improving the reproducibility of analyses.

CASE STUDIES

To showcase the analysis modules implemented in BIOMEX, we provide two case studies. A step-by-step tutorial on how to reproduce the results obtained in both case studies is available in the manual, which includes all the parameters used to perform the analyses and to generate the plots.

Bulk data: exploration of the TCGA cholangiocarcinoma dataset

To provide an illustrative example on how bulk data can be analyzed, we explored a publicly available TCGA dataset (TCGA-CHOL) on cholangiocarcinoma (CCA), a cancer from the bile duct that represents the second most commonly diagnosed primary liver tumor (56). Even when diagnosed at an early stage, CCA is a very aggressive malignancy with poor patient outcome and limited treatment opportunities (56). According to their anatomical location, CCAs are classified as intrahepatic, hilar-perihilar and distal, which represent respectively 88.2%, 5.9% and 5.9% of the patients in this analysis (Figure 2A). Dimensionality reduction (PCA) and correlation heatmap analyses showed that biopsies from normal tissue have a clearly distinct transcriptomic signature compared to intrahepatic CCA resections (Figure 2B, C). Although there is a strong inter tumor sample heterogeneity between patients (Figure 2C), we aimed at determining transcriptomic similarities that could be involved in overall CCA pathogenesis. Enrichment analysis of normal versus intrahepatic CCA samples indicated that cell cycle and extracellular matrix (ECM)-receptor interaction gene sets were the most upregulated (Figure 2D). Consistently, differential gene expression analysis showed that several key mitotic checkpoints (e.g. CDK1, E2F1, CDC45, SFN) and mitotic spindle assembly/control genes (e.g. CDC20, CDC25, TUBB3), as well as numerous genes encoding laminin, integrin, collagen and ECM-secreted proteins (e.g. SPP1, COMP, TNC) were upregulated in the tumor samples (Figure 2E–G and not shown). To explore whether this signature was conserved across the other classes of CCA, we performed a meta-analysis of normal versus tumor samples from intrahepatic, hilar-perihilar and distal CCAs. We identified two genes, namely CEACAM5 and AFAP1-AS1, ranking in the top 2 in a product-rank meta-analysis (Figure 2H). Interestingly, CEACAM5 (carcinoembryonic antigen, CEA) is a well-established prognostic marker in CCA (57), while the long non-coding RNA AFAP1-AS1 has been linked to metastasis (a process requiring complex ECM remodeling) and cancer cell proliferation in CCA. Hence, the meta-analysis results further supported the importance of cell proliferation and ECM-cell adhesion in CCA.

Figure 2.

Cholangiocarcinoma TCGA data analysis results. (A) Overall histological type percentage of the TCGA cholangiocarcinoma dataset. (B) PCA of normal and intrahepatic tumor samples. (C) Clustered heatmap based on the correlation of normal and intrahepatic tumor samples. (D) Competitive enrichment analysis of normal versus intrahepatic tumor samples using the gene sets related to the ‘Environmental Information Processing’ and ‘Cellular Process’ KEGG pathway maps. The upregulated gene sets are shown in red, the downregulated gene sets are shown in blue. Note: There are two enriched KEGG gene sets related to Hippo signaling, indicated separately in the figure. Hippo signaling (1): KEGG Hippo signaling pathway; Hippo signaling (2): KEGG Hippo signaling pathway—multiple species. (E) Differential analysis of normal versus intrahepatic tumor samples shown in a volcano plot. The significantly different genes (P < 0.05) are shown in blue, the non-significant genes are shown in grey. (F) Barplot visualization of SPP1 expression in normal and intrahepatic tumor samples. The error bar represents the standard error. (G) Boxplot visualization of SFN expression in normal and intrahepatic tumor samples. The box represents the range between the first quartile (Q1) and the third quartile (Q3), the horizontal line represents the median, the whiskers represent the interquartile ranges (IQR, 1.5 × IQR below Q1 and 1.5 × IQR above Q3). (H) Meta-analysis of intrahepatic, hilar-perihilar and distal tumor types. Each violin plot represents the differential analysis of normal versus the corresponding tumor type. The top 2 most consistently upregulated genes (CEACAM5 and AFAP1-AS1) are highlighted. (I) Survival analysis based on MMP11 gene expression of intrahepatic tumor samples. All the results shown in the figure can be directly explored in the BIOMEX ‘Case studies’ section. The parameters used to generate these plots can be found in the manual.

Single cell data: re-analysis of the endothelial cell atlas dataset

To provide an illustrative example on how single cell data can be analyzed, we selected heart, liver and lung endothelial cells (ECs) from the recently published murine EC atlas scRNA-seq dataset (59). We intended to showcase a logical sequence of analyses that a BIOMEX user can employ to (re-)analyze single cell data. ECs line the lumen of blood vessels and are known to be heterogeneous along the vascular tree. Consistently, dimensionality reduction and visualization using PCA and t-SNE indicated that ECs from the lung, heart and liver vascular beds have a distinct transcriptional profile (Figure 3A). To explore heterogeneity of ECs within a single vascular bed, we performed dimensionality reduction using UMAP to visualize the subclusters as they were detected in the EC atlas (59) (Figure 3B). Next, we performed rank-product based marker set analysis, and visualized the top 5 marker genes for each cluster using a heatmap (Figure 3C). Marker genes were consistent with previously described markers of (sublineages of) arterial, capillary, venous, lymphatic and proliferating ECs (59). Quantification of the number of cells per cluster showed that capillary ECs constitute the majority of the liver single cell population (Figure 3D). Further, unbiased linear trajectory inference reconstructed a phenotypic continuum of arterial, capillary and venous phenotypes, consistent with the known anatomical topography of liver ECs (Figure 3E). To explore whether the cluster signatures are conserved across tissues, we performed a similar analysis for the lung and heart ECs (not shown), and subsequently performed a Jaccard similarity analysis. This analysis revealed that marker genes of arterial, capillary and venous phenotypes are conserved across vascular beds (Figure 3F). Finally, we used scmap to project liver ECs from an independent reference dataset (Tabula Muris dataset (8)), onto the cluster identified in the EC atlas liver ECs (Figure 3G).

Figure 3.

Endothelial cell atlas data analysis results. (A) t-SNE plot of ECs from three murine tissues (heart, liver, lung). (B) UMAP of liver tissue showing the endothelial cell clusters as described in the EC atlas. (C) Clustered heatmap showing the top 5 marker genes for each cluster. Colors represent row-wise scaled gene expression with a mean of 0 and a standard deviation of 1 (Z scores). (D) Number of cells for each cluster in liver ECs. (E) Differentiation trajectory of the classic EC phenotypes (arteries, capillaries, veins) in liver. (F) PCA on the pairwise Jaccard similarity coefficients between the top 50 marker genes of the classic EC phenotypes (arteries, capillaries and veins) in heart, lung and liver. (G) Sankey diagram showing the scmap cluster projection of the EC atlas liver data on the Tabula Muris EC liver data. All the results shown in the figure can be directly explored in the BIOMEX ‘Case studies’ section. The parameters used to generate these plots can be found in the manual.

CONCLUSION

To facilitate bench scientists in solving the computational problems arising from omics experiments, we designed and developed BIOMEX, a data mining software for the Biological Interpretation Of Multi-omics EXperiments. BIOMEX aims to alleviate the data-analysis-to-interpretation bottlenecks, lowering the barriers needed to extract the biological information embedded in omics measurements. With its user-friendly, highly interactive web-like interface, users can address complex biological questions by using advanced computational tools and fine-tune the analyses in real time. In addition, its design is unconstrained and allows multi-omics data to be simultaneously uploaded and analyzed into one unified framework, providing a well-defined workflow to analyze, interpret and visualize large-scale data such as single cell measurements. BIOMEX also aids the exploration of datasets generated from publicly available profiling efforts (e.g. Tabula Muris (8), Human Cell Atlas (60), The Cancer Genome Atlas (44)), repositories (e.g. ArrayExpress (61), Gene Expression Omnibus (62)) and added-value databases (e.g. EndoDB (63)). Furthermore, it facilitates the shareability of results, reproducibility of analyses and execution of meta-analyses between different experiments. Due to its convenient user interface and comprehensive manual, BIOMEX could also be used as a didactical tool to introduce researchers to the field of biological data science. To further promote detailed data mining of (single cell) omics datasets accessible to non-bioinformatician experimental scientists, we made BIOMEX freely available for Windows and Linux at https://www.vibcancer.be/software-tools/biomex. The source code is deposited at https://bitbucket.org/ftaverna/biomex. Click here for additional data file.

54 in total

1. Controlling the false discovery rate in behavior genetics research.

Authors: Y Benjamini; D Drai; G Elmer; N Kafkafi; I Golani
Journal: Behav Brain Res Date: 2001-11-01 Impact factor: 3.332

Review 2. The logrank test.

Authors: J Martin Bland; Douglas G Altman
Journal: BMJ Date: 2004-05-01

3. XCMS Online: a web-based platform to process untargeted metabolomic data.

Authors: Ralf Tautenhahn; Gary J Patti; Duane Rinehart; Gary Siuzdak
Journal: Anal Chem Date: 2012-05-10 Impact factor: 6.986

4. MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide protein quantification.

Authors: Jürgen Cox; Matthias Mann
Journal: Nat Biotechnol Date: 2008-11-30 Impact factor: 54.908

5. Understanding survival analysis: Kaplan-Meier estimate.

Authors: Manish Kumar Goel; Pardeep Khanna; Jugal Kishore
Journal: Int J Ayurveda Res Date: 2010-10

6. ArrayExpress update--simplifying data submissions.

Authors: Nikolay Kolesnikov; Emma Hastings; Maria Keays; Olga Melnichuk; Y Amy Tang; Eleanor Williams; Miroslaw Dylag; Natalja Kurbatova; Marco Brandizi; Tony Burdett; Karyn Megy; Ekaterina Pilicheva; Gabriella Rustici; Andrew Tikhonov; Helen Parkinson; Robert Petryszak; Ugis Sarkans; Alvis Brazma
Journal: Nucleic Acids Res Date: 2014-10-31 Impact factor: 16.971

7. Fast principal component analysis of large-scale genome-wide data.

Authors: Gad Abraham; Michael Inouye
Journal: PLoS One Date: 2014-04-09 Impact factor: 3.240

8. Cholangiocarcinoma‑associated genes identified by integrative analysis of gene expression data.

Authors: Wei Zhong; Lianzhi Dai; Jing Liu; Song Zhou
Journal: Mol Med Rep Date: 2018-02-12 Impact factor: 2.952

9. The Human Cell Atlas.

Authors: Aviv Regev; Sarah A Teichmann; Eric S Lander; Ido Amit; Christophe Benoist; Ewan Birney; Bernd Bodenmiller; Peter Campbell; Piero Carninci; Menna Clatworthy; Hans Clevers; Bart Deplancke; Ian Dunham; James Eberwine; Roland Eils; Wolfgang Enard; Andrew Farmer; Lars Fugger; Berthold Göttgens; Nir Hacohen; Muzlifah Haniffa; Martin Hemberg; Seung Kim; Paul Klenerman; Arnold Kriegstein; Ed Lein; Sten Linnarsson; Emma Lundberg; Joakim Lundeberg; Partha Majumder; John C Marioni; Miriam Merad; Musa Mhlanga; Martijn Nawijn; Mihai Netea; Garry Nolan; Dana Pe'er; Anthony Phillipakis; Chris P Ponting; Stephen Quake; Wolf Reik; Orit Rozenblatt-Rosen; Joshua Sanes; Rahul Satija; Ton N Schumacher; Alex Shalek; Ehud Shapiro; Padmanee Sharma; Jay W Shin; Oliver Stegle; Michael Stratton; Michael J T Stubbington; Fabian J Theis; Matthias Uhlen; Alexander van Oudenaarden; Allon Wagner; Fiona Watt; Jonathan Weissman; Barbara Wold; Ramnik Xavier; Nir Yosef
Journal: Elife Date: 2017-12-05 Impact factor: 8.140

10. Identification of cell types in a mouse brain single-cell atlas using low sampling coverage.

Authors: Aparna Bhaduri; Tomasz J Nowakowski; Alex A Pollen; Arnold R Kriegstein
Journal: BMC Biol Date: 2018-10-11 Impact factor: 7.431

16 in total

1. Protocols for endothelial cell isolation from mouse tissues: small intestine, colon, heart, and liver.

Authors: Liliana Sokol; Vincent Geldhof; Melissa García-Caballero; Nadine V Conchinha; Sébastien J Dumas; Elda Meta; Laure-Anne Teuwen; Koen Veys; Rongyuan Chen; Lucas Treps; Mila Borri; Pauline de Zeeuw; Kim D Falkenberg; Charlotte Dubois; Magdalena Parys; Laura P M H de Rooij; Jermaine Goveia; Katerina Rohlenova; Luc Schoonjans; Mieke Dewerchin; Guy Eelen; Xuri Li; Joanna Kalucka; Peter Carmeliet
Journal: STAR Protoc Date: 2021-05-01

2. Metabolic View on Human Healthspan: A Lipidome-Wide Association Study.

Authors: Justin Carrard; Hector Gallart-Ayala; Denis Infanger; Tony Teav; Jonathan Wagner; Raphael Knaier; Flora Colledge; Lukas Streese; Karsten Königstein; Timo Hinrichs; Henner Hanssen; Julijana Ivanisevic; Arno Schmidt-Trucksäss
Journal: Metabolites Date: 2021-04-30

Review 3. Optimization of metabolomic data processing using NOREVA.

Authors: Jianbo Fu; Ying Zhang; Yunxia Wang; Hongning Zhang; Jin Liu; Jing Tang; Qingxia Yang; Huaicheng Sun; Wenqi Qiu; Yinghui Ma; Zhaorong Li; Mingyue Zheng; Feng Zhu
Journal: Nat Protoc Date: 2021-12-24 Impact factor: 13.491

4. Identification of vascular cues contributing to cancer cell stemness and function.

Authors: Saran Kumar; Libat Bar-Lev; Husni Sharife; Myriam Grunewald; Maxim Mogilevsky; Tamar Licht; Jermaine Goveia; Federico Taverna; Iddo Paldor; Peter Carmeliet; Eli Keshet
Journal: Angiogenesis Date: 2022-02-03 Impact factor: 10.658

5. Peroxisomal Multifunctional Protein 2 Deficiency Perturbs Lipid Homeostasis in the Retina and Causes Visual Dysfunction in Mice.

Authors: Yannick Das; Daniëlle Swinkels; Sai Kocherlakota; Stefan Vinckier; Frédéric M Vaz; Eric Wever; Antoine H C van Kampen; Bokkyoo Jun; Khanh V Do; Lieve Moons; Nicolas G Bazan; Paul P Van Veldhoven; Myriam Baes
Journal: Front Cell Dev Biol Date: 2021-02-02

6. Effects of the Novel PFKFB3 Inhibitor KAN0438757 on Colorectal Cancer Cells and Its Systemic Toxicity Evaluation In Vivo.

Authors: Tiago De Oliveira; Tina Goldhardt; Marcus Edelmann; Torben Rogge; Karsten Rauch; Nikola Dobrinov Kyuchukov; Kerstin Menck; Annalen Bleckman; Joanna Kalucka; Shawez Khan; Jochen Gaedcke; Martin Haubrock; Tim Beissbarth; Hanibal Bohnenberger; Mélanie Planque; Sarah-Maria Fendt; Lutz Ackermann; Michael Ghadimi; Lena-Christin Conradi
Journal: Cancers (Basel) Date: 2021-02-28 Impact factor: 6.639

7. Integrative Omics Analysis Unravels Microvascular Inflammation-Related Pathways in Kidney Allograft Biopsies.

Authors: Claire Tinel; Baptiste Lamarthée; Jasper Callemeyn; Elisabet Van Loon; Virginia Sauvaget; Lise Morin; Laïla Aouni; Marion Rabant; Wilfried Gwinner; Pierre Marquet; Maarten Naesens; Dany Anglicheau
Journal: Front Immunol Date: 2021-11-02 Impact factor: 7.561

8. Rapid Identification of the Tumor-Specific Reactive TIL Repertoire via Combined Detection of CD137, TNF, and IFNγ, Following Recognition of Autologous Tumor-Antigens.

Authors: Arianna Draghi; Christopher Aled Chamberlain; Shawez Khan; Krisztian Papp; Martin Lauss; Samuele Soraggi; Haja Dominike Radic; Mario Presti; Katja Harbst; Aishwarya Gokuldass; Anders Kverneland; Morten Nielsen; Marie Christine Wulff Westergaard; Mads Hald Andersen; Istvan Csabai; Göran Jönsson; Zoltan Szallasi; Inge Marie Svane; Marco Donia
Journal: Front Immunol Date: 2021-10-11 Impact factor: 7.561

9. Protocols for endothelial cell isolation from mouse tissues: kidney, spleen, and testis.

Authors: Sébastien J Dumas; Elda Meta; Nadine V Conchinha; Liliana Sokol; Rongyuan Chen; Mila Borri; Laure-Anne Teuwen; Koen Veys; Melissa García-Caballero; Vincent Geldhof; Lucas Treps; Pauline de Zeeuw; Kim D Falkenberg; Charlotte Dubois; Magdalena Parys; Laura P M H de Rooij; Katerina Rohlenova; Jermaine Goveia; Luc Schoonjans; Mieke Dewerchin; Guy Eelen; Xuri Li; Joanna Kalucka; Peter Carmeliet
Journal: STAR Protoc Date: 2021-07-28

10. Protocols for endothelial cell isolation from mouse tissues: brain, choroid, lung, and muscle.

Authors: Nadine V Conchinha; Liliana Sokol; Laure-Anne Teuwen; Koen Veys; Sébastien J Dumas; Elda Meta; Melissa García-Caballero; Vincent Geldhof; Rongyuan Chen; Lucas Treps; Mila Borri; Pauline de Zeeuw; Kim D Falkenberg; Charlotte Dubois; Magdalena Parys; Laura P M H de Rooij; Katerina Rohlenova; Jermaine Goveia; Luc Schoonjans; Mieke Dewerchin; Guy Eelen; Xuri Li; Joanna Kalucka; Peter Carmeliet
Journal: STAR Protoc Date: 2021-09-14