| Literature DB >> 28899340 |
Camila M Lopes-Ramos1,2, Joseph N Paulson1,2, Cho-Yi Chen1,2, Marieke L Kuijjer1,2, Maud Fagny1,2, John Platig1,2, Abhijeet R Sonawane3, Dawn L DeMeo3,4, John Quackenbush5,6,7,8, Kimberly Glass9,10,11.
Abstract
BACKGROUND: Cell lines are an indispensable tool in biomedical research and often used as surrogates for tissues. Although there are recognized important cellular and transcriptomic differences between cell lines and tissues, a systematic overview of the differences between the regulatory processes of a cell line and those of its tissue of origin has not been conducted. The RNA-Seq data generated by the GTEx project is the first available data resource in which it is possible to perform a large-scale transcriptional and regulatory network analysis comparing cell lines with their tissues of origin.Entities:
Keywords: Fibroblast cell lines; GTEx; Lymphoblastoid cell lines; Regulatory networks; Transcriptome
Mesh:
Year: 2017 PMID: 28899340 PMCID: PMC5596945 DOI: 10.1186/s12864-017-4111-x
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Fig. 1Pathways are differentially expressed between cell lines and their tissues of origin. a Number of differentially expressed genes (absolute log2 fold change >2 and FDR < 0.05) using voom on paired samples. b Results of GSEA reported based on the log10(FDR) significance scale, with one group in red and the other one in blue. The 15 pathways most significantly differentially expressed between each cell line and its tissue of origin. c Pathways enriched for at least two group comparisons (FDR < 0.05). The pathways differentially expressed between the tissues that are also differentially expressed between the cell lines (preserved pathways) are highlighted in red and blue. Pathways over-expressed in both cell lines compared to their tissues of origin are highlighted in yellow. Rows are ordered by hierarchical clustering of the enrichment significance values, log10(FDR). To represent the FDR significance in the heatmap, the color was saturated at 1.1 × 10−4. The exact reported FDR can be found in Additional file 2
Fig. 2Transcription factors differentially-targeting genes in cell lines and their tissues of origin. a Illustration of the TF out-degree difference between each cell line and its tissue of origin. Positive values indicate higher targeting in cell lines, and negative values indicate higher targeting in tissues. b Function of the TFs with the largest difference in out-degree comparing LCL-vs-blood; and fibroblast-vs-skin regulatory networks. The complete table with references and differential expression results is shown in Additional file 8
Fig. 3Cell cycle pathway genes are less strongly targeted by TFs in cell lines. a Group-specific gene regulatory networks were generated using PANDA. The illustrations represent subnetworks of the 1000 edges with the highest edge weight difference between a cell line and its tissue of origin around the cell cycle genes. The color indicates the edge weight strength between the TF and target gene (the edges shown have a weight greater than 2 in at least one network). b Illustration of the gene in-degree difference between each cell line and its tissue of origin. Positive values indicate higher targeting in cell lines, and negative values indicate higher targeting in tissues. c Boxplot of the gene in-degree differences for the genes in the KEGG cell cycle pathway and for genes not in this pathway (significance measured using a t-test). Reduction of gene in-degree difference indicates that the genes in the cell cycle pathway are less strongly targeted by TFs in the cell line compared to its tissue of origin
Fig. 4SMAD5 is differentially regulating cell cycle pathway genes. a Spearman correlation between the log2 fold change in gene expression (LCL-blood difference) of KEGG cell cycle pathway genes and the differential targeting they receive by the TF SMAD5. Red: evidence of SMAD5 ChIP-Seq binding in the promoter of the gene, black: no evidence of SMAD5 binding. The negative correlation observed indicates the cell cycle genes are more highly expressed but less targeted by SMAD5 in LCL compared to blood. b Boxplot of Spearman correlation coefficients between SMAD5 expression levels and expression levels of all genes, and between SMAD5 expression levels and the expression levels of cell cycle genes with SMAD5 ChIP-Seq binding evidence for LCL and blood samples. Difference in magnitude was tested using a Wilcoxon rank-sum test LCL-vs- blood comparison. c Visualization of the correlation between TF and cell cycle gene expression for interactions that have ChIP-Seq binding evidence. More positively-correlated associations are shown in red, more negatively correlated are blue, and correlations near zero are gray