Literature DB >> 26484102

Transcriptomics profiling of human SGBS adipogenesis.

Mafalda Galhardo1, Lasse Sinkkonen1, Philipp Berninger2, Jake Lin3, Thomas Sauter1, Merja Heinäniemi4.   

Abstract

Obesity is an ever-growing epidemic where tissue homeostasis is influenced by the differentiation of adipocytes that function in lipid metabolism, endocrine and inflammatory processes. While this differentiation process has been well-characterized in mice, limited data is available from human cells. Applying microarray expression profiling in the human SGBS pre-adipocyte cell line, we identified genes with differential expression during differentiation in combination with constraint-based modeling of metabolic pathway activity. Here we describe the experimental design and quality controls in detail for the gene expression and related results published by Galhardo et al. in Nucleic Acids Research 2014 associated with the data uploaded to NCBI Gene Expression Omnibus (GSE41352).

Entities:  

Keywords:  Adipocyte; Differentiation; Metabolism

Year:  2014        PMID: 26484102      PMCID: PMC4535456          DOI: 10.1016/j.gdata.2014.07.004

Source DB:  PubMed          Journal:  Genom Data        ISSN: 2213-5960


Direct link to deposited data

http://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE41352.

Experimental design, materials and methods

Cell differentiation and experimental design

Gene expression levels during adipocyte differentiation were obtained by stimulating the SGBS pre-adipocyte cell line with a mix of differentiation inducing compounds and collecting RNA samples at 0, 4, 8 and 12 h and on days 1, 3 and 12 of adipogenesis for hybridization on Illumina HT-12 microarrays. Triplicate samples were prepared following the differentiation protocol modified from [2] (exception is 12 h time point that has only duplicate samples) as shown in Table 1. SGBS cells differentiate within 10–12 days as determined by microscopic analysis (Oil red O staining). At this time point the cells are filled with small sized lipid droplets and are most responsive, whereas at later time points (20 days) the lipid droplets fuse and cells are less active (personal communication, Dr. Martin Wabitsch).
Table 1

Microarray sample description from the SGBS pre-adipocyte differentiation experiment (GSE41578). GEO sample identifiers are presented for the 20 samples prepared, as well as their differentiation time point and replicate number.

Sample nameGSM identifierTitleTimeReplicate
Sample 1GSM1015366SGBS_day0_10 h1
Sample 2GSM1015367SGBS_day0_20 h2
Sample 3GSM1015368SGBS_day0_30 h3
Sample 4GSM1015369SGBS_4h_14 h1
Sample 5GSM1015370SGBS_4h_24 h2
Sample 6GSM1015371SGBS_4h_34 h3
Sample 7GSM1015372SGBS_8h_18 h1
Sample 8GSM1015373SGBS_8h_28 h2
Sample 9GSM1015374SGBS_8h_38 h3
Sample 10GSM1015375SGBS_12h_112 h1
Sample 11GSM1015376SGBS_12h_212 h2
Sample 12GSM1015377SGBS_day1_1Day 11
Sample 13GSM1015378SGBS_day1_2Day 12
Sample 14GSM1015379SGBS_day1_3Day 13
Sample 15GSM1015380SGBS_day3_1Day 31
Sample 16GSM1015381SGBS_day3_2Day 32
Sample 17GSM1015382SGBS_day3_3Day 33
Sample 18GSM1015383SGBS_day12_1Day 121
Sample 19GSM1015384SGBS_day12_2Day 122
Sample 20GSM1015385SGBS_day12_3Day 123
Specifically, SGBS cells were cultured in Dulbecco's modified Eagle's medium (DMEM)/Nutrient Mix F12 (Gibco) containing 8 mg/L biotin, 4 mg/L pantothenate, 0.1 mg/mg streptomycin and 100 U/mL penicillin (OF medium) supplemented with 10% FBS in a humidified 95% air/5% CO2 incubator. The cells were seeded into 10 cm plates, which were coated with a solution of 10 μL/mL fibronectin and 0.05% gelatine in phosphate-buffered saline. Confluent cells were cultured in serum-free OF medium for 2 days followed by stimulation to differentiate with OF media supplemented with 0.01 mg/mL human transferrin, 200 nM T3, 100 nM cortisol, 20 nM insulin, 500 μM IBMX and 100 nM rosiglitazone (Cayman Chemicals). After day 4, the differentiating cells were kept in OF media supplemented with 0.01 mg/mL human transferrin, 100 nM cortisol and 20 nM insulin.

Gene expression analysis

Total RNA was extracted using TriSure (Bioline). 1 mL of TriSure was added per a confluent 10 cm dish to lyse the cells. RNA was extracted with 200 μL chloroform and precipitated from the aqueous phase with 400 μL isopropanol by incubating at − 20 °C overnight. The longer isopropanol incubation allowed the precipitation of microRNAs and other small RNAs from the same samples. The total RNA samples were processed according to the manufacturer instructions to prepare cDNA that was hybridized on microarrays (Turku Centre for Biotechnology, Microarray and Sequencing Facility, Turku, Finland). Total RNA integrity was confirmed using an Agilent 2100 Bioanalyzer (Agilent Technologies, Santa Clara, CA, USA).

Data processing and normalization

The raw data files were processed and quality controlled using the R/Bioconductor lumi package. Raw and normalized expression values are available via GEO (GSE41352). Control probe data was included and used to background correct the signal values with the lumiB “bgAdjust” method. We provide this data and sample data in a format that is directly compatible with the lumi analysis package through our web resource at http://systemsbiology.uni.lu/idare.html. The data was then transformed with the “vst” method and normalized with robust spline normalization (rsn) method. The probe intensity value distribution and sample relation are plotted in Fig. 1, Fig. 2, with sample naming described in Table 1. No outliers were detected based on data value range at this step and the samples clustered according to the biological sample group. The code that can be used to download processed data from GEO or to process them from the files that we provide through our website is available (see Additional Data File 1).
Fig. 1

Probe intensity plots for the 20 SGBS differentiation samples in GSE41578. A) Box plots of raw probe intensities. B) Box plots of normalized probe intensities indicate the absence of outliers and comparable data mean intensities.

Fig. 2

Hierarchical clustering of the SGBS differentiation microarray samples. The dendrogram shows high similarity between replicates and grouping based on differentiation time progression.

Statistical analysis

The negative probe signals were used to filter non-expressed genes. Only genes that had a detection p-value < 0.05 within all samples of at least one time point were selected for statistical analysis, resulting in a total of 12 756 detected probes. The statistical analysis was performed using the R/Bioconductor limma package. The F-test was used to assess significance of overall dynamic response over the differentiation while a two-tailed t-test was performed to compare specific time points to day 0 undifferentiated cells. In both analyses Benjamini–Hochberg adjusted p-value < 0.01 was considered statistically significant. In total, 1936 Refseq transcripts changed their expression more than 2-fold up or down during the differentiation time series. The code that can be used to filter non-expressed genes and to perform the statistical analysis is available (see Additional Data File 1). Several of these genes were metabolic genes, represented by 2-fold more differentially expressed genes compared to other gene categories with similar numbers of genes (extracted from the GO Online SQL Environment, as of 12th of August 2013: cell projection, envelope, locomotion and receptor activity).

Analysis of metabolic genes in Recon1

The annotation data from Recon1 was obtained and checked against the current EntrezGene and Refseq annotations (hg19 Refseq; Feb 02 2012). The reaction to gene mappings was updated with current gene IDs (see Table S1). Withdrawn IDs and pseudogenes present a difficulty in the Recon1 annotation. As there were only few such genes (see Table S1), they were left out from visualizations and assigned expression level 0 in modeling. LPIN1 was missing and due to its central role in adipocytes, it was added to the triacylglycerol pathway reaction catalyzed by Phosphatidic Acid Phosphatase (PPAP). The expression profiles of metabolic genes (from Recon 1 [3]) or TFs (from [4]) were clustered for visualization using self-organizing maps (GEDI software [5]) and AutoSOME [6] as instructed in the tool documentation. The settings to reproduce the results presented in [1] were the following: GEDI grid size was adjusted based on input gene number and settings were tuned in order to minimize data missing grid points (gene density map) (see Table S2). AutoSOME GUI was used following description in the manual without data filtering. Clustering was done for columns (samples) on “precision” mode, with the “Fuzzy Cluster Network” option and network visualization with Cytoscape [7]. Enriched pathways of the human metabolic reconstruction [3] were determined using a hypergeometric test. A consistent version of the generic human metabolic model Recon1 [3] was used as modeling platform for prediction of network activity distributions. The Recon1 model was downloaded from the BiGG database [8] (04.11.11) and the consistent version was derived using the function “reduceModel” from the COBRA toolbox 2.0 [9], which resulted in the exclusion of 1273 reactions (34%) of the initial model (Table S3). To include the microarray data as soft-constraints for reaction activity prediction, the probes were mapped to Entrez Gene IDs. First, continuous log2 normalized expression values for the probes were discretized into three categories: lowly expressed (− 1), moderately expressed (0) and highly expressed (1) based on the mean expression ± 0.5 ∗ standard deviation cutoffs across all arrays. Then, one unique discretized value per gene was selected taking the rounded discretized mean of all probes for a gene. Each gene was then assigned to the Recon1 reaction based on gene–protein-reaction associations.

Discussion

Here we describe a time series dataset of human SGBS pre-adipocyte differentiation. This dataset is comprised of whole transcriptome gene expression profiling data derived using the Illumina BeadArrays. We demonstrated differential expression that was particularly prevalent among metabolic genes. Moreover, discretization of the metabolic gene expression levels allowed using them as soft-constrains for metabolic activity modeling. Further, this dataset is part of a GEO SuperSeries (GSE41578) and we have used it in combination with next-generation sequencing data and microRNA expression profiles to associate putative regulators to the metabolic genes in [1]. To further analyze the data in an integrative manner, we introduced gene metanodes and the web portal IDARE (Integrated Data Nodes or Regulation) in [1] for interactive data exploration of various data types within the metabolic network context, available at http://systemsbiology.uni.lu/idare.html, including a detailed user guide. The data could be similarly analyzed to interrogate the regulation of other pathways. Results from the data have increased our understanding of human adipogenesis. The following are the supplementary data related to this article.

Table S1

Reaction to gene mapping to current gene IDS.

Table S2

Setting for GEDI visualization.

Table S3

Blocked reactions in Recon1.

Supplementary material 1

The script for transforming and normalizing Illumina datasets.

Supplementary material 2

The script to perform statistical analysis of gene expression.
Specifications [standardized info for the reader]; where applicable, please follow the Ontology for Biomedical Investigations: http://obi-ontology.org/page/Main_Page
Organism/cell line/tissueHuman/SGBS pre-adipocyte/adipose tissue
SexMale
Sequencer or array typeIllumina HumanHT-12V3.0 expression beadchip
Data formatRaw and analyzed data
Experimental factorsTime point of differentiation to adipocytes. Cells were cultured 2 days in serum-free OF medium prior to differentiation
Experimental featuresTime series of differentiation (20 samples, 7 time points in duplicate or triplicate). SGBS pre-adipocyte cells originate from patient with SGB syndrome. See Wabitsch M. et al. Int J Obes Relat Metab Disord. 2001 for more details on differentiation protocol and origin of cells.
ConsentSee Wabitsch M. et al. Int J Obes Relat Metab Disord. 2001 [2]
Sample source locationSee Wabitsch M. et al. Int J Obes Relat Metab Disord. 2001 [2]
  9 in total

1.  Gene Expression Dynamics Inspector (GEDI): for integrative analysis of expression profiles.

Authors:  Gabriel S Eichler; Sui Huang; Donald E Ingber
Journal:  Bioinformatics       Date:  2003-11-22       Impact factor: 6.937

2.  Global reconstruction of the human metabolic network based on genomic and bibliomic data.

Authors:  Natalie C Duarte; Scott A Becker; Neema Jamshidi; Ines Thiele; Monica L Mo; Thuy D Vo; Rohith Srivas; Bernhard Ø Palsson
Journal:  Proc Natl Acad Sci U S A       Date:  2007-01-31       Impact factor: 11.205

3.  Quantitative prediction of cellular metabolism with constraint-based models: the COBRA Toolbox v2.0.

Authors:  Jan Schellenberger; Richard Que; Ronan M T Fleming; Ines Thiele; Jeffrey D Orth; Adam M Feist; Daniel C Zielinski; Aarash Bordbar; Nathan E Lewis; Sorena Rahmanian; Joseph Kang; Daniel R Hyduke; Bernhard Ø Palsson
Journal:  Nat Protoc       Date:  2011-08-04       Impact factor: 13.491

4.  AutoSOME: a clustering method for identifying gene expression modules without prior knowledge of cluster number.

Authors:  Aaron M Newman; James B Cooper
Journal:  BMC Bioinformatics       Date:  2010-03-04       Impact factor: 3.169

5.  Characterization of a human preadipocyte cell strain with high capacity for adipose differentiation.

Authors:  M Wabitsch; R E Brenner; I Melzner; M Braun; P Möller; E Heinze; K M Debatin; H Hauner
Journal:  Int J Obes Relat Metab Disord       Date:  2001-01

6.  BiGG: a Biochemical Genetic and Genomic knowledgebase of large scale metabolic reconstructions.

Authors:  Jan Schellenberger; Junyoung O Park; Tom M Conrad; Bernhard Ø Palsson
Journal:  BMC Bioinformatics       Date:  2010-04-29       Impact factor: 3.169

7.  Cytoscape 2.8: new features for data integration and network visualization.

Authors:  Michael E Smoot; Keiichiro Ono; Johannes Ruscheinski; Peng-Liang Wang; Trey Ideker
Journal:  Bioinformatics       Date:  2010-12-12       Impact factor: 6.937

8.  Integrated analysis of transcript-level regulation of metabolism reveals disease-relevant nodes of the human metabolic network.

Authors:  Mafalda Galhardo; Lasse Sinkkonen; Philipp Berninger; Jake Lin; Thomas Sauter; Merja Heinäniemi
Journal:  Nucleic Acids Res       Date:  2013-11-05       Impact factor: 16.971

9.  Gene-pair expression signatures reveal lineage control.

Authors:  Merja Heinäniemi; Matti Nykter; Roger Kramer; Anke Wienecke-Baldacchino; Lasse Sinkkonen; Joseph Xu Zhou; Richard Kreisberg; Stuart A Kauffman; Sui Huang; Ilya Shmulevich
Journal:  Nat Methods       Date:  2013-04-21       Impact factor: 28.547

  9 in total
  2 in total

Review 1.  20 Years with SGBS cells - a versatile in vitro model of human adipocyte biology.

Authors:  Daniel Tews; Rolf E Brenner; Reiner Siebert; Klaus-Michael Debatin; Pamela Fischer-Posovszky; Martin Wabitsch
Journal:  Int J Obes (Lond)       Date:  2022-08-19       Impact factor: 5.551

2.  A new human adipocyte model with PTEN haploinsufficiency.

Authors:  Franziska Kässner; Anna Kirstein; Norman Händel; Gordian L Schmid; Kathrin Landgraf; Antje Berthold; Astrid Tannert; Michael Schaefer; Martin Wabitsch; Wieland Kiess; Antje Körner; Antje Garten
Journal:  Adipocyte       Date:  2020-12       Impact factor: 4.534

  2 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.