Literature DB >> 21427694

Differential coexpression analysis of obesity-associated networks in human subcutaneous adipose tissue.

A J Walley1, P Jacobson, M Falchi, L Bottolo, J C Andersson, E Petretto, A Bonnefond, E Vaillant, C Lecoeur, V Vatin, M Jernas, D Balding, M Petteni, Y S Park, T Aitman, S Richardson, L Sjostrom, L M S Carlsson, P Froguel.   

Abstract

OBJECTIVE: To use a unique obesity-discordant sib-pair study design to combine differential expression analysis, expression quantitative trait loci (eQTLs) mapping and a coexpression regulatory network approach in subcutaneous human adipose tissue to identify genes relevant to the obese state. STUDY
DESIGN: Genome-wide transcript expression in subcutaneous human adipose tissue was measured using Affymetrix U133 Plus 2.0 microarrays (Affymetrix, Santa Clara, CA, USA), and genome-wide genotyping data was obtained using an Applied Biosystems (Applied Biosystems; Life Technologies, Carlsbad, CA, USA) SNPlex linkage panel.
SUBJECTS: A total of 154 Swedish families ascertained through an obese proband (body mass index (BMI) >30 kg m(-2)) with a discordant sibling (BMI>10 kg m(-2) less than proband).
RESULTS: Approximately one-third of the transcripts were differentially expressed between lean and obese siblings. The cellular adhesion molecules (CAMs) KEGG grouping contained the largest number of differentially expressed genes under cis-acting genetic control. By using a novel approach to contrast CAMs coexpression networks between lean and obese siblings, a subset of differentially regulated genes was identified, with the previously GWAS obesity-associated neuronal growth regulator 1 (NEGR1) as a central hub. Independent analysis using mouse data demonstrated that this finding of NEGR1 is conserved across species.
CONCLUSION: Our data suggest that in addition to its reported role in the brain, NEGR1 is also expressed in subcutaneous adipose tissue and acts as a central 'hub' in an obesity-related transcript network.

Entities:  

Mesh:

Substances:

Year:  2011        PMID: 21427694      PMCID: PMC3160485          DOI: 10.1038/ijo.2011.22

Source DB:  PubMed          Journal:  Int J Obes (Lond)        ISSN: 0307-0565            Impact factor:   5.095


Introduction

Obesity, commonly defined as a body mass index (BMI) > 30 kg/m2, has steadily risen in prevalence globally, a trend that could lead to over a billion people being obese by 2030 1. Obesity is already a major public health problem, resulting in increased morbidity and mortality 2 and different hypotheses have been suggested to account for this 3. Genome-wide linkage analysis alone has identified many genomic regions linked to obesity but replication has been problematic 4. More recently, common low-penetrant variants associated with obesity have been identified in genome-wide association studies (GWAS) 5-8. Additionally, rare copy number variants 9 have also been implicated in the causality of obesity. All of these approaches rely on the correlation between genomic variation and either obesity status or an obesity-related quantitative phenotype, e.g. BMI. Gene expression levels reflect the combined effects of a wide range of genomic modifications including point mutations, structural variants and epigenetic changes. Abundance of any specific mRNA is therefore likely to more closely reflect the overall genomic effects than each type of variation separately. This is especially true for those changes having a direct effect on the transcription levels, although alterations in protein structure and function might also have a feedback effect on transcriptional activity 10. Environmental effects are also likely to be indirectly captured by transcript levels, as recently shown in leukocyte gene expression studies among three Moroccan sub-populations where at least 37% of the differentially expressed transcripts were not explainable by genetic and methylation differences11. Therefore, the assessment of genome-wide gene expression provides a snapshot of underlying cellular processes and their environmental and genomic influences. Since the transcript levels are strongly modulated by polymorphisms in regulatory regions, they can be powerfully mapped by correlating gene expression with genetic data. The regions identified by such correlations, named expression quantitative trait loci (eQTLs), directly pinpoint the functional link between variants in the genome and their biological effect. For this reason, eQTL analysis has been suggested as a means to identify genetic variants involved in the susceptibility to complex diseases and to fill the gap between disease associations identified by GWA and the mechanism by which they contribute to the disease 12, 13. The choice of tissue is central to a gene expression study, as the expression profile is context dependent and differs between tissues 14. In addition, within the same tissue, eQTLs can be specific to the cellular differentiation state 15. Subcutaneous adipose tissue (SAT) is the tissue of choice to investigate common human obesity because it displays obesity-related changes in gene expression 16, it has clear endocrine organ characteristics 17, and samples can be obtained from large numbers of human subjects. Altered expression of a number of genes implicated with obesity and the metabolic syndrome has been reported in studies of SAT from obese subjects, including CD36 18 and PFKFB3 19. Instead of analyzing each transcript independently from the others, novel approaches can exploit the interactions among transcripts to identify gene networks. They delineate the complex interrelationships occurring amongst gene transcription levels which can be correlated with phenotypic and genomic data for the identification of relevant biological pathways 12. Measurement of gene expression in multiple tissues in mice has allowed the delineation of a gene network enriched for genes involved in the inflammatory response and macrophage activation that is highly correlated with obesity-related phenotypes 20. A similar overlapping network has been identified in human SAT 21. Our study takes advantage of the SibPair cohort, which consists of154 families (n=732) identified by having an obese proband (BMI>30kg/m2) with a BMI-discordant sibling (BMI difference of at least 10kg/m2) 22. SAT and blood samples were available from the siblings and peripheral blood from all subjects. These unique discordant families allowed a combined approach for the identification of genes and pathways involved in obesity. Using a relatively small sample, we have combined eQTL mapping, differential-expression analysis, and a novel differential co-expression network approach in sib-pairs to identify biologically-relevant transcriptional modules and their key regulators to provide insights into the pathogenesis of obesity.

Materials and Methods

Participants and study design

The study cohort was 154 nuclear families (732 subjects) ascertained via an extremely BMI-discordant sib-pair (difference >= 10kg/m2)22. Average family size was 4.75. SAT samples were available from the siblings and peripheral blood from all subjects. Median BMI (1st-3rd quartiles) was 27.2 (23.0–33.2), range 16.9-57.8. Median age (1st-3rd quartiles) was 45 years (36–63). Informed written consent was obtained from all participants. This study was approved by the ethics committee of Gothenburg University.

Nucleic acid isolation

Genomic DNA was isolated from whole blood using the QIAamp DNA Blood Maxi Kit (Qiagen, Hilden, Germany) according to the manufacturers' recommendations. Subcutaneous adipose tissue biopsies were immediately frozen in liquid nitrogen and RNA was extracted using the Qiagen RNeasy Lipid Tissue kit.

Linkage Genotyping

The SNPlex™ System Linkage Mapping Set (http://www.appliedbiosystems.com) was used, comprising 3 922 SNPs, of which ~75% are in clusters, distributed across 95 probe pools. Allelic discrimination was performed using an Applied Biosystems 3730xl DNA Analyzer and GeneMapper3.7 software. Pedcheck 23 was used to detect Mendelian inconsistency. Genetic markers giving rise to tight double recombinants were identified with MERLIN 24 and treated as missing data.

Gene expression measurement

Gene expression was measured using the Affymetrix Human Genome U133+2.0 array (Affymetrix, Santa Clara, CA). In brief, RNA was reverse transcribed into cDNA and biotin-labelled cRNA was prepared by in vitro transcription (Enzo Diagnostics Inc, Farmingdale, NY). After hybridisation, the arrays were scanned using the Affymetrix GeneArray GCS3000 scanner and visualised using GeneChip Operating Software (GCOS, Affymetrix). Gene expression levels were normalised using the Robust Multiarray Average (RMA) method 25.

RT-PCR gene expression analysis

Adipose tissue biopsies were obtained from subcutaneous fat depots of two French volunteers, as previously described 26. For each sample, 1μg of total RNA was transcribed into cDNA using the cDNA Archive Kit (Applied Biosystems) or Random Primed First Strand Synthesis (Applied Biosystems). 4μl of a 1/10th dilution of each resulting cDNA was used in a 20μl reaction, including 10μl of TaqMan gene expression mastermix (Applied Biosystems) and 1μl of the appropriate assay (Applied Biosystems). Quantitative RT-PCR analyses were performed using ABI 7900 HT SDS2.3 software and each sample was run in triplicate. NEGR1 expression levels were obtained relative to three housekeeping genes (ACTB, TOP1 and POLR2A). The cDNA sample content was normalized by subtracting the number of copies of the mean of three housekeeping genes from the number of copies of the target gene (ΔCt = Ct of target gene – Ct of housekeeping genes). Expression was calculated using the formula 100×2-ΔCt.

Linkage analysis

After quality control, 149 families were considered suitable for analysis. We selected the subset of transcripts having a unique position or specificity > 70% in the genome (n=27 904 transcripts) using SCAMPA (http://web.bioinformatics.ic.ac.uk/scampa). Linkage was evaluated using MERLIN-REGRESS24. Although robust to misspecification, MERLIN-REGRESS requires the population trait's mean, variance, and heritability. Population parameters were estimated using the variance component model implemented in SOLAR 27. Since the variance components analysis requires a normal distribution for the trait, we applied a Box-Cox transformation to each transcript level 28. Gene expression values falling outside the mean ±3 SDs were excluded from the analysis. Age and sex were included as covariates in the SOLAR analyses. To identify cis-eQTLs, a window of 2.5cM left and right of each transcript position was used. Given this map resolution there are 1 483 transcripts which have no marker within 5cM, therefore a subset of 26 421 transcripts was analysed. All 27 904 transcripts were included in the trans-eQTL analysis. Linkage disequilibrium among the SNPs was modelled by specifying in MERLIN-REGRESS to treat as a “super-locus” all SNPs for which the observed pairwise r2>0.1 29. All P values were calculated from LOD-scores, then corrected for multiple testing by the FDR procedure 30.

Assessing the significance of trans-eQTLs

To determine the empirical significance of trans-eQTLs, the approach of Emilsson 21 was used. Linkage analysis of the 27 904 transcripts was repeated using ten genome-wide datasets simulated by gene dropping under the null hypothesis of no linkage. The top-hit trans-eQTL for every transcript was extracted from each of the ten genome-wide analyses, giving a distribution of 279 040 LOD-scores that was used to assess empirical P values for the trans-eQTLs observed in the original data. For the detection of hotspots of trans-regulation, we are interested in the probability for different signals, each of them genome-wide significant, to randomly arise at the same location. Hidden underlying correlation structure between the IBD at a genetic location and the transcription levels might influence the occurrence of false coincident linkages. The 5% LOD-score observed in the simulated dataset was used as threshold for the genome-wide significance of each analysed transcript in our data. The number of coincident linkages was then recorded at each marker location. Applying the same procedure to the simulated dataset, we obtained the distribution of coincident linkages under the null hypothesis of no-linkage. We used this distribution to assess empirical P values for the size of the observed coincident linkages. Finally, multiple test correction was assessed using the FDR procedure 30.

Differential expression

Log-transformed expression levels for the whole set of 54 675 transcripts were corrected for age and sex and 119 pairs of extreme sibs were selected. The Limma package was used to identify significant genes that were over- or under-expressed 31. Linear and robust regressions were performed separately, before applying the Empirical Bayes shrinkage method, obtaining similar results. Paired design was taken into account and specified accordingly. Correction for multiple testing was performed using Storey's FDR procedure 32 on the P values of the shrunk test statistics.

Differential co-expression analysis

Diseases can often result from the dysregulation of a gene network 33. Differential co-expression analysis 34 35 might help in identifying those genes within the network that lead to the disruption of the regulatory mechanisms. We propose a novel approach of testing the difference between gene networks in two groups. Firstly, we built obese and lean relevance networks with correlation matrices calculated using Kendall's tau correlation 36 in order to robustify the analysis. Then we contrasted the two networks calculating the differences between the transcript-transcript correlation matrices. Significant difference were evaluated using permutation tests 37 with different resample schemes chosen according to the two samples dependencies. Empirical P values were computed as the proportion of the differences observed in the permuted data sets that were equal or greater than what was observed in the original data set. An FDR thresholding procedure 32 was applied to the empirical P values to highlight the most significant differences. Our approach, although similar in spirit to other methods that look at differences in coexpression networks between different conditions/or case control groups (for a review see 38), is new in many respects. Firstly, through a model-free permutation test, we test directly whether the observed correlations differences are significant so we are not considering differences in the graph's topology 39. Secondly, simply changing the sampling scheme for the permutation test, we can accommodate different levels of dependence between the groups. Thirdly, we do not consider just strong (positive or negative) correlations or strong differences using ad hoc thresholding 40. Selection of what is relevant is obtained by applying the FDR procedure. Finally, the network module is defined as the connected component after FDR calculation, avoiding the ad hoc metric distances required in cluster algorithms 40, 41

Identification of obesity-related biological pathways

At 10% FDR level we selected those differentially expressed transcripts for which cis-eQTLs were also identified. Enrichment of KEGG pathways was assessed with DAVID. Using all differentially expressed transcripts belonging to identified KEGG pathways and the same sub-sample selected by the Limma analysis, we applied the differential co-expression analysis approach at 10% FDR level. To take into account the paired design, we randomly relabelled the data within each pair in each permuted data set. We tested whether the number of connections observed for the analysed genes was larger than that expected under the null hypothesis of these genes being randomly connected 42. We also contrasted 1 000 relevance networks between obese and lean subjects generated using randomly-selected transcripts. The maximum number of connections was recorded for each simulation to evaluate the empirical significance for the most connected genes in the original dataset.

Validation of the differential co-expressed network in mouse

To validate the differential co-expression network identified in human SAT, we used adipose tissue gene expression data that were available from a mouse F2 intercross, although this was from white adipose tissue rather than pure subcutaneous adipose tissue 43. The first and third quartiles of mouse weight were used to select the most obese and most lean mice (n=144). Orthologous genes were identified using Ensembl Biomart (build 37) 44. For comparison, the differential co-expression analysis in humans was re-evaluated using the subset of genes also present in the mouse dataset. To assess the empirical significance of the difference observed between relevance networks, we applied the differential co-expression analysis approach at 10% FDR level. Assuming the independence of the two samples, in each permuted data set the pooled sample was randomly split preserving the original sample size of the two groups. Statistical assessment was carried out to determine whether any gene showed a number of connections in both the human and mouse differential co-expression networks higher than expected under the assumption of independence. Assuming that the number of connections in each network follows a Poisson distribution, we simulated 1 000 000 times a sample of n paired observations from two independent Poisson, with n equal to the number of genes used to build the two networks. In each simulation we calculated the proportion of connections for the same gene in both networks and we recorded the highest joint proportion which, under the null hypothesis, corresponds to the product of the two marginal distributions. Finally, the empirical distribution of the highest joint proportion was used to evaluate the empirical P value for each pair of significant genes identified in both the human and mouse difference relevance networks.

Correlation of NEGR1 Gene expression in human SAT and hypothalamus

In order to investigate the possibility of correlation between expression of NEGR1 in adipose tissue and in the hypothalamus, a publicly available dataset was used (NCBI GEO accession number GSE3526) 45. This study analysed gene expression in different normal tissues from ten healthy donors using the Affymetrix Human Genome U133 Plus 2.0 Array. Genome-wide expression levels in hypothalamus were available for eight subjects. For three subjects, expression levels were also available for adipose tissue. We assessed NEGR1 correlation in expression levels between the two tissues, using the genome-wide data to generate a null distribution of no association. An empirical P value was derived using one million permutations.

Results

Differentially expressed transcripts

We determined which transcripts were differentially expressed between obese and lean subjects. The results are reported in Table 1. Obesity showed a global effect on genome-wide gene expression. A majority (55%) of the differentially expressed transcripts were up-regulated in lean subjects. DAVID/KEGG analysis of the differentially expressed transcripts did not identify significant enrichment for any obvious obesity-related pathway.
Table 1

Numbers (and percentage) of differentially expressed transcripts between lean and obese subjects identified using the Limma package at different FDR levels using the linear regression option. Number (and percentage) of upregulated transcripts in obese subjects is also provided.

FDR levelTranscripts differentially expressed (%)Upregulated in obese (%)
5%12 621 (23)6 179 (49)
10%16 454 (30)7 478 (45)
20%23 251 (43)9 679 (42)

Detection of cis-eQTLs

Given the inter-SNP map distances, we defined a cis-eQTL signal for each transcript as the maximum LOD-score obtained within 2.5cM 5′ or 3′ of each transcript position in the genome. There were 26 421 transcripts with a SNP marker within 5cM. Median (1st - 3rd quartile) heritability was 0.19 (0.05 – 0.34). The maximum LOD-score was detected at a median (1st - 3rd quartile) distance from the centre of the transcript of 1.5 cM (0.8 - 2.1cM). We identified 1 063 (4%) eQTLs at 10% FDR level. The twenty cis-eQTLs with the highest LOD-scores are shown in Table 2. As expected, cis-eQTLs were detected for those expression traits with a heritability score of zero or close to zero but traits with higher heritability also had higher LOD-scores.
Table 2

Top twenty human cis-eQTLs across the whole genome in descending order of LOD-score. Chromosomal position is based upon NCBI build 37 of the human genome.

Affymetrix IDGene SymbolGene NameCh rLO DPosition (Mb)Distance (Mb)
219629_at FAM118A family with sequence similarity 118, member A2210.943.590.51
228425_at LOC654433 hypothetical LOC654433210.4113.700.02
203815_at GSTT1 glutathione S-transferase theta 12210.323.550.84
222424_s_at NUCKS1 nuclear casein kinase and cyclin-dependent kinase substrate 119.0203.130.84
219269_at HMBOX1 homeobox containing 189.028.820.06
226707_at NAPRT1 nicotinate phosphoribosyltransferase domain containing 188.1144.830.10
214012_at ERAP1 endoplasmic reticulum aminopeptidase 157.994.881.26
209255_at KLHDC10 kelch domain containing 1077.7128.491.04
203337_x_at ITGB1BP1 integrin beta 1 binding protein 127.58.101.37
224097_s_at F11R F11 receptor17.4158.520.74
203814_s_at NQO2 NAD(P)H dehydrogenase, quinone 267.32.840.12
224828_at CPEB4 cytoplasmic polyadenylation element binding protein 457.3173.270.01
226736_at CHURC1 churchill domain containing 1147.366.221.76
225278_at PRKAB2 protein kinase, AMP-activated, beta 2 non-catalytic subunit17.1143.211.89
224904_at PDPR pyruvate dehydrogenase phosphatase regulatory subunit167.168.010.71
203096_s_at RAPGEF2 Rap guanine nucleotide exchange factor (GEF) 246.5160.550.09
219027_s_at MYO9A myosin IXA156.572.031.98
227678_at XRCC6BP1 XRCC6 binding protein 1126.555.621.01
229238_at C17orf97 chromosome 17 open reading frame 97176.51.571.30
210947_s_at MSH3 mutS homolog 3 (E. coli)56.578.711.39

Detection of trans-eQTLs

For each transcript we recorded the maximum peak LOD-score located on a chromosome different to the chromosome where the transcript was located. Using simulations (see Methods) with a 10% FDR, we identified 50 significant trans-eQTLs distributed across 12 chromosomes (see Table 3). Although most trans-eQTLs were not significant after multiple testing correction, we noted that trans-linkage signals for many transcripts were concentrated in the 1p13.3-q23.3 region. The empirical probability of observing coincident linkage was tested by simulating under the null hypothesis of no linkage. In the simulations, when a false positive was detected for a transcript, a number of correlated transcripts also showed a linkage peak in the same region, as expected. Using the empirical probability of coincident linkages through the genome, we determined a significant clustering of 374 transcripts in the 1p13.3-q23.3 region at 10% FDR.
Table 3

List of human trans-eQTLs across the whole genome in chromosomal position order and descending order of LOD-score where trans-eQTLs have the same position. Chromosomal position is based upon NCBI build 37 of the human genome.

Affymetrix IDGene SymbolGene NameCh rLO DPosition (Mb)
204440_at CD83 CD83 molecule113.453.22
227610_at TSPAN11 Tetraspanin 11113.153.22
204973_at GJB1 Gap junction protein, beta 118.653.22
205203_at PLD1 Phospholipase D1, phosphatidylcholine-specific111.9109.84
208631_s_at HADHA Hydroxyacyl-Coenzyme A dehydrogenase/3-ketoacyl-Coenzyme A thiolase/enoyl-Coenzyme A hydratase (trifunctional protein), alpha subunit113.4113.22
217169_at IGHA1 Immunoglobulin heavy constant alpha 118.8115.34
225279_s_at C3ORF17 Chromosome 3 open reading frame 1719.2120.23
200874_s_at NOL5A Nucleolar protein 5A114.8143.21
222735_at TMEM38B Transmembrane protein 38B18.9148.37
224755_at TM9SF3 Transmembrane 9 superfamily member 3111.7157.05
200894_s_at FKBP4 FK506 binding protein 4115.1169.80
222425_s_at POLDIP2 Polymerase (DNA-directed), delta interacting protein 218.9203.13
223193_x_at C3ORF28 Chromosome 3 open reading frame 2819.9212.88
217959_s_at TRAPPC4 Trafficking protein particle complex 418.3212.88
228451_at TSSK3 Testis-specific serine kinase 319.1227.81
202000_at NDUFA6 NADH dehydrogenase (ubiquinone) 1 alpha subcomplex, 619.6228.89
227607_at STAMBPL1 STAM binding protein-like 1210.138.12
228426_at CLEC2D C-type lectin domain family 2, member D210.4113.70

Biological pathways involved in obesity

Given the set of 1 063 cis-eQTLs, pathway enrichment analysis using DAVID 46, 47 identified the KEGG insulin signalling pathway as the most significantly enriched (P=1.6×10−2). The proportion of differentially expressed genes in this pathway did not differ from that observed in the whole dataset. No significant enrichment was observed for the small number of trans-regulated genes identified in this study. For the hotspot of trans-regulators on chromosome 1p13.3-q23.3 the proportion of obesity-related transcripts was again not different from what would be expected at random. Significant enrichment was observed for genes in the apoptosis pathway (P=5.5×10−3) but no obvious obesity candidates were present in this very large region. To identify obesity-related networks that include transcripts under genetic control, we focused on 425 transcripts that for an FDR of 10% were both differentially expressed between lean and obese subjects and under cis-acting regulation. Using an EASE 48 score threshold of 0.1 in DAVID to rank categories of genes, only the Cell Adhesion Molecule (CAMs) KEGG functional grouping was highlighted, which in our dataset contains 160 transcripts (corresponding to 76 genes), eight of them (corresponding to seven genes) under cis-regulation. Relevance gene networks were constructed separately in obese and lean subjects using these 160 transcripts and the empirical significance of the observed differences in co-expression among pairs of transcripts in the two networks evaluated by permutations (see Methods). The lean and obese relevance networks and their contrast are shown in Figure 1. Table 4 lists the CAMs genes and their number of connections in the contrasted network, i.e. the number of significantly different correlations (FDR < 10%) of each gene with the remaining genes between the two groups. The neuronal growth regulator 1 (NEGR1) gene was the most connected gene with nine edges, while four significant connections were observed for HLA-DQB2, three for ALCAM, and two for HLA-DQA2, ITGAM, and CD86. We tested whether the number of connections observed for these genes was larger than that expected by chance. Using as the null distribution a Poisson random variable with mean equal to the average connectivity in the network, the 9 connections observed for NEGR1 were considered as a rare event (P=4.4×10−10). Having four, three, and two connections in this dataset corresponds to P values of 2.7×10−4, 0.002, and 0.02, respectively.
Figure 1

Differential co-expression analysis of CAM gene expression in human subcutaneous adipose tissue

Differentially co-expressed network of the CAMs functional grouping resulting from the contrast between obese (A) and lean (B) networks at FDR 10%. Red and blue edges represent negative and positive correlations respectively. For simplicity, gene names are only shown for the external nodes in (A) and (B).

Table 4

Genes showing significantly different co-expression between lean and obese human SAT CAMs networks at FDR 10% in descending order of their number of connections

Gene SymbolGene NameConnectio ns
NEGR1 neuronal growth regulator 19
HLA-DQB2 major histocompatibility complex, class II, DQ beta 24
ALCAM activated leukocyte cell adhesion molecule3
CD86 CD86 antigen2
HLA-DQA2 major histocompatibility complex, class II, DQ alpha 22
ITGA9 integrin, alpha 92
ITGAM integrin, alpha M (complement component 3 receptor 3 subunit)2
SELP selectin P (granule membrane protein 140kDa, antigen CD62)2
CADM1 cell adhesion molecule 11
CD99 CD99 antigen1
GLG1 golgi apparatus protein 11
HLA-DMB major histocompatibility complex, class II, DM beta1
HLA-DPA1 major histocompatibility complex, class II, DP alpha 11
HLA-DRB5 major histocompatibility complex, class II, DR beta 51
ITGA8 integrin, alpha 81
ITGAL integrin, alpha L1
ITGB1 integrin, beta 11
ITGB2 integrin, beta 21
NLGN1 neuroligin 11
NRCAM neuronal cell adhesion molecule1
NRXN1 neurexin 11
PTPRM protein tyrosine phosphatase, receptor type, M1
PVRL3 poliovirus receptor-related 31
SIGLEC1 sialic acid binding Ig-like lectin 1, sialoadhesin1
We also evaluated the empirical significance of the connectivity observed for these genes by contrasting relevance networks (between obese and lean subjects) randomly generated by using the same number of transcripts and recording the gene with highest connectivity in each simulated dataset. Out of 1 000 replicates, sporadic differences were observed between the obese and lean correlation matrices, as expected, but none of them showed a similar number of differences with respect to the original dataset. In no cases did a sample size of 160 transcripts contain a gene with nine edges. Marginal significance was observed for HLA-DQB2 (P=0.028).

Validation of the CAMs network in mouse

From the whole set of 76 genes belonging to the human CAMs pathway, 57 orthologous genes were present in a mouse dataset 43, corresponding to 66 mouse transcripts. To assess the importance of the NEGR1 gene in both humans and mice, we first restricted the set of CAMs genes in the human data to those which were also present in the mouse, resulting in a set of 115 human transcripts. Contrast of the co-expression networks were carried out in human and mouse and significant results filtered using a 10% FDR level. Table 5 shows the list of significantly connected genes from the mouse analysis, highlighting that NEGR1 is highly connected in the contrasted mouse network as well. The mouse differential relevance network contained an overall larger number of connections, probably reflecting higher intra-group homogeneity and reduced environmental noise in this dataset. We ordered each gene with respect to the observed joint connectivity in both networks. The empirical significance of its rank was assessed through simulations under the null hypothesis of networks' independence (see Methods). Only NEGR1 showed a significant departure from this assumption (P=2.1×10−5) indicating that this gene is integral to both the human and mouse networks.
Table 5

Genes showing significantly different co-expression by contrasting the mouse SAT CAMs networks between mice in the first and third quartile of the weight distribution at 10% FDR level in descending order of their number of connections.

Gene symbolGene nameConnectio ns
SDC2 syndecan 225
CLDN11 claudin 1123
CDH2 cadherin 222
PTPRF protein tyrosine phosphatase, receptor type, F21
NEGR1 neuronal growth regulator 120
NRXN1 neurexin I19
F11R F11 receptor18
NLGN1 Neuroligin 117
HLA-DRB5 major histocompatibility complex, class II, DR beta 516
ITGB1 integrin beta 116
JAM2 junction adhesion molecule 216
PTPRM protein tyrosine phosphatase, receptor type, M14
SIGLEC1 sialic acid binding Ig-like lectin 1, sialoadhesin13
VCAN versican13
ALCAM activated leukocyte cell adhesion molecule12
ITGB8 integrin beta 812
NRXN3 neurexin 312
CLDN15 claudin 1511
ICAM1 intercellular adhesion molecule10
JAM3 junction adhesion molecule 310
SPN sialophorin10
CADM1 cell adhesion molecule 19
PTPRC protein tyrosine phosphatase, receptor type, C9
CLDN5 claudin 58
ITGA8 integrin alpha 88
ITGAL integrin alpha L8
ITGAM integrin alpha M8
ITGAV integrin alpha V8
SELPLG selectin, platelet (p-selectin) ligand8
CD274 CD274 antigen7
CD86 CD86 antigen7
CLDN19 claudin 197
HLA-DQB2 major histocompatibility complex, class II, DQ beta 27
ITGA9 integrin alpha 97
MPZL1 myelin protein zero-like 17
NCAM1 neural cell adhesion molecule 17
CD2 CD2 antigen6
HLA-DOA major histocompatibility complex, class II, DO alpha6
SELP selectin, platelet6
VCAM1 vascular cell adhesion molecule 16
CD4 CD4 antigen5
CLDN9 claudin 95
NEO1 neogenin5
PVRL3 poliovirus receptor-related 35
GLG1 golgi apparatus protein 14
PVRL2 poliovirus receptor-related 24
NCAM2 neural cell adhesion molecule 23
NRCAM neuron-glia-CAM-related cell adhesion molecule3
CD34 CD34 antigen2
ITGB2 integrin beta 22
NLGN3 neuroligin 32
CD28 CD28 antigen1
CD8B CD8b antigen1
PDCD1 programmed cell death 11

Expression of the NEGR1 transcript in SAT

The NEGR1 gene, central to the contrasted co-expression network, is expressed at high levels in brain 7. Using quantitative real-time PCR we demonstrated that NEGR1 is also expressed in SAT (as well as heart and skeletal muscle) using a commercially-available tissue panel and two independent unrelated human SAT samples (Figure 2).
Figure 2

NEGR1 expression levels in human tissues

Relative expression of the NEGR1 transcript in human subcutaneous adipose tissue (SAT) compared to expression in other human tissues from a commercially-available multiple tissue panel.

Correlation of NEGR1 expression in human SAT and hypothalamus

NEGR1 expression levels were significantly correlated between adipose and hypothalamus tissues (r = 0.99; P value = 0.020). This places NEGR1 in the top 3% of the most correlated transcripts genome-wide. We also assessed the empirical significance of our finding using genome-wide expression data in the two tissues to generate a null distribution of no association (empirical P value = 0.022).

Discussion

An important goal of systems biology is the identification of biological pathways and genetic networks underlying complex human diseases. We studied genome-wide gene expression in SAT and its genetically determined variation in families ascertained through sib-pairs discordant for obesity. The expression of about 30% of all genes was significantly altered in the obese state, confirming a broad effect of obesity on SAT gene expression 21. Linkage analysis identified a large number of significant eQTL, most of them localized in cis, and a lesser number of trans-acting signals perhaps due to reduced power of detection. Gene Ontology and pathway analyses of the cis-regulated genes demonstrated that they were enriched for genes involved in the insulin signalling pathway. The identification of genetic regulation of the insulin pathway is intriguing as it may indicate a role for SAT in glucose homeostasis and identify its contribution to the development of polygenic type 2 diabetes 49. Clear identification of genetic regulation of this pathway in SAT suggests that exploration of the regulated genes may give valuable insights into the fact that only a minority of obese subjects develop T2D, and those that do, typically have insulin resistance, metabolic syndrome and insulin secretion defects 50. No significant biological clustering was observed for the small number of trans-regulated genes identified in this study. A group of 374 transcripts suggested the presence of a significant hotspot for trans-regulation on chromosome 1p13.3-q23.3 and it may be of note that this overlaps the well-replicated T2D linkage locus of 1q21-q25 51. Significant enrichment was observed for genes in the apoptosis pathway but no obvious candidates could be identified in this very large region. While differential expression analysis can identify those genes and pathways with a causal or reactive role in obesity, genetic analysis can highlight which of them are under genetic control and therefore likely to be “functionally” transcribed in SAT cells 52. Therefore, whereas differential expression may be a result of the “obesogenic environment”, those biological pathways enriched for differentially expressed and genetically controlled genes are more likely to have a causative role in the development of obesity. The subset of cis-regulated transcripts which were also differentially expressed between lean and obese subjects suggested a possible role for genes belonging to the CAMs functional grouping. Contrasting CAMs co-expression networks between lean and obese subjects identified a subset of genes whose pattern of co-expression was significantly associated with the obese state. We found NEGR1 as the central highly-connected gene of this subset and replicated this observation using a mouse expression dataset, thus validating its central role for this pathway. In the context of disease, the topology of gene networks is often determined by key genes showing a high degree of connectivity. Indeed, highly connected genes are likely to encode essential genes 53 which are often evolutionarily conserved 54. Genes showing an intermediate number of connections have been shown to be more likely to harbour inherited mutations for common diseases 55. Whereas NEGR1 was the most connected gene in the contrasted network, it showed intermediate connectivity within each group-specific co-expression network, thus supporting its possible role as a disease gene. The GiANT consortium meta-analysis of obesity GWA studies reported that genetic markers near the NEGR1 gene are associated with obesity 7. The NEGR1 protein is a member of the immunoglobulin superfamily, is highly expressed in the hypothalamus 56 where it appears to modulate synapse number in neurons 57 and this makes it a good functional candidate for obesity 58, especially when considering obesity as a disorder having a neurobehavioral origin 59. Our findings demonstrate that NEGR1 is expressed in human SAT where it appears to be central to the network of the most differentially-expressed set of functionally-related genes between lean and obese subjects. Using publicly-available data 60 we observed high correlation in the expression levels of NEGR1 between human subcutaneous adipose and hypothalamus tissues. These results suggest a similar expression pattern for NEGR1 across tissues. Thus, transcriptional regulation of NEGR1 might not be restricted to neural development and might involve additional mechanisms shared by other tissues. In addition to NEGR1, other genes in the CAMs network have been previously shown to be over-expressed in SAT. In a study of BMI-discordant identical twins 61 the up-regulation of inflammatory and cytoskeleton pathways and down-regulation of energy metabolism and cell differentiation pathways was clearly demonstrated. Specifically, an over-expression of MHC Class II transcripts in obese subjects was reported and these are present in our CAMs relevance network. This further supports the utility of our approach and suggests that other genes within the identified obesity subset of CAMs genes might be good candidates for further investigation. In summary, we have identified a subset of genes that are both differentially-expressed between lean and obese subjects and are under cis-regulation, and so are very good candidates to investigate further for the presence of gene variants regulating their expression and thus contributing to obesity. We have applied a novel differential co-expression analysis strategy to identify NEGR1 as a gene central to the CAMs network in the obese state and confirmed this finding in a different species.
  58 in total

1.  Expression of the IgLON cell adhesion molecules Kilon and OBCAM in hypothalamic magnocellular neurons.

Authors:  S Miyata; N Funatsu; W Matsunaga; T Kiyohara; Y Sokawa; S Maekawa
Journal:  J Comp Neurol       Date:  2000-08-14       Impact factor: 3.215

2.  Genetics of gene expression and its effect on disease.

Authors:  Valur Emilsson; Gudmar Thorleifsson; Bin Zhang; Amy S Leonardson; Florian Zink; Jun Zhu; Sonia Carlson; Agnar Helgason; G Bragi Walters; Steinunn Gunnarsdottir; Magali Mouy; Valgerdur Steinthorsdottir; Gudrun H Eiriksdottir; Gyda Bjornsdottir; Inga Reynisdottir; Daniel Gudbjartsson; Anna Helgadottir; Aslaug Jonasdottir; Adalbjorg Jonasdottir; Unnur Styrkarsdottir; Solveig Gretarsdottir; Kristinn P Magnusson; Hreinn Stefansson; Ragnheidur Fossdal; Kristleifur Kristjansson; Hjortur G Gislason; Tryggvi Stefansson; Bjorn G Leifsson; Unnur Thorsteinsdottir; John R Lamb; Jeffrey R Gulcher; Marc L Reitman; Augustine Kong; Eric E Schadt; Kari Stefansson
Journal:  Nature       Date:  2008-03-16       Impact factor: 49.962

3.  A new highly penetrant form of obesity due to deletions on chromosome 16p11.2.

Authors:  R G Walters; S Jacquemont; A Valsesia; A J de Smith; D Martinet; J Andersson; M Falchi; F Chen; J Andrieux; S Lobbens; B Delobel; F Stutzmann; J S El-Sayed Moustafa; J-C Chèvre; C Lecoeur; V Vatin; S Bouquillon; J L Buxton; O Boute; M Holder-Espinasse; J-M Cuisset; M-P Lemaitre; A-E Ambresin; A Brioschi; M Gaillard; V Giusti; F Fellmann; A Ferrarini; N Hadjikhani; D Campion; A Guilmatre; A Goldenberg; N Calmels; J-L Mandel; C Le Caignec; A David; B Isidor; M-P Cordier; S Dupuis-Girod; A Labalme; D Sanlaville; M Béri-Dexheimer; P Jonveaux; B Leheup; K Ounap; E G Bochukova; E Henning; J Keogh; R J Ellis; K D Macdermot; M M van Haelst; C Vincent-Delorme; G Plessis; R Touraine; A Philippe; V Malan; M Mathieu-Dramard; J Chiesa; B Blaumeiser; R F Kooy; R Caiazzo; M Pigeyre; B Balkau; R Sladek; S Bergmann; V Mooser; D Waterworth; A Reymond; P Vollenweider; G Waeber; A Kurg; P Palta; T Esko; A Metspalu; M Nelis; P Elliott; A-L Hartikainen; M I McCarthy; L Peltonen; L Carlsson; P Jacobson; L Sjöström; N Huang; M E Hurles; S O'Rahilly; I S Farooqi; K Männik; M-R Jarvelin; F Pattou; D Meyre; A J Walley; L J M Coin; A I F Blakemore; P Froguel; J S Beckmann
Journal:  Nature       Date:  2010-02-04       Impact factor: 49.962

4.  Obesity-induced inflammatory changes in adipose tissue.

Authors:  Kathryn E Wellen; Gokhan S Hotamisligil
Journal:  J Clin Invest       Date:  2003-12       Impact factor: 14.808

5.  Inflammation is associated with a decrease of lipogenic factors in omental fat in women.

Authors:  Odile Poulain-Godefroy; Cécile Lecoeur; François Pattou; Gema Frühbeck; Philippe Froguel
Journal:  Am J Physiol Regul Integr Comp Physiol       Date:  2008-04-30       Impact factor: 3.619

Review 6.  Global burden of obesity in 2005 and projections to 2030.

Authors:  T Kelly; W Yang; C-S Chen; K Reynolds; J He
Journal:  Int J Obes (Lond)       Date:  2008-07-08       Impact factor: 5.095

7.  Meta-analysis of genome-wide linkage studies in BMI and obesity.

Authors:  Catherine L Saunders; Benedetta D Chiodini; Pak Sham; Cathryn M Lewis; Victor Abkevich; Adebowale A Adeyemo; Mariza de Andrade; Rector Arya; Gerald S Berenson; John Blangero; Michael Boehnke; Ingrid B Borecki; Yvon C Chagnon; Wei Chen; Anthony G Comuzzie; Hong-Wen Deng; Ravindranath Duggirala; Mary F Feitosa; Philippe Froguel; Robert L Hanson; Johannes Hebebrand; Patricia Huezo-Dias; Ahmed H Kissebah; Weidong Li; Amy Luke; Lisa J Martin; Matthew Nash; Miina Ohman; Lyle J Palmer; Leena Peltonen; Markus Perola; R Arlen Price; Susan Redline; Sathanur R Srinivasan; Michael P Stern; Steven Stone; Heather Stringham; Stephen Turner; Cisca Wijmenga; David A Collier
Journal:  Obesity (Silver Spring)       Date:  2007-09       Impact factor: 5.002

8.  Expression quantitative trait loci are highly sensitive to cellular differentiation state.

Authors:  Alice Gerrits; Yang Li; Bruno M Tesson; Leonid V Bystrykh; Ellen Weersing; Albertina Ausema; Bert Dontje; Xusheng Wang; Rainer Breitling; Ritsert C Jansen; Gerald de Haan
Journal:  PLoS Genet       Date:  2009-10-16       Impact factor: 5.917

9.  Genome wide association (GWA) study for early onset extreme obesity supports the role of fat mass and obesity associated gene (FTO) variants.

Authors:  Anke Hinney; Thuy Trang Nguyen; André Scherag; Susann Friedel; Günter Brönner; Timo Dirk Müller; Harald Grallert; Thomas Illig; H-Erich Wichmann; Winfried Rief; Helmut Schäfer; Johannes Hebebrand
Journal:  PLoS One       Date:  2007-12-26       Impact factor: 3.240

10.  Similarities and differences in genome-wide expression data of six organisms.

Authors:  Sven Bergmann; Jan Ihmels; Naama Barkai
Journal:  PLoS Biol       Date:  2003-12-15       Impact factor: 8.029

View more
  26 in total

1.  The discordant method: a novel approach for differential correlation.

Authors:  Charlotte Siska; Russell Bowler; Katerina Kechris
Journal:  Bioinformatics       Date:  2015-10-31       Impact factor: 6.937

2.  Novel association approach for variable number tandem repeats (VNTRs) identifies DOCK5 as a susceptibility gene for severe obesity.

Authors:  Julia S El-Sayed Moustafa; Hariklia Eleftherohorinou; Adam J de Smith; Johanna C Andersson-Assarsson; Alexessander Couto Alves; Eleni Hadjigeorgiou; Robin G Walters; Julian E Asher; Leonardo Bottolo; Jessica L Buxton; Rob Sladek; David Meyre; Christian Dina; Sophie Visvikis-Siest; Peter Jacobson; Lars Sjöström; Lena M S Carlsson; Andrew Walley; Mario Falchi; Philippe Froguel; Alexandra I F Blakemore; Lachlan J M Coin
Journal:  Hum Mol Genet       Date:  2012-05-16       Impact factor: 6.150

3.  Gene expression profile of subcutaneous adipose tissue in BMI-discordant monozygotic twin pairs unravels molecular and clinical changes associated with sub-types of obesity.

Authors:  M Muniandy; S Heinonen; H Yki-Järvinen; A Hakkarainen; J Lundbom; N Lundbom; J Kaprio; A Rissanen; M Ollikainen; K H Pietiläinen
Journal:  Int J Obes (Lond)       Date:  2017-04-25       Impact factor: 5.095

4.  New animal models reveal that coenzyme Q2 (Coq2) and placenta-specific 8 (Plac8) are candidate genes for the onset of type 2 diabetes associated with obesity in rats.

Authors:  Daiki Sasaki; Jun Kotoh; Risa Watadani; Kozo Matsumoto
Journal:  Mamm Genome       Date:  2015-08-22       Impact factor: 2.957

Review 5.  Human genetic variation and the gut microbiome in disease.

Authors:  Andrew Brantley Hall; Andrew C Tolonen; Ramnik J Xavier
Journal:  Nat Rev Genet       Date:  2017-08-21       Impact factor: 53.242

6.  Placental NEGR1 DNA methylation is associated with BMI and neurodevelopment in preschool-age children.

Authors:  E Breton; V Gagné-Ouellet; K Thibeault; R Guérin; Rj Van Lieshout; P Perron; Mf Hivert; L Bouchard
Journal:  Epigenetics       Date:  2019-09-23       Impact factor: 4.528

7.  Functional relevance of genes implicated by obesity genome-wide association study signals for human adipocyte biology.

Authors:  F Bernhard; K Landgraf; N Klöting; A Berthold; P Büttner; D Friebe; W Kiess; P Kovacs; M Blüher; A Körner
Journal:  Diabetologia       Date:  2012-11-16       Impact factor: 10.122

8.  Fusing Gene Interaction to Improve Disease Discrimination on Classification Analysis.

Authors:  Ji-Gang Zhang; Jian Li; Wenlong Tang; Hong-Wen Deng
Journal:  Adv Genet Eng       Date:  2012-02-09

9.  MetaDCN: meta-analysis framework for differential co-expression network detection with an application in breast cancer.

Authors:  Li Zhu; Ying Ding; Cho-Yi Chen; Lin Wang; Zhiguang Huo; SungHwan Kim; Christos Sotiriou; Steffi Oesterreich; George C Tseng
Journal:  Bioinformatics       Date:  2017-04-15       Impact factor: 6.937

10.  Functional inactivation of the genome-wide association study obesity gene neuronal growth regulator 1 in mice causes a body mass phenotype.

Authors:  Angela W S Lee; Heidi Hengstler; Kathrin Schwald; Mauricio Berriel-Diaz; Desirée Loreth; Matthias Kirsch; Oliver Kretz; Carola A Haas; Martin Hrabě de Angelis; Stephan Herzig; Thomas Brümmendorf; Martin Klingenspor; Fritz G Rathjen; Jan Rozman; George Nicholson; Roger D Cox; Michael K E Schäfer
Journal:  PLoS One       Date:  2012-07-23       Impact factor: 3.240

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.