| Literature DB >> 21299892 |
Clinton C Mason1, Robert L Hanson, Vicky Ossowski, Li Bian, Leslie J Baier, Jonathan Krakoff, Clifton Bogardus.
Abstract
BACKGROUND: Many human diseases and phenotypes are related to RNA expression, levels of which are influenced by a wide spectrum of genetic and exposure-related factors. In a large genome-wide study of muscle tissue expression, we found that some genes exhibited a bimodal distribution of RNA expression, in contrast to what is usually assumed in studies of a single healthy tissue. As bimodality has classically been considered a hallmark of genetic control, we assessed the genome-wide prevalence, cause, and association of this phenomenon with diabetes-related phenotypes in skeletal muscle tissue from 225 healthy Pima Indians using exon array expression chips.Entities:
Mesh:
Substances:
Year: 2011 PMID: 21299892 PMCID: PMC3044673 DOI: 10.1186/1471-2164-12-98
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Characteristics of 225 individuals with a baseline muscle biopsy.
| Characteristic | Mean | Min | Max | N |
|---|---|---|---|---|
| 29.6 | 18.1 | 49.7 | 225 (152 M, 73 F) | |
| 33.4 | 19.1 | 55.0 | 225 (152 M, 73 F) | |
| 31.8 | 9.4 | 47.3 | 223 (150 M, 73 F) | |
| 0.435 | 0.178 | 0.952 | 182 (123 M, 59 F) | |
| 288.9 | 89.9 | 864.3 | 79 (59 M, 20 F) | |
a EMBS = estimated metabolic body size
b in full heritage, normal glucose tolerant individuals with covariates
Figure 1Identification of genes with bimodal expression. Muscle samples from 225 individuals were processed in groups that could be identified for distinct batch effects influencing the expression levels. Bimodality was assessed in groups of microarrays that were homogeneous for such bias. A variety of constraints were applied to ensure bimodal genes identified were genuine, including replication. Also illustrated is how bimodality may be artifactually (incorrectly) inferred due to heterogeneity of batch effects on expression levels.
Number (prevalence) of bimodal genes as determined by compounding model selection criteria in the two different groups of microarrays.
| (17,881 genes analyzed) | ||
|---|---|---|
| Group A | Group B | |
| 430 (2.40%) | 40 (0.224%) | |
| 408 (2.28%) | 36 (0.201%) | |
| 33 (0.185%) | 33 (0.185%) | |
| 19 (0.106%) | 21 (0.117%) | |
Columns represent the two groups of microarrays from which the bimodal genes were determined: Group A - 71 microarray chips all from the same lot, time span, and fluidics station; Group B - 47 microarray chips from two consecutive lots and from the same time span and fluidics station. The final row represents the total number of interesting bimodal genes for muscle expression in this population as estimated by the various chip groupings. A total of 17,881 genes were analyzed for potential bimodality.
a p-value from a likelihood ratio test with 6 d.f.
b calculated as proportion of misclassified area to unity
c points in each component tallied as number to the left and right of the two components' intersection
d from assessment in Group A
Summary of 32 genes (not associated with gender) which met group criteria for bimodality.
| Transcript number | Chromosome | Gene symbol | Group A | Group B | All chips | p-value | Meets |
|---|---|---|---|---|---|---|---|
| 3618333 | 15 | 1.16E-06 | 1.35E-14 | 0.995 | A,All | ||
| 3103494 | 8 | 1.76E-06 | 0.382 | A | |||
| 2418570 | 1 | 3.97E-06 | 4.17E-19 | 0.954 | A,All | ||
| 3822074 | 19 | 2.26E-04 | 0.137 | A | |||
| 3682182 | 16 | 2.36E-04 | 1.86E-04 | 0.953 | A,All | ||
| 3590275 | 15 | 4.42E-04 | 0.724 | A | |||
| 3680953 | 16 | 5.34E-04 | 9.34E-08 | 0.594 | A,All | ||
| 3393257 | 11 | 5.67E-04 | 0.547 | A | |||
| 3963990 | 22 | 8.75E-04 | 0.403 | A | |||
| 3404660 | 12 | 8.85E-04 | 0.417 | A | |||
| 3737274 | 17 | 9.33E-04 | 0.106 | A | |||
| 3824153 | 19 | 2.39E-06 | 0.367 | B | |||
| 3471427 | 12 | 5.88E-06 | 0.144 | B | |||
| 3867223 | 19 | 2.80E-05 | 0.841 | B | |||
| 2948887 | 6 | 5.15E-05 | 9.33E-04 | 0.929 | B,All | ||
| 3146433 | 8 | 1.52E-04 | 1.90E-16 | 0.103 | B,All | ||
| 3402697 | 12 | 1.53E-04 | 0.773 | B | |||
| 3845782 | 19 | 1.69E-04 | 0.090 | B | |||
| 2324616 | 1 | 1.87E-04 | 1.63E-05 | 0.062 | B,All | ||
| 2431031 | 1 | 2.32E-04 | 3.59E-04 | 0.202 | B,All | ||
| 3373392 | 11 | 4.63E-04 | 0.581 | B | |||
| 2344888 | 1 | 5.60E-04 | 0.611 | B | |||
| 3851055 | 19 | 5.98E-04 | 0.699 | B | |||
| 2383726 | 1 | 9.45E-04 | 0.078 | B | |||
P-value for bimodality is shown for each group of microarray chips when all criteria for bimodality were met. Blank cells indicate that at least one bimodal criterion for the given gene and group was not met. Genes identified as bimodal in all groups are shown in bold.
Figure 2Histograms of the eight confirmed bimodal genes. Shown is the distribution of RNA expression values for the entire data set of n = 225. The 8 genes shown above correspond to: glutathione S-transferase mu 1 (GSTM1), major histocompatibility complex, class II, DR beta 1 (HLA-DRB1), endoplasmic reticulum aminopeptidase 2 (ERAP2), major histocompatibility complex, class II, DR beta 5 (HLA-DRB5), monoamine oxidase A (MAOA), actinin, alpha 3 (ACTN3), nuclear receptor subfamily 4, group A, member 2 (NR4A2), and threonine synthase-like 2 (THNSL2).
Figure 3Two-way Manhattan plot of significance for bimodality in both of the independent groups. Shown are the -log10 p-values for bimodality for each of the 17,881 genes from trimmed Group A (blue circles, n = 65) and from trimmed Group B (red circles, n = 43). Genes with a combined Fisher's FDR < 0.05 are labeled (see Additional files 3 and 4).
Differences (p-value) in diabetes related traits in the 8 confirmed bimodal genes for those in the high compared with those in the low RNA expression mode.
| Gene | ||||
|---|---|---|---|---|
| -0.408 (p = 0.67) | -0.88 (p = 0.28) | 0.015 (p = 0.46) | -1.73 (p = 0.96) | |
| 2.04 (p = 0.09) | 1.15 (p = 0.32) | -0.0033 (p = 0.89) | 31.6 (p = 0.49) | |
| -0.122 (p = 0.9) | 0.472 (p = 0.58) | -0.0033 (p = 0.86) | 54.4 (p = 0.11) | |
| 2.19 (p = 0.072) | 1.22 (p = 0.29) | -0.0095 (p = 0.69) | 31.6 (p = 0.49) | |
| 0.949 (p = 0.29) | -0.0001 (p = 1.0) | -44.1 (p = 0.17) | ||
| 0.71 (p = 0.46) | 0.284 (p = 0.72) | 53.5 (p = 0.14) | ||
| -0.576 (p = 0.57) | -0.836 (p = 0.36) | -0.010 (p = 0.63) | -2.05 (p = 0.96) | |
| 0.695 (p = 0.47) | 0.671 (p = 0.48) | 0.0062 (p = 0.81) | 2.23 (p = 0.97) | |
The number of individuals measured for each phenotype varied (n).
a adjusted for age, gender, sibship, and heritage; BMI = body mass index (n = 225), %fat = percent body fat (n = 223)
b adjusted for age, gender, sibship, heritage, and % body fat; Mlow = insulin sensitivity (n = 182)
c adjusted for age, gender, sibship, heritage, % body fat, and Mlow; AIR = acute insulin response (n = 79 normal glucose tolerant individuals with full heritage and covariates)
Associations of 32 bimodal genes with cis-determining SNPs and reported CNV.
| Bimodality | Gene | SNP association | Kappa | # of SNPs | |
|---|---|---|---|---|---|
| 0.04 | 0.17 | 86 | No | ||
| Yes | 0.26 | 0.00 | 36 | No | |
| 0.22 | 0.09 | 13 | No | ||
| 0.035 | 0.12 | 12 | No | ||
| 0.0037 | 0.17 | 79 | No | ||
| 0.023 | 0.02 | 24 | Yes | ||
| 0.084 | 0.01 | 61 | No | ||
| 0.088 | 0.09 | 63 | Yes | ||
| 0.022 | 0.01 | 41 | Yes | ||
| 0.061 | 0.00 | 52 | No | ||
| 0.17 | 0.10 | 19 | Yes | ||
| 0.000097 | 0.15 | 36 | No | ||
| 0.048 | 0.04 | 47 | No | ||
| 0.048 | 0.14 | 22 | Yes | ||
| 0.0026 | 0.25 | 76 | Yes | ||
| 0.049 | 0.23 | 49 | No | ||
| 0.012 | 0.11 | 100 | No | ||
| 0.019 | 0.04 | 32 | No | ||
| 0.065 | 0.12 | 55 | Yes | ||
a Highest SNP association shown. All other significant SNPs are listed in Additional file 5. Apparent cis-acting eQTL are identified in bold.
b Copy number variation (CNV) as identified in the UCSC Genome Browser.
Further validation of genes having bimodal expression (FDR < 0.05) in Pima Indians with bimodal expression of those genes in other populations.
| Gene symbol | Pima | GSE1485 | GSE13070 | GSE5086 | GSE1485 | GSE13070 | GSE5086 | |
|---|---|---|---|---|---|---|---|---|
| Genes with no Association of Expression and Gender | ||||||||
| 1 | 3.54E-44 | 0.042 | 0.532 | 0.405 | 204550_x_at | 215333_x_at | 215333_x_at | |
| 5 | 8.76E-18 | 0.038 | 0.00768 | 219759_at | 227462_at | 227462_at | ||
| 6 | 8.33E-15 | 0.859 | 0.515 | 204670_x_at | 208306_x_at | 208306_x_at | ||
| 11 | 3.41E-10 | - | 206891_at | 206891_at | ||||
| X | 3.97E-10 | - | 204388_s_at | 204389_at | ||||
| 6 | 1.20E-09 | - | - | - | ||||
| 2 | 3.21E-08 | - | 0.468 | 216248_s_at | 204622_x_at | |||
| 6 | 2.02E-07 | 0.471 | 0.607 | 216526_x_at | 216526_x_at | 216526_x_at | ||
| 1 | 1.76E-06 | - | 0.324 | 0.043 | 235763_at | 235763_at | ||
| 9 | 6.63E-06 | - | 0.690 | 0.347 | 207978_s_at | 207978_s_at | ||
| 1 | 1.13E-05 | - | 0.987 | 0.204 | 219865_at | 219865_at | ||
| 2 | 1.42E-05 | - | 0.022 | 0.315 | 219044_at | 239949_at | ||
| 16 | 3.73E-05 | - | 0.721 | 0.045 | 208480_s_at | 208480_s_at | ||
| 19 | 5.60E-05 | - | 0.230 | 0.094 | 221711_s_at | 221711_s_at | ||
| Genes with Association of Expression and Gender | ||||||||
| Y | 7.86E-54 | 205000_at | 205000_at | 205000_at | ||||
| Y | 3.91E-49 | 204409_s_at | 204409_s_at | 204409_s_at | ||||
| Y | 7.50E-45 | 201909_at | 201909_at | 201909_at | ||||
| Y | 1.15E-42 | 0.00269 | 211149_at | 211149_at | 211149_at | |||
| Y | 1.46E-32 | 206624_at | 228492_at | 228492_at | ||||
| Y | 6.01E-22 | - | 230760_at | 230760_at | ||||
| Y | 9.51E-21 | - | 0.0324 | 0.00508 | 214131_at | 223646_s_at | ||
| 7 | 1.94E-14 | - | 0.111 | AFFX-HSAC07/X0 | AFFX-HSAC07/X0 | |||
| Y | 1.48E-11 | - | 0.194 | 0.798 | 224293_at | 224293_at | ||
| Y | 9.99E-08 | 0.998 | 0.0101 | 208650_s_at | 216379_x_at | 266_s_at | ||
| 14 | 1.23E-06 | - | 0.910 | 203946_s_at | 203945_at | |||
| 3 | 2.10E-06 | - | 0.0353 | 0.961 | 229764_at | 229764_at | ||
a Present study in skeletal muscle tissue of non-diabetic Pima Indians. P-value from Fisher's meta-analysis of trimmed Group A (n = 65) and trimmed Group B (n = 43) for bimodality of RNA expression; shown are all genes having an FDR < 0.05. See additional files 3 and 4.
b Bimodality of RNA expression assessed in lymphoblastoid cells from CEPH Utah residents (n = 193). Not all genes were represented on the chip or had excessive intra-individual variation (-). ID shown is affy_id. P-values < 0.001 from a likelihood ratio test with 6 d.f. shown in bold. GSE1485 expression data provided to GEO as part of unrelated study [28].
c Bimodality of RNA expression assessed in abdominal muscle from patients undergoing gastrointestinal surgery (n = 59). Not all genes were represented on the chip (-). ID shown is affy_id. P-values < 0.001 from a likelihood ratio test with 6 d.f. shown in bold. GSE13070 expression data provided to GEO as part of unrelated study [14].
d Bimodality of RNA expression assessed in skeletal muscle from subjects with a broad range of insulin sensitivity/resistance (n = 62). Not all genes were represented on the chip (-), ID shown is affy_id. P-values < 0.001 from a likelihood ratio test with 6 d.f. shown in bold. GSE5086 expression data provided to GEO as part of unrelated study [15].