| Literature DB >> 21609452 |
Hiroko K Solvang1, Ole Christian Lingjærde, Arnoldo Frigessi, Anne-Lise Børresen-Dale, Vessela N Kristensen.
Abstract
BACKGROUND: Elucidating the exact relationship between gene copy number and expression would enable identification of regulatory mechanisms of abnormal gene expression and biological pathways of regulation. Most current approaches either depend on linear correlation or on nonparametric tests of association that are insensitive to the exact shape of the relationship. Based on knowledge of enzyme kinetics and gene regulation, we would expect the functional shape of the relationship to be gene dependent and to be related to the gene regulatory mechanisms involved. Here, we propose a statistical approach to investigate and distinguish between linear and nonlinear dependences between DNA copy number alteration and mRNA expression.Entities:
Mesh:
Year: 2011 PMID: 21609452 PMCID: PMC3128865 DOI: 10.1186/1471-2105-12-197
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Figure 1In-cis correlations between CGH an mRNA. Number of significant correlated parts were in about 13 K loci (FDR 5%). Horizontal axis indicates chromosomes and vertical axis indicates Spearman correlation values (significant positive (red), negative (green) and uncorrelated (black)).
Figure 2Identified genomic locations by linear/quadratic models.
Figure 3Estimated predictions for NDRG1 and ERBB2 obtained by quadratic model. Scatter plots are on vertical axis for non-logarithmic mRNA data vs. horizontal axis for non-logarithmic PCF logR. Estimated predictions and the 95% confidential interval are represented by solid and dashed lines, respectively.
Figure 4Box-plots for the observations of NDRG1 and ERBB2 and predictions by linear and quadratic models. The plots are illustrated for six regions to indicate the degree of dispersion of each particular region. Solid and dash-dotted lines indicate the predications by linear and quadratic models, respectively.
Summary for pathway analyses
| models | Top Canonical Pathways | -log(B_H p-value) | Ratio |
|---|---|---|---|
| Linear | Estrogen receptor signaling | 6.95e-01 | 15/135 (0.111) |
| Pyrimidine Metabolism | 3.31e-01 | 15/151 (0.099) | |
| Mitochondrial Dysfunction | 3.31e-01 | 13/127 (0.102) | |
| Molecular Mechanisms of Cancer | 1.77e-01 | 25/359 (0.07) | |
| Oxidative Phosphorylation | 1.77-01 | 13/140 (0.093) | |
| Linear | Purine metabolism | 1.1e00 | 23/303 (0.076) |
| DNA Methylation and Transcriptional Repression Signaling | 5.24e-01 | 4/23 (0.174) | |
| Reelin Signaling in Neurons | 5.24e-01 | 8/82 (0.098) | |
| Pyrimidine Metabolism | 5.24e-01 | 12/151 (0.079) | |
| Toll-like Receptor Signaling | 5.24e-01 | 6/51 (0.118) | |
| Quad | P13K/AKT signaling | 1.93e00 | 9/132 (0.068) |
| Neuregulin signaling | 1.49e00 | 7/98 (0.071) | |
| PTEN signaling | 4.97e-01 | 6/120 (0.05) | |
| TR/RXR activation | 2.95e-01 | 5/93 (0.054) | |
| ERK5 Signaling | 2.74e-01 | 4/67 (0.06) | |
| Quad | Cell cycle regulation by BTG family proteins | 1.88e00 | 5/36 (0.139) |
| CTLA4 signaling in cytotoxic T lymphocytes | 1.83e00 | 7/98 (0.071) | |
| Ceramide signaling | 1.48e00 | 6/85 (0.071) | |
| Wnt/ß-catenin signaling | 6.97e-01 | 7/173 (0.04) | |
| IL-2 signaling | 6.97e-01 | 4/59 (0.068) | |
| models | Top Associated Network Functions | Score | |
| Linear | Embryonic Development, Tissue Development, DNA Replication, Recombination, and Repair | 41 | |
| Dermatological Diseases and Conditions, Genetic Disorder, Metabolic Disease | 35 | ||
| Cell Signaling, RNA Post-Transcriptional Modification, Gene Expression | 35 | ||
| Gene Expression, Cellular Compromise, Cardiovascular System Development and Function | 35 | ||
| Cellular Development, Carbohydrate Metabolism, Lipid Metabolism | 33 | ||
| Linear | Cellular assembly and organization, Cellular function and maintenance, DNA replication, Recombination, and Repair | 37 | |
| RNA Post-Transcriptional Modification, Gene Expression, Immunological Disease | 37 | ||
| Embryonic Development, Cell Morphology, Cellular Development | 37 | ||
| Gene Expression, Cell Cycle, Cellular Development | 33 | ||
| Cellular Movement, Nervous System Development and Function, Cellular Assembly and Organization | 27 | ||
| Quad | Molecular Transport, Small Molecule Biochemistry, Cell Cycle | 44 | |
| Gene Expression, Cancer, Gastrointestinal Disease | 41 | ||
| Cell Morphology, Cell-To-Cell Signaling and Interaction, Cellular Assembly and Organization | 26 | ||
| Gene Expression, Endocrine System Disorders, Genetic Disorder | 21 | ||
| Gene Expression, Connective Tissue Disorders, Immunological Disease | 19 | ||
| Quad | Cell Cycle, Carbohydrate Metabolism, Lipid Metabolism | 45 | |
| Organismal Injury and Abnormalities, Cellular Development, Cell Death | 23 | ||
| Post-Translational Modification, Protein Degradation, Protein Synthesis | 21 | ||
| Cellular Assembly and Organization, Nervous System Development and Function, Cell-To-Cell Signaling and Interact | 21 | ||
| Cell-To-Cell Signaling and Interaction, Hematological System Development and Function, Immune Cell Trafficking | 19 | ||
| models | Diseases and disorders | B-H p-value | # Molecules |
| Linear | Cancer | 2.24e-05 - 4.45e-01 | 120 |
| Reproductive system disease | 2.24e-05 - 4.39e-01 | 86 | |
| Development disorder | 6.55 e -02 - 4.39e-01 | 19 | |
| Inflammatory response | 3.77e-01 - 4.39e-01 | 5 | |
| Genetic disorder | 4.39e-01 - 4.39e-01 | 38 | |
| Linear | Cancer | 4,27e-06 - 2.4e-01 | 102 |
| Reproductive system disease | 4.27e-06 - 2.4e-01 | 75 | |
| Infectious disease | 6.9e-03 - 2.4e-01 | 58 | |
| Infection mechanism | 7.62e-03 - 2.4e-01 | 60 | |
| Dermatological Diseases and Conditions | 9.78e-02 - 2.4e-01 | 28 | |
| Quad | Cancer | 6.54e-03 - 1.68e-01 | 73 |
| Reproductive system disease | 6.54e-03 - 1.68e-01 | 34 | |
| Gastrointestinal disease | 1.57e-01 - 1.68e-01 | 9 | |
| Hepatic system disease | 1.57e-01 - 1.68e-01 | 8 | |
| Inflammatory disease | 1.57e-01 - 1.68e-01 | 12 | |
| Quad | Genetic disorder | 1.54e-01 - 2.11e-01 | 33 |
| Neurological disease | 1.54e-01 - 2.11e-01 | 24 | |
| Psychological disorders | 1.54e-01 - 2.11e-01 | 18 | |
| Cancer | 1.54e-01 - 2.11e-01 | 15 | |
| Metabolic Disease | 1.54e-01 - 2.11e-01 | 7 | |
| models | Molecular and Cellular Functions | B-H p-value | # Molecules |
| Linear | Gene expression | 5.23e-03 - 4.45e-01 | 133 |
| Cell death | 1.21e-01 - 4.45e-01 | 50 | |
| Cellular growth and proliferation | 1.21e-01 - 4.39e-01 | 33 | |
| Molecular transport | 3.77e-01 - 4.39e-01 | 52 | |
| Protein trafficking | 3.77e-01 - 4.39e-01 | 24 | |
| Linear | Cellular assembly and organization | 1.47e-05 - 2.4e-01 | 75 |
| Cellular function and maintenance | 1.47e-05 - 2.4e-01 | 44 | |
| Cellular development | 2.56e-02 - 2.4e-01 | 39 | |
| Cellular growth and proliferation | 2.56e-02 - 2.4e-01 | 15 | |
| DNA replication, recombination, and repair | 4.28e-02 - 2.4e-01 | 41 | |
| Quad | Cellular development | 1.57e-01 - 1.68e-01 | 20 |
| Cell death | 1.57e-01 - 1.68e-01 | 25 | |
| Cell cycle | 1.57e-01 - 1.68e-01 | 16 | |
| Cell morphology | 1.57e-01 - 1.68e-01 | 24 | |
| Gene expression | 1.57e-01 - 1.68e-01 | 23 | |
| Quad | Post-translation modification | 1.54e-01 - 2.11e-01 | 19 |
| Cell morphology | 1.54e-01 - 2.11e-01 | 13 | |
| Cellular development | 1.54e-01 - 2.11e-01 | 14 | |
| Carbohydrate metabolism | 1.54e-01 - 2.11e-01 | 9 | |
| Lipid metabolism | 1.54e-01 - 2.11e-01 | 16 | |
Canonical pathways, association network functions, disease and disorders and molecular and cellular functions were analyzed by IPA to assemble gene lists identified by linear/quadratic models for Amp and Del.
Figure 5Line maps of genomic locations for the significant DNA aberrations/mRNA expression dependences. The line maps describe genomic locations for the significant DNA aberrations/mRNA expression dependences found by linear (top) and quadratic (middle) for Amp (a) and Del (b). The specific regions for Amp (a) and Del (b), which we detected in Methods, are illustrated in the bottom panels.
Figure 6RT-PCR Validation. The left panel indicates ERBB2 data and quadratic predictions from array and the right panel indicates that the plots and the quadratic prediction curve of the data are given for the DNA and mRNA tumor expressed from the ERBB2 locus for TaqMan quantitative RT PCR analysis.
Figure 7Amp/Del genomic locations. The vertical axis indicates the percentages of the sample number with PCF filtered values > 0.15 (red) and PCF filtered values < -0.15 (green) divided by the number of samples. The horizontal line at 20% indicates the threshold we used in this analysis. The locations above the threshold line are defined as Amp (red, 12% of all) or Del (green, 11% of all) and were then investigated for linear and nonlinear relationships, respectively.