| Literature DB >> 25887114 |
E Andres Houseman1, Karl T Kelsey2, John K Wiencke3, Carmen J Marsit4.
Abstract
BACKGROUND: The impact of cell-composition effects in analysis of DNA methylation data is now widely appreciated. With the availability of a reference data set consisting of DNA methylation measurements on isolated cell types, it is possible to impute cell proportions and adjust for them, but there is increasing interest in methods that adjust for cell composition effects when reference sets are incomplete or unavailable.Entities:
Mesh:
Year: 2015 PMID: 25887114 PMCID: PMC4392865 DOI: 10.1186/s12859-015-0527-y
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Data sets analyzed
|
|
|
|
|
|
| |
|---|---|---|---|---|---|---|
| Ref. Blood | GSE39981 | 27K | [ | Human leukocyte subtypes purified from whole blood samples. | 73 | [ |
| GSE35069 | 450K | [ | Human leukocyte subtypes purified from whole blood samples. | 54 | [ | |
| Blood | GSE30229* | 27K | [ | Peripheral blood from 92 head and neck squamous cell carcinoma (HNSCC) patients and 92 controls. | 184 | [ |
| GSE19711 | 27K | [ | Whole blood from 131 ovarian cancer cases (drawn pre-treatment) and 274 controls. | 405 | [ | |
| GSE42861 | 450K | [ | Peripheral blood from 354 rheumatoid arthritis patients and 335 controls. | 689 | [ | |
| Breast Tum | GSE32393 | 27K | [ | Breast tumor samples: 91 invasive ductal, 13 invasive lobular, 10 mucinous or medullary; 76 were ER+. | 114 | [ |
| Gastric | GSE30601 | 27K | [ | 203 gastric tumors and 94 matched gastric non-malignant samples. | 297 | [ |
| Liver | GSE60753 | 450K | [ | 34 normal liver tissues, 21 cirrhotic tissues (due to alcoholism), 45 cirrhotic tissues [due to chronic hepatitis B (HBV) or C (HCV) viral infection]. | 100 | [ |
| Artery | GSE46394 | 450K | [ | 15 normal aortic tissues, 15 atherosclerotic lesions, 19 carotid atherosclerotic samples. | 49 | [ |
* For the HNSCC data, age is not available in the GEO submission GSE30229, but was obtained from the authors of the original study.
Figure 1Significance profiles for rheumatoid arthritis case vs. control coefficients. Varying the value of the dimension parameter k results in differing numbers of significant Δ and B coefficients, with stablilization of results for k ≥ 10. Similar plots for the other eight data sets are given in the Additional file 1.
Figure 2Gene-set p-values for KEGG pathways and Δ coefficients. The clustering heatmap shows gene set p-values depicted by color, with data set indicated in the row annotation track. Clustering was achieved by applying a Euclidean metric to – log10 p-values and using Ward’s linkage method. Note that the gene set tests were conducted as exact Mantel-Haenzel tests, stratified by CpG Island status (27K) and by Infinium biochemistry type, relation to CpG Island, and gene region (450K).
Figure 3Sequence of clustering dendrograms from 27K blood reference data set. The dendrograms display the results of clustering the non-intercept columns of the coefficient matrices Δ by applying Euclidean metric and Ward’s linkage method. Note that the intercept (reference) represented whole blood. The sequence shows that differing the values of k results in distinct levels of information with respect to cell lineage. Red dots indicate myeloid lineage (granulocyte and monocyte).
Gene sets analyzed
|
|
|
| |||
|---|---|---|---|---|---|
|
|
|
| |||
| Total CpGs included in analyses | 27578 | 26,486 | 319,264 | ||
| DMR | 500 previously published leukocyte DMRs [27K] [ | 500 | 500 | 417 | |
| PcG | Polycomb targets compiled from four sources [ | 3614 | 3523 | 41,942 | |
| Apoptosis | 04210 | Apoptosis | 157 | 149 | 1126 |
| Bcell | 04662 | B cell receptor signaling pathway | 138 | 136 | 1168 |
| CardiacM | 04260 | Cardiac muscle contraction | 121 | 116 | 967 |
| CellCycle | 04110 | Cell cycle | 250 | 245 | 2028 |
| CyCy | 04060 | Cytokine-cytokine receptor interaction | 437 | 426 | 2088 |
| DorsoV | 04320 | Dorso-ventral axis formation | 38 | 38 | 565 |
| ErbB | 04012 | ErbB signaling pathway | 154 | 148 | 1470 |
| Hemato | 04640 | Hematopoietic cell lineage | 153 | 152 | 779 |
| HepC | 05160 | Hepatitis C | 209 | 205 | 1536 |
| Hhog | 04340 | Hedgehog signaling pathway | 92 | 90 | 1090 |
| JakStat | 04630 | Jak-STAT signaling pathway | 254 | 250 | 1579 |
| MAPK | 04010 | MAPK signaling pathway | 469 | 457 | 4535 |
| mTOR | 04150 | mTOR signaling pathway | 92 | 88 | 1117 |
| NK | 04650 | Natural killer cell mediated cytotoxicity | 219 | 215 | 1530 |
| Notch | 04330 | Notch signaling pathway | 67 | 67 | 1097 |
| p53 | 04115 | p53 signaling pathway | 144 | 142 | 1034 |
| Tcell | 04660 | T cell receptor signaling pathway | 191 | 187 | 1568 |
| TGF | 04350 | TGF-beta signaling pathway | 154 | 154 | 1314 |
| VascSm | 04270 | Vascular smooth muscle contraction | 193 | 189 | 1903 |
| VEGF | 04370 | VEGF signaling pathway | 131 | 131 | 1104 |