| Literature DB >> 31822265 |
Elham Khodayari Moez1, Morteza Hajihosseini1, Jeffrey L Andrews2, Irina Dinu3.
Abstract
BACKGROUND: Although microarray studies have greatly contributed to recent genetic advances, lack of replication has been a continuing concern in this area. Complex study designs have the potential to address this concern, though they remain undervalued by investigators due to the lack of proper analysis methods. The primary challenge in the analysis of complex microarray study data is handling the correlation structure within data while also dealing with the combination of large number of genetic measurements and small number of subjects that are ubiquitous even in standard microarray studies. Motivated by the lack of available methods for analysis of repeatedly measured phenotypic or transcriptomic data, herein we develop a longitudinal linear combination test (LLCT).Entities:
Mesh:
Year: 2019 PMID: 31822265 PMCID: PMC6902471 DOI: 10.1186/s12859-019-3221-7
Source DB: PubMed Journal: BMC Bioinformatics ISSN: 1471-2105 Impact factor: 3.169
Fig. 1Calculation of the power of LLCT using simulated data generated with different within-geneest correlation (a), sample size (b), geneset size (c), number of repeated measurements (d) and within-subject correlation (e). Type I error is set at 5%. For each plot, the simulation variables except the one mentioned on the title varies but remains comparable among the curves
Fig. 2Comparison of the powers of LLCT method and the method of pathway analysis via regression (PAVR) proposed by Adewale et a1. using simulated data generated with different within-geneset correlation (a, b and c); different sample size (d and e); different geneset size (f and g and h) and different number of repeated measurements (h and i). Β1 denotes the gene effect and Β3 denotes the gene effect over time referring to eq. 7
Summary information (mean (standard deviation)) of covariates and outcomes at different time points: GAW19 application, studies of related and unrelated subjects
| Age | Antihypertensive Medication | Smoking Status | Systolic Blood Pressure (SBP) | Diastolic Blood Pressure (DBP) | Hypertension Status (HTN) | |
|---|---|---|---|---|---|---|
| Related Subjects | ||||||
| First visit | 39.58 (16.88) | 0.1 (0.3) | 0.23 (0.42) | 121.73 (18.98) | 71.48 (9.99) | 0.18 (0.39) |
| Second visit | 42.76 (15.93) | 0.19 (0.39) | 0.18 (0.39) | 124.96 (19.34) | 71.94 (10.01) | 0.28 (0.45) |
| Third visit | 46.34 (15.10) | 0.29 (0.45) | 0.2 (0.4) | 125.21 (18.04) | 70.73 (10.02) | 0.36 (0.48) |
| Forth visit | 50.88 (12.76) | 0.43 (0.5) | 0.11 (0.32) | 128.24 (17.63) | 77.76 (11.06) | 0.52 (0.5) |
| Unrelated Subjects | ||||||
| First visit | 53.84 (14.77) | 0.22 (0.42) | 0.25 (0.43) | 130.3 (23.36) | 72.96 (9.48) | 0.37 (0.48) |
| Second visit | 58.26 (12.30) | 0.36 (0.48) | 0.11 (0.32) | 135.01 (20.17) | 72.34 (10.09) | 0.59 (0.49) |
| Third visit | 59.52 (10.85) | 0.53 (0.50) | 0.17 (0.38) | 130.46 (19.24) | 69.14 (9.74) | 0.59 (0.49) |
| Forth visit | 62.16 (9.26) | 0.63 (0.49) | 0.06 (0.25) | 135.5 (23.44) | 77.06 (15.4) | 0.71 (0.46) |
The number of significant gene sets found by LLCT at different levels of confidence, testing a variety of outcomes and datasets
| Datasets | Type I Error | SBP | DBP | SBP& DBPa | SBP-DBPb | HTN |
|---|---|---|---|---|---|---|
| Adjusted for smoking status | ||||||
| Related Subjects | 1% | 30 | 23 | 20 | 73 | 65 |
| 5% | 170 | 135 | 141 | 360 | 321 | |
| 10% | 255 | 278 | 310 | 434 | 392 | |
| Unrelated Subjects | 1% | 12 | 3 | 5 | 27 | 5 |
| 5% | 136 | 39 | 60 | 389 | 82 | |
| 10% | 408 | 78 | 245 | 735 | 162 | |
| Adjusted for antihypertensive medications | ||||||
| Related Subjects | 1% | 98 | 13 | 63 | 127 | 12 |
| 5% | 402 | 127 | 271 | 541 | 99 | |
| 10% | 413 | 242 | 390 | 614 | 159 | |
| Unrelated Subjects | 1% | 17 | 3 | 11 | 17 | 2 |
| 5% | 142 | 60 | 86 | 116 | 22 | |
| 10% | 465 | 75 | 186 | 382 | 88 | |
| No Adjustment | ||||||
| Related Subjects | 1% | 18 | 17 | 14 | 43 | 54 |
| 5% | 158 | 141 | 122 | 259 | 327 | |
| 10% | 263 | 273 | 277 | 386 | 417 | |
| Unrelated Subjects | 1% | 9 | 2 | 3 | 17 | 2 |
| 5% | 234 | 37 | 70 | 273 | 71 | |
| 10% | 537 | 68 | 231 | 682 | 168 | |
aThe multiple analysis of systolic and diastolic blood pressure. In this analysis, the outcome is a linear combination of SBP and DBP with the highest association with the linear combinations of gene expressions
bPulse pressure which is the difference between systolic and diastolic blood pressures
The list of KEGG pathways significantly expressed over three healing states of hemostasis, inflammation and proliferation for different wounds of skin and tongue, and their corresponding p-values calculated by LLCT method
| Geneset size | Hemostasis (< 12 h) | Inflammation (12–72 h) | Proliferation (3–7 days) | Differentially expressed at unwounded status? | Prioritization Score | ||||
|---|---|---|---|---|---|---|---|---|---|
| q-value* | q-value* | q-value* | |||||||
| Metabolism | |||||||||
| Nucleotide metabolism | |||||||||
| Purine metabolism | 789 | < 0.01 | < 0.01 | 0.01 | 0.03 | < 0.01 | < 0.01 | Yes | 9.0% |
| Glycan biosynthesis and metabolism | |||||||||
| Mucin type O-glycan biosynthesis | 156 | 0.01 | 0.03 | <0.01 | <0.01 | 0.01 | 0.01 | Yes | 10.7% |
| Glycosaminoglycan degradation | 108 | 0.03 | 0.09 | 0.01 | 0.02 | <0.01 | < 0.01 | No | 9.6% |
| Genetic Information Processing | |||||||||
| Transcription | |||||||||
| Spliceosome | 984 | <0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 | Yes | 0.4% |
| Replication and repair | |||||||||
| DNA replication | 237 | <0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 | Yes | 0.4% |
| Mismatch repair | 135 | <0.01 | 0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 5.7% |
| Folding, sorting and degradation | |||||||||
| Protein processing in endoplasmic reticulum | 1164 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 0.4% |
| Environmental Information Processing | |||||||||
| Signal transduction | |||||||||
| MAPK signaling pathway | 2127 | 0.02 | 0.08 | 0.02 | 0.04 | <0.01 | <0.01 | No | 9.4% |
| Ras signaling pathway | 1566 | 0.02 | 0.07 | 0.01 | 0.02 | <0.01 | <0.01 | No | 8.5% |
| Calcium signaling pathway | 1206 | 0.01 | 0.03 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 3.1% |
| NF-kappa B signaling pathway | 702 | <0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 | < 0.01 | Yes | 0.4% |
| HIF-1 signaling pathway | 804 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 0.4% |
| Hedgehog signaling pathway | 378 | <0.01 | < 0.01 | 0.01 | 0.02 | 0.01 | 0.01 | Yes | 8.9% |
| JAK-STAT signaling pathway | 972 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 0.4% |
| Signaling molecules and interaction | |||||||||
| Cytokine-cytokine receptor interaction | 1308 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 0.4% |
| Neuroactive ligand-receptor interaction | 1557 | <0.01 | 0.01 | <0.01 | <0.01 | <0.01 | <0.01 | No | 2.2% |
| Cellular Processes | |||||||||
| Transport and catabolism | |||||||||
| Autophagy - animal | 996 | <0.01 | 0.01 | 0.02 | 0.04 | <0.01 | <0.01 | No | 10.8% |
| Lysosome | 771 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 0.4% |
| Cell growth and death | |||||||||
| Apoptosis | 951 | 0.03 | 0.09 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 5.1% |
| Necroptosis | 849 | 0.01 | 0.03 | <0.01 | <0.01 | <0.01 | <0.01 | No | 3.5% |
| Cellular community - eukaryotes | |||||||||
| Adherens junction | 609 | 0.02 | 0.06 | 0.01 | 0.01 | 0.01 | 0.01 | Yes | 12.6% |
| Organismal Systems | |||||||||
| Immune system | |||||||||
| Toll-like receptor signaling pathway | 615 | 0.01 | 0.02 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 3.0% |
| RIG-I-like receptor signaling pathway | 426 | 0.02 | 0.06 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 3.8% |
| Cytosolic DNA-sensing pathway | 345 | 0.00 | 0.00 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 4.3% |
| Hematopoietic cell lineage | 492 | 0.01 | 0.04 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 3.6% |
| Natural killer cell mediated cytotoxicity | 720 | 0.02 | 0.06 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 6.7% |
| IL-17 signaling pathway | 552 | <0.01 | 0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 2.2% |
| B cell receptor signaling pathway | 540 | 0.03 | 0.09 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 5.0% |
| Fc epsilon RI signaling pathway | 444 | 0.02 | 0.07 | 0.01 | 0.03 | <0.01 | <0.01 | No | 8.9% |
| Fc gamma R-mediated phagocytosis | 696 | 0.04 | 0.11 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 5.4% |
| Leukocyte transendothelial migration | 753 | 0.03 | 0.08 | <0.01 | 0.01 | <0.01 | <0.01 | Yes | 8.4% |
| Circulatory system | |||||||||
| Adrenergic signaling in cardiomyocytes | 1092 | <0.01 | 0.01 | 0.01 | 0.01 | <0.01 | <0.01 | Yes | 10.5% |
| Development | |||||||||
| Axon guidance | 1389 | <0.01 | <0.01 | <0.01 | <0.01 | 0.02 | 0.03 | Yes | 5.4% |
| Nervous system | |||||||||
| Glutamatergic synapse | 777 | 0.02 | 0.07 | <0.01 | <0.01 | 0.01 | 0.01 | Yes | 8.6% |
| Cholinergic synapse | 831 | 0.03 | 0.08 | <0.01 | <0.01 | <0.01 | <0.01 | No | 7.8% |
| Dopaminergic synapse | 1041 | <0.01 | <0.01 | <0.01 | 0.01 | 0.01 | 0.02 | Yes | 0.4% |
| Endocrine system | |||||||||
| Oxytocin signaling pathway | 1125 | <0.01 | 0.01 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 5.1% |
| AGE-RAGE signaling pathway in diabetic complications | 717 | 0.01 | 0.03 | <0.01 | <0.01 | <0.01 | <0.01 | Yes | 3.1% |
*P-values adjusted for False Discovery Rate (FDR)
Fig. 3Time-course expression patterns of gene pathways. These gene sets were expressed non-differentially at unwounded state (time 0) but they are expressed differentially over all three stages of hemostasis (before 12 h after wounding), inflammatory (12–72 h after wounding) and proliferation (3–7 days after wounding)