| Literature DB >> 15128448 |
Dmitry N Grigoryev1, Shwu-Fan Ma, Rafael A Irizarry, Shui Qing Ye, John Quackenbush, Joe G N Garcia.
Abstract
Microarray-driven gene-expression profiles are generally produced and analyzed for a single specific experimental model. We have assessed an analytical approach that simultaneously evaluates multi-species experimental models within a particular biological condition using orthologous genes as linkers for the various Affymetrix microarray platforms on multi-species models of ventilator-associated lung injury. The results suggest that this approach may be a useful tool in the evaluation of biological processes of interest and selection of process-related candidate genes.Entities:
Mesh:
Year: 2004 PMID: 15128448 PMCID: PMC416470 DOI: 10.1186/gb-2004-5-5-r34
Source DB: PubMed Journal: Genome Biol ISSN: 1474-7596 Impact factor: 13.583
Relationship of EGO orthologs between selected Affymetrix GeneChips
| GeneChip | Reference genes on U133A | Orthologs for reference genes | Reference gene ortholog pairs |
| U95A | 11,437 | 9,677 | 15,545 |
| U74A | 7,665 | 6,013 | 10,346 |
| U34A | 4,872 | 4,630 | 7,786 |
The number of orthologs linked to the U133A arrays is lower than the number of reference genes because of the same ortholog being shared by different reference genes. The number of reference gene ortholog pairs is always higher than the number of reference genes itself; this is attributed to the multiple orthologs for the common reference genes (see Figure 4).
Figure 1Overlaps between rat (U34A GeneChip), mouse (U74A GeneChip) and human (U95A GeneChip) Affymetrix array platforms based on the human (U133A GeneChip) ortholog assignments. The sum of numbers inside each circle represents the total number of ortholog pairs formed with reference genes on the U133A GeneChip by corresponding arrays (see also Table 1). The reference genes formed 3,077 pairs with corresponding orthologs that were represented on all depicted arrays.
Figure 2Schema of the centric approach in ortholog-linked database building and putative ortholog detection. (a) An example of putative ortholog creation for the ornithine decarboxylase 1 (ODC-1) gene. U74A and U34A probe IDs were EGO orthologs (solid line) for the U133A and U95A ODC-1 gene but were not directly linked (dashed line) either in EGO or in the Affymetrix ortholog table. (b) The reference genes common to all arrays (see Figure 1) and their corresponding orthologs for U95A-U74A, U95A-U34A, and U74A-U34A pairs were permutated and all possible combinations counted (dashed lines). EGO combinations were retrieved from RESOURCERER-generated tables for these paired arrays and counted (solid lines). The difference in the predicted and existing pairs represents the number of putative orthologs to be created, based on homology to the common reference gene.
Figure 3Experimental data used for populating the ortholog-link database. (a) Using Affymetrix MAS 5.0 software, absent (black), marginal (white) and present (gray) transcript-abundance calls were counted for each experimental dataset and the values obtained expressed as a percentage of all calls. (b) By masking poorly performing probes for U95A, U74A and U34A, the present call ratio for these GeneChips was increased by 25%. As dog mRNA was hybridized to the human U133A chip, the present call ratio for this hetero-hybridization was much lower than that in other experiments. We therefore corrected U133A probe sets for differences in gene sequence between human and canine, which increased the present call ratio by more than 50%.
Figure 4Overall distribution of orthologs among reference genes. Most of the reference genes (1,088) had only one ortholog on each of the U95A, U74A and U34A arrays used in these studies. The first bar shown here represents the number of reference genes that had three orthologs. The majority of remaining reference genes had two orthologs on one of the studied arrays. Overall, about 62% of reference genes had at least one multiple ortholog set.
Figure 5Distribution of co-regulated and inversely regulated biological bioprocesses identified by linkage to GO. (a) Genes involved in a co-regulated bioprocess (inflammatory response; GO 6954) and (b) an inversely regulated bioprocess (DNA-dependent regulation of transcription; GO 6355). Solid areas under the curve represent upregulated genes and gray areas under the curve represent downregulated genes. (c) A summary of all co-regulated (top curve) and inversely regulated (bottom curve) GO bioprocesses identified by MAPPFinder corresponding to the increment in the fold-change cutoff.
Genes showing significant changes in expression throughout all biological systems tested
| Gene symbol | Fold change | PubMatrix terms | |||||
| Lung | Lung injury | Mechanical ventilation | Endothelium | Pulmonary endothelium | |||
| IL-1B* | 1.53 | 0.023 | 1536 | 287 | 3 | 1759 | 70 |
| IL-6 | 1.84 | 0.015 | 963 | 173 | 75 | 527 | 30 |
| F3 | 1.52 | 0.017 | 411 | 54 | 4 | 713 | 18 |
| PAI-1* | 1.47 | 0.003 | 201 | 31 | 3 | 7367 | 22 |
| COX2* | 1.79 | 0.011 | 257 | 28 | 1 | 206 | 9 |
| IL-13 | 1.30 | 0.013 | 327 | 21 | 0 | 62 | 2 |
| AQP-1 | -1.30 | 0.038 | 49 | 9 | 0 | 35 | 6 |
| PLAUR* | 1.47 | 0.033 | 83 | 8 | 0 | 113 | 5 |
| FGA* | 1.30 | 0.024 | 22 | 4 | 10 | 189 | 12 |
| C/EBP | 1.40 | 0.013 | 27 | 3 | 0 | 9 | 2 |
| CCL2 | 2.00 | 0.024 | 11 | 1 | 0 | 9 | 2 |
| ADMR | -1.35 | 0.022 | 19 | 1 | 0 | 16 | 2 |
| CXCR4 | 1.62 | 0.007 | 26 | 0 | 0 | 79 | 4 |
| GJA-1 | 1.33 | 0.026 | 7 | 0 | 0 | 30 | 1 |
| IL1R2* | 1.88 | 0.026 | 11 | 0 | 0 | 7 | 0 |
| GADD45A | 1.71 | 0.004 | 21 | 0 | 0 | 4 | 0 |
| BTG-1* | 1.38 | 0.007 | 2 | 0 | 0 | 1 | 0 |
| TFF-2* | -1.32 | 0.045 | 9 | 0 | 0 | 1 | 0 |
ADMR, adrenomedullin receptor; AQP-1, aquaporin 1; BTG-1, B-cell translocation gene 1; CCL2, cell chemokine 2; C/EBP, CCAAT/enhancer-binding protein; COX2, prostaglandin G/H synthase and cyclooxygenase 2; CXCR4, chemokine (C-X-C motif) receptor 4; F3, coagulation factor III (thromboplastin, tissue factor); FGA, fibrinogen alpha; GADD45A, growth arrest and DNA-damage-inducible, alpha; GJA-1, gap junction protein, alpha 1 (connexin 43); IL-1B, interleukin 1 beta; IL1R2, interleukin 1 receptor, type II; IL-6, interleukin 6; IL-13, interleukin 13; PAI-1 - plasminogen activator inhibitor type 1; PLAUR, plasminogen activator, urokinase receptor; TFF-2, trefoil factor 2 (spasmolytic protein 1). *Not detected by common single-experiment analysis.
MAPPFinder results for significantly upregulated genes throughout all species tested
| GO ID | GO name | Number of genes with FC >1.3 | Number of measured genes | Number of genes in GO | Percent changed genes | Percent present genes | z-score |
| 6955 | Immune response | 17 | 91 | 559 | 18.7 | 16.3 | 6.617 |
| 6954 | Inflammatory response | 13 | 55 | 161 | 23.6 | 34.2 | 6.076 |
| 8285 | Negative regulation of cell proliferation | 6 | 33 | 120 | 18.2 | 27.5 | 5.484 |
| 7050 | Cell cycle arrest | 4 | 12 | 53 | 33.3 | 22.6 | 5.001 |
| 7596 | Blood coagulation | 4 | 25 | 68 | 16.0 | 36.8 | 4.091 |
| 6960 | Antimicrobial humoral response | 4 | 25 | 92 | 16.0 | 27.2 | 4.091 |
| 6917 | Induction of apoptosis | 3 | 17 | 60 | 17.6 | 28.3 | 4.091 |
| 7267 | Cell-cell signaling | 12 | 98 | 284 | 12.2 | 34.5 | 3.303 |
| 6935 | Chemotaxis | 5 | 40 | 97 | 12.5 | 41.2 | 3.255 |
Comparison of candidate gene list generated by multi-species cross-platform analysis with that obtained using a single-experiment analysis
| U133 | U95 | U74 | U34 | |||||
| Gene ID | FC | pV | FC | pV | FC | pV | FC | pV |
| TCF21 | x | x | X | x | x | |||
| ADPRH | x | x | X | X | ||||
| AREG | x | x | X | X | ||||
| S100A9 | x | x | X | X | ||||
| EST | x | x | X | X | ||||
| EST | x | x | X | |||||
| GAPD | x | x | X | |||||
| LGALS3 | x | x | X | |||||
| YWHAZ | x | x | x | X | ||||
| KCNJ6 | x | x | X | X | ||||
| BHLHB2 | x | x | X | |||||
| CDKN1A | X | x | x | |||||
| PRKAR2A | X | x | x | |||||
| GABRD | x | x | X | |||||
| TNFSF6 | x | x | X | |||||
| IFRD1 | x | x | X | |||||
| GCH1 | x | x | X | |||||
| NPY1R | x | x | ||||||
| EST | X | X | X | x | x | |||
| SLC2A3 | X | X | x | x | ||||
| DNAJA1 | X | X | x | x | ||||
| BAIAP3 | X | X | x | x | ||||
| PSMB5 | X | x | x | |||||
| GBP2 | X | x | x | |||||
| CYCS | X | x | x | |||||
| CD2 | X | x | x | |||||
| EIF2S1 | X | x | x | |||||
| CD79B | X | x | x | |||||
| ENSA | X | x | x | |||||
| LAT | x | x | ||||||
| RPLP2 | X | X | X | |||||
| ARPC4 | X | X | X | |||||
| NOS1 | X | X | X | |||||
| EGFR | X | X | X | |||||
| P4HA1 | X | X | X | |||||
| ATF3 | X | X | X | |||||
| RAB5A | X | X | X | |||||
| HSPA8 | X | X | ||||||
| CEBPB | X | X | ||||||
| ARG2 | X | X | ||||||
| MYBPH | X | X | ||||||
| MST1 | ||||||||
| ACADL | X | X | ||||||
| PLCG2 | X | X | ||||||
| PLAA | X | X | ||||||
| PDHB | X | X | ||||||
| BTG2 | X | X | ||||||
| AIF1 | X | X | ||||||
| XCL1 | X | X | ||||||
| GCLC | X | |||||||
| SP-C | x | x | ||||||
| ODC-1 | X | X | x | x | ||||
| SP-B | X | |||||||
| ACE | X | |||||||
| SP-D | NA | NA | ||||||
| TGFA | NA | NA | ||||||
| IL-8 | NA | NA | NA | NA | ||||
| TNFA | NA | NA | NA | NA | ||||
FC, fold change in gene expression; pV, p-value produced by variance-independent double-tailed t-test of mechanical stretch vs control. X or x denotes changes in gene expression greater than 1.3-fold or p < 0.05; lower-case x represent genes that satisfied both filtering conditions. NA (not available) represents genes that were not present on the array. Rows in bold depict genes presented in Table 2.
Data sets and analytical tools sources
| Affymetrix gene arrays | Rat. RG_U34A profiles | |
| Mouse. MG_U74A profiles | ||
| Dog. HG_U133A profiles | ||
| HPAEC. HG-U95Av2 profiles | ||
| Software and tools | RESOURCERER | |
| NetAffx | ||
| GenMAPP and MAPPFinder | ||
| PubMatrix | ||
| Python 2.2 | ||
| Stats Module for Python |