| Literature DB >> 20473674 |
Mary E Winn1, Matthew A Zapala, Iiris Hovatta, Victoria B Risbrough, Elizabeth Lillie, Nicholas J Schork.
Abstract
The use of mouse blood as a model for human blood is often considered in the development of clinically relevant, gene expression-based disease biomarkers. However, the ability to derive biologically meaningful insights from microarray-based gene expression patterns in mouse whole blood, as in human whole blood, is hindered by high levels of globin mRNA. In order to characterize the effects of globin reduction on gene expression of peripheral mouse blood, we performed gene set enrichment analysis on genes identified as expressed in blood via microarray-based genome-wide transcriptome analysis. Depletion of globin mRNA enhanced the quality of microarray data as shown by improved gene expression detection and increased sensitivity. Compared to genes expressed in whole blood, genes detected as expressed in blood following globin reduction were enriched for low abundance transcripts implicated in many biological pathways, including development, g-protein signaling, and immune response. Broadly, globin reduction resulted in improved detection of expressed genes that serve as molecular binding proteins and enzymes in cellular metabolism, intracellular transport/localization, transcription, and translation, as well as genes that potentially could act as biomarkers for diseases such as schizophrenia. These significantly enriched pathways overlap considerably with those identified in globin-reduced human blood suggesting that globin-reduced mouse blood gene expression studies may be useful for identifying genes relevant to human disease. Overall, the results of this investigation provide a better understanding of the impact of reducing globin transcripts in mouse blood and highlight the potential of microarray-based, globin-reduced, mouse blood gene expression studies in biomarker development.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20473674 PMCID: PMC2890980 DOI: 10.1007/s00335-010-9261-y
Source DB: PubMed Journal: Mamm Genome ISSN: 0938-8990 Impact factor: 2.957
Fig. 1Boxplots of present calls in whole-blood RNA and globin-reduced blood RNA samples. The boxes represent the lower quartile through the upper quartile, while the whiskers extend to 1.5 times the interquartile range. Open circles denote outliers. A bold line denotes the median. WB whole-blood RNA, GR globin-reduced blood RNA
Distribution of probe sets detected in (1) globin-reduced samples only, (2) whole-blood samples only, or (3) both whole-blood and globin-reduced samples across all detectable probes (n = 5400)
| Percentile | Globin-reduced only (total = 3609) | Whole-blood only (total = 17) | Common (total = 1774) | |||
|---|---|---|---|---|---|---|
| Whole-blood RNA | Globin-reduced RNA | Whole-blood RNA | Globin-reduced RNA | Whole-blood RNA | Globin-reduced RNA | |
| <5% | 264 | 256 | 3 | 9 | 3 | 5 |
| 5–25% | 1063 | 1050 | 2 | 4 | 15 | 26 |
| 25–50% | 1269 | 1242 | 2 | 2 | 79 | 106 |
| 50-75% | 878 | 899 | 4 | 2 | 468 | 449 |
| 75–95% | 135 | 162 | 5 | 0 | 940 | 918 |
| >95% | 0 | 0 | 1 | 0 | 269 | 270 |
Detectable probes were ranked according to their average, normalized expression intensities in both whole-blood and globin-reduced RNA
Fig. 2Number of significantly enriched GeneGO pathways maps at a threshold of P ≤ 0.001 in mouse whole-blood RNA (gray) and globin-reduced RNA (black). The associated false discovery rate is less than 0.01 for a P value threshold less than or equal to 0.001
Top 25 statistically significant GeneGO disease categories in globin-reduced mouse blood RNA as compared to whole-blood RNA
| GeneGO disease (by Biomarker) | Globin-reduced RNA | Whole-blood RNA | McNemar test | ||
|---|---|---|---|---|---|
| Network objects |
| Network objects |
| ||
| Hemic and lymphatic diseases | 597/1927 | 1.02 × 10−11 | 251/1927 | 5.75 × 10−09 | <2.2 × 10−16 |
| Hematologic diseases | 424/1316 | 1.04 × 10−10 | 195/1316 | 1.21 × 10−11 | <2.2 × 10−16 |
| Neoplasms | 1571/5827 | 4.24 × 10−10 | 626/5827 | 3.45 × 10−09 | <2.2 × 10−16 |
| Anemia | 135/344 | 2.25 × 10−09 | 73/344 | 2.24 × 10−11 | 2.13 × 10−12 |
| Bone marrow diseases | 256/748 | 2.60 × 10−09 | 114/748 | 1.25 × 10−07 | <2.2 × 10−16 |
| Spherocytosis, hereditary | 13/13 | 1.50 × 10−08 | 9/13 | 3.16 × 10−07 | NS |
| Tay-Sachs disease | 11/11 | 2.40 × 10−07 | 8/11 | 8.45 × 10−07 | NS |
| Gangliosidoses | 12/13 | 5.99 × 10−07 | 8/13 | 5.53 × 10−06 | NS |
| Anemia, hemolytic, congenital | 66/153 | 6.44 × 10−07 | 40/153 | 1.81 × 10−09 | 9.44 × 10−07 |
| Myelodysplastic syndromes | 210/630 | 7.85 × 10−07 | 89/630 | 7.19 × 10−05 | <2.2 × 10−16 |
| Immunoproliferative disorders | 459/1538 | 1.34 × 10−06 | 182/1538 | 5.51 × 10−04 | <2.2 × 10−16 |
| Lymphoproliferative disorders | 458/1537 | 1.68 × 10−06 | 181/1537 | 7.34 × 10−04 | <2.2 × 10−16 |
| Neoplastic processes | 280/887 | 2.21 × 10−06 | 144/887 | 1.75 × 10−11 | <2.2 × 10−16 |
| Gangliosidoses GM2 | 11/12 | 2.22 × 10−06 | 8/12 | 2.32 × 10−06 | NS |
| Neoplasms by site | 1091/3996 | 3.36 × 10−06 | 430/3996 | 1.65 × 10−04 | <2.2 × 10−16 |
| Breast neoplasms | 539/1852 | 3.88 × 10−06 | 212/1852 | 1.09 × 10−03 | <2.2 × 10−16 |
| Anemia, hemolytic | 77/194 | 4.03 × 10−06 | 47/194 | 1.08 × 10−09 | 1.19 × 10−07 |
| Breast diseases | 539/1853 | 4.17 × 10−06 | 212/1853 | 1.12 × 10−03 | <2.2 × 10−16 |
| Lymphatic diseases | 464/1572 | 4.60 × 10−06 | 182/1572 | 1.63 × 10−03 | <2.2 × 10−16 |
| Leukemia, myeloid | 193/586 | 5.52 × 10−06 | 80/586 | 5.11 × 10−04 | <2.2 × 10−16 |
| Leukemia, nonlymphocytic, acute | 155/455 | 6.20 × 10−06 | 64/455 | 8.63 × 10−04 | <2.2 × 10−16 |
| Leukemia | 339/1115 | 8.07 × 10−06 | 128/1115 | 1.17 × 10−02 | <2.2 × 10−16 |
| Thalassemia | 29/55 | 9.49 × 10−06 | 18/55 | 1.71 × 10−06 | 2.57 × 10−03 |
| Neoplasms by histologic type | 1092/4027 | 1.59 × 10−05 | 442/4027 | 1.00 × 10−05 | <2.2 × 10−16 |
| RNA virus infections | 195/603 | 1.80 × 10−05 | 77/603 | 4.21 × 10−03 | <2.2 × 10−16 |
Network objects represent the proportion of disease-associated genes (biomarkers) identified per disease category, with the denominator representing the number of objects assayed by the Affymetrix Mouse 430 2.0 array
aGeneGO hypergeometric model
Nonhematologic-based diseases with a significantly improved proportion of gene expression levels detected in globin-reduced mouse blood RNA as compared to whole mouse blood RNA
| GeneGO disease (by biomarkers) | Whole-blood network objects | Globin-reduced network objects |
|
|---|---|---|---|
| Alzheimer’s disease | 87/721 | 194/721 | <2.20 × 10−16 |
| Amyotrophic lateral sclerosis | 19/167 | 54/167 | 9.08 × 10−9 |
| Cerebellar ataxia | 3/32 | 13/32 | 4.43 × 10−3 |
| Dementia | 97/829 | 226/829 | <2.20 × 10−16 |
| Down syndrome | 21/133 | 41/133 | 2.15 × 10−5 |
| Muscle hypertonia | 5/48 | 21/48 | 1.77 × 10−4 |
| Neurodegenerative diseases | 161/1289 | 354/1289 | <2.20 × 10−16 |
| Ophthalmoplegia | 1/34 | 12/34 | 2.57 × 10−3 |
| Parkinson’s disease | 38/283 | 64/283 | 9.44 × 10−7 |
| Schizophrenia | 106/752 | 208/752 | <2.20 × 10−16 |
All diseases are significantly enriched with a GeneGO hypergeometric P ≤ 0.05 in whole or globin-reduced blood RNA samples. Network objects represent the proportion of disease-associated genes identified per disease category, with the denominator representing the number of objects assayed by the Affymetrix Mouse 430 2.0 array
aMcNemar test comparing the proportion of network objects identified before and after globin reduction
Data set characteristics for human-mouse gene set enrichment analysis
| Study | Microarray | Globin reduction method | No. of globin-reduced samples | Probe sets present in ≥80% of samples | Significant GeneGO pathways ( |
|---|---|---|---|---|---|
| Mouse | Affymetrix Mouse 430 2.0 | GLOBINclear | 18 | 5428 | 267 |
| GSE2888 | Affymetrix HG U133A | Affymetrix protocol | 14 | 6048 | 279 |
| GSE16728 | Affymetrix HG U133 Plus 2.0 | GLOBINclear | 10 | 9363 | 296 |
Fig. 3Venn diagram comparing significant GeneGO pathway maps (P ≤ 0.001) enriched in globin-reduced mouse RNA and globin-reduced human (GSE2888 and GSE16728) RNA. The associated false discovery rate is approximately 0.01 for a P value threshold less than or equal to 0.001