| Literature DB >> 12885301 |
Lukasz Huminiecki1, Andrew T Lloyd, Kenneth H Wolfe.
Abstract
BACKGROUND: Extracting biological knowledge from large amounts of gene expression information deposited in public databases is a major challenge of the postgenomic era. Additional insights may be derived by data integration and cross-platform comparisons of expression profiles. However, database meta-analysis is complicated by differences in experimental technologies, data post-processing, database formats, and inconsistent gene and sample annotation.Entities:
Mesh:
Year: 2003 PMID: 12885301 PMCID: PMC183867 DOI: 10.1186/1471-2164-4-31
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
r2 values for Pearson correlations between PEM scores computed from Gene Expression Atlas (GEA), SAGEmap and TissueInfo expression profiles for six human tissue categories, for which data were available in the three databases.
| Tissue | TissueInfo vs. SAGEmap | GEA vs. SAGEmap | GEA vs. TissueInfo | ||||||||||
| Genesa | Pb | Signsc | Genesa | Pb | Signsc | Genesa | Pb | Signsc | |||||
| Brain | 346 | 0.59 | <0.01 | 92% | 479 | 0.49 | <0.01 | 84% | 1431 | 0.43 | <0.01 | 77% | |
| Prostate | 58 | 0.36 | <0.01 | 76% | 237 | 0.09 | <0.01 | 59% | 266 | 0.19 | <0.01 | 63% | |
| Vascular endothelium | 13 | 0.29 | 0.6 | 92% | 150 | 0.37 | <0.01 | 73% | 23 | 0.05 | 0.29 | 39% | |
| Ovary | 10 | 0.11 | 0.35 | 30% | 175 | 0.03 | 0.2 | 49% | 36 | 0.03 | 0.28 | 69% | |
| Kidney | 103 | 0.01 | 0.36 | 34% | 140 | 0.00 | 0.78 | 50% | 508 | 0.11 | <0.01 | 55% | |
| Pancreas | 21 | 0.02 | 0.55 | 62% | 141 | 0.00 | 0.59 | 48% | 82 | 0.02 | 0.19 | 55% | |
a Number of genes compared. SAGEmap and TissueInfo data were only used for genes whose PEM scores were significant by χ2 test (P = 0.05), and only tags that could be mapped unambiguously to UniGene were used. b Two-tailed significance values for Pearson correlation coefficients. c Percentage of genes whose PEM scores have the same sign for the two methods being compared.
Figure 1Correlations between the three databases Correlations between Gene Expression Atlas (GEA), SAGEmap, and TissueInfo preferential expression measures (PEM) for brain, prostate and vascular endothelium. Trend lines for linear regression and the corresponding r2 values are shown for each pairwise comparison.
Number of SAGE and EST tags in tissue categories derived from SAGEmap and TissueInfo.
| Brain | Prostate | Kidney | Vascular endothelium | Ovary | Pancreas | |
| SAGE | 326,481 | 150,116 | 67,923 | 110,460 | 96,911 | 64,577 |
| EST | 154,214 | 51,299 | 60,188 | 8,435 | 12,587 | 23,545 |
| SAGE+EST | 480,695 | 201,415 | 128,111 | 118,895 | 109,498 | 88,122 |
Top 50 brain-specific and bottom ten brain underexpressed genes (in italics) identified by integrated PEM scores from Gene Expression Atlas, SAGEmap and TissueInfo.
| %PEM a | UniGene cluster | LocusLink symbol | Description and references |
| 76% | Hs.6535 | n/a | novel cDNA clone, FLJ33742 fis, highly similar to Homo sapiens BNPI mRNA for brain-specific Na-dependent inorganic phosphate cotransporter (GenBank AK091061) |
| 76% | Hs.143535 | CAMK2A | calcium/calmodulin-dependent protein kinase, implied in control of fear and aggression [ |
| 75% | Hs.78854 | ATP1B2 | ATPase, Na+/K+ transporting [ |
| 74% | Hs.194301 | MAP1A | microtubule-associated protein, expressed strongly in brain, spinal cord and weakly in muscle [ |
| 73% | Hs.84389 | SNAP25 | synaptosomal associated protein; one of SNARE proteins required for neuronal exocytosis [ |
| 72% | Hs.2288 | VSNL1 | visinin like 1; known to be expressed in neuronal cells in retina [ |
| 72% | Hs.1787 | PLP1 | lipophilin – the primary constituent of myelin [ |
| 72% | Hs.296184 | GNAO1 | G protein alpha activating activity polypeptide O; multiple neurological abnormalities in knock-out mice [ |
| 71% | Hs.75819 | GPM6A | neuronal membrane glycoprotein M6A; similar to myelin proteolipid protein [ |
| 71% | Hs.69547 | MBP | myelin basic protein [ |
| 70% | Hs.74565 | APLP1 | amyloid beta (A4) precursor-like protein 1 [ |
| 69% | Hs.6139 | SYNGR1 | synaptogyrin 1 [ |
| 69% | Hs.154679 | SYT1 | synaptotagmin 1 [ |
| 68% | Hs.20912 | APCL | brain-specific adenomatous polyposis coli (APC) homologue [ |
| 66% | Hs.5422 | GPM6B | neuronal membrane glycoprotein M6B, highly similar to the myelin proteolipid protein [ |
| 66% | Hs.79000 | GAP43 | neuromodulin, implied in axon guidance [ |
| 65% | Hs.22777 | CA11 | carbonic anhydrase XI; brain-specific carbonic anhydrase isosyme [ |
| 65% | Hs.239356 | STXB1 | syntaxin binding protein 1 [ |
| 65% | Hs.182859 | LFG | lifeguard (alias neuromembrane protein 35) [ |
| 65% | Hs.7979 | SV2 | synaptic vesicle glycoprotein 2 [ |
| 64% | Hs.322430 | NDRG4 | ndrg family member 4, expressed in brain and heart [ |
| 64% | Hs.74554 | SYT11 | synaptotagmin 11; identified by genomic screen with highly conserved synaptotagmin motif [ |
| 64% | Hs.82749 | TM4SF2 | transmembrane 4 superfamily member 2; contains a trinucleotide repeat [ |
| 64% | Hs.169401 | APOE | apolipoprotein E; implied in Alzheimer's disease [ |
| 63% | Hs.166161 | DNM1 | dynamin 1 – brain expression at least 30-fold higher than in other tissues [ |
| 62% | Hs.6164 | NECL1 | nectin-like protein 1; alias brain immunoglobulin receptor precursor (unpublished, GenBank NM_021189) |
| 62% | Hs.76888 | INA | internexin – neuronal intermediate filament protein alpha [ |
| 61% | Hs.146580 | ENO2 | enolase 2 – neuronal-specific enolase [ |
| 61% | Hs.90005 | STMN2 | stathmin-like 2; expressed in neuronal growth cones [ |
| 61% | Hs.226133 | GAS7 | growth arrest-specific 7; expressed in neurons and growth-arrested fibroblasts [ |
| 60% | Hs.155524 | PNUTL2 | septin 4; human orthologue of the mouse h5 brain protein [ |
| 60% | Hs.117546 | NNAT | neuronatin; known to contain a 5' neurospecific silencer element which controls brain-specific expression [ |
| 60% | Hs.155247 | ALDOC | brain-type aldolase; fructose-bisphosphate aldolase C [ |
| 60% | Hs.7357 | CLIPR-59 | microtubule-binding protein involved in Golgi targeting; known to be preferentially expressed in brain [ |
| 60% | Hs.74583 | SPOCK2 | sparc/osteonectin like; testican 2; calcium-binding proteoglycan specific to brain; expressed in many neuronal cell types in olfactory bulb, cerebral cortex, thalamus, hippocampus, cerebellum, and medulla [ |
| 60% | Hs.80395 | MAL | hydrophobic integral membrane lipoprotein; a component of myelin [ |
| 60% | Hs.6349 | BC008967 | novel protein BC008967 |
| 60% | Hs.6755 | RPIP8 | RaP2 interacting protein; belongs to Ras family; expressed principally in brain [ |
| 60% | Hs.75090 | SPRING | involved in synaptic exocytosis; expressed only in brain [ |
| 60% | Hs.90063 | NCALD | neurocalcin delta; calcium-binding protein; expressed preferentially in brain [ |
| 58% | Hs.83384 | S100B | calcium-binding protein S100; expressed in glia, astrocytes and neurons [ |
| 58% | Hs.104925 | ENC1 | ectodermal-neural cortex; p53 induced gene; expressed preferentially in brain, particularly in amygdala and hippocampus [ |
| 58% | Hs.7782 | PNMA2 | onconeuronal antigen MA2; expressed in brain and testis [ |
| 58% | Hs.8526 | B3GNT6 | beta-1,3-N-acetylglucosaminyltransferase 6; expressed preferentially in foetal and adult brain [ |
| 56% | Hs.125359 | THY1 | Thy-1 cell surface antigen; expressed in brain and on some T cells; neuronal expression pattern changes in development suggesting role in neurogenesis [ |
| 56% | Hs.194534 | VAMP2 | synaptobrevin 2; one of proteins involved in fusion of synaptic vesicles with the presynaptic membrane [ |
| 56% | Hs.31463 | ELMO1 | engulfment and cell motility 1; orthologue of C. elegans gene ced-12; has two isforms – 4.4 kb expressed ubiquitously and 2.4 kb expressed only in brain [ |
| 55% | Hs.3763 | APBB1 | amyloid beta (A4) precursor protein-binding; expressed preferentially in brain [ |
| 55% | Hs.286055 | CHN2 | chimerin 2; expressed preferentially in cerebellum in granule cells [ |
| 55% | Hs.12305 | DKFZP566B183 | novel protein DKFZP566B183 |
a Integrated PEM score, calculated as the average of PEMGEA/PEMGEAmax, PEMSAGEMAP/PEMSAGEMAPmax, PEMTISSUEINFO/PEMTISSUEINFOmax, where "max" refers to the maximum PEM value encountered in the tissue. This weighting scheme was used because brain microarray PEM values were several fold higher than those for SAGE and EST (PEMGEAmax = 5.21, PEMSAGEMAPmax = 1.19, PEMTISSUEINFOmax = 1.15)
Top ten prostate-specific genes identified by integrated PEM scores from Gene Expression Atlas, SAGEmap and TissueInfo.
| %PEM a | UniGene cluster | LocusLink symbol | Description and references |
| 95% | Hs.183752 | MSMB | beta microseminoprotein; synthesized by the epithelial cells of the prostate gland and secreted into the seminal plasma [ |
| 94% | Hs.1852 | ACPP | prostatic acid phosphatase [ |
| 82% | Hs.171995 | KLK3 | kallikrein 3; prostate specific antigen (PSA) [ |
| 73% | Hs.78344 | MYH11 | smooth muscle-specific myosin heavy polypeptide 11 [ |
| 70% | Hs.300772 | TPM2 | tropomyosin 2 (beta); binds actin [ |
| 55% | Hs.195464 | FLNA | filamin A; binds actin [ |
| 54% | Hs.75777 | TAGLN | transgelin; smooth muscle-specific actin binding protein [ |
| 53% | Hs.262476 | AMD1 | S-adenosylmethionine decarboxylase 1; key enzyme in the synthesis of seminal polyamines such as spermine, spermidine [ |
| 53% | Hs.9615 | MYL9 | myosin, smooth muscle-specific regulatory light polypeptide 9 [ |
| 48% | Hs.179817 | RDH11 | androgen-regulated short-chain dehydrogenase/reductase 1 [ |
a Prostate PEMGEAmax = 4.30, PEMSAGEMAPmax = 1.97, and PEMTISSUEINFOmax = 1.61