| Literature DB >> 21695066 |
Gwangsik Shin1, Tae-Wook Kang, Sungjin Yang, Su-Jin Baek, Yong-Su Jeong, Seon-Young Kim.
Abstract
BACKGROUND: Some oncogenes such as ERBB2 and EGFR are over-expressed in only a subset of patients. Cancer outlier profile analysis is one of computational approaches to identify outliers in gene expression data. A database with a large sample size would be a great advantage when searching for genes over-expressed in only a subset of patients. DESCRIPTION: GENT (Gene Expression database of Normal and Tumor tissues) is a web-accessible database that provides gene expression patterns across diverse human cancer and normal tissues. More than 40000 samples, profiled by Affymetrix U133A or U133plus2 platforms in many different laboratories across the world, were collected from public resources and combined into two large data sets, helping the identification of cancer outliers that are over-expressed in only a subset of patients. Gene expression patterns in nearly 1000 human cancer cell lines are also provided. In each tissue, users can retrieve gene expression patterns classified by more detailed clinical information.Entities:
Keywords: Affymetrix; cancer; gene expression; human tissues
Year: 2011 PMID: 21695066 PMCID: PMC3118449 DOI: 10.4137/CIN.S7226
Source DB: PubMed Journal: Cancer Inform ISSN: 1176-9351
The number of tissue samples according to tissue types (U133plus2 and U133A).
| Abdomen | 13 | 0 | 0 | 0 | 13 |
| Adipose | 1 | 59 | 0 | 12 | 72 |
| Adrenal gland | 14 | 5 | 0 | 0 | 19 |
| Bladder | 39 | 14 | 87 | 15 | 155 |
| Blood | 4693 | 639 | 3130 | 1099 | 8974 |
| Brain | 785 | 568 | 592 | 1627 | 3572 |
| Breast | 1954 | 251 | 2635 | 91 | 4931 |
| Cervix | 74 | 12 | 64 | 34 | 184 |
| Colon | 1294 | 206 | 256 | 27 | 1783 |
| Endometrium | 72 | 61 | 0 | 9 | 142 |
| Esophagus | 48 | 9 | 24 | 28 | 109 |
| GIST | 64 | 0 | 0 | 0 | 64 |
| Head and neck | 202 | 14 | 21 | 2 | 239 |
| Heart | 0 | 0 | 0 | 41 | 41 |
| Kidney | 573 | 105 | 366 | 66 | 1110 |
| Liver | 182 | 25 | 156 | 52 | 415 |
| Lung | 441 | 225 | 582 | 364 | 1612 |
| Muscle | 0 | 177 | 0 | 331 | 508 |
| Myometrium | 0 | 0 | 0 | 24 | 24 |
| Ovary | 859 | 21 | 341 | 9 | 1230 |
| Pancreas | 132 | 55 | 13 | 8 | 208 |
| Prostate | 308 | 45 | 244 | 83 | 680 |
| Sarcoma | 493 | 0 | 0 | 0 | 493 |
| Skin | 290 | 28 | 499 | 59 | 876 |
| Small intestine | 13 | 6 | 0 | 22 | 41 |
| Stomach | 268 | 57 | 46 | 18 | 389 |
| Testis | 4 | 6 | 184 | 13 | 207 |
| Thyroid | 62 | 25 | 44 | 25 | 156 |
| Tongue | 0 | 11 | 0 | 4 | 15 |
| Uterus | 155 | 12 | 0 | 24 | 191 |
| Vagina | 3 | 5 | 0 | 0 | 8 |
| Vulva | 21 | 14 | 0 | 0 | 35 |
| Total | 13057 | 2655 | 9284 | 4087 | 29083 |
The number of cancer cell lines according to tissue types (U133plus2 and U133A).
| Adrenal gland | 0 | 2 | 2 |
| Biliary tract | 0 | 6 | 6 |
| Bladder | 30 | 40 | 70 |
| Blood | 229 | 142 | 371 |
| Bone | 12 | 32 | 44 |
| Brain | 149 | 117 | 266 |
| Breast | 239 | 199 | 438 |
| Cervix | 23 | 23 | 46 |
| Colon | 143 | 56 | 199 |
| Connective tissue | 9 | 0 | 9 |
| Endometrium | 0 | 11 | 11 |
| Esophagus | 12 | 25 | 37 |
| EWT | 7 | 0 | 7 |
| Eye | 5 | 2 | 7 |
| Kidney | 29 | 40 | 69 |
| Leukemia | 0 | 4 | 4 |
| Liver | 33 | 16 | 49 |
| Lung | 358 | 325 | 683 |
| Lymphoma | 73 | 1 | 74 |
| Muscle | 10 | 0 | 10 |
| Myeloma | 7 | 24 | 31 |
| Ovary | 24 | 43 | 67 |
| Pancreas | 50 | 18 | 68 |
| Pharynx | 6 | 0 | 6 |
| Placenta | 9 | 2 | 11 |
| Prostate | 12 | 18 | 30 |
| Rectum | 7 | 0 | 7 |
| Sarcoma | 8 | 0 | 8 |
| Skin | 129 | 73 | 202 |
| Soft tissue | 0 | 19 | 19 |
| Stomach | 56 | 24 | 80 |
| Testis | 0 | 4 | 4 |
| Thyroid | 12 | 13 | 25 |
| Upper aerodigestive | 0 | 24 | 24 |
| Urinary tract | 0 | 20 | 20 |
| Vulva | 9 | 3 | 12 |
| Total | 1714 | 1336 | 3050 |
Figure 1.Pattern of ERBB2 expression across diverse normal and tumor tissues, A) U133plus2 data set, B) U133A data set.
Figure 2.Pattern of ERBB2 expression across diverse cancer cell lines in U133plus2 data. A) Data is shown as multiple boxplots. B) Data is shown as a flash chart so users can identify a cell line name by pointing a mouse on each dot.
Figure 3.Pattern of LAMB3 expression among ovarian cancer subtypes. A) A screenshot of sub-type specific search option. B) Pattern of LAMB3 expression in different stages of ovarian cancer patients from GSE12172 data set.
Figure 4.Analysis of laboratory effects by comparing distribution of correlation coefficients among three different groups: Distribution of all pairwise correlations between the samples in the dataset (black), distribution of average similarities between the sample subgroups from different laboratories within the same biological group (green), distribution of average similarities between the sample subgroups from different biological groups within the same laboratory (red).