| Literature DB >> 12377104 |
Lawrence Benbow1, Lynn Wang, Maureen Laverty, Suxing Liu, Ping Qiu, Richard W Bond, Eric Gustafson, Joseph A Hedrick, Mitchell Kostich, Jonathan R Greene, Luquan Wang.
Abstract
BACKGROUND: The EST database provides a rich resource for gene discovery and in silico expression analysis. We report a novel computational approach to identify co-expressed genes using EST database, and its application to IL-8.Entities:
Year: 2002 PMID: 12377104 PMCID: PMC131052 DOI: 10.1186/1471-2164-3-29
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
dbEST cDNA library summary. The cDNA libraries are catalogued into libraries where IL-8, GRO-α, or GRO-β is represented. Tumor libraries are cDNA libraries prepared from tumor tissues or tumor cell lines based on dbEST library annotation. See Additional file 1 for the detailed description of cDNA libraries where IL-8 is represented.
| Total Libraries | Tumor Libraries | |||||
| Gene(s) | ESTs | Clones | Libraries | ESTs | Clones | Libraries |
| IL-8 | 329649 | 306888 | 53 | 173152 | 171521 | 16 |
| GRO-α | 337364 | 349476 | 34 | 175968 | 174072 | 13 |
| GRO-β | 224993 | 213645 | 27 | 161390 | 159808 | 12 |
| dbEST Total | 3350920 | 3043217 | 4419 | 896990 | 864427 | 2734 |
Contigs identified in IL-8 tissue and IL-8-tumor tissue. Contigs are catalogued using different Z-score and total clone cutoffs. Total Clone: The total number of clones from all cDNA libraries within a contig. IL-8 contigs: Contigs identified in IL-8 tissue. IL-8-tumor contigs: Contigs identified in IL-8-tumor tissue. NC: no Z-score cutoff is applied.
| 5 | 15222# | 3130 | 991 | 352 | |
| 10 | 14700 | 2662 | 755 | 274 | |
| 50 | 8063 | 1010 | 249 | 98 | |
| 100 | 3243 | 393 | 102 | 36 | |
| 5 | 10016^ | 1987 | 577 | 198 | |
| 10 | 9776 | 1825 | 520 | 183 | |
| 50 | 6677 | 985 | 281 | 102 | |
| 100 | 3020 | 403 | 115 | 42 | |
# See Additional file 2 for detailed list. ^See Additional file 3 for detailed list.
EST Clone distribution for some known IL-8 related genes in the IL-8 tissue. Total Clones: The total number of clones from all cDNA libraries. IL-8 Clones: The number of clones from IL-8 tissue.
| Gene | Total Clones | IL-8 Clones | IL-8 Z-score |
| IL-6 | 44 | 24 | 4.1 |
| GRO-α | 52 | 25 | 3.2 |
| GRO-β | 46 | 22 | 3 |
| GRO-γ | 8 | 5 | 3.9 |
| ENA-78 | 6 | 4 | 2.8 |
IL-8 related genes (IL-8 Z-score >= 3 and EST clone >= 100). Total Clones: The total number of clones from all cDNA libraries. IL-8 Clones: The number of clones from IL-8 tissue. These genes are sorted by the ratio of IL-8 clones/total clones and given a gene id (I-1, I-2, ..., I-36).
| I-1 | IL-8 (SW:P10145) | 100 |
| I-2 | serum albumin (SW:P02768) | 88 |
| I-3 | fibrinogen gamma-a chain (SW:P02679) | 87 |
| I-4 | aldolase B (P05062) | 87 |
| I-5 | fibrinogen gamma-B chain (GI:71828) | 74 |
| I-6 | kruppel-like factor 2 (SW:Q9Y5W3) | 71 |
| I-7 | GW112 protein (GI:11544538) | 69 |
| I-8 | presenilin 1 (SW:P49768) | 65 |
| I-9 | selenoprotein P (GI:2654365) | 58 |
| I-10 | beta-fibrinogen (GI:182430) | 57 |
| I-11 | splice variant of serum albumin (GI:28592) | 56 |
| I-12 | alpha-1-antitrypsin (SW:P01009) | 56 |
| I-13 | complement factor B (GI:291922) | 53 |
| I-14 | MSTP032(GI:13376832) | 52 |
| I-15 | serotransferrin (SW:P02787) | 51 |
| I-16 | splice variant of fibrinogen B beta (GI:14423575) | 51 |
| I-17 | lumican (SW:P51884) | 50 |
| I-18 | apolipoprotein A-I (SW:P02647) | 50 |
| I-19 | splice variant of complement component C4A (GI:387438) | 50 |
| I-20 | beta-2-microglobulin (SW:P01884) | 49 |
| I-21 | claudin-3 (SW:O15551) | 49 |
| I-22 | arylacetamide deacetylace (SW:P22760) | 48 |
| I-23 | osteoinductive factor (SW:P20774) | 48 |
| I-24 | hypothetical protein (GI:6807713) | 47 |
| I-25 | tumor-associated calcium signal transducer 1 (GI:182906) | 47 |
| I-26 | transgelin 2 (GI:434763) | 46 |
| I-27 | complement component C4A (GI:443671) | 45 |
| I-28 | secreted apoptosis related protein 1 (GI:2415415) | 45 |
| I-29 | thymosin beta-4 precursor(GI:2143995) | 45 |
| I-30 | secretory granule proteoglycan core protein (SW:P10124) | 44 |
| I-31 | ceruloplasmin (SW:P00450) | 44 |
| I-32 | nonspecific crossreacting antigen (GI:88276) | 44 |
| I-33 | plasminogen (SW:P00747) | 41 |
| I-34 | biglycan (GI:13279002) | 41 |
| I-35 | hypothetical protein (GI:6807932) | 40 |
| I-36 | neural proliferation differentiation and control protein-1 (SW:Q9NQX5) | 39 |
# SW: Swissprot accession, GI: GenBank Identifier.
Figure 1Distribution of IL-8 related genes in tumor cDNA libraries. The percentage ratio of IL-8-tumor clones/IL-8 clones (Rtumor) was calculated for each IL-8 related gene. The result is plotted here with gene id on the x-axis and Rtumor on the y-axis. A bimodal distribution is obtained where most genes are either highly associated (>80%) with tumor libraries or they show very little association (<10%) with tumor libraries. Five genes have an Rtumor value of >95%: KLF2 (99.5%), PS1 (98.9%), NPDC1 (97.9%), GW112 protein (97.4%), and claudin-3 (96.6%).
IL-8-tumor related genes (Zscore >= 3 and EST clone >= 100). Total Clones: The total number of clones from all cDNA libraries. IL-8 Tumor Clones: The number of clones from IL-8-tumor tissue. These are sorted by the ratio of IL-8-tumor clones/total clones and given a gene id (IT-1, IT-2, ..., IT-42).
| IT-1 | kruppel-like factor 2 (SW:Q9Y5W3) | 70 |
| IT-2 | gw112 protein (GI:11544538) | 68 |
| IT-3 | presenilin 1 (SW:P49768) | 65 |
| IT-4 | claudin-3 (SW:O15551) | 47 |
| IT-5 | similar to EGP314 (GI:6678752) | 43 |
| IT-6 | lumican (SW:P51884) | 42 |
| IT-7 | nonspecific crossreacting antigen precursor (GI:88276) | 41 |
| IT-8 | complement factor B (GI:291922) | 40 |
| IT-9 | transgelin 2 (SW:P37802) | 40 |
| IT-10 | neural proliferation, differentiation and control protein-1 (SW:Q9NQX5) | 39 |
| IT-11 | similar to atrophin-1(GI:1732417) | 38 |
| IT-12 | immunoglobulin mu chain (GI:553361) | 37 |
| IT-13 | hypothetical protein (GI:6807713) | 37 |
| IT-14 | hypothetical protein (GI:6807932) | 36 |
| IT-15 | alpha-1 type III collagen (GI:180414) | 36 |
| IT-16 | heparan sulfate proteoglycan perlecan (GI:11602963) | 35 |
| IT-17 | immunoglobulin alpha-1 heavy chain constant region (GI:184749) | 35 |
| IT-18 | secreted apoptosis related protein 1 (GI:2415415) | 32 |
| IT-19 | anterior gradient 2 (Xenopus laevis) homolog (GI:3779197) | 31 |
| IT-20 | elongation factor 1-alpha 1 (SW:P10126) | 31 |
| IT-21 | signal recognition particle 9 kd protein (SW:P49458) | 31 |
| IT-22 | similar to CDK5 activator-binding protein C53 (GI:10435740) | 31 |
| IT-23 | alpha-2-macroglobulin (SW:P01023) | 31 |
| IT-24 | beta-2-microglobulin (GI:179318) | 30 |
| IT-25 | vascular endothelial growth factor (GI:3712669) | 30 |
| IT-26 | unknown | 30 |
| IT-27 | similar to UNC-93 (GI:4263743) | 30 |
| IT-28 | human C4 complement (GI:387438) | 30 |
| IT-29 | t-complex protein 1, eta subunit (SW:Q99832) | 29 |
| IT-30 | plasma protease C1 inhibitor (SW:P05155) | 28 |
| IT-31 | tumor necrosis factor receptor 1 (SW:P19438) | 28 |
| IT-32 | DRAL gene product (GI:1160932) | 28 |
| IT-33 | unknown | 28 |
| IT-34 | AF-6 (GI:3452572) | 28 |
| IT-35 | ubiquitin (SW:P02248) | 27 |
| IT-36 | unnamed protein product (GI:7023247) | 27 |
| IT-37 | 45 kda calcium-binding protein (SW:Q61112) | 27 |
| IT-38 | ERF-2 (GI:509778) | 27 |
| IT-39 | E74-like factor 3 (GI:1754538) | 26 |
| IT-40 | connective tissue growth factor (GI:984956) | 26 |
| IT-41 | mouse phospholipase D3 (GI:7242181) | 25 |
| IT-42 | KIAA1536 protein (GI:7959339) | 25 |
# SW: Swissprot accession, GI: GenBank Identifier.
Figure 2Distribution of IL-8-tumor related genes in all tumor cDNA libraries. The percentage ratio of IL-8-tumor clones/tumor clones (Ril8tumor) was calculated for IL-8-tumor related genes. The result is plotted here with gene id on the x-axis and Ril8tumor on the y-axis, and is ordered by decreasing Ril8tumor. Only the top half genes (21) with relatively high Ril8tumor are displayed. The 3 most specific genes are GW112 protein (86.2%), PS1 (81.1%), and KLF2 (76.4%).