Literature DB >> 12377104

A reference database for tumor-related genes co-expressed with interleukin-8 using genome-scale in silico analysis.

Lawrence Benbow1, Lynn Wang, Maureen Laverty, Suxing Liu, Ping Qiu, Richard W Bond, Eric Gustafson, Joseph A Hedrick, Mitchell Kostich, Jonathan R Greene, Luquan Wang.   

Abstract

BACKGROUND: The EST database provides a rich resource for gene discovery and in silico expression analysis. We report a novel computational approach to identify co-expressed genes using EST database, and its application to IL-8.
RESULTS: IL-8 is represented in 53 dbEST cDNA libraries. We calculated the frequency of occurrence of all the genes represented in these cDNA libraries, and ranked the candidates based on a Z-score. Additional analysis suggests that most IL-8 related genes are differentially expressed between non-tumor and tumor tissues. To focus on IL-8's function in tumor tissues, we further analyzed and ranked the genes in 16 IL-8 related tumor libraries.
CONCLUSIONS: This method generated a reference database for genes co-expressed with IL-8 and could facilitate further characterization of functional association among genes.

Entities:  

Year:  2002        PMID: 12377104      PMCID: PMC131052          DOI: 10.1186/1471-2164-3-29

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Chemokines are a large superfamily of small, structurally-related peptides originally discovered as neutrophil attractants. Interleukin-8 (IL-8) is a potent member of the supergene family of CXC chemokines with ELR motif (ELR+). ELR+ CXC chemokines are potent angiogenic factors, whereas ELR- CXC chemokines are potent angiostatic factors. Studies have shown that these angiogenesis-related activities are correlated with tumorigenesis in many tumor types and are distinct from their ability to recruit neutrophils [1,2]. Interleukin-8 (IL-8) is inducible in a wide range of cells including lymphocytes, monocytes, endothelial cells, fibroblasts, hepatocytes, and keratinocytes [3-5]. IL-8 is also found to be constitutively expressed in several tumor tissues including bronchogenic carcinoma, non-small cell lung cancer, colorectal carcinoma, breast cancer, melanoma, prostate cancer, gastric carcinoma, and ovarian cancer. While IL-8 has been implicated in growth-potentiation [6], angiogenesis [7], metastasis [3,8], and tumorigenesis [3,6] of various tumors, its specific role remains unclear. Several observations support the assertion that a complex interaction between IL-8 and several other growth factors, cytokines, or other proteins is responsible for these tumor-related events [5,9-13]. Growth regulated protein alpha (GRO-α), beta (GRO-β), and ENA-78 have been reported to be co-induced with IL-8 in A549 cells stimulated with two proinflammatory cytokines, IL-1β and TNF-α [14]. This result is consistent with the presence of nuclear factor kappa B (NF-κB) consensus binding sites in the promoter regions of all three genes [4,15]. In addition, GRO-α and -β are also members of the ELR+ CXC cytokine family that are reported to be important mediators of tumorigenesis through their angiogenic properties [2,16]. They both share one of the IL-8 receptors, CXCR2, which has been postulated to regulate the ELR+ CXC chemokine-mediated angiogenesis and resulting-tumorigenesis [1]. These observations suggest that GRO-α and -β may exhibit expression profiles similar to IL-8. Coordinated expression of these and other factors with IL-8 in certain tumor tissues suggests that they may be functionally associated with IL-8 in these tumors. In order to fully understand IL-8's role in these events, it is important to investigate these coordinately expressed genes. A systematic, unbiased approach to identification and ranking of proteins related to IL-8's expression in tumor tissues/cells may help to define the scope of IL-8's role in tumorigenicity and important related interactions with other factors. There are over 3 million human Expressed Sequence Tag (EST) records in GenBank (Table 1), which is still growing rapidly. EST sequences in GenBank are derived from cDNA libraries generated from a vast array of tissue types including normal, disease-state, and variously treated tissues. The number of EST clones is reported to be proportional to the abundance of cognate transcripts in the tissue or cell type used to make the cDNA library and thus the EST distribution can provide a quantitative assessment of differential expression of a gene [17]. The expanding tissue diversity and EST coverage have increased the statistical power of EST-distribution based expression analysis. An approach using EST expression data as a binary variable (present or absent in a cDNA library) has previously identified prostate cancer-associated genes [18]. We are reporting here a novel in silico approach for identification of genes whose mRNA are enriched in libraries where IL-8 is represented. We focused on ESTs from cDNA libraries in which IL-8 has been sequenced at least once and analyzed the EST frequency of occurrence for all other genes in these IL-8 related cDNA libraries. Those cDNA libraries were further catalogued into tumor and non-tumor libraries, which allowed us to identify genes whose expression profile is closely related to that of IL-8 in tumor tissues.
Table 1

dbEST cDNA library summary. The cDNA libraries are catalogued into libraries where IL-8, GRO-α, or GRO-β is represented. Tumor libraries are cDNA libraries prepared from tumor tissues or tumor cell lines based on dbEST library annotation. See Additional file 1 for the detailed description of cDNA libraries where IL-8 is represented.

Total LibrariesTumor Libraries

Gene(s)ESTsClonesLibrariesESTsClonesLibraries
IL-83296493068885317315217152116
GRO-α3373643494763417596817407213
GRO-β2249932136452716139015980812
dbEST Total3350920304321744198969908644272734
dbEST cDNA library summary. The cDNA libraries are catalogued into libraries where IL-8, GRO-α, or GRO-β is represented. Tumor libraries are cDNA libraries prepared from tumor tissues or tumor cell lines based on dbEST library annotation. See Additional file 1 for the detailed description of cDNA libraries where IL-8 is represented.

Results

Genes co-expressed with IL-8

We found that IL-8 has been sequenced in 53 dbEST cDNA libraries (Table 1 and Additional file 1). Using these cDNA libraries (referred to as IL-8 tissue), we generated a reference database for all the genes and their expression profile relationship (measure by Z-score) with IL-8. The complete list of genes is provided in additional file 2, along with a distribution table based on contig size and Z-score (Table 2). A gene could be represented by multiple contigs which represent splice variants and sequencing errors. In this database, we provide the statistics for each contig, including the number of EST clones, cDNA libraries, EST clones in IL-8 tissue, and cDNA libraries in IL-8 tissue. First we evaluated the performance of this search using the distribution analysis results for some known genes (Table 3). The co-expression of IL-8 with IL-6, GRO-α, and GRO-β in many tissues has been well documented [5,14,19], and those genes are identified here as IL-8 related genes with an IL-8 Z-score >= 3. The GRO-γ and ENA-78 [14] also show a high IL-8 Z-score, although the available EST clone number for these two genes is much smaller. Successful identification of these genes demonstrates the value of our in silico approach. Besides Z-score, this database can be sorted in many different ways. For example, we can adjust the stringency of correlation by using the number of IL-8 cDNA libraries as a cutoff parameter.
Table 2

Contigs identified in IL-8 tissue and IL-8-tumor tissue. Contigs are catalogued using different Z-score and total clone cutoffs. Total Clone: The total number of clones from all cDNA libraries within a contig. IL-8 contigs: Contigs identified in IL-8 tissue. IL-8-tumor contigs: Contigs identified in IL-8-tumor tissue. NC: no Z-score cutoff is applied.

Total Clone CutoffZ-score Cutoff

NC123
IL-8 Contigs515222#3130991352
10147002662755274
508063101024998
100324339310236
IL-8 Tumor Contigs510016^1987577198
1097761825520183
506677985281102
100302040311542

# See Additional file 2 for detailed list. ^See Additional file 3 for detailed list.

Table 3

EST Clone distribution for some known IL-8 related genes in the IL-8 tissue. Total Clones: The total number of clones from all cDNA libraries. IL-8 Clones: The number of clones from IL-8 tissue.

GeneTotal ClonesIL-8 ClonesIL-8 Z-score
IL-644244.1
GRO-α52253.2
GRO-β46223
GRO-γ853.9
ENA-78642.8
Contigs identified in IL-8 tissue and IL-8-tumor tissue. Contigs are catalogued using different Z-score and total clone cutoffs. Total Clone: The total number of clones from all cDNA libraries within a contig. IL-8 contigs: Contigs identified in IL-8 tissue. IL-8-tumor contigs: Contigs identified in IL-8-tumor tissue. NC: no Z-score cutoff is applied. # See Additional file 2 for detailed list. ^See Additional file 3 for detailed list. EST Clone distribution for some known IL-8 related genes in the IL-8 tissue. Total Clones: The total number of clones from all cDNA libraries. IL-8 Clones: The number of clones from IL-8 tissue. The significance of this study lies in the generation of a complete reference database for IL-8 co-expressed genes. We present here a list of 36 genes with high EST clone count (>= 100) and high Z-score (>= 3) (referred to as IL-8 genes, I-1, I-2, ..., I-36) (Table 4) to illustrate the usage of this reference database. A high EST clone count is used to ensure the statistic significance of the data mining results. IL-8 tissues represent both tumor tissues/cells (IL-8-tumor) and non-tumor tissues/cells (Table 1 and Additional file 1). To generate a tumor expression profile, the percentage of IL-8-tumor clones versus IL-8 clones (Rtumor) was calculated for the 36 IL-8 genes. The resulting graph is shown in Figure 1. The Rtumor profile reveals a bimodal distribution. In the case of IL-8, tumor clones account for approximately 17% of all IL-8 clones (Rtumor = 17%). Eleven IL-8 genes exhibit a Rtumor of greater than 80%; five of these being greater than 95% (kruppel-like factor 2 (KLF2), presenilin 1 (PS1), neural proliferation and differentiation control protein-1 (NPDC1), GW112 and claudin-3). On the other hand, eight genes were not found in any IL-8-tumor related library (Rtumor = 0).
Table 4

IL-8 related genes (IL-8 Z-score >= 3 and EST clone >= 100). Total Clones: The total number of clones from all cDNA libraries. IL-8 Clones: The number of clones from IL-8 tissue. These genes are sorted by the ratio of IL-8 clones/total clones and given a gene id (I-1, I-2, ..., I-36).

Gene IDDefinition#IL-8 Clones/Total Clones (%)
I-1IL-8 (SW:P10145)100
I-2serum albumin (SW:P02768)88
I-3fibrinogen gamma-a chain (SW:P02679)87
I-4aldolase B (P05062)87
I-5fibrinogen gamma-B chain (GI:71828)74
I-6kruppel-like factor 2 (SW:Q9Y5W3)71
I-7GW112 protein (GI:11544538)69
I-8presenilin 1 (SW:P49768)65
I-9selenoprotein P (GI:2654365)58
I-10beta-fibrinogen (GI:182430)57
I-11splice variant of serum albumin (GI:28592)56
I-12alpha-1-antitrypsin (SW:P01009)56
I-13complement factor B (GI:291922)53
I-14MSTP032(GI:13376832)52
I-15serotransferrin (SW:P02787)51
I-16splice variant of fibrinogen B beta (GI:14423575)51
I-17lumican (SW:P51884)50
I-18apolipoprotein A-I (SW:P02647)50
I-19splice variant of complement component C4A (GI:387438)50
I-20beta-2-microglobulin (SW:P01884)49
I-21claudin-3 (SW:O15551)49
I-22arylacetamide deacetylace (SW:P22760)48
I-23osteoinductive factor (SW:P20774)48
I-24hypothetical protein (GI:6807713)47
I-25tumor-associated calcium signal transducer 1 (GI:182906)47
I-26transgelin 2 (GI:434763)46
I-27complement component C4A (GI:443671)45
I-28secreted apoptosis related protein 1 (GI:2415415)45
I-29thymosin beta-4 precursor(GI:2143995)45
I-30secretory granule proteoglycan core protein (SW:P10124)44
I-31ceruloplasmin (SW:P00450)44
I-32nonspecific crossreacting antigen (GI:88276)44
I-33plasminogen (SW:P00747)41
I-34biglycan (GI:13279002)41
I-35hypothetical protein (GI:6807932)40
I-36neural proliferation differentiation and control protein-1 (SW:Q9NQX5)39

# SW: Swissprot accession, GI: GenBank Identifier.

Figure 1

Distribution of IL-8 related genes in tumor cDNA libraries. The percentage ratio of IL-8-tumor clones/IL-8 clones (Rtumor) was calculated for each IL-8 related gene. The result is plotted here with gene id on the x-axis and Rtumor on the y-axis. A bimodal distribution is obtained where most genes are either highly associated (>80%) with tumor libraries or they show very little association (<10%) with tumor libraries. Five genes have an Rtumor value of >95%: KLF2 (99.5%), PS1 (98.9%), NPDC1 (97.9%), GW112 protein (97.4%), and claudin-3 (96.6%).

Distribution of IL-8 related genes in tumor cDNA libraries. The percentage ratio of IL-8-tumor clones/IL-8 clones (Rtumor) was calculated for each IL-8 related gene. The result is plotted here with gene id on the x-axis and Rtumor on the y-axis. A bimodal distribution is obtained where most genes are either highly associated (>80%) with tumor libraries or they show very little association (<10%) with tumor libraries. Five genes have an Rtumor value of >95%: KLF2 (99.5%), PS1 (98.9%), NPDC1 (97.9%), GW112 protein (97.4%), and claudin-3 (96.6%). IL-8 related genes (IL-8 Z-score >= 3 and EST clone >= 100). Total Clones: The total number of clones from all cDNA libraries. IL-8 Clones: The number of clones from IL-8 tissue. These genes are sorted by the ratio of IL-8 clones/total clones and given a gene id (I-1, I-2, ..., I-36). # SW: Swissprot accession, GI: GenBank Identifier. For comparison with GRO-α and GRO-β, the ratios of IL-8 clones, GRO-α clones and GRO-β clones to total clones(Ril8, RGRO-α and RGRO-β) for each IL-8 gene were analyzed. Approximately half of the IL-8 genes show similar correlation with all three chemokines. As an example, GW112 exhibits high correlation with IL-8, GRO-α and -β (Ril8 = 69%, RGRO-α = 71%, RGRO-β = 68%). On the other hand, KLF2 (Ril8 = 71%, RGRO-α = 16%, RGRO-β = 2%) and PS1 (Ril8= 65%, RGRO-α = 5%, RGRO-β = 3%) show high correlation with IL-8 and only slight association with GRO-α or GRO-β.

Genes co-expressed with IL-8 in tumor tissues or cell lines

A set of tumor cDNA libraries in which IL-8 is expressed (a subset of the IL-8 tissue cDNA libraries), was generated and is referred to as IL-8-tumor tissue. This virtual tissue consists of 16 cDNA libraries (171,521 cDNA clones) (Table 1 and Additional file 1). An IL-8-tumor gene database was established based on these libraries. The complete list of genes is provided in additional file 3, along with a distribution table based on contig size and Z-score (Table 2). We present here 42 candidate genes with relatively high levels of expression in IL-8-tumor tissue (EST clone count >= 100 and IL-8-tumor Z-score >= 3) (referred to as IL-8-tumor genes, IT-1, IT-2, ..., IT-42) (Table 5). To determine whether these genes are specifically related to IL-8-tumor tissue or whether they are also commonly found in tumors lacking IL-8 expression, the ratio between IL-8-tumor clones and general tumor clones (Ril8tumor) was plotted for IL-8-tumor genes (Figure 2). The Ril8tumor for all IL-8 tumor genes are greater than 38%, which is much higher than the expected background ratio between IL-8-tumor clones (171521, Table 1) and general tumor clones (864428, Table 1). A few of these genes are highly specific to IL-8-tumor tissues. Most notably GW 112, PS1, and KLF2 show Ril8tumor values of 86%, 81% and 76% respectively.
Table 5

IL-8-tumor related genes (Zscore >= 3 and EST clone >= 100). Total Clones: The total number of clones from all cDNA libraries. IL-8 Tumor Clones: The number of clones from IL-8-tumor tissue. These are sorted by the ratio of IL-8-tumor clones/total clones and given a gene id (IT-1, IT-2, ..., IT-42).

Gene IDDefinition#IL-8 Tumor Clones/Total Clones (%)
IT-1kruppel-like factor 2 (SW:Q9Y5W3)70
IT-2gw112 protein (GI:11544538)68
IT-3presenilin 1 (SW:P49768)65
IT-4claudin-3 (SW:O15551)47
IT-5similar to EGP314 (GI:6678752)43
IT-6lumican (SW:P51884)42
IT-7nonspecific crossreacting antigen precursor (GI:88276)41
IT-8complement factor B (GI:291922)40
IT-9transgelin 2 (SW:P37802)40
IT-10neural proliferation, differentiation and control protein-1 (SW:Q9NQX5)39
IT-11similar to atrophin-1(GI:1732417)38
IT-12immunoglobulin mu chain (GI:553361)37
IT-13hypothetical protein (GI:6807713)37
IT-14hypothetical protein (GI:6807932)36
IT-15alpha-1 type III collagen (GI:180414)36
IT-16heparan sulfate proteoglycan perlecan (GI:11602963)35
IT-17immunoglobulin alpha-1 heavy chain constant region (GI:184749)35
IT-18secreted apoptosis related protein 1 (GI:2415415)32
IT-19anterior gradient 2 (Xenopus laevis) homolog (GI:3779197)31
IT-20elongation factor 1-alpha 1 (SW:P10126)31
IT-21signal recognition particle 9 kd protein (SW:P49458)31
IT-22similar to CDK5 activator-binding protein C53 (GI:10435740)31
IT-23alpha-2-macroglobulin (SW:P01023)31
IT-24beta-2-microglobulin (GI:179318)30
IT-25vascular endothelial growth factor (GI:3712669)30
IT-26unknown30
IT-27similar to UNC-93 (GI:4263743)30
IT-28human C4 complement (GI:387438)30
IT-29t-complex protein 1, eta subunit (SW:Q99832)29
IT-30plasma protease C1 inhibitor (SW:P05155)28
IT-31tumor necrosis factor receptor 1 (SW:P19438)28
IT-32DRAL gene product (GI:1160932)28
IT-33unknown28
IT-34AF-6 (GI:3452572)28
IT-35ubiquitin (SW:P02248)27
IT-36unnamed protein product (GI:7023247)27
IT-3745 kda calcium-binding protein (SW:Q61112)27
IT-38ERF-2 (GI:509778)27
IT-39E74-like factor 3 (GI:1754538)26
IT-40connective tissue growth factor (GI:984956)26
IT-41mouse phospholipase D3 (GI:7242181)25
IT-42KIAA1536 protein (GI:7959339)25

# SW: Swissprot accession, GI: GenBank Identifier.

Figure 2

Distribution of IL-8-tumor related genes in all tumor cDNA libraries. The percentage ratio of IL-8-tumor clones/tumor clones (Ril8tumor) was calculated for IL-8-tumor related genes. The result is plotted here with gene id on the x-axis and Ril8tumor on the y-axis, and is ordered by decreasing Ril8tumor. Only the top half genes (21) with relatively high Ril8tumor are displayed. The 3 most specific genes are GW112 protein (86.2%), PS1 (81.1%), and KLF2 (76.4%).

Distribution of IL-8-tumor related genes in all tumor cDNA libraries. The percentage ratio of IL-8-tumor clones/tumor clones (Ril8tumor) was calculated for IL-8-tumor related genes. The result is plotted here with gene id on the x-axis and Ril8tumor on the y-axis, and is ordered by decreasing Ril8tumor. Only the top half genes (21) with relatively high Ril8tumor are displayed. The 3 most specific genes are GW112 protein (86.2%), PS1 (81.1%), and KLF2 (76.4%). IL-8-tumor related genes (Zscore >= 3 and EST clone >= 100). Total Clones: The total number of clones from all cDNA libraries. IL-8 Tumor Clones: The number of clones from IL-8-tumor tissue. These are sorted by the ratio of IL-8-tumor clones/total clones and given a gene id (IT-1, IT-2, ..., IT-42). # SW: Swissprot accession, GI: GenBank Identifier.

Discussion

The vast amount of available EST data allows expression analysis based solely on computational methods. It is possible to construct a "virtual tissue" based on the expression pattern of a particular gene or group of genes (e.g. tumor marker genes). It is also possible to group genes by tissue type through combining genes from the same source tissue (e.g. tumor, brain, etc.). In this study we have generated "virtual tissues" based on (a) expression of IL-8 in all tissues and (b) expression of IL-8 in tumor tissues. Next we generated lists of genes that are most highly co-expressed with IL-8 in these two virtual tissues. Since the method is not limited to genes previously correlated with IL-8 expression nor to known genes, we have the opportunity to identify previously overlooked correlations with known or novel genes. In addition, the relative strength of the correlations is measured by a Z-score based on the number of clones representing IL-8 or IL-8-tumor tissue compared with the total number of clones. Assuming that co-expression is related to function, these Z-scores provide a basis for ranking genes according to potential involvement in IL-8's function. This reference database for genes co-expressed with IL-8 can be cross-referenced with other large scale expression analyses, such as microarray experiments, to help decipher the regulation network and the functions of IL-8 and IL-8 co-expressed genes. Among the genes most highly correlated with IL-8 in tumor tissues are KLF2, PS1, VEGF, and tumor necrosis factor receptor-1 (TNFR-1), lumican (keratin sulfate proteoglycan), claudin-3, and perlecan. Cytokines (IL-2 and IL-7) induce the expression of KLF2 in activated T cells which correlates with their survival [20]. They suggest that KLF2 may be involved in avoiding activation-induced cell death. It is possible that IL-8 may also mediate similar actions. This type of action could increase the ability of tumor cells to survive under conditions where cells normally apoptose. Lumican expression has previously been correlated with higher tumor grade, lower estrogen levels in the tumor and younger age of patients in human breast cancer [21]. In addition, lumican is structurally related to both biglycan which is among the top IL-8-tumor related genes and the human embryonal carcinoma marker antigen TRA-1-60 [22]. These relationships support the hypothesis that lumican may be a tumor related protein involved with IL-8's tumorigenic function. PS1 was also found to be highly correlated with IL-8 in both the entire list of tissues and specifically in tumor tissues. Based on EST distribution, presenilin 2 (PS2) is less related to IL-8-tumor libraries (Z-score = 0.3, IL-8-tumor clone/Total Clone = 10%). In the breast tumor microarray experiments, unlike PS1, the peak level of expression for PS2 is not in BT-549. PS1 is a gene involved in early-onset familial Alzheimer's disease. There is accumulating evidence that mutations in PS1 accelerate neurodegeneration and facilitate apoptosis, and some researchers [23] suggest an association with the p53 signal transduction pathway. It was suggested that down-regulation of PS1 by wildtype p53 and also p21WAF-1 may be independent mechanisms leading to apoptosis and tumor suppression [24]. Our data correlating PS1 with IL-8-tumor related tissue also suggest a potential anti-apoptotic role for PS1 in tumorigenesis. Vascular endothelial growth factor (VEGF), like IL-8, is a potent mediator of angiogenesis. VEGF has been shown to regulate angiogenesis and metastasis of bladder cancer [25] and several recent studies have reported correlations with IL-8 in several cell types including bladder cancer (TCC), non-small cell lung cancer (NSCLC) [26,27], human brain microvascular endothelial cells (HBMECS) [28] and monocytes [29]. These data collectively demonstrate the correlation of VEGF with IL-8 in tumor tissues. TNF-α is a well-characterized, potent inducer of IL-8 transcription and an anti-apoptotic agent [30-32]. Roebuck provides a detailed analysis of IL-8 promoter structure including a promoter recruitment mechanism which involves TNF-α and the cooperativity of NF-kB and NF-IL-6 binding sites [4,32]. TNF-α exerts many of its effects through TNFR-1 and TNFR-2 receptors, thus co-expression of these receptors with IL-8 might be anticipated. Our analysis does reveal a correlation between TNFR-1 and IL-8 expression. Using another bioinformatic approach, Eidelman et al. report a close relationship between IL-8 secretion in cystic fibrosis cells and expression of genes from the TNFR-1/NFkB pathway. As is the case for several other genes found in this study, the TNFR-1/NFkB pathway is associated with p53/p21WAF-1 tumor suppressor systems [33-37]. Our analysis identified several interesting genes co-expressed with IL-8 in tumor tissues which have not previously been associated with IL-8. Along with lumican, claudin-3, KLF2, PS1, VEGF and TNFR-1 mentioned above, these include several secreted proteins (secreted apoptosis related protein 1 and NPDC1) which are also likely to function in coordination with IL-8; several genes from the well-known family of complement factors (C1 esterase, human complement C4A and complement factor B); and several unknown or hypothetical genes. Interestingly, several of the genes correlate with the p53/p21WAF-1 tumor suppressor network. We expect that some of these genes may present important therapeutic targets at one or more levels in the network of IL-8 mediated interactions related to tumorigenesis. Like all EST-based expression analysis, there are limitations for this in silico method. First, EST-based expression analysis is not suitable for transcripts with low abundance (no or few representative ESTs in the database), which precludes statistical analysis. Second, EST-based expression analysis can only indicate the co-expression of genes, but can not accurately measure the expression level. Third, to generate a comprehensive reference database, we decided not to exclude cDNA libraries based on the sequence depth or special manipulation (e.g. subtraction and normalization). The depth of sequencing and special manipulation are not consistent across cDNA libraries, which could also influence the sensitivity of the analysis. The computational expression analysis methods developed here are not limited to the identification of IL-8 related genes, but can also be applied to many other proteins of interest. This method is complementary to other large-scale expression analysis methods (e.g. microarray) in that it is not limited by the physical presence of a gene on microarray, thus it offers an unique approach to discovering potential functional links between genes through expression profiling.

Methods

In silico expression analysis

All non-commercial software used in these studies was written in PERL 5.0. Human EST sequence and cDNA library information were retrieved from GenBank (Release 120) and an in-house relational database model (Sybase, SQL Server Release 11.0, CA, Sybase Inc.) was created to mirror the public human EST database (dbEST). The EST sequences were first binned into clusters if they share the same 21 mer tag beginning with ATG or CTG. Next, ESTs in each cluster were assembled into contigs using the PHRAP sequence assembly software (Phil Green, Unpublished). Based on the ESTs corresponding to IL-8 in its cognate contig, we generated a set of cDNA libraries in which IL-8 is represented. This set of libraries is referred to as IL-8 tissue and can be considered as a virtual tissue consisting of 53 cDNA libraries (306,888 cDNA clones) (Table 1 and Additional file 1). We chose contigs consisting of at least 5 EST clones and present in at least 3 IL-8 libraries. The frequency of occurrence (F) and Z-score for all the contigs in these cDNA libraries were calculated as: where NIL8 is the number of EST clones in IL-8 tissue for a particular contig, Ntotal is the total EST clone count in that contig. Contigs were collected into groups based on contig size (number of EST clones in a contig), and small contig groups (< 1000 contigs) were merged together with neighboring groups (similar in contig size) to make sure that there were at least 1000 contigs in each group. The rational for grouping contigs based on size is that the distribution pattern of F (i.e. mean and Z-score) may vary for contigs of different size. n is the total number of contigs in a contig group, ΣF is the sum of F in a contig group, and SDV is the standard deviation of F in a contig group. Assuming a Gaussian Distribution, the fractions of the population that are greater than Z-score SDV and also above the mean are 15.87%, 2.28%, and 0.13% for Z-scores of 1, 2, and 3 respectively. Similarly, a set of tumor cDNA libraries (based on their GenBank Annotations) in which IL-8 is represented was generated. This set of libraries is referred to as IL-8-tumor tissue, consisting of 16 cDNA libraries (171,521 cDNA clones) (Table 1 and Additional file 1). Using the method described above, a database for IL-8-tumor candidate genes (using IL-8-tumor tissue) was generated and IL-8-tumor Z-scores were calculated. Databases for GRO-α and GRO-β related genes were generated in a similar way using the cDNA libraries where GRO-α or GRO-β were represented respectively (Table 1).

Authors' contributions

LB and LUW carried out the in silico expression analysis and drafted the manuscript. SL participated in the cataloging of tumor EST libraries. LYW, RB, JG, JH, PQ, EG, ML, and MK participated in design of the study. All authors read and approved the final manuscript.

Additional File 1

IL-8 cDNA Libraries List of all cDNA libraries in which IL-8 has been sequenced at least once. Click here for file

Additional File 2

Contigs identified in IL8 tissue List of contigs that have at least one EST in IL8 tissue. Click here for file

Additional File 3

Contigs identified in IL8-tumor tissue List of contigs that have at least one EST in IL8-tumor tissue Click here for file
  37 in total

Review 1.  Tumor angiogenesis is regulated by CXC chemokines.

Authors:  B B Moore; D A Arenberg; C L Addison; M P Keane; R M Strieter
Journal:  J Lab Clin Med       Date:  1998-08

2.  The significance of digital gene expression profiles.

Authors:  S Audic; J M Claverie
Journal:  Genome Res       Date:  1997-10       Impact factor: 9.043

3.  Tumor necrosis factor-induced apoptosis stimulates p53 accumulation and p21WAF1 proteolysis in ME-180 cells.

Authors:  N J Donato; M Perez
Journal:  J Biol Chem       Date:  1998-02-27       Impact factor: 5.157

4.  Regulation of interleukin 8 expression in human malignant melanoma cells.

Authors:  R K Singh; M L Varney
Journal:  Cancer Res       Date:  1998-04-01       Impact factor: 12.701

5.  Expression of lumican in human breast carcinoma.

Authors:  E Leygue; L Snell; H Dotzlaw; K Hole; T Hiller-Hitchcock; P J Roughley; P H Watson; L C Murphy
Journal:  Cancer Res       Date:  1998-04-01       Impact factor: 12.701

Review 6.  The role of CXC chemokines in the regulation of angiogenesis in non-small cell lung cancer.

Authors:  D A Arenberg; P J Polverini; S L Kunkel; A Shanafelt; J Hesselgesser; R Horuk; R M Strieter
Journal:  J Leukoc Biol       Date:  1997-11       Impact factor: 4.962

7.  Regulation of Mel-CAM/MUC18 expression on melanocytes of different stages of tumor progression by normal keratinocytes.

Authors:  I M Shih; D E Elder; M Y Hsu; M Herlyn
Journal:  Am J Pathol       Date:  1994-10       Impact factor: 4.307

8.  Interleukin 8: an autocrine growth factor for malignant mesothelioma.

Authors:  G Galffy; K A Mohammed; P A Dowling; N Nasreen; M J Ward; V B Antony
Journal:  Cancer Res       Date:  1999-01-15       Impact factor: 12.701

9.  Inhibition of presenilin 1 expression is promoted by p53 and p21WAF-1 and results in apoptosis and tumor suppression.

Authors:  J P Roperch; V Alvaro; S Prieur; M Tuynder; M Nemani; F Lethrosne; L Piouffre; M C Gendron; D Israeli; J Dausset; M Oren; R Amson; A Telerman
Journal:  Nat Med       Date:  1998-07       Impact factor: 53.440

10.  Effects of tumor necrosis factor-alpha on antimitogenicity and cell cycle-related proteins in MCF-7 cells.

Authors:  D I Jeoung; B Tang; M Sonenberg
Journal:  J Biol Chem       Date:  1995-08-04       Impact factor: 5.157

View more
  1 in total

1.  Genome wide in silico SNP-tumor association analysis.

Authors:  Ping Qiu; Luquan Wang; Mitch Kostich; Wei Ding; Jason S Simon; Jonathan R Greene
Journal:  BMC Cancer       Date:  2004-01-29       Impact factor: 4.430

  1 in total

北京卡尤迪生物科技股份有限公司 © 2022-2023.