Literature DB >> 16255782

CEBPG transcription factor correlates with antioxidant and DNA repair genes in normal bronchial epithelial cells but not in individuals with bronchogenic carcinoma.

D'Anna N Mullins1, Erin L Crawford, Sadik A Khuder, Dawn-Alita Hernandez, Youngsook Yoon, James C Willey.   

Abstract

BACKGROUND: Cigarette smoking is the primary cause of bronchogenic carcinoma (BC), yet only 10-15% of heavy smokers develop BC and it is likely that this variation in risk is, in part, genetically determined. We previously reported a set of antioxidant genes for which transcript abundance was lower in normal bronchial epithelial cells (NBEC) of BC individuals compared to non-BC individuals. In unpublished studies of the same NBEC samples, transcript abundance values for several DNA repair genes were correlated with these antioxidant genes. From these data, we hypothesized that antioxidant and DNA repair genes are co-regulated by one or more transcription factors and that inter-individual variation in expression and/or function of one or more of these transcription factors is responsible for inter-individual variation in risk for BC.
METHODS: The putative transcription factor recognition sites common to six of the antioxidant genes were identified through in silico DNA sequence analysis. The transcript abundance values of these transcription factors (n = 6) and an expanded group of antioxidant and DNA repair genes (n = 16) were measured simultaneously by quantitative PCR in NBEC of 24 non-BC and 25 BC individuals.
RESULTS: CEBPG transcription factor was significantly (p < 0.01) correlated with eight of the antioxidant or DNA repair genes in non-BC individuals but not in BC individuals. In BC individuals the correlation with CEBPG was significantly (p < 0.01) lower than that of non-BC individuals for four of the genes (XRCC1, ERCC5, GSTP1, and SOD1) and the difference was nearly significant for GPX1. The only other transcription factor correlated with any of these five target genes in non-BC individuals was E2F1. E2F1 was correlated with GSTP1 among non-BC individuals, but in contrast to CEBPG, there was no significant difference in this correlation in non-BC individuals compared to BC individuals.
CONCLUSION: We conclude that CEBPG is the transcription factor primarily responsible for regulating transcription of key antioxidant and DNA repair genes in non-BC individuals. Further, we conclude that the heavy smokers selected for development of BC are those who have sub-optimal regulation of antioxidant and DNA repair genes by CEBPG.

Entities:  

Mesh:

Substances:

Year:  2005        PMID: 16255782      PMCID: PMC1310620          DOI: 10.1186/1471-2407-5-141

Source DB:  PubMed          Journal:  BMC Cancer        ISSN: 1471-2407            Impact factor:   4.430


Background

BC is currently the leading cause of cancer-related death in the United States, causing 28% of all cancer deaths [1]. Although cigarette smoking is the primary risk factor, only 10–15% of heavy smokers (greater than 20 pack years) develop BC [1-3]. Antioxidant and DNA repair enzymes that provide protection from the effects of cigarette smoke are expressed in the progenitor cells for BC, normal bronchial epithelial cells (NBEC) [1]. Inherited inter-individual variation in the function of these genes plays a role in determining risk for BC [4-6]. Antioxidant enzymes protect NBEC from reactive oxygen species produced by interaction with and metabolism of xenobiotics such as pollution and cigarette smoke [4-7] as well as those produced by normal cellular metabolism. Reactive oxygen species cause many damaging reactions including denaturation of proteins, cross-linking of lipids and proteins and modification of nucleic acid bases, which can lead to cancer [7]. DNA repair enzymes repair the frequent damage to DNA caused by oxidant stress as well as other stresses, including bulky adducts derived from carcinogens in cigarette smoke [8]. We previously reported that an interactive transcript abundance index comprising antioxidant genes was lower in NBEC of BC individuals compared to non-BC individuals, suggesting that BC individuals are selected on the basis of poor antioxidant protection [9]. In that study, there was a tendency towards correlation in transcript abundance between several pairs of antioxidant or DNA repair genes in non-BC individuals, but not in BC individuals. Gene pairs included in that observation were GSTP1/GPX1, CAT/GPX3, and GPX3/SOD1. Correlation is one typical characteristic of co-regulated genes. Another is shared transcription factor recognition sites in the regulatory regions of those genes [10]. Based on the above findings, it was hypothesized first, that there is inter-individual variation in regulation of key antioxidant and DNA repair genes by one or more transcription factors and second, that individuals with sub-optimal regulation are selected for development of BC if they smoke cigarettes. To test these hypotheses, transcription factor recognition sites common to the regulatory regions of the above correlated gene pairs were identified through in silico DNA sequence analysis, and their transcript abundance measured simultaneously with an expanded group of ten antioxidant and six DNA repair genes.

Methods

NBEC sample procurement

Brush biopsy samples of normal bronchial epithelium were obtained for research studies at the time of diagnostic bronchoscopy according to previously described methods [9,11]. Normal bronchial epithelium in the lung not involved with cancer was brushed prior to biopsy of the suspected cancerous area. Samples were collected in a manner satisfying all requirements of the Institutional Review Board for the Medical University of Ohio. Each BC diagnosis and subtype identification was determined by histopathological examination in the Department of Pathology at the Medical University of Ohio. NBEC samples from a total of 49 individuals, including 24 non-BC individuals and 25 BC individuals, were evaluated in this study. The biographical characteristics of these individuals are presented in Table 1.
Table 1

Demographic data of patients from whom the NBEC samples were obtained.

Subject #GroupAgeGenderHistologySmoking Hx1Ethnicity
63NBCI277M75W3
64NBCI47F45W
136NBCI38M25A.A.4
139NBCI44F17.5W
150NBCI70F45W
156NBCI46FNS5A.A.
157NBCI60M>100W
194NBCI57M3W
210NBCI40M34W
257NBCI69F20W
261NBCI73FNSW
282NBCI83F60W
285NBCI69FNSW
296NBCI43F20W
305NBCI50F40W
315NBCI64MN/A6W
330NBCI39MNSW
331NBCIN/AN/AN/AN/A
334NBCI51M>50W
336NBCI31FNSW
337NBCI32M22N/A
339NBCI59M50W
361NBCI73FNSH7
363NBCI50M20A.A.

34BCI880MNSCLC940W
71BCI63MNSCLC100W
85BCI73FSQ10>100W
88BCI85MSQ75W
99BCI63MNSCLC45W
118BCI72MSQ30W
146BCI64FSCLC1145W
147BCI76MSCLC75W
158BCI88MSCLC115.5W
167BCI60FNSCLC50W
171BCI67MSCLC100W
191BCI75MSQ54W
211BCI71MSQ50W
212BCI65MSQ67.5W
247BCI75FSQ50W
255BCI60FNSCLC30W
259BCI68MCS12137.5W
271BCI58MAC1394.5W
287BCI65FNSCLC50W
300BCI56MSQ34W
306BCI46MSQ30W
314BCI69FBC14NSW
329BCI76FPD15>37.5W
335BCI75MSCLC58A.A.
B3BCI63MSQ60W

1Pack years; 2Non-bronchogenic carcinoma individual; 3White; 4African-American; 5Non-smoker; 6Not available; 7Hispanic; 8Bronchogenic carcinoma individual; 9Non-small cell lung cancer; 10Squamous carcinoma; 11Small cell lung cancer; 12Carcinoma-in-situ; 13Adenocarcinoma;14Bronchogenic Cancer, histology not specified; 15Poorly differentiated carcinoma.

Transcript abundance measurement

Total RNA samples extracted from NBEC were reverse transcribed using M-MLV reverse transcriptase and oligo dT primers as previously described [9,11]. Standardized RT (StaRT)-PCR was used for transcript abundance measurement in these studies. With StaRT-PCR, an internal standard for each gene within a standardized mixture of internal standards (SMIS) is included in each PCR reaction. After amplification, products were electrophoresed on an Agilent 2100 Bioanalyzer using DNA Chips with DNA 1000 Kit reagents for visualization according to the manufacturer's protocol (Agilent Technologies Deutschland GmbH, Waldbronn, Germany). The StaRT-PCR technology is licensed to Gene Express, Inc. (Toledo, OH). Many of the reagents are available commercially and were obtained through Gene Express, Inc. for this study. StaRT-PCR reagents for each of the measured genes that were not commercially available, including primers and SMIS, were prepared according to previously described methods [11,12]. Sequence information for the primers is provided in Table 2.
Table 2

Sequence for each primer used for StaRT-PCR virtually-multiplexed transcript abundance measurement or for internal standard preparation (CT) [15].

GeneAccession #PrimerSequencePositionProduct
ACTBX00351Forward5' ATC CTC ACC CTG AAG TAC CC 3'231
Reverse5' CCA TCT CTT GCT CGA AGT CC 3'704493 bp
CT5' CCA TCT CTT GCT CGA AGT CCG CCA GCC AGG TCC AGA CGC A 3'568377 bp
CATX04076Forward5' CCA GAA GAA AGC GGT CAA GA 3'1492
Reverse5' AAC CTT CAT TTT CCC CTG GG 3'1822350 bp
CT5' AAC CTT CAT TTT CCC CTG GGC CAG TGA TGA GCG GGT TAC A 3'1699247 bp
CEBPBNM_005194Forward5' TGT CCA AAC CAA CCG CAC AT 3'1412
Reverse5' AGC AAC AAG CCC GTA GGA AC 3'1657265 bp
CT5' AGC AAC AAG CCC GTA GGA ACA CGC GTT CAG CCA TGT TTA A 3'1571199 bp
CEBPGU20240Forward5' CGG TTG AAA AGC AAG CAG AAA GCA 3'488
Reverse5' GAT CCC AGA AAA TAG CCT CCA ATG 3'814350 bp
CT5' GAT CCC AGA AAA TAG CCT CCA ATG AAC ATT CAA GCC ACA AGC TC 3'726282 bp
E2F1M96577Forward5' TGA TAC CCC AAC TCC CTC TA 3'2076
Reverse5' AAA GCA GGA GGG AAC AGA GC 3'2452396 bp
CT5' AAA GCA GGA GGG AAC AGA GCA CTG CAG GGA CCA CAG G 3'2363327 bp
E2F3Y10479Forward5' TGA AAG CCC CTC CAG AAA CAA G 3'1019
Reverse5' GCA GCA GGG GAG GCA GTA AGT T 3'1336339 bp
CT5' GCA GCA GGG GAG GCA GTA AGT TGG GGA GGC CAG AGG AGA AAG GT 3'1253278 bp
E2F6AF059292Forward5' GGG CCT GCT GCC ATC AAA AAT A 3'99
Reverse5' CCG CTT TCG GAC TCC CAG TTT 3'283205 bp
CT5' CCG CTT TCG GAC TCC CAG TTA GCG ATA CAT CAA AAC GAG G 3'184125 bp
ERCC1M13194Forward5' CTG GAG CCC CGA GGA AGC 3'739
Reverse5' CAC TGG GGG TTT CCT TTG 3'1049328 bp
CT5' CAC TGG GGG TTT CCT TGG AAG GCC AGA TCT TCT CTT 3928240 bp
ERCC2X52221Forward5' GGC CTT CTT CAC CAG CTA C 3'1608
Reverse5' GTA GTC CGT CTT GCC CCT G 3'2004415 bp
CT5' GTA GTC CGT CTT GCC CCT GTG GAA CTG GTC CCG CAG GT 3'2597346 bp
ERCC4U64315Forward5' AGT GCA TCT CCA TGT CCC GCT ACT A 3'2213
Reverse5' CGA TGT TCT TAA CGT GGT GCA TCA A 3'2578390 bp
CT5' CGA TGT TCT TAA CGT GGT GCA TCA ACA GGC TGT GGC TTG CTT TGT 3'2433265 bp
ERCC5D16305Forward5' AAG GAA AGA GAA AGA AGC AGC AGC CA 3'3087
Reverse5' CAA ACA CAG ATC TGG CGG TCA CGA GG 3'3501440 bp
CT5' CAA ACA CAG ATC TGG CGG TCA CGA GGA GCT TCC TTC ACT GAG TTC TGC GAA T 3'3401366 bp
EVI1NM_005241Forward5' CGC CGG ATA TCC ACG AAG A 3'302
Reverse5' ATG CTG AGA GCG AAT GTG C 3'711428 bp
CT5' ATG CTG AGA GCG AAT GTG CTT AAA TGC CTT GGG ACA CT 3'587323 bp
GPX1Y00433Forward5' CCT GGT GGT GCT CGG CTT CC 3'522
Reverse5' CAA TGG TCT GGA AGC GGC GG 3'852350 bp
CT5' CAA TGG TCT GGA AGC GGC GGA CCG GAG ACC AGG TGA TGA G 3'757279 bp
GPX3D16360Forward5' GCA GAG CCG GGG ACA AGA GAA 3'113
Reverse5' CTG CTC TTT CTC TCC ATT GAC 3'471379 bp
CT5' CTG CTC TTT CTC TCC ATT GAC GCT CTT CCT GTA GTG CAT TCA 3'298227 bp
GSTM1,2,4,5J03817Forward5' GGG ACG CTC CTG ATT ATG AC 3'122
Reverse5' GCA AAC CAT GGC CGC TTC CC 3'442340 bp
CT5' GCA AAC CAT GGC CGC TTC CCT TCT CCA AAA TGT CCA CAC G 3'301219 bp
GSTM3J05459Forward5' GTG CGA GTC GTC TAT GGT TC 3'23
Reverse5' AGT TGT GTG CGG AAA TCC AT 3'342339 bp
CT5' AGT TGT GTG CGG AAA TCC ATT GCT CTG GGT GAT CTT GTT C 3'230247 bp
GSTP1X08058Forward5' TCC GCT GCA AAT ACA TCT CC 3'305
Reverse5' TGT TTC CCG TTG CCA TTG AT 3'616331 bp
CT5' TGT TTC CCG TTG CCA TTG ATT AGG ACC TCA TGG ATC AGC A 3'485220 bp
GSTT1X79389Forward5' GCT CTA CCT GGA CCT GCT GT 3'12
Reverse5' GGA ACA CAG GGA ACA TCA CC 3'351359 bp
CT5' GGA ACA CAG GGA ACA TCA CCT AGA GCA GGA TGG CCA CAC T 3'199227 bp
GSTZ1U86529Forward5' TCA CCC CCT ACC CTA CCA TCA GC 3'806
Reverse5' ATT TCA GCG CGG GCA TTC TTT 3'1267482 bp
CT5' ATT TCA GCG CGG GCA TTC TTT CCG CAT TCT CAT CTC AGC CTC AC 3'1161399 bp
mGST1J03746Forward5' GTC GGA GCA CGG ATC TAC CAC A 3'404
Reverse5' TTC CTC TGC TCC CCT CCT ACC TA 3'623242 bp
CT5' TTC CTC TGC TCC CCT CCT ACC TAT TTT CAG CAA CCT GTA AGC C 3'505144 bp
SOD1X02317Forward5' TGA AGG TGT GGG GAA GCA TTA 3'153
Reverse5' TTA CAC CAC AAG CCA AAC GAC 3'492360 bp
CT5' TTA CAC CAC AAG CCA AAC GAC TGA TGC AAT GGT CTC CTG AGA 3'384273 bp
XPAD14533Forward5' CTC GGC GAC GGC GGC TGC GGC TAC TGG AG 3'178
Reverse5' TGT CGG ACT TCC TTT GCT TCT TCT AAT GC 3'629480 bp
CT5' TGT CGG ACT TCC TTT GCT TCT TCT AAT GCT CTT TTT TCT AAA TCA CAG TCT 3'487360 bp
XRCC1M36089Forward5' CCC CTG AAG AGA CCA AAG CA 3'1906
Reverse5' CCA TTG AAG GCT GTG ACG TA 3'2241355 bp
CT5' CCA TTG AAG GCT GTG ACG TAT CAG GGA CTG GCA GAT G 3'2142276 bp
Including an internal standard within a SMIS in each measurement controls for all known sources of variation during PCR, including inhibitors in samples, and generates virtually-multiplexed transcript abundance data that are directly comparable across multiple experiments and institutions [13]. The performance characteristics of StaRT-PCR are superior to other forms of commercially available quantitative PCR technology in the areas critical to this study. With respect to these studies, the key property of a quantitative PCR method is not whether the PCR products are measured kinetically or at endpoint, but rather whether there are internal standards in each measurement or not. The overall performance characteristics of StaRT-PCR, including extensive validation of the method in independent laboratories have been presented in several recent articles and chapters [13-15]. With respect to the genes measured in this study, for each gene the StaRT-PCR reagents had lower detection threshold of less than 10 molecules, linear dynamic range of more than six orders of magnitude (less than 10 to over 107 molecules), and signal-to-analyte response of 100%. In addition, the presence of an internal standard controls for inter-sample variation in presence of PCR inhibitors (which often are gene-specific) and ensures no false negatives (if the PCR fails the internal standard PCR product is not observed and there are no data to report). False positives are eliminated through use of a control PCR reaction with no cDNA in it.

Statistical analysis

More than 6,000 transcript abundance measurements were conducted in multiple experiments over two years to assess the six transcription factors and sixteen antioxidant and DNA repair genes in NBEC samples from 49 individuals (24 non-BC individuals and 25 BC individuals). Correlation of each of the six transcription factors with each of the antioxidant or DNA repair genes was determined by Pearson's correlation following logarithmic transformation. The transformation was necessary due to the wide biological variation in expression of each gene among the individuals. Significance level was defined as p < 0.01 following Bonferroni adjustment for multiple comparison, specifically comparison of each of six transcription factors to each of the antioxidant or DNA repair genes. Comparison for significant differences between pairs of correlation coefficients was done by Fisher's Z-transformation test [16]. Analysis of the relationship between virtually-multiplexed transcript abundance data for each gene with age was assessed by Pearson's correlation, with gender by t-test, and with smoking history by ANOVA followed by Duncan's test.

Transcription factor recognition site analysis

The El Dorado (Build 35) program from the Genomatix software package was used to locate the correlated genes within the genome and define 1101 base pairs of the promoter regions (1000 base pairs upstream of and 100 base pairs into the transcription start site) for each gene (Genomatix Software GmbH, Munich, Germany, [17]). The 1101 base pair sequences obtained from the El Dorado program then were used as the target sequences for putative transcription factor recognition site identification using the MatInspector Version 4.2 program, which yielded sites for 11 transcription factors (Genomatix Software GmbH, Munich, Germany, [17]). The parameters used were the standard (0.75) core similarity and the optimized matrix similarity [18]. StaRT-PCR reagents were optimized for ten of these transcription factors, including CEBPB, CEBPE, CEBPG, E2F1, E2F3, E2F4, E2F5, E2F6, EVI1, and PAX5. Four transcription factors were expressed at low and invariant levels among multiple NBEC samples and were therefore excluded from the study. The remaining six, CEBPB, CEBPG, E2F1, E2F3, E2F6, and EVI, were evaluated for correlation with an expanded group of ten antioxidant and six DNA repair genes.

Results

Virtually-multiplexed transcript abundance data were obtained for each gene in each of the 49 samples, except for E2F1 measurement in sample 147 (Table 3). A gene-specific inhibitor in sample 147 prevented amplification of E2F1. Neither the internal standard, nor the native cDNA PCR product was observed. The presence of gene-specific PCR inhibition was observable in some other samples as reduction in peak heights in internal standard PCR products relative to that expected for the number of internal standard molecules present at the beginning of the PCR reaction. However, in each such case, the PCR amplification was efficient enough to enable quantification.
Table 3

Virtually-multiplexed transcript abundance data.

SUBJECT #GROUPCEBPBCEBPGE2F1E2F3E2F6EVI1CATERCC1ERCC2ERCC4ERCC5
63NBCI7.2E+036.4E+022.7E+021.9E+023.4E+026.1E+012.0E+041.1E+053.8E+031.1E+024.3E+04
64NBCI7.9E+031.7E+032.0E+032.0E+011.7E+022.5E+012.5E+043.0E+053.4E+038.1E+012.0E+05
136NBCI6.2E+032.1E+027.0E+021.0E-025.0E+016.0E+012.9E+037.4E+035.9E+025.6E+011.9E+04
139NBCI4.5E+033.4E+032.3E+045.6E+021.0E+031.0E+036.1E+051.2E+062.2E+043.9E+034.6E+05
150NBCI8.5E+037.4E+021.6E+021.1E+024.1E+013.2E+023.5E+041.7E+055.7E+036.9E+027.2E+04
156NBCI2.1E+041.2E+037.5E+02ND1ND1.4E+021.5E+041.8E+052.0E+032.9E+023.0E+04
157NBCI2.3E+044.1E+033.1E+032.1E+026.1E+021.4E+023.5E+055.6E+058.4E+031.6E+032.1E+05
194NBCI6.5E+032.1E+032.9E+022.6E+028.5E+024.7E+024.5E+046.1E+054.7E+034.4E+028.9E+04
210NBCI1.0E+042.1E+037.6E+024.0E+026.1E+023.6E+027.6E+047.6E+043.4E+037.7E+027.2E+04
257NBCI1.1E+041.8E+032.7E+028.9E+021.7E+039.7E+021.0E+052.6E+051.6E+037.6E+027.4E+04
261NBCI7.6E+031.3E+032.5E+021.7E+021.6E+029.4E+014.4E+042.7E+055.1E+034.6E+021.3E+05
282NBCI6.4E+031.2E+035.4E+024.1E+012.9E+011.2E+024.1E+041.0E+056.0E+038.1E+024.0E+04
285NBCI2.6E+034.4E+022.5E+031.1E+03ND1.2E+023.7E+049.1E+041.6E+037.7E+031.6E+04
296NBCI1.9E+041.0E+031.0E+03ND2.8E+01ND6.9E+042.5E+053.6E+031.7E+027.1E+04
305NBCI1.8E+039.1E+016.1E+02ND8.7E+012.6E+022.0E+044.9E+044.3E+022.8E+021.3E+04
315NBCI3.5E+031.3E+031.2E+032.0E+027.5E+016.1E+024.7E+042.1E+052.7E+032.4E+036.2E+04
330NBCI2.7E+032.4E+024.0E+021.1E+023.5E+024.0E+023.6E+048.3E+041.5E+031.7E+032.8E+04
331NBCI7.3E+031.3E+038.0E+02NDNDND8.6E+042.6E+056.5E+026.4E+026.2E+04
334NBCI3.7E+036.1E+021.1E+032.3E+011.9E+014.0E+017.8E+041.4E+052.5E+031.3E+035.1E+04
336NBCI3.6E+038.4E+022.8E+034.3E+021.2E+022.5E+028.9E+042.1E+057.9E+033.1E+039.5E+04
337NBCI5.0E+039.3E+021.1E+031.6E+022.5E+02ND6.5E+041.9E+053.3E+031.3E+034.2E+04
339NBCI2.5E+033.2E+025.2E+023.4E+014.3E+016.3E+015.0E+048.7E+042.3E+037.7E+023.2E+04
361NBCI7.9E+032.5E+036.7E+025.8E+021.4E+033.5E+026.7E+042.5E+051.5E+032.8E+034.7E+04
363NBCI6.2E+031.7E+031.4E+031.1E+021.9E+025.2E+017.6E+041.4E+053.7E+037.3E+027.2E+04

34BCI1.9E+031.7E+035.7E+022.4E+021.2E+023.7E+026.7E+045.6E+056.1E+031.5E+031.2E+05
71BCI7.6E+031.2E+031.3E+031.7E+011.8E+022.1E+019.5E+046.7E+051.8E+042.4E+022.2E+05
85BCI1.0E+049.7E+028.7E+026.3E+01NDND2.6E+044.9E+041.1E+033.9E+022.3E+04
88BCI1.5E+034.0E+024.2E+021.1E+02NDND3.1E+042.9E+042.7E+033.6E+023.9E+04
99BCI1.4E+043.3E+032.6E+032.2E+023.1E+031.4E+022.1E+052.0E+053.7E+039.3E+021.4E+05
118BCI1.5E+031.2E+032.8E+02NDND1.4E+022.4E+043.5E+041.9E+031.2E+031.7E+04
146BCI1.9E+041.2E+032.3E+021.2E+021.6E+036.0E+013.1E+041.4E+054.9E+032.7E+026.2E+04
147BCI2.8E+031.3E+03N/A23.6E+022.8E+023.8E+026.1E+043.0E+052.7E+031.1E+031.8E+04
158BCI4.9E+038.8E+021.9E+022.8E+014.0E+026.8E+012.2E+041.4E+051.7E+031.5E+021.3E+04
167BCI6.7E+039.2E+024.2E+021.8E+028.2E+023.8E+028.9E+041.1E+052.2E+032.8E+024.1E+04
171BCI1.0E+043.9E+033.2E+035.1E+028.9E+012.4E+025.3E+058.3E+051.3E+046.3E+021.6E+05
191BCI1.6E+042.4E+031.5E+021.5E+027.1E+011.2E+022.0E+049.6E+042.5E+033.4E+021.2E+04
211BCI3.8E+031.1E+036.5E+026.3E+023.2E+022.9E+028.3E+043.5E+052.8E+033.2E+029.1E+04
212BCI1.8E+042.8E+036.0E+023.4E+022.0E+022.2E+023.4E+042.1E+055.2E+033.0E+023.6E+04
247BCI6.5E+037.5E+026.4E+021.8E+02ND1.2E+027.8E+044.4E+045.5E+022.7E+031.3E+04
255BCI1.9E+041.1E+036.1E+021.8E+012.3E+027.6E+011.1E+056.5E+051.2E+041.2E+033.8E+05
259BCI8.4E+036.4E+025.1E+02ND6.7E+014.0E+014.7E+041.5E+052.6E+033.8E+027.9E+04
271BCI4.1E+036.6E+021.4E+036.5E+012.7E+021.9E+029.1E+041.5E+051.6E+03ND1.5E+05
287BCI8.7E+031.1E+039.5E+017.4E+014.0E+012.6E+024.7E+041.2E+056.5E+031.1E+024.1E+04
300BCI4.9E+034.4E+024.1E+024.4E+01ND5.1E+013.4E+046.9E+041.0E+031.0E+032.6E+04
306BCI6.5E+036.8E+024.9E+024.4E+01ND1.7E+013.7E+042.5E+052.8E+033.2E+024.1E+04
314BCI4.4E+039.3E+024.8E+023.1E+02ND1.9E+025.3E+046.2E+044.5E+035.2E+023.1E+04
329BCI1.4E+043.5E+022.4E+02NDNDND7.9E+041.5E+051.4E+03ND1.5E+05
335BCI4.2E+033.7E+022.2E+033.3E+022.4E+022.4E+024.8E+041.9E+059.8E+033.1E+023.3E+04
B3BCI7.4E+039.3E+021.4E+022.4E+021.8E+023.8E+023.4E+041.6E+052.2E+032.8E+024.1E+04

SUBJECT #GROUPGPX1GPX3GSTM15GSTM3GSTP1GSTT1GSTZ1MGST1SOD1XPAXRCC1

63NBCI8.4E+051.5E+037.3E+033.9E+031.9E+06ND3.1E+031.7E+051.6E+052.5E+032.1E+04
64NBCI4.4E+053.1E+021.5E+042.6E+033.5E+066.4E+032.0E+031.6E+056.5E+052.2E+034.2E+04
136NBCI2.0E+055.2E+026.3E+032.2E+035.3E+05ND8.9E+023.2E+045.3E+041.1E+034.4E+03
139NBCI1.8E+063.2E+035.2E+048.3E+033.2E+07ND2.8E+048.4E+052.2E+062.6E+041.6E+05
150NBCI2.1E+052.8E+031.1E+041.1E+031.5E+06ND3.1E+033.1E+042.0E+052.0E+033.1E+04
156NBCI4.8E+053.9E+031.9E+046.5E+032.9E+06ND4.9E+035.8E+041.3E+054.0E+032.5E+04
157NBCI2.4E+062.0E+032.6E+043.3E+031.8E+078.5E+033.6E+033.6E+051.8E+066.2E+031.7E+05
194NBCI5.4E+054.0E+031.2E+041.1E+037.8E+067.7E+033.2E+039.6E+045.8E+053.7E+032.4E+05
210NBCI3.2E+053.3E+034.1E+032.6E+033.9E+061.8E+033.7E+037.2E+043.3E+054.8E+032.9E+04
257NBCI6.3E+052.5E+031.1E+041.2E+022.8E+068.7E+032.9E+032.3E+041.2E+052.6E+034.9E+04
261NBCI7.6E+054.7E+031.6E+047.4E+033.1E+061.5E+031.8E+031.2E+054.5E+054.9E+037.2E+04
282NBCI4.6E+052.5E+031.3E+042.3E+031.9E+061.5E+041.3E+036.5E+044.5E+052.0E+033.1E+04
285NBCI4.3E+055.3E+038.4E+046.2E+023.5E+067.3E+039.2E+033.0E+042.3E+05ND2.0E+04
296NBCI8.1E+051.5E+037.4E+034.1E+033.6E+068.1E+034.5E+032.0E+055.4E+052.6E+035.2E+04
305NBCI1.4E+056.5E+024.0E+033.2E+031.6E+06ND6.9E+024.1E+041.8E+051.3E+035.7E+03
315NBCI4.2E+057.6E+035.1E+033.1E+037.9E+067.2E+034.3E+037.3E+044.5E+051.0E+043.8E+04
330NBCI1.2E+052.0E+035.8E+033.5E+031.1E+063.1E+011.3E+032.9E+042.3E+051.4E+031.7E+04
331NBCI6.1E+053.4E+031.3E+043.6E+037.3E+061.0E+041.8E+031.2E+051.3E+061.9E+035.7E+04
334NBCI6.5E+054.0E+032.7E+041.7E+043.7E+064.6E+033.0E+036.8E+045.9E+052.9E+032.6E+04
336NBCI4.7E+052.7E+034.4E+041.6E+033.4E+061.2E+048.3E+036.2E+045.4E+054.2E+031.2E+05
337NBCI2.8E+052.8E+033.8E+034.3E+031.5E+065.3E+035.6E+034.9E+043.3E+051.7E+034.4E+04
339NBCI3.1E+051.6E+043.0E+041.1E+033.5E+066.6E+032.0E+036.6E+042.4E+052.5E+032.2E+04
361NBCI3.7E+058.4E+021.5E+041.3E+037.8E+063.1E+032.7E+034.8E+046.4E+056.1E+037.7E+04
363NBCI6.1E+052.7E+032.2E+042.0E+038.1E+069.9E+032.4E+039.7E+047.0E+057.0E+033.6E+04

34BCI8.8E+053.1E+033.4E+035.1E+021.8E+06ND1.9E+036.1E+042.9E+053.1E+035.6E+04
71BCI8.8E+051.3E+043.2E+042.2E+036.7E+06ND2.5E+034.1E+057.5E+053.1E+035.6E+05
85BCI2.3E+051.1E+032.6E+044.1E+031.2E+061.0E+048.1E+024.8E+041.5E+053.5E+031.8E+04
88BCI1.3E+051.5E+033.5E+031.1E+036.9E+05ND5.6E+022.4E+041.9E+052.4E+031.6E+04
99BCI9.0E+052.1E+031.5E+045.5E+039.6E+066.7E+038.0E+031.2E+059.4E+058.5E+034.7E+04
118BCI8.0E+043.8E+031.1E+044.8E+031.7E+064.0E+034.5E+024.4E+041.7E+051.1E+036.1E+03
146BCI4.1E+052.9E+044.7E+044.7E+022.0E+067.1E+031.6E+046.6E+041.7E+053.8E+046.5E+04
147BCI4.1E+052.5E+038.2E+037.3E+028.8E+051.7E+031.3E+032.3E+046.5E+052.3E+027.4E+04
158BCI2.6E+042.6E+031.4E+044.7E+032.3E+062.9E+031.2E+032.3E+043.7E+041.9E+032.3E+04
167BCI1.9E+052.5E+035.3E+031.8E+032.4E+061.4E+042.3E+039.5E+044.0E+052.9E+032.8E+04
171BCI1.6E+061.7E+031.6E+044.0E+037.8E+066.0E+036.5E+032.5E+051.4E+061.4E+046.0E+04
191BCI4.5E+051.9E+033.5E+032.5E+021.7E+063.6E+035.6E+022.1E+041.6E+052.5E+032.1E+04
211BCI6.0E+051.7E+041.1E+041.3E+021.0E+075.5E+034.1E+032.4E+047.1E+054.3E+038.3E+04
212BCI6.5E+041.6E+039.6E+031.1E+034.0E+068.1E+039.5E+027.5E+049.4E+043.1E+033.6E+04
247BCI2.3E+052.0E+021.4E+042.1E+031.3E+067.1E+031.1E+032.4E+041.3E+051.3E+031.1E+04
255BCI1.3E+067.3E+031.5E+042.5E+032.8E+063.8E+048.1E+031.1E+052.8E+062.0E+031.0E+05
259BCI3.0E+065.9E+031.4E+047.0E+039.2E+065.1E+035.1E+031.1E+055.0E+054.8E+032.1E+04
271BCI7.4E+059.5E+023.8E+032.6E+032.4E+063.7E+033.1E+037.1E+046.3E+054.1E+033.0E+04
287BCI5.8E+058.7E+031.1E+042.3E+031.9E+061.5E+046.3E+028.8E+041.2E+054.7E+033.3E+04
300BCI2.7E+051.5E+037.7E+035.0E+033.7E+067.1E+031.9E+036.7E+043.2E+058.0E+021.3E+04
306BCI4.2E+052.0E+029.7E+032.3E+032.2E+06ND4.8E+035.6E+041.1E+064.5E+021.9E+04
314BCI1.6E+053.7E+031.7E+041.1E+031.2E+061.9E+021.5E+032.7E+042.7E+052.8E+033.4E+04
329BCI2.3E+058.9E+031.6E+045.3E+037.2E+067.8E+031.4E+035.7E+048.9E+051.9E+034.8E+04
335BCI4.1E+056.5E+032.1E+047.1E+034.3E+061.8E+027.0E+036.9E+045.2E+053.9E+034.4E+04
B3BCI4.0E+053.2E+036.0E+033.8E+021.6E+067.7E+031.8E+031.8E+042.6E+053.7E+031.7E+04

Virtually-multiplexed transcript abundance data for each gene (in the form of molecules/106 β-actin molecules) from all experiments were included in the same Standardized Expression Database (SED). These data are now directly comparable to previously published virtually-multiplexed transcript abundance data from this laboratory [15], or to Virtually-multiplexed transcript abundance data collected by others who use the NCI-funded (R24 CA 95806) Standardized Expression Measurement (SEM) Center. The data presented here represent more than 6,000 Virtually-multiplexed transcript abundance measurements conducted in multiple experiments. The sixteen antioxidant or DNA repair genes and each of the six transcription factors except for E2F1 were measured in each NBEC sample from 49 individuals (24 non-BC individuals and 25 BC individuals).

1ND, Not detectable (When 60 molecules of Competitive Template (CT) were added to the reaction, CT was detected but native template was not.)

2NA, Not available. E2F1 was measured in all of the samples except for one BC individual (24 non-BC individuals and 24 BC individuals).

Bivariate analysis

In non-BC individuals there was significant (p < 0.01) correlation between CEBPG and eight of the 16 antioxidant or DNA repair genes, specifically XRCC1, ERCC5, GSTP1, SOD1, GPX1, ERCC1, CAT and ERCC2 (Table 4). In contrast, in BC individuals samples CEBPG was not correlated with any of the antioxidant or DNA repair genes. These relationships were not observed with any of the other transcription factors studied.
Table 4

Bivariate analysis of virtually-multiplexed transcript abundance data values for each antioxidant or DNA repair gene versus each transcription factor.

Non-BC Individuals n = 24BC Individuals n = 25ALL n = 49
Antioxidant/DNA Repair Genes vs Transcription Factorsr Valuep Valuer Valuep Valuer Valuep Value

CAT vs CEBPB0.1310.1810.151
CAT vs CEBPG0.650.0040.350.480.55<0.0006
CAT vs E2F1*0.540.040.680.0020.56<0.0006
CAT vs E2F30.480.120.1810.370.06
CAT vs E2F60.2610.30.840.250.48
CAT vs EVI1-0.0110.2110.081
ERCC1 vs CEBPB0.320.780.2710.290.24
ERCC1 vs CEBPG0.77<0.00060.420.240.62<0.0006
ERCC1 vs E2F10.350.540.390.360.370.06
ERCC1 vs E2F30.390.360.2110.310.18
ERCC1 vs E2F60.1710.630.0050.420.02
ERCC1 vs EVI1-0.0210.380.360.171
ERCC2 vs CEBPB0.2510.1910.220.84
ERCC2 vs CEBPG0.630.0060.390.30.53<0.0006
ERCC2 vs E2F10.390.360.320.720.330.12
ERCC2 vs E2F30.580.020.2210.420.02
ERCC2 vs E2F60.370.420.510.060.420.02
ERCC2 vs EVI10.1910.290.960.230.66
ERCC4 vs CEBPB-0.350.6-0.111-0.161
ERCC4 vs CEBPG0.2410.370.420.250.48
ERCC4 vs E2F10.420.240.0410.21
ERCC4 vs E2F30.60.010.330.60.330.12
ERCC4 vs E2F6-0.0410.0410.071
ERCC4 vs EVI10.2410.330.660.270.36
ERCC5 vs CEBPB0.40.30.2810.330.12
ERCC5 vs CEBPG0.79<0.00060.1210.460.005
ERCC5 vs E2F10.440.180.450.180.440.01
ERCC5 vs E2F30.390.36-0.1110.131
ERCC5 vs E2F60.410.30.350.540.380.04
ERCC5 vs EVI10.071-0.0410.011
GPX1 vs CEBPB0.490.060.2410.320.12
GPX1 vs CEBPG0.72<0.00060.1910.40.02
GPX1 vs E2F10.480.120.380.360.430.02
GPX1 vs E2F30.221-0.00410.081
GPX1 vs E2F60.0610.360.480.280.3
GPX1 vs EVI1-0.0610.210.11
GPX3 vs CEBPB-0.1810.1410.021
GPX3 vs CEBPG0.1310.0110.071
GPX3 vs E2F1-0.031-0.171-0.121
GPX3 vs E2F30.320.78-0.210.051
GPX3 vs E2F6-0.2610.440.180.191
GPX3 vs EVI10.0610.0810.061
GSTM1-5 vs CEBPB-0.0810.430.180.171
GSTM1-5 vs CEBPG0.2510.0210.161
GSTM1-5 vs E2F10.510.060.2310.410.02
GSTM1-5 vs E2F30.290.96-0.1610.11
GSTM1-5 vs E2F6-0.30.90.0061-0.11
GSTM1-5 vs EVI10.221-0.1210.071
GSTM3 vs CEBPB0.0110.0610.041
GSTM3 vs CEBPG-0.0071-0.2810.011
GSTM3 vs E2F10.2910.350.540.340.12
GSTM3 vs E2F3-0.310.84-0.490.06-0.40.03
GSTM3 vs E2F6-0.111-0.251-0.161
GSTM3 vs EVI1-0.271-0.251-0.250.54
GSTP1 vs CEBPB0.1910.380.360.280.36
GSTP1 vs CEBPG0.74<0.00060.1810.510.001
GSTP1 vs E2F10.60.010.460.120.56<0.0006
GSTP1 vs E2F30.320.78-0.2510.071
GSTP1 vs E2F60.110.350.480.260.42
GSTP1 vs EVI10.1110.1310.121
GSTT1 vs CEBPB0.0310.450.120.240.6
GSTT1 vs CEBPG0.390.360.1610.30.24
GSTT1 vs E2F10.071-0.151-0.051
GSTT1 vs E2F30.350.54-0.110.171
GSTT1 vs E2F60.0510.2110.111
GSTT1 vs EVI1-0.2610.221-0.041
GSTZ1 vs CEBPB0.1110.360.420.250.54
GSTZ1 vs CEBPG0.510.060.0810.280.3
GSTZ1 vs E2F10.640.0040.50.060.58<0.0006
GSTZ1 vs E2F30.420.240.1410.250.54
GSTZ1 vs E2F6-0.0510.480.120.320.18
GSTZ1 vs EVI10.0210.2710.161
mGST vs CEBPB0.310.780.350.480.320.12
mGST vs CEBPG0.560.020.2510.420.02
mGST vs E2F10.580.020.540.040.58<0.0006
mGST vs E2F30.031-0.151-0.061
mGST vs E2F60.1710.290.960.270.36
mGST vs EVI1-0.1610.071-0.041
SOD1 vs CEBPB0.1310.1510.141
SOD1 vs CEBPG0.660.0020.00910.360.06
SOD1 vs E2F10.590.020.550.040.56<0.0006
SOD1 vs E2F30.251-0.0710.091
SOD1 vs E2F60.1210.1410.141
SOD1 vs EVI1-0.1710.031-0.061
XPA vs CEBPB0.310.840.420.240.310.18
XPA vs CEBPG0.360.540.330.660.340.12
XPA vs E2F1-0.0510.221-0.021
XPA vs E2F3-0.0710.141-0.011
XPA vs E2F60.550.040.460.120.40.02
XPA vs EVI10.0410.0710.041
XRCC1 vs CEBPB0.360.480.2810.320.18
XRCC1 vs CEBPG0.83<0.00060.2710.591<0.0006
XRCC1 vs E2F10.320.780.370.480.350.12
XRCC1 vs E2F30.470.120.2210.350.06
XRCC1 vs E2F60.2610.540.040.410.02
XRCC1 vs EVI1-0.00910.1210.061

The correlation of logarithmically transformed virtually-multiplexed transcript abundance data (Table 3) from each of the six transcription factors with each of the sixteen antioxidant or DNA repair genes was determined. The transformation was necessary due to the wide range of expression of each gene among the samples. The correlation coefficient (r value) and level of significance (p value) for each correlation are presented. Significance (p < 0.01) was determined using a two-tailed test following Bonferroni adjustment for six multiple comparisons (comparison of each of six transcription factors to each of the antioxidant or DNA repair genes).

*values for E2F1 obtained in all but one BC individuals.

For XRCC1, ERCC5, GSTP1, and SOD1 the correlation with CEBPG was significantly lower in BC individuals compared to non-BC individuals and the difference was nearly significant for GPX1 (Fig. 1b). Scatter plots of the relationship between CEBPG and XRCC1 in non-BC individuals or BC individuals (Fig. 2a,b) are representative of the other four genes. Neither CEBPG, nor XRCC1, ERCC5, GSTP1, SOD1 or GPX1 was significantly correlated with age, gender, or smoking history in non-BC individuals, BC individuals, or the combined group.
Figure 1

Correlation of each transcription factor with XRCC1, ERCC5, GSTP1, SOD1, or GPX1. (a-f) Each panel presents the correlation coefficients (r values) for one transcription factor in relation to each of the five genes: (a) CEBPB, (b) CEBPG, (c) E2F1, (d) E2F3, (e) E2F6, (f) EVI1. The p value for each significant correlation is provided above the bar. For CEBPG, presented in panel b, the difference in r value between non-BC individuals and BC individuals was significant or nearly significant for each correlated gene, and the p value for each comparison is provided below the corresponding pair of bars.

Figure 2

Scatter plot representation of bivariate correlation of CEBPG with XRCC1. (a, b) CEBPG/XRCC1 data from Figure 1b presented as scatter plots: (a) non-BC individuals, (b) BC individuals.

In non-BC individuals, based on the r2 values from Pearson's correlation analysis, CEBPG accounts for much of the variance in expression of XRCC1 (69%), ERCC5 (62%), GSTP1 (55%), SOD1 (44%), and GPX1 (52%). E2F1 accounts for some of the remaining variance. For example, when samples from all 49 non-BC individuals and BC individuals were assessed as a single group, E2F1 was significantly correlated with ERCC5, GSTP1 and SOD1 (Table 4). Further, in non-BC individuals, E2F1 was correlated with GSTP1 (Fig. 1c) and the correlation was lower in BC individuals. However, the difference in correlation between non-BC individuals and BC individuals was not significant. None of the other transcription factors were correlated with XRCC1, ERCC5, GSTP1, SOD1, or GPX1 (Fig. 1a,d,e,f).

Comparison of gene expression with demographic characteristics

E2F1 and GSTZ1 each were positively correlated with age. GSTM1-5 was the only gene with a difference in expression by gender. There was a difference in ERCC2 expression between former and never smokers.

Discussion

In this study, we tested two hypotheses. First, that there is inter-individual variation in regulation of key antioxidant and DNA repair genes by one or more transcription factors. Second, that individuals with sub-optimal regulation are selected for development of BC if they smoke cigarettes. These hypotheses are supported by the findings that a) there was large inter-individual variation in transcript levels of CEBPG and each of the target genes and in non-BC individuals, b) CEBPG transcript abundance values were significantly correlated by bivariate analysis with the transcript abundance values of four key antioxidant and DNA repair genes in non-BC individuals, and c) that there was no correlation between CEBPG and these genes in BC individuals. These results support the hypothesis that each of the antioxidant or DNA repair genes correlated with CEBPG in non-BC individuals is regulated by CEBPG. This is supported by the specificity of the CEBPG correlation. That is, there was lack of correlation between any of the other five transcription factors assessed and these target genes. Of particular note is the lack of correlation of the target genes with CEBPB, which binds to the same recognition site as CEBPG, and shares its recognition site within each of the antioxidant or DNA repair genes. However, there are alternative explanations for the observed correlation of CEBPG with antioxidant and DNA repair genes in non-BC individuals. One possibility is that CEBPG and each of the correlated antioxidant or DNA repair genes is regulated by a transcription factor that is as yet undiscovered, and/or has a recognition site that is not yet known and was not in the Genomatix software database. There also is more than one possible explanation for the observed lack of correlation between CEBPG and antioxidant or DNA repair genes in BC individuals. For example, the non-BC individual and BC individual groups are not perfectly matched with respect to age, gender or smoking history (Table 1) and each of these factors could contribute to the observed difference in correlation between groups. However, the lack of association of transcript abundance level for CEBPG, XRCC1, ERCC5, GSTP1, SOD1, or GPX1 with age, gender or smoking history argues against such an explanation. One way to examine this possibility is through additional, larger, more closely matched studies. Another possible explanation is that any differences in NBEC from BC individuals compared to non-BC individuals resulted from development of BC, instead of being a hereditary cause of increased risk for cancer. The best way to determine this will be to conduct a prospective study. In such a study, individuals matched for smoking history will be monitored for development of BC over time. The correlation of transcript abundance values for CEBPG relative to transcript abundance values for each of the antioxidant or DNA repair genes will be assessed. It is expected that the greatest incidence of BC will be among the heaviest smokers. Among the matched heaviest smokers, it is expected that CEBPG will be significantly correlated with each of the antioxidant or DNA repair genes among the non-BC individuals but not correlated in BC individuals. Thus, there are multiple possible explanations for the observed findings. However, based on the preponderance of data thus far available, we conclude that CEBPG is responsible for optimal transcriptional regulation of key antioxidant or DNA repair genes in NBEC and that there is inter-individual variation in the regulation of each of these genes by CEBPG. If this conclusion is correct, the individuals at greatest risk for BC will be those with the most extreme smoking history combined with sub-optimal regulation of the largest number of antioxidant and DNA repair genes. This, in turn, leads to increased representation among BC individuals of individuals with lack of correlation between CEBPG and each of the affected antioxidant and/or DNA repair genes. CEBPG is a truncated CEBP transcription factor [19] and possesses the sequences necessary for DNA binding and heterodimer formation, but lacks the sequences necessary for transactivation [20]. CEBPG forms heterodimers with other CEBP family members and in other tissues this leads to increased [21] or decreased [20] transcription of the regulated gene. CEBPG is known to have stimulatory effect on the IL-6 and IL-8 promoters in B cell lines [21], and can also act as a dominant negative regulator of CEBPA and CEBPB in fibroblast and B cell lines [20]. The data from CEBPG knockout mice support a role for CEBPG in protecting lungs from oxidant damage. CEBPG-/- knockout mice are healthy at birth but begin to die within 24 hours, and histological examination reveals emphysematous lungs [22]. In humans, risk for emphysema is associated with antioxidant capacity [23], and there is a strong correlation between risk for emphysema and risk for BC. However, it will be important to obtain direct experimental evidence in NBECs for the role of CEBPG in regulating the antioxidant and DNA repair genes included in this study. Correlation between CEBPG and target gene transcript levels may not be associated with correlation at the protein level. In this study, E2F1 correlation with DNA repair and antioxidant genes was less than the correlation observed with CEBPG, and the E2F1 correlation was observed in both non-BC individuals as well as BC individuals. The maintained correlation of E2F1 with DNA repair and antioxidant genes in BC individuals suggests that this function is more tightly controlled in the population and does not play a role in determination of risk for BC. E2F1 has previously been reported to regulate transcription of DNA repair enzyme genes in other cell types, including primary human fibroblasts and mouse epidermal cells [24,25]. Clearly this would have survival value since DNA repair gene up-regulation in response to E2F1 provides additional DNA repair when the DNA is replicating and is particularly vulnerable to damage. Epidemiologic assessment of the correlation between a particular variation in DNA sequence, or polymorphism, and risk for BC has been a dominant paradigm for many years. Thus far, these efforts have met with scant success [26]. A common limitation in design of such studies is that they involve assessment of a single polymorphism or occasionally, a few polymorphisms. Further, although the polymorphism assessed typically resides within a gene known to protect bronchial epithelium from carcinogens, oxidants, or DNA damage, the selection of the particular polymorphism for study is largely empiric, and not based on known functional properties. These are problems because multiple infrequent polymorphisms at different sites may all contribute to risk and unless the key polymorphisms can be identified through a functional test, a statistically valid assessment would require much larger study populations [27]. The findings of this study support a novel approach to identifying clinically useful biomarkers. According to the paradigm used in this study, a) a normal phenotype results from regulated transcription of a group of genes by one or more transcription factors, b) the corresponding risk-conferring or disease phenotype results from sub-optimal interaction among those same genes, and c) each phenotype is identifiable and distinguishable through virtually-multiplexed transcript abundance analysis. The data presented here support the utility of this paradigm in identifying genes associated with risk for BC. The next step will be to identify polymorphisms that affect regulation of XRCC1, ERCC5, GSTP1, SOD1, and GPX1 by CEBPG. Such polymorphisms should yield biomarkers suitable for more readily accessible samples, such as peripheral blood or buccal smears. A biomarker combining polymorphisms that affect regulation with those that affect function of antioxidant and DNA repair genes is likely to be the most accurate for identifying individuals at risk for BC. Biomarkers that accurately identify individuals at risk for BC will improve efficacy of chemoprevention and early detection clinical trials. The observed inter-sample variation in the presence of gene-specific inhibitors of PCR provides evidence supporting the need for inclusion of an internal standard in each quantitative PCR transcript abundance measurement. Including such internal standards in the form of standardized mixtures of internal standards improves the reproducibility of transcript abundance measurement and enables development of a standardized database comprising virtually-multiplexed transcript abundance data. Virtually-multiplexed transcript abundance data are highly suited to identification of genes that have correlated transcript abundance values. Correlation at the transcript abundance level is an important property of genes that are co-regulated at the transcription level.

Conclusion

We conclude that in non-BC individuals, CEBPG regulates transcription of key antioxidant or DNA repair genes in NBEC and that in smokers who develop BC, CEBPG regulation is sub-optimal for a sufficient number of antioxidant and/or DNA repair genes to cause increased risk.

Competing interests

JCW and ELC each have significant equity interest in Gene Express, Inc., which produces and markets StaRT-PCR™ reagents used in this study.

Authors' contributions

DNM participated in the design of the study, performed the TF identification, carried out TA measurements, coordinated and participated in the statistical analysis and drafted the manuscript. ELC contributed the preliminary data, participated in the design of the study and carried out TA measurements. SAK performed the statistical analysis for interpretation of data. DAH and YY consented patients and obtained the primary samples necessary for the study according to IRB regulations. JCW conceived of the study, participated in its design and coordination, and helped to draft the manuscript. All authors read and approved the final manuscript.

Pre-publication history

The pre-publication history for this paper can be accessed here:
  21 in total

1.  Lung cancer.

Authors:  Patricia Donahue Baldwin
Journal:  Clin J Oncol Nurs       Date:  2003 Nov-Dec       Impact factor: 1.027

Review 2.  Why have we failed to find the low penetrance genetic constituents of common cancers?

Authors:  Neil E Caporaso
Journal:  Cancer Epidemiol Biomarkers Prev       Date:  2002-12       Impact factor: 4.254

3.  Epidemiology of lung cancer with special reference to genetics, bioassays, women, and developing countries.

Authors:  G P Page; J L Green; D Lackland
Journal:  Semin Respir Crit Care Med       Date:  2000       Impact factor: 3.119

Review 4.  The base excision repair pathway.

Authors:  E Seeberg; L Eide; M Bjørås
Journal:  Trends Biochem Sci       Date:  1995-10       Impact factor: 13.807

5.  MatInd and MatInspector: new fast and versatile tools for detection of consensus matches in nucleotide sequence data.

Authors:  K Quandt; K Frech; H Karas; E Wingender; T Werner
Journal:  Nucleic Acids Res       Date:  1995-12-11       Impact factor: 16.971

Review 6.  Biology of disease: free radicals and tissue injury.

Authors:  B A Freeman; J D Crapo
Journal:  Lab Invest       Date:  1982-11       Impact factor: 5.662

7.  E2F integrates cell cycle progression with DNA repair, replication, and G(2)/M checkpoints.

Authors:  Bing Ren; Hieu Cam; Yasuhiko Takahashi; Thomas Volkert; Jolyon Terragni; Richard A Young; Brian David Dynlacht
Journal:  Genes Dev       Date:  2002-01-15       Impact factor: 11.361

8.  C/EBP gamma has a stimulatory role on the IL-6 and IL-8 promoters.

Authors:  Hongwei Gao; Sara Parkin; Peter F Johnson; Richard C Schwartz
Journal:  J Biol Chem       Date:  2002-08-12       Impact factor: 5.157

9.  Expression of plasma glutathione peroxidase in human liver in addition to kidney, heart, lung, and breast in humans and rodents.

Authors:  F F Chu; R S Esworthy; J H Doroshow; K Doan; X F Liu
Journal:  Blood       Date:  1992-06-15       Impact factor: 22.113

10.  The human glutathione S-transferase supergene family, its polymorphism, and its effects on susceptibility to lung cancer.

Authors:  B Ketterer; J M Harris; G Talaska; D J Meyer; S E Pemble; J B Taylor; N P Lang; F F Kadlubar
Journal:  Environ Health Perspect       Date:  1992-11       Impact factor: 9.031

View more
  20 in total

1.  Application of quantitative trait locus mapping and transcriptomics to studies of the senescence-accelerated phenotype in rats.

Authors:  Elena E Korbolina; Nikita I Ershov; Leonid O Bryzgalov; Natalia G Kolosova
Journal:  BMC Genomics       Date:  2014-12-19       Impact factor: 3.969

2.  Haplotype and diplotype analyses of variation in ERCC5 transcription cis-regulation in normal bronchial epithelial cells.

Authors:  Xiaolu Zhang; Erin L Crawford; Thomas M Blomquist; Sadik A Khuder; Jiyoun Yeo; Albert M Levin; James C Willey
Journal:  Physiol Genomics       Date:  2016-05-27       Impact factor: 3.107

3.  Cis-acting genetic variation at an E2F1/YY1 response site and putative p53 site is associated with altered allele-specific expression of ERCC5 (XPG) transcript in normal human bronchial epithelium.

Authors:  Thomas M Blomquist; Erin L Crawford; James C Willey
Journal:  Carcinogenesis       Date:  2010-03-15       Impact factor: 4.944

4.  DNA MMR systems, microsatellite instability and antioxidant activity variations in two species of wild bats: Myotis velifer and Desmodus rotundus, as possible factors associated with longevity.

Authors:  Juan C Conde-Pérezprina; Armando Luna-López; Viridiana Y González-Puertos; Tania Zenteno-Savín; Miguel Angel León-Galván; Mina Königsberg
Journal:  Age (Dordr)       Date:  2012-03-28

5.  Pattern of antioxidant and DNA repair gene expression in normal airway epithelium associated with lung cancer diagnosis.

Authors:  Thomas Blomquist; Erin L Crawford; D'Anna Mullins; Youngsook Yoon; Dawn-Alita Hernandez; Sadik Khuder; Patricia L Ruppel; Elizabeth Peters; David J Oldfield; Brad Austermiller; John C Anders; James C Willey
Journal:  Cancer Res       Date:  2009-11-03       Impact factor: 12.701

6.  C/EBPγ Is a Critical Regulator of Cellular Stress Response Networks through Heterodimerization with ATF4.

Authors:  Christopher J Huggins; Manasi K Mayekar; Nancy Martin; Karen L Saylor; Mesfin Gonit; Parthav Jailwala; Manjula Kasoji; Diana C Haines; Octavio A Quiñones; Peter F Johnson
Journal:  Mol Cell Biol       Date:  2015-12-14       Impact factor: 4.272

7.  CEBPγ facilitates lamellipodia formation and cancer cell migration through CERS6 upregulation.

Authors:  Hanxiao Shi; Atsuko Niimi; Toshiyuki Takeuchi; Kazuya Shiogama; Yasuyoshi Mizutani; Taisuke Kajino; Kenichi Inada; Tetsunari Hase; Takahiro Hatta; Hirofumi Shibata; Takayuki Fukui; Toyofumi Fengshi Chen-Yoshikawa; Kazuki Nagano; Takashi Murate; Yoshiyuki Kawamoto; Shuta Tomida; Takashi Takahashi; Motoshi Suzuki
Journal:  Cancer Sci       Date:  2021-05-04       Impact factor: 6.716

8.  Comparison of proteomic and transcriptomic profiles in the bronchial airway epithelium of current and never smokers.

Authors:  Katrina Steiling; Aran Y Kadar; Agnes Bergerat; James Flanigon; Sriram Sridhar; Vishal Shah; Q Rushdy Ahmad; Jerome S Brody; Marc E Lenburg; Martin Steffen; Avrum Spira
Journal:  PLoS One       Date:  2009-04-09       Impact factor: 3.240

9.  Identification of IFRD1 as a modifier gene for cystic fibrosis lung disease.

Authors:  YuanYuan Gu; Isaac T W Harley; Lindsay B Henderson; Bruce J Aronow; Ilja Vietor; Lukas A Huber; John B Harley; Jeffrey R Kilpatrick; Carl D Langefeld; Adrienne H Williams; Anil G Jegga; Jing Chen; Marsha Wills-Karp; S Hasan Arshad; Susan L Ewart; Chloe L Thio; Leah M Flick; Marie-Dominique Filippi; H Leighton Grimes; Mitchell L Drumm; Garry R Cutting; Michael R Knowles; Christopher L Karp
Journal:  Nature       Date:  2009-02-25       Impact factor: 49.962

Review 10.  DNA mismatch repair system: repercussions in cellular homeostasis and relationship with aging.

Authors:  Juan Cristóbal Conde-Pérezprina; Miguel Ángel León-Galván; Mina Konigsberg
Journal:  Oxid Med Cell Longev       Date:  2012-11-08       Impact factor: 6.543

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.