Literature DB >> 19832994

Identification of novel androgen-responsive genes by sequencing of LongSAGE libraries.

Tammy L Romanuik1, Gang Wang, Robert A Holt, Steven J M Jones, Marco A Marra, Marianne D Sadar.   

Abstract

BACKGROUND: The development and maintenance of the prostate is dependent on androgens and the androgen receptor. The androgen pathway continues to be important in prostate cancer. Here, we evaluated the transcriptome of prostate cancer cells in response to androgen using long serial analysis of gene expression (LongSAGE) libraries.
RESULTS: There were 131 tags (87 genes) that displayed statistically significant (p <or= 0.001) differences in expression in response to androgen. Many of the genes identified by LongSAGE (35/87) have not been previously reported to change expression in the direction or sense observed. In regulatory regions of the promoter and/or enhancer regions of some of these genes there are confirmed or potential androgen response elements (AREs). The expression trends of 24 novel genes were validated using quantitative real time-polymerase chain reaction (qRT-PCR). These genes were: ARL6IP5, BLVRB, C19orf48, C1orf122, C6orf66, CAMK2N1, CCNI, DERA, ERRFI1, GLUL, GOLPH3, HM13, HSP90B1, MANEA, NANS, NIPSNAP3A, SLC41A1, SOD1, SVIP, TAOK3, TCP1, TMEM66, USP33, and VTA1. The physiological relevance of these expression trends was evaluated in vivo using the LNCaP Hollow Fibre model. Novel androgen-responsive genes identified here participate in protein synthesis and trafficking, response to oxidative stress, transcription, proliferation, apoptosis, and differentiation.
CONCLUSION: These processes may represent the molecular mechanisms of androgen-dependency of the prostate. Genes that participate in these pathways may be targets for therapies or biomarkers of prostate cancer.

Entities:  

Mesh:

Substances:

Year:  2009        PMID: 19832994      PMCID: PMC2766392          DOI: 10.1186/1471-2164-10-476

Source DB:  PubMed          Journal:  BMC Genomics        ISSN: 1471-2164            Impact factor:   3.969


Background

Androgens mediate their effect through the androgen receptor (AR) and together they play integral roles in the development and maintenance of the prostate. In the absence of a functional androgen-axis during development, the prostate will fail to form[1]. The size of the prostate increases with the elevation of levels of androgens in males during puberty[2]. Androgens promote proliferation, differentiation, and survival of prostate cells[1]. Men that have used excess androgens in the form of anabolic steroids have a higher incidence of prostate cancer [3-5]. Association of prostate cancer with levels of androgens has also been reported in rodents[6,7]. Reduction of androgen in humans or dogs before puberty by castration is associated with decreased incidence of prostate cancer[8,9]. Castration of adult males causes apoptosis of prostatic epithelium, involution and reduction of the prostate [10-12]. Thus the prostate gland is an androgen-dependent organ where androgens are the predominant mitogenic stimulus[13]. The dependency of the prostate epithelium on androgens provides the underlying rationale for treating prostate cancer with chemical or surgical castration (androgen-deprivation)[14]. The AR is a ligand-activated transcription factor[15] that regulates transcription of genes that contain androgen response elements (AREs) in the upstream or downstream regulatory regions of the promoter and/or enhancer. Kallikrein 3 (KLK3) is an example of a gene that contains numerous functional AREs that the AR interacts with to increase transcription in response to androgens [16-19]. KLK3, also known as prostate-specific antigen (PSA), is the main tumor marker for prostate cancer and has been used clinically for 15 years[20]. Serum levels of PSA correlate with tumor volume[21]. However, as a screening and monitoring tool for prostate cancer, serum PSA levels are subject to false positives and false negatives[20]. Identification of the genes that change in expression in response to androgen in prostate cells is essential for the understanding of androgen-dependency of the normal prostate and the proliferation, survival, and hormonal progression of prostate cancer. There are several studies that have investigated genes that alter expression in response to a changing androgen-axis using SAGE [22-24]. Here we highlight several key differences in the current experimental design from previous studies: 1) a physiological concentration of metabolically stable androgen (R1881) was employed in vitro; 2) the transcriptome was catalogued using LongSAGE[25] opposed to (short)SAGE[26] because it generates lengthier tags allowing increased confidence in tag-to-gene mapping, and leaves fewer tags unmapped[25]; 3) the transcriptome of human prostate cancer cells was examined instead of murine cells [22]; 4) sequencing depth was increased by approximately 1.5-2 times more tags relative to other studies [23,24] to improve the potential for novel findings; 5) transcript expression was validated using an alternative assay as opposed to protein expression [24], and tens of novel genes were validated as opposed to only two[23]. Thus, we apply LongSAGE for the first time to create transcript libraries of prostate cancer cells maintained in the presence or absence of androgen. These libraries are publicly available at Gene Expression Omnibus. We describe 24 genes never before identified or validated to alter expression in response to androgen treatment. These genes were: ARL6IP5, BLVRB, C19orf48, C1orf122, C6orf66, CAMK2N1, CCNI, DERA, ERRFI1, GLUL, GOLPH3, HM13, HSP90B1, MANEA, NANS, NIPSNAP3A, SLC41A1, SOD1, SVIP, TAOK3, TCP1, TMEM66, USP33, and VTA1. Statistically significant changes in expression of ARL6IP5, CAMK2N1, ERRFI1, HSP90B1, and TAOK3 in response to reduced levels of circulating androgens were measured using in vivo samples.

Results and discussion

Summary of LongSAGE libraries

LongSAGE was employed to obtain quantitative gene expression profiles of human prostate cancer cells treated with or without synthetic androgen R1881. LNCaP human prostate cancer cells were chosen as the model cell line for evaluating androgen signaling because they respond to androgens, express a functional although mutated (T877A) AR, they can be grown in vitro as a monolayer or in vivo as a xenograft or in the Hollow Fiber model [27-29]. LNCaP cells have been used extensively in prostate cancer research. The time of 16 hours for treatment and concentration of R1881 (10 nM) were chosen based upon optimal induction of levels of KLK3 mRNA [30]. LongSAGE libraries were sequenced to a total of 121,760 (R1881) and 103,391 (vehicle) tags (Table 1). The libraries were filtered on several levels to leave only useful tags for analysis. First, bad tags were removed if they contained at least one N-base call in the LongSAGE tag sequence. Notably, when bad tags were filtered the percentages of duplicate ditags in the R1881 and vehicle LongSAGE libraries were 6% and 5%, respectively. Early SAGE studies suggest duplicate ditags likely represent polymerase chain reaction (PCR) artifacts due to the low probability the same two tags will ligate together to form ditags[26]. However, with LongSAGE library sequencing and highly expressed transcripts, this probability becomes significant[31]. A recent study[32] suggests that discarding duplicate ditags in LongSAGE analysis may introduce a bias affecting the fold differences in tag expression between libraries for all tags observed at a frequency >(113-224)/100,000. Therefore, we opted to retain duplicate ditags. PHRED software was used to call bases for the sequencing of the LongSAGE tags[33,34]. PHRED has a small, but significant error rate in base-calls. To ascertain which tags potentially contained these erroneous base-calls, we calculated a tag sequence quality factor (QF) and probability[35]. The second line of filtering removed LongSAGE tags with probabilities less than 0.95 (QF < 95%). Linkers of known sequence were introduced into SAGE libraries as primers for amplifying ditags prior to concatenation[26]. These linker sequences were designed so they do not map to the human genome. At a low frequency, linkers ligate to themselves creating linker-derived tags (LDTs). These LDTs do not represent transcripts and are removed from the LongSAGE libraries. After filtering, there were 97,981 total useful tags representing 23,828 tag sequences in the R1881 LongSAGE library, and 85,861 total useful tags representing 24,592 tag sequences in the vehicle LongSAGE library. Due to redundancy in the expressed sequences, the combined number of useful tag types in the R1881 and vehicle LongSAGE libraries was 38,574. The remainder of the data analysis in this manuscript was carried out using this filtered data.
Table 1

Composition of LongSAGE libraries

UnfilteredLibraryR1881Vehicle
Unfiltered Total Tags121,760103,391
No. of Bad Tags528383

Minus Bad TagsTotal Tags121,232103,008
Tag Types33,38531,764
No. of Duplicate Ditags6,7635,193
% of Duplicate Ditags5.5795.041
Average QFr of Tags8 9.6489.67
No. of Tags QF<95%22,81617,095

QF ≥ 95%Total Tags98,41685,913
Tag Types23,83024,594
Total Tags Combined184,329
Tag Types Combined38,576
No. of LDTss Type I21934
No. of LDTs Type II21618

Minus LDTsTotal Tags97,98185,861
Tag Types23,82824,592
Total Tags Combined183,842
Tag Types Combined38,574

r QF, Quality Factor

s LDTs, Linker-derived Tags

Composition of LongSAGE libraries r QF, Quality Factor s LDTs, Linker-derived Tags

Tag frequency and transcript abundance

Tag frequency spanned over three orders of magnitude corresponding to transcript abundance of 5 to 8,746 copies per cell (based on minimum and maximum observed tag counts of 1 and 1714; see Table 2 legend)[36]. The distribution of LongSAGE tag frequencies per 100,000 tags revealed the majority (64 and 67%) of tag types in each LongSAGE library (R1881 and vehicle, respectively) were singletons (tags counted only once). This result was consistent with other published SAGE libraries reporting 66% singletons[37]. Singletons can represent very low abundance transcripts (≤ 5 transcript copies per cell) or PCR/sequencing errors. Estimates indicate that less than 17% of LongSAGE tags in a library contain PCR/sequencing errors[38]. Coincidently, 17% of the total tags in the R1881 and vehicle LongSAGE libraries roughly equal the number of singletons in each LongSAGE library (Table 2). Although initial estimates suggest 6.8-10% of shortSAGE tags contain PCR/sequencing errors, more recent experimental evidence suggests the actual error rate is much lower (≤ 2%)[39]. This implies that an error rate of 17% may also be an overestimate for LongSAGE tags. Tag types counted 2-4 times per 100,000 tags (10-20 transcript copies per cell) and 5-9 times per 100,000 tags (25-45 transcript copies per cell) were the second and third most common groups of tag types, respectively. Generally, high frequency tags were less common. The majority of total tags in each LongSAGE library were derived from a few tag types detected between 10-99 times per 100,000 tags (50-495 transcript copies per cell).
Table 2

Characteristics of LongSAGE tag frequency distribution

Tag Frequency & AbundanceTag Count per 100,000t≤12-45-910-99100-999≥1,000
Transcript Copies per Cellu≤510-2025-4550-495500-4,995≥5,000
% Transcript Abundance in Cellv≤0.0010.002-0.0040.005-0.0090.01-0.0990.1-0.999≥1
R1881Total Tags15,14113,98511,05532,80021,9713,029
Tag Types15,1415,4641,7031,4171012
VehicleTotal Tags16,5621 0,22911,63326,46618,4532,518
Tag Types16,5624,4272,1951,313932
% of Tags that Map as Transcription Factorsα, z, δ9.148.947.956.000
% of Tags that Mapχ, β, δ29.4057.8276.2283.185100γ
% of Tags Significantly Differentially Expressedε, a, δ0.41.1516.1725.3858.12100

t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000

u Transcript copies per cell= (observed tag count/total tags in the library) × 500,000

v % Transcript abundance in cell= (transcript copies per cell/500,000) × 100%

w Calculation based on ~500,000 transcripts in a cell [36]

α % of tags that map as transcription factors = (no. of genes with "transcription regulation acivity"/no. of genes with unambiguous sense mappings and GO terms) × 100%

Mapped unambigously sense toRefSeqand subjected to Gene Ontology (GO) analysis

δ Tag types from each tag frequency class of R1881 and vehicle LongSAGE libraries were combined

χ % of tags that map = (no. of genes with sense mappings/combined total tag types) × 1 00%

β Mapped sense (incl. ambiguous) to RefSeq

γ One tag was mapped sense using Ensembl gene

ε % of tags significantly differentially expressed = (no. of significantly differently expressed tag types in class/combined total tag types in class) × 100%

a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001)

Characteristics of LongSAGE tag frequency distribution t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000 u Transcript copies per cell= (observed tag count/total tags in the library) × 500,000 v % Transcript abundance in cell= (transcript copies per cell/500,000) × 100% w Calculation based on ~500,000 transcripts in a cell [36] α % of tags that map as transcription factors = (no. of genes with "transcription regulation acivity"/no. of genes with unambiguous sense mappings and GO terms) × 100% Mapped unambigously sense toRefSeqand subjected to Gene Ontology (GO) analysis δ Tag types from each tag frequency class of R1881 and vehicle LongSAGE libraries were combined χ % of tags that map = (no. of genes with sense mappings/combined total tag types) × 1 00% β Mapped sense (incl. ambiguous) to RefSeq γ One tag was mapped sense using Ensembl gene ε % of tags significantly differentially expressed = (no. of significantly differently expressed tag types in class/combined total tag types in class) × 100% a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001)

Mapping distribution of LongSAGE tags

When mapped tags (v38 Ensembl) were clustered to amalgamate 1-off tags (see Methods, Gene Expression Analysis for a description) and tags that mapped ambiguously were removed, the tag types in the R1881 and vehicle LongSAGE libraries represented 7,484 genes and 7,441 genes, respectively (Table 3). Tag types that mapped ambiguously constituted 13% (R1881 and vehicle), while 36% (R1881) and 35% (vehicle) of tag types did not map to the genome (Table 3). Due to the fact that these tags were clustered, the majority of the tags that did not map to the genome probably represent true unannotated transcripts rather than PCR/sequencing errors. Approximately 28% of tags in each LongSAGE library mapped to the opposite strand of known genes. These LongSAGE tags either represent transcription from previously undescribed coding regions or true antisense transcripts. Each LongSAGE library contained tags representing transcripts from 32% of the genes in the Ensembl gene database. This percentage is indicative of the depth of coverage of the transcriptome achieved with LongSAGE. Alternatively, this percentage indicates that one third of known Ensembl genes were expressed in LNCaP cells under these experimental conditions. This percentage is substantial when considering tag types from the Mouse Atlas Project (8.55 million total LongSAGE tags generated from 72 libraries of mouse development) mapped to 57% of the Ensembl transcript database[35]. Approximately 63% (R1881) and 61% (vehicle) of the genes that mapped to Ensembl's database were associated with more than one tag type to suggest that most gene expression was represented by transcript variants which is consistent with previous observations[35]. When the mapped LongSAGE tags (Reference Sequence, RefSeq; May 18, 2006) were clustered to amalgamate 1-off tags and tags that mapped ambiguously were removed, 53% of tags mapped solely to known exons, 9% solely to known introns (novel transcript variants), and 38% to intergenic regions (novel genes or transcript variants).
Table 3

LongSAGE tag mappings

LibraryNo. of Tag Types that Mapped Unambiguously to (Genes)No. of Tag Types that Mapped AmbiguouslyNo. of Tag Types that Did Not MapTotal No. of Tag Types (Clustered)y
R188114,587 (7,484)3,75410,21528,556
Vehicle13,626 (7,441)3,2869,06625,978

x Ensembl gene (v38) was used for mapping

y Clustering amalgamated 1-off tags with likely 'parent' tags to improve the mapping capability of LongSAGE tags Clustering altered the number of tag types without changing the total number of tags in the libraries

LongSAGE tag mappings x Ensembl gene (v38) was used for mapping y Clustering amalgamated 1-off tags with likely 'parent' tags to improve the mapping capability of LongSAGE tags Clustering altered the number of tag types without changing the total number of tags in the libraries The two most abundant tag types in the LongSAGE libraries were shared by both libraries. The first highly abundant LongSAGE tag mapped to human mitochondrial NADH ubiquinone oxidoreductase chain 4. This gene is also highly expressed in other human tissues (i.e., cardiac tissue; SAGE Genie, ). The protein product of this gene transfers electrons from NADH to ubiquinone to generate adenosine triphosphate as metabolic energy. Using the Ensembl database, the second most abundant LongSAGE tag mapped to a non-coding gene of human mitochondria. In contrast to the higher abundance classes, the lower abundance classes were enriched for LongSAGE tags that mapped to genes with functions in regulating transcription (Table 2). This is particularly significant because the percentages of LongSAGE tags that mapped to the genome in the lower abundance class were reduced compared to the higher abundance classes (Table 2). Together this implies that the number of tags that map to genes with a function in transcription may be underestimated, as low abundance tags may be underrepresented.

Differential gene expression

Venn analysis identified that 36% and 38% of tag types were exclusive to the R1881 or vehicle LongSAGE libraries, respectively (Figure 1). The unique expression of tag types indicates differential expression depending upon androgen stimulation. The biological relevance of this differential expression is complicated by the fact that 85% (R1881) and 88% (vehicle) of these exclusive LongSAGE tags were singletons. Consistent with our observation that low abundance tags did not map as readily to the genome, the mutually exclusive tags also did not map as readily as tags shared between both libraries. Only 17% and 15% of tags exclusive to R1881 and vehicle LongSAGE libraries, respectively, mapped unambiguously sense to RefSeq, in contrast to 39% of shared tags. We therefore, concentrated on genes for which the tag abundance allowed for the determination of statistically significant changes in transcript abundance.
Figure 1

Relationship between LongSAGE library compositions. The Venn Diagram shows the tag types and genes exclusive to, and shared by each LongSAGE library, R1881 and vehicle. Tags were mapped unambiguously sense to RefSeq transcripts and redundant mappings were removed. Singletons are tags counted only once in each library, but may be common to both libraries.

Relationship between LongSAGE library compositions. The Venn Diagram shows the tag types and genes exclusive to, and shared by each LongSAGE library, R1881 and vehicle. Tags were mapped unambiguously sense to RefSeq transcripts and redundant mappings were removed. Singletons are tags counted only once in each library, but may be common to both libraries. A scatter plot illustrates observed tag counts in LongSAGE libraries relative to the confidence intervals (CIs; 95%, 99%, and 99.9%) of respective p-values (p ≤ 0.05, 0.01, and 0.001) by Audic and Claverie statistics[40] (Figure 2). 891 tags were differentially expressed (p ≤ 0.05) between the two LongSAGE libraries (Figure 2 and Table 4). LongSAGE tags statistically (p ≤ 0.001) differentially represented between the libraries were enriched in the higher abundance classes compared to the lower abundance classes (Table 2). Additionally, 90% of the LongSAGE tags were statistically (p ≤ 0.001) differentially represented between the libraries with ≥ 2-fold differences, compared to only 17% of tags with p-values greater than 0.001 (p > 0.001).
Figure 2

Confidence intervals highlight expressed tag types with non-linear relationships between LongSAGE libraries. Scatter plot dots represent tag types and their placement on the axis indicates the frequency of observation in either of the LongSAGE libraries. Tag types that fall outside the confidence interval (CI) lines are considered statistically significantly differentially expressed (Audic and Claverie test statistic); outer line, 99.9% CI; middle line, 99% CI; and inner line, 95% CI.

Table 4

Number of tag types found to be significantly differentially expressed between R1881 and vehicle libraries

Direction of Changep ≤ 0.001p ≤ 0.01p ≤ 0.05
Up Regulated83196455
Down Regulated48120436
Total131316891
% of All Tag Types0.34%0.82%2.31%

a Statistics according to the Audic and Claverie test statistic

Number of tag types found to be significantly differentially expressed between R1881 and vehicle libraries a Statistics according to the Audic and Claverie test statistic Confidence intervals highlight expressed tag types with non-linear relationships between LongSAGE libraries. Scatter plot dots represent tag types and their placement on the axis indicates the frequency of observation in either of the LongSAGE libraries. Tag types that fall outside the confidence interval (CI) lines are considered statistically significantly differentially expressed (Audic and Claverie test statistic); outer line, 99.9% CI; middle line, 99% CI; and inner line, 95% CI. A stringent p-value cutoff (p ≤ 0.001), not corrected for multiple tests, was employed prior to validation of changes in expression of a gene in response to androgen. LongSAGE tags that were differentially expressed, but mapped ambiguously to more than one gene, and/or differed by less than 2-fold between the treatment groups, were excluded from analysis. Application of these criteria reduced the LongSAGE tags from 131 to 93. These 93 tags represented 87 genes. Analysis of differentially expressed LongSAGE tags revealed that 54 LongSAGE tags that mapped to 52 genes were previously known to change in expression in the direction observed in response to androgen in prostate cancer cells. Of these, the expression of 41 genes increased as expected, including the well-known androgen-regulated gene, KLK3 (Table 5). The expression of 11 genes decreased in response to androgen and were consistent with previous reports (Table 6). Genes previously not reported to alter expression in response to androgen in prostate cancer cells were represented by 39 LongSAGE tags. These tags represented the expression of 20 genes that were increased, excluding mappings to non-coding and intergenic regions, (Table 7), and expression of 15 genes that was decreased (Table 8) in response to androgen. The 93 tags were represented by 87 genes because one tag did not map to the human genome (Table 7) and two tags mapped to intergenic regions of the human mitochondrial genome (Tables 7 and 8). Three genes were represented twice in the tables (CAMK2N1, PPAP2A, and SORD). One gene, KRT8, was categorized in both the known and not previously known categories due to the sense of the mapping (Tables 5 and 8).
Table 5

LongSAGE tags corresponding to genes known to increase expression in response to androgen in LNCaP cells

Tags/100,000d, t

LongSAGE Tag SequenceVehicleR1881Foldc, d ChangeRefSeq/Ensembl Access. No.HGNC Gene SymbolDescriptionφ
GTGACAAGTGACAGAGT11920NM_007011ABHD2Abhydrolase domain containing 2, transcript variant 1
ACGTCACCATTTTTAAC12420NM_004457ACSL3Acyl-CoA synthetase long-chain family member 3, transcript variant 1
TACTTTATAAGTATTGG14594.2NM_006988ADAMTS1*ADAM metallopeptidase with thrombospondin type 1 motif, 1
TAGCTCTATGGGGGGAG35752.1NM_000701ATP1A1ATPase, Na+/K+ transporting, alpha 1 polypeptide, transcript variant 1
GTTGTGGTTAATCTGGT481092.3NM_004048B2MBeta-2-microglobulin
ACTTAAGGAACTTATCT14423.0NM_015415BRP44Brain protein 44
AAAGGAAAATAAAAATT3279NM_018455CENPN*Centromeric protein N
CTGTGATGTGACTCCTG5306NM_030806Clorf21Chromosome 1 open reading frame 21
CAGATGAGATGTGAGCT5337NM_130898CREB3L4*cAMP responsive element binding protein 3-like-4
TGTTTATCCTAAACTGA211155.5NM_020548DEIDiazepam binding inhibitor (GAB A receptor modulator, acyl-Coenzyme A binding protein)
TCCCCGTGGCTGTGGGG1063563.36NM_014762DHCR2424-dehydrocholesterol reductase
GAAATTAGGGAAGCCTT9344NM_015036ENDOD1Endonuclease domain containing 1
AGATCCTACTTAGTATG16513.2NM_004462FDFT1Farnesyl-diphosphate farnesyltransferase 1
GTTCCAGTGAGGCCAAG35020NM_004117FKBP5*FK506 binding protein 5
ACCTAGCCACTGCTGGG12420NM_002247KCNMA1Potassium large conductance calcium-activated channel, subfamily M, alpha member 1, transcript variant 2
GGATGGGGATGAAGTAA503667.3NM_001648KLK3*Kallikrein 3, (prostate-specific antigen), transcript variant 1
CCTCCAGCTACAAAACA352236.4NM_002273KRT8Keratin 8
TAAAATATTGAAGTGTCNDb4240NM_015541LRIG1*Leucine-rich repeats and immunoglobulin-like domains 1
TCCCTGAGCACCATTGCND3540NM_015261NCAPD3*Non-SMC condensin complex subunit D3
GGACTTTCCTTCCCTCT17270NM_006096NDRG1N-myc downstream regulated gene 1
TTTAGGTAAACGAAAGC19562.9NM_014445N/AqStress-associated ER protein 1
AGGTTTTGCCTCATTCC13382.9ENSG00000196930N/AqSimilar to Vesicle-associated membrane protein-associated protein A mRNA
ATGCAGCCATATGGAAG2020810NM_002539ODC1Ornithine decarboxylase 1
GCCAAGGGGCCAGCTGC17452.6NM_002541OGDHOxoglutarate (alpha-ketoglutarate) dehydrogenase (lipoamide), nuclear gene encoding mitochondrial protein, transcript variant 1
TAATTTTTACTTTGTAC5398NM_017906PAK1IP1*PAK1 interacting protein 1
TATGTAATATGCTTTCT271646.1NM_003711PPAP2APhosphatidic acid phosphatase type 2A, transcript variant 1
AAACACCAACAACTGGG5316NM_003711PPAP2APhosphatidic acid phosphatase type 2A isoforms 1 and 2
GTGTTTACGTGATCCAC11820NM_004578RAB4ARAB4A, member RAS oncogene family
TATGTATAAATGGACCTND1620NM_021205RHOU*Ras homolog gene family, member U
TTTGAAATGAGGTCTGT14483.4NM_002970SATSpermidine/spermine N1-acetyltransferase
GCAACAGCAATAGGATT3227NM_014302SEC61GSec61 gamma subunit, transcript variant 1
GCGCTGGAGTGAGATGG591262.1NM_031287SF3B5Splicing factor 3B, subunit 5, l0kDa
GGATTTGAACATATGAAND1310NM_033102SLC45A3Solute carrier family 45, member 3
ACCTTGTGCCCGATTCT472385.1NM_003104SORDSorbitol dehydrogenase
AAAATCTGCCACTCAGGND1 210NM_003104SORDSorbitol dehydrogenase
GTGCAGGGAGACATCTG35520NM_012391SPDEFSAM pointed domain containing ets transcription factor
TTAAGGGATGATGGCTTND1 210NM_024636STEAP4STEAP family member 4
TACTACAGCTATATTTG1 6523. 3NM_016192TMEFF2Transmembrane protein with EGF-like and 2 follistatin-like domains 2
TGATGTCTGGTCTGAAT11 720NM_020182TMEPAITransmembrane, prostate androgen induced RNA, transcript variant 1
CAAATAAATTATGCGAT56410NM_005656TMPRSS2Transmembrane protease, serine 2
TGAAAAGCTTAATAAAT7284NM_005079TPD52Tumor protein D52, transcript variant 3
TTAAAGATTTAGACACC10363. 6ENSG00000140416TPM1Tropomyosin 1 apha chain
TTCTCTACACAATTGTA6366NM_006022TSC22D1TSC22 domain family, member 1, transcript variant 1

a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001)

b ND, not detected

c ND tags were assigned a value of 1 when calculating fold

d Appropriate significant figures are displayed

n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from the table

q N/A = there is no HGNC approved gene symbol for this tag

t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000

φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed

* Gene further characterized in this paper

Table 6

LongSAGE tags corresponding to genes known to decrease expression in response to androgen in LNCaP cells

Tags/100,000d, t

LongSAGE Tag SequenceVehicleR1881Fold Changec, d, jRefSeq/Ensembl Access. No.HGNC Gene SymbolDescriptionφ
CAAAAGCTTATTCTTGT293-10NM_016613C4orf18Chromosome 4 open reading frame 18, transcript variant 2
TCACACAGTGCCTGTCG191-20NM_020311CXCR7*Chemokine orphan receptor 1
ACAAACCCCCACCCCAG417-6NM_013330NME7Non-metastatic cells 7, protein expressed in, transcript variant 1, Nucleoside diphosphate kinase
AATCTCTCAATTATAGG349-4NM_006183NTS*Neurotensin
ATCAACTGGAGGCTCAG15NDb-20NM_005013NUCB2Nucleobindin 2
CCAAAATTAGGAAAAAC151-20NM_002577PAK2p21 (CDKNlA)-activated kinase 2k
TTACGTTTGGGAAAAAT192-9NM_032971PCDH11YProtocadherin 11 Y-linked, transcript variant ak
TGACTTTGGTGCCGTTA12ND-10NM_003629PIK3R3Phosphoinositide-3-kinase, regulatory subunit 3 (p55, gamma)
AGCAAATATGTCAAGGG4716-2.9NM_182948PRKACB*Protein kinase, cAMP-dependent, catalytic, beta, transcript variant 1
GACTATTCCATATTAAA271-30NM_018412ST7*Suppression of tumorigenicity 7, transcript variant A
GAGGGTTTTAAATGGAG799-9NM_001077UGT2B17UDP glucuronosyltransferase 2 family, polypeptide B17

a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001)

b ND, not detected

c ND tags were assigned a value of 1 when calculating fold change

d Appropriate significant figures are displayed

j Negative fold change value indicates down-regulation in response to R1881

k Tag has a single base pair permutation, insertion, or deletion with respect to gene

n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from table

t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000

φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed

* Gene further characterized in this paper

Table 7

LongSAGE tags corresponding to genes not previously reported to increase expression in response to androgen in LNCaP cells

Tags/100,000d, t

LongSAGE Tag SequenceVehicleR1881Fold Changec, dRefSeq/Ensembl Access. No.HGNC Gene SymbolDescriptionφ
TCTTTATTAGAAAAAAANDb1620NM_014265ADAM28ADAM metallopeptidase domain 28, transcript variant 1k
AGGAGCAAAGGAAGGGG511072.1NM_000713BLVRB*Biliverdin reductase B (flavin reductase (NADPH))
TTTTGGGGGCTTTTAGC16442.8NM_198446Clorf122*Chromosome 1 open reading frame 122
GGGCCCCAAAGCACTGC22693.1NM_199249C19orf48*Chromosome 19 open reading frame 48
CCCCAGTTGCTGATCTC24602.5NM_001003962CAPNS1*Calpain, small subunit 1, transcript variant 2
CTTAAGAAAAATGCACT12320NM_018948ERRFI1 *ERBB receptor feedback inhibitor 1
TACAGTATGTTCAAAGT13524.0NM_002065GLUL*Glutamate-ammonia ligase (glutamine synthetase), transcript variant 1g, i
TTAATAGTGGGGCTTTC10393.9NM_022130GOLPH3*Golgi phosphoprotein 3 (coat protein)
GCCAGGGCGGGCCACTGND1620NM_178580HM13*Histocompatibility (minor) 13, transcript variant 2
GAGGAAGAAGAAGCAGCND1410NM_003299HSP90B1*Heat shock protein 90kDa beta (Grp94), member 1
GGCAAGGGGGGTCCCCA12020NM_002273KRT8Keratin 8m
ACTCCAAAAAAAAAAAA41812.0XM_376154N/AqSimilar to 40S ribosomal protein S15 (RIG protein), transcript variant 1
GGGTTGGCTTGAAACCA6305ENSG00000210151N/ANon-coding predicted mitochondrial gene m
GAGAGCTCCCGTGAGTG721221.7NC_001807PN/AIntergenic region of mitochondrial genome
TCGGACGTACATCGTTA402235.6No mapN/AN/A
GCAAAAAAATCAAGTCT22663.0NM_018946NANS*N-acetylneuraminic acid phosphate synthase (sialic acid synthase)
TCTTTTAGCCAATTCAG23620NM_006167NKX3-1NK3 transcription factor related, locus 1 m
TACTTTTGGCCTGGCTG6356NM_173854SLC41A1*Solute carrier family 41, member 1
GAGAGCCTCAGAATGGG5265NM_016281TAOK3*TAO kinase 3
GAAGTTATGAAGATGCT411062.6NM_030752TCP1*T-complex protein 1, transcript variant 1
CAGTTCTCTGTGAAATC40932.3NM_016127TMEM66*Transmembrane protein 66
ATGGCTTTGTTTTGGTTND1410NM_201624USP33*Ubiquitin specific protease 33, transcript variant 2

a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001)

b ND, not detected

c ND tags were assigned a value of 1 when calculating fold change

d Appropriate significant figures are displayed

e Gene family, but not this family member, previously described to change expression in response to androgens

g Protein known to change expression in reponse to androgens

h Gene known to change expression in response to androgens, but in the opposite direction as reported here

i Gene known to change expression in response to androgens in cells other than prostate

k Tag has a single base pair mutation, insertion, or deletion with respect to gene map

mTag maps to the strand opposite of the gene

n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from the table

p NC_001807, refers to the complete genome of mitochondria in humans

All mitochondrial genes in the RefSeq database are assigned the same accession number by NCBI q N/A, there is no HGNC approved gene symbol or description for this tag t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000

φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed * Gene further characterized in this paper

Table 8

LongSAGE tags corresponding to genes not previously reported to decrease expression in response to androgen in LNCaP cells

Tags/100,000d, t

LongSAGE Tag SequenceVehicleR1881Foldc, d, j ChangeRefSeq/Ensembl Access. No.HGNC Gene SymbolDescriptionφ
GTCTAGAATCTGTACCC298-4NM_006407ARL6IP5*ADP-ribosylation-like factor 6 interacting protein 5
TCAAGAGCCGAAGGAAT12NDb-10NM_014165C6orf66*Chromosome 6 open reading frame 66
GTATTTGCAAAAATGCC11824-4.9NM_018584CAMK2N1*Calcium/calmodulin-dependent protein kinase II inhibitor 1
AAAAGAGAAAGCACTTT305-6NM_018584CAMK2N1*Calcium/calmodulin-dependent protein kinase II inhibitor 1
TTATAACTGAATTTAGT5111-4.6NM_006835CCNI*Cyclin Ii, h
GCCAGGAGAAGGGACAG347-5NP_775809CNBD1N/Am
TGGTACTCATTTCAGGC12ND-10NM_015954DERA*2-deoxyribose-5-phosphate aldolase homolog
AATCATAATGGATTCTT16ND-20NM_024641MANEA*Mannosidase, endo-alpha
CTAAGACTTCACCAGCC192-10ENSG00000210082N/A qNon-coding predicted mitochondrial rRN A genek
CATTTGGTATTTTCGTC308-4NC_001807PN/AIntergenic region of mitochondrial genome
GTATTTCAGTGTCTGTC339-4NM_015469NIPSNAP3A*Nipsnap homolog 3A
GTGTGTGGTGCCCCCAG235-5NM_024066PRNPIP*Prion protein interacting protein
GTGTTAACCAGCTAAAG12260-2.0NM_002948RPL15Ribosomal protein L15
GCACAAGAAGATTAAAA5825-2.3NR_002746SNORD47Small nucleolar RNA, C/D box 47 on chromosome 1
AAAAAGCAGATGACTTG7737-2.1NM_000454SOD1*Superoxide dismutase 1, soluble (amyotrophic lateral sclerosis 1 (adult))
GTTTGGTTATAAATTCT263-10NM_148893SVIP*Hypothetical protein DKFZp313A2432, transcript variant 1
TATTAGAGAATGAAAAG172-9NM_016485WA1*VPS20-associated 1 homologue

a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001)

b ND, not detected

c ND tags were assigned a value of 1 when calculating fold change

d Appropriate significant figures are displayed

h Gene known to change expression in response to androgens, but in the opposite direction as reported here

i Gene known to change expression in response to androgens in cells other than prostate

j Negative fold change value indicates down-regulation by R1881

k Tag has a single base pair permutation, insertion, or deletion with respect to gene

mTag maps to the strand opposite of the gene

n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from the table

p NC_001807 refers to the complete genome of mitochondria in humans

All mitochondrial genes in the RefSeq database are assigned the same accession number by NCBI q N/A = there is no HGNC approved gene symbol for this tag t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000

φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed * Gene further characterized in this paper

LongSAGE tags corresponding to genes known to increase expression in response to androgen in LNCaP cells a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001) b ND, not detected c ND tags were assigned a value of 1 when calculating fold d Appropriate significant figures are displayed n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from the table q N/A = there is no HGNC approved gene symbol for this tag t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000 φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed * Gene further characterized in this paper LongSAGE tags corresponding to genes known to decrease expression in response to androgen in LNCaP cells a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001) b ND, not detected c ND tags were assigned a value of 1 when calculating fold change d Appropriate significant figures are displayed j Negative fold change value indicates down-regulation in response to R1881 k Tag has a single base pair permutation, insertion, or deletion with respect to gene n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from table t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000 φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed * Gene further characterized in this paper LongSAGE tags corresponding to genes not previously reported to increase expression in response to androgen in LNCaP cells a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001) b ND, not detected c ND tags were assigned a value of 1 when calculating fold change d Appropriate significant figures are displayed e Gene family, but not this family member, previously described to change expression in response to androgens g Protein known to change expression in reponse to androgens h Gene known to change expression in response to androgens, but in the opposite direction as reported here i Gene known to change expression in response to androgens in cells other than prostate k Tag has a single base pair mutation, insertion, or deletion with respect to gene map mTag maps to the strand opposite of the gene n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from the table p NC_001807, refers to the complete genome of mitochondria in humans All mitochondrial genes in the RefSeq database are assigned the same accession number by NCBI q N/A, there is no HGNC approved gene symbol or description for this tag t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000 φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed * Gene further characterized in this paper LongSAGE tags corresponding to genes not previously reported to decrease expression in response to androgen in LNCaP cells a Statistics according to the Audic and Claverie test statistic (p ≤ 0.001) b ND, not detected c ND tags were assigned a value of 1 when calculating fold change d Appropriate significant figures are displayed h Gene known to change expression in response to androgens, but in the opposite direction as reported here i Gene known to change expression in response to androgens in cells other than prostate j Negative fold change value indicates down-regulation by R1881 k Tag has a single base pair permutation, insertion, or deletion with respect to gene mTag maps to the strand opposite of the gene n Ambiguously mapped tags and tags with a fold change less than 2-fold have been excluded from the table p NC_001807 refers to the complete genome of mitochondria in humans All mitochondrial genes in the RefSeq database are assigned the same accession number by NCBI q N/A = there is no HGNC approved gene symbol for this tag t Tag count per 100,000 = (observed tag count/total tags in the library) × 100,000 φ In cases where a tag mapped to >1 transcript variant of the same gene the RefSeq accession number for transcript variant 1 was displayed * Gene further characterized in this paper Interestingly some antisense tags were identified as differentially expressed in response to androgen. Antisense to NKX3-1 is of particular note. Transcription of this gene is regulated by androgen in a time- and concentration-dependent manner [41] with an ARE confirmed in its enhancer region [42]. Anti-sense RNA is involved in transcriptional silencing of sense transcript, imprinting control, post-transcriptional down-regulation of sense transcript or even stabilizing/promotion of the expression of the sense transcript [43]. In the case of NKX3-1, antisense transcript may be a negative feedback mechanism; however, this remains to be determined.

Validation of changes in gene expression in response to androgen

Quantitative real time-polymerase chain reaction (qRT-PCR) was used to validate changes in gene expression in response to androgen of 39 (13 known; 26 novel) of the 87 total genes identified by LongSAGE. Of the 35 genes previously not reported to change expression in response to androgens in prostate cancer cells, only 26 were quantified by qRT-PCR, because technical limitations and gaps in the transcriptome databases prevented the analysis of 9 genes. That is, specific qRT-PCR primers could not be designed due to repetition in the genome, or because the tag mapped to an unannotated transcript variant. There were 24 of the 26 (92%) novel genes that displayed statistically significant differential expression in response to androgen as measured by qRT-PCR (Figure 3A). BLVRB, C19orf48, C1orf122, ERRFI1, GLUL, GOLPH3, HM13, HSP90B1, NANS, SLC41A1, TAOK3, TCP1, TMEM66, and USP33 all increased levels of expression in response to androgen, while ARL6IP5, C6orf66, CAMK2N1, CCNI, DERA, MANEA, NIPSNAP3A, SOD1, SVIP, and VTA1 decreased in response to androgen (Figure 3A). Under the experimental conditions and primers used, we did not measure statistically significant changes in expression of PRNPIP and CAPNS1. A false discovery rate (FDR)[44] of 29% was expected of the LongSAGE data based on the Audic and Claverie p-value ≤ 0.001. This FDR represents the anticipated percentage of type I errors (i.e., false positives). We observed only 2/26 (8%) false positives, suggesting that the other filter parameters (e.g., ≥ 2-fold difference in expression level) may have the increased the chances of validation by qRT-PCR. Moreover, the expression trends for all 13 genes known to change expression in response to androgen in prostate cancer cells correlated between the LongSAGE and qRT-PCR data. ADAMTS1, CENPN, CREB3L4, FKBP5, KLK3, LRIG1, NCAPD3, PAK1IP1, and RHOU all increased levels of expression in response to androgen while CXCR7, NTS, PRKACB, and ST7 decreased in response to androgen (Figure 3B).
Figure 3

Androgen regulation of genes as measured by qRT-PCR. Candidate genes not previously implicated to change expression in response to androgens in prostate cancer cells, and Genes known to change levels of expression in response to androgens. LNCaP cells were treated for 16 hours prior to harvesting RNA, and analysing mRNA levels by qRT-PCR. Fold-change was calculated by normalizing the mean normalized expression (MNE) of transcripts in R1881-treated cells to the mock vehicle-treated cells. In doing this, the vehicle treatment fold-change became one and standard deviation (SD) zero. Error bars represent ± SD for biological sextuplets. [*] Asterisk indicates significant differential gene expression according to the Two-Sample Student's T-test (p ≤ 0.05) for unequal variance.

Androgen regulation of genes as measured by qRT-PCR. Candidate genes not previously implicated to change expression in response to androgens in prostate cancer cells, and Genes known to change levels of expression in response to androgens. LNCaP cells were treated for 16 hours prior to harvesting RNA, and analysing mRNA levels by qRT-PCR. Fold-change was calculated by normalizing the mean normalized expression (MNE) of transcripts in R1881-treated cells to the mock vehicle-treated cells. In doing this, the vehicle treatment fold-change became one and standard deviation (SD) zero. Error bars represent ± SD for biological sextuplets. [*] Asterisk indicates significant differential gene expression according to the Two-Sample Student's T-test (p ≤ 0.05) for unequal variance.

Known or potential AREs in the regulatory regions of androgen-regulated genes

AR directly regulates transcription in response to androgen by binding to AREs in the promoter and/or enhancer regions of target genes. ChIP-chip database mining for suggested AREs combined with a literature search for known AREs revealed some of the genes that alter expression in response to androgen do contain AREs (Table 9). For the 87 genes identified using the cut-off p-value of 0.001 and 2-fold change in response to androgen, there were eight genes with AREs in their promoter, enhancer or intron regions[16,42,45-49]. AREs were detected in the proximity of seven genes by data mining of ChIP-chip studies of ARE on chromosomes 19, 20, 21, 22 [50,51]. Additionally, sequence analysis of the promoters [52] found eight genes from our gene list to contain potential AREs (Table 9). Identification of potential AREs in the regulatory regions of the newly identified genes that alter expression in response to androgen (BLVRB, C19orf48, HM13, SOD1) may be directly regulated by AR.
Table 9

Genes with confirmed or potential AREs that change expression in response to androgen

HGNC Gene SymbolAccess. No.Expression change by R1881 stimulationChromosomeDistance of the ARE from TSS* (bp)Reference
Conventional

B2MNM_0040482.315-1902[42]
NKX3-1NM_006167208-3013[42]
KLK3NM_0016487.319-170[16,48]
-4006
-4075
-4115
FKBP5NM_00411720665.6 k (Intron 5)[47]
NDRG1NM_006096708-984[45]
TMEPAINM_0201822020-2134[45]
TMPRSS2NM_0056561021-148[46]
TPD52NM_00507948-359[49]

Identified by ChIP-Chip

BLVRBNM_0007132.119-56.7 k[51]
C19orf48NM_1992493.119-363 k[51]
CAPNS1NM_0010039622.519-165 k[51]
HM13NM_1785802020-330 k[51]
ADAMTS1NM_0069884.221276 k[50]
310 k
481 k
TMPRSS2NM_0056561021-1063 k[50]
-462 k
13.5 k
SOD1NM_000454-2.121-496 k[50]

Potential based on the sequence

B2MNM_0040482.315-440[52]
NDRG1NM_006096708-1018[52]
NKX3-1NM_006167208-1272[52]
SORDNM_0031045.115-1995[52]
TMEPAINM_0201822020-225[52]
TMPRSS2NM_0056561021-771[52]
TPD52NM_00507948-609[52]
TSC22D1NM_006022613-1711[52]

*TSS: Transcription starting site

Genes with confirmed or potential AREs that change expression in response to androgen *TSS: Transcription starting site

Cell-type specificity of gene expression

To determine if expression of candidate genes was unique to LNCaP cells, we assayed for constitutive levels of expression of 18 known and novel candidate genes in prostate cancer cell lines DU145[53] and PC-3[54] using qRT-PCR (Figure 4). Genes chosen included those that both increased (ADAMTS1, CAPNS1, CENPN, CREB3L4, ERRFI1, FKBP5, HSP90B1, KLK3, LRIG1, NCAPD3, PAK1IP1, and TAOK3) and decreased expression in response to androgen (ARL6IP5, CAMK2N1, CCNI, CXCR7, PRKACB and ST7). No obvious trends were observed depending on whether expression of the genes increased, or decreased, in response to androgen. All genes tested, except ERRFI1, were expressed at a lower level in PC-3 and DU145 cells relative to LNCaP cells. This suggests that the majority of genes that alter levels of expression in response to androgen were enriched in LNCaP cells relative to PC-3 and DU145 cells. These data are consistent with both DU145 and PC3 cells being androgen-insensitive and lacking a functional AR[53,54].
Figure 4

Differential expression of candidate genes in LNCaP, DU145, and PC-3 cells. Levels of transcripts in LNCaP, DU145, and PC-3 cells were analyzed by qRT-PCR. Error bars represent ± standard deviation(SD) for biological triplicates. [*] Asterisks indicate the significant differential gene expression in each cell line compared to LNCaP cells according to the Two-Sample Student's T-test (p ≤ 0.05) for equal (unpaired) or unequal variance as determined appropriate with the F-test.

Differential expression of candidate genes in LNCaP, DU145, and PC-3 cells. Levels of transcripts in LNCaP, DU145, and PC-3 cells were analyzed by qRT-PCR. Error bars represent ± standard deviation(SD) for biological triplicates. [*] Asterisks indicate the significant differential gene expression in each cell line compared to LNCaP cells according to the Two-Sample Student's T-test (p ≤ 0.05) for equal (unpaired) or unequal variance as determined appropriate with the F-test.

In vivo changes in gene expression in response to androgen-deprivation

The LNCaP Hollow Fibre model combined with qRT-PCR was employed to capture in vivo gene expression representative of physiological levels and castrated levels of androgen (Figure 5). We expected that the genes that had increased levels of expression in vitro in response to androgens, would decrease expression in vivo in response to castration (androgen-deprivation). Conversely, we expected that the genes that had decreased levels of expression in vitro in response to androgens, would increase expression in vivo in response to castration. These in vivo results would be consistent with androgen-responsiveness of the candidate genes. Of the candidate genes examined, 13 of 16 genes showed significant changes in gene expression in response to androgen-deprivation (Figure 5). As anticipated, expression of ARL6IP5, CAMK2N1, CXCR7, and ST7 increased, while CENPN, CREB3L4, ERRFI1, FKBP5, KLK3, LRIG1, NCAPD3, PAK1IP1, and TAOK3 decreased levels of expression in response to castration. No significant changes in gene expression in vivo was measured for ADAMTS1, HSP90B1, or PRKACB, suggesting that in vivo, other factors may influence their expression. Alternatively, the expression kinetics of each specific gene and half-life of its transcript may vary considerably. The time of harvesting samples and measuring changes in expression of genes in response to androgen-deprivation was at 10 days in vivo compared to 16 hr in vitro in response to addition of androgens (10 nM R1881). Different levels of androgen may also have profound effects on proliferation and differentiation. Physiological levels of androgen in male Nude mice may be considerably lower than the levels used in vitro. Androgen at 10 nM inhibits proliferation of LNCaP cells in vitro while 0.1 nM is optimal for proliferation[55].
Figure 5

Androgen regulation of genes in the . Levels of transcripts in LNCaP cells from the Hollow Fibre model were analyzed by qRT-PCR. Cx, castrated mice, 10 days post castration, n = 12; Pre-Cx, pre-castration, day 0 of castration, n = 15. Exception: LRIG1 gene expression in Cx samples was represented by 11 mice. Fold-change was calculated by normalizing the mean normalized expression (MNE) of transcripts in the Pre-Cx sample to the castrate sample. In doing this, the Cx sample fold-change became one and standard deviation (SD) zero. Error bars represent ± SD. [*] Asterisks indicate the significant differential gene expression with respect to Cx according to the Two-Sample Student's T-test (p ≤ 0.05) for unequal variance.

Androgen regulation of genes in the . Levels of transcripts in LNCaP cells from the Hollow Fibre model were analyzed by qRT-PCR. Cx, castrated mice, 10 days post castration, n = 12; Pre-Cx, pre-castration, day 0 of castration, n = 15. Exception: LRIG1 gene expression in Cx samples was represented by 11 mice. Fold-change was calculated by normalizing the mean normalized expression (MNE) of transcripts in the Pre-Cx sample to the castrate sample. In doing this, the Cx sample fold-change became one and standard deviation (SD) zero. Error bars represent ± SD. [*] Asterisks indicate the significant differential gene expression with respect to Cx according to the Two-Sample Student's T-test (p ≤ 0.05) for unequal variance.

Conclusion

Androgens are essential for the growth, development and maintenance of the prostate. Here, we created LongSAGE libraries to obtain quantitative gene expression profiles of LNCaP human prostate cancer cells treated with, or without, androgen and revealed the following: 1) 33,385 tag types in the R1881 LongSAGE library and 31,764 tag types in the vehicle LongSAGE library; 2) the majority (64% to 67%) of tag types in each LongSAGE library were singletons which may represent very low abundance transcripts (≤ 5 transcript copies per cell); 3); when mapped tags were clustered and ambiguous mappings were removed, the tag types in the R1881 and vehicle LongSAGE libraries represented 7,484 genes and 7,441 genes, respectively; 4) 53% of tags mapped solely to known exons, 9% solely to known introns (novel transcript variants), and 38% to intergenic regions (novel genes or transcript variants); 5) the most highly abundant LongSAGE tag mapped to human mitochondrial NADH ubiquinone oxidoreductase chain 4 involved in metabolic energy; 6) the lower abundance classes were enriched for genes with functions in regulating transcription; 7) 87 genes were differentially expressed by two-fold (p ≤ 0.001) in response to androgen representing 0.34% of the total tag types (131 differentially expressed tag types/38,574 total tag types); 8) some of these genes have confirmed or potential AREs; 9) novel androgen regulated genes (direct or indirect) identified and validated were ARL6IP5, BLVRB, C19orf48, C1orf122, C6orf66, CAMK2N1, CCNI, DERA, ERRFI1, GLUL, GOLPH3, HM13, HSP90B1, MANEA, NANS, NIPSNAP3A, SLC41A1, SOD1, SVIP, TAOK3, TCP1, TMEM66, USP33, and VTA1; 9) expression of ADAMTS1, ARL6IP5, CAMK2N1, CAPNS1, CENPN, CREB3L4, CCNI, CXCR7, FKBP5, HSP90B1, KLK3, LRIG1, NCAPD3, PAK1IP1, PRKACB, ST7, and TAOK3 was increased in LNCaP cells compared to prostate cancer cells lacking a functional AR; and 10) significant differences in levels of expression of ARL6IP5, CAMK2N1, CENPN, CREB3L4, CXCR7, ERRFI1, FKBP5, KLK3, LRIG1, NCAPD3, PAK1IP1, ST7, and TAOK3 were measured in vivo in response to androgen-deprivation. The products of these genes are involved in amino acid and protein synthesis, cofactor transport, protein trafficking, response to oxidative stress, as well as signaling pathways that regulate gene expression, proliferation, apoptosis, and differentiation. These genes are potentially critical for the function and maintenance of the prostate and represent targets for clinical intervention.

Methods

Cell culture

LNCaP human prostate cancer cells (American Type Culture Collection, Bethesda, MD, USA) were maintained in RPMI-1640 media (Stem Cell Technologies, Vancouver, BC, Canada) supplemented with 10% v/v fetal bovine serum (FBS; HyClone, Logan, UT, USA), 100 units/mL penicillin and 100 units/mL streptomycin (antibiotics; Invitrogen, Burlington, ON, Canada). DU145 and PC-3 human prostate cancer cells were maintained in DMEM (Stem Cell Technologies) supplemented with 10% v/v FBS and 5% v/v FBS, respectively with antibiotics. All cells were maintained at 37°C with 5% CO2.

Long serial analysis of gene expression

RNA sample generation

1 × 106 LNCaP cells were seeded in 10 cm-diameter dishes. The next day, cells were serum-starved (0% serum) for 48 hours and then treated for 16 hours with 10 nM synthetic androgen R1881 (also known as methyltrienolone; PerkinElmer, Woodbridge, ON, Canada), or solvent (vehicle) control, ethanol (final concentration 2.85 × 10-4%). Total RNA was extracted using TRIZOL Reagent (Invitrogen) following the manufacturer's instructions. RNA quality and quantity were assessed using the Agilent 2100 Bioanalyzer (Agilent Technologies, Mississauga, ON, Canada) and RNA 6000 Nano LabChip kit (Caliper Technologies, Hopkinton, MA, USA).

LongSAGE library production

LongSAGE[25] libraries were constructed with 5 μg of total RNA using the Invitrogen I-SAGE Long kit and protocol with alterations as previously published[35]. Briefly, double-stranded cDNA was synthesized from total RNA and digested with Nla III. The sample was split in half and linkers type I and II were added and ligated to Nla III overhangs. An Mme I digestion resulted in 17-21 base-pair (bp) LongSAGE tags. The tags with unique linkers were combined and ligated together to form ditags. Ditags (131 bp) were amplified with primers designed to recognize sequences within linkers type I and II using PCR. This scale-up PCR was performed in 48 wells of a 96 well plate (50 μL/well) using a 1/20th dilution of template cDNA and 25 and 27 cycles of PCR (R1881 and vehicle LongSAGE library, respectively). Following an Nla III digestion to remove the linkers, the 36 bp ditags were concatenized. Concatemers sized 1300-1700 bp were digested with Nla III (1 minute) to increase the efficiency of cloning into pZErO-1 vectors. Cloned concatemers were transformed into One Shot TOP10 Electrocompetent Escherichia coli and colonies were picked with the Q-Pix robot (Genetix) and cultured in 2× Yeast-Tryptone media with 50 μg/mL zeocin and 7.5% (v/v) glycerol.

Sequencing

Glycerol stocks of transformed bacteria were used to inoculate larger cultures for alkaline lysis plasmid preparation[56]. Plasmid preparations were separated by agarose gel electrophoresis and visualized by ultraviolet light and sybr green. 1/24th BigDye v3.1 terminator cycle sequencing reactions were performed with tetrad thermal cyclers (BioRad, Waltham, MA, USA) and visualized with capillary DNA sequencers, models 3700 and 3730 xl (Applied Biosystems, Foster City, CA, USA). Each library was sequenced to a depth of ~100,000 LongSAGE tags. Flanking vector sequences were removed and the LongSAGE tags were extracted from each sequence read. On average, 34 and 38 LongSAGE tags were sequenced in each read (R1881 and vehicle libraries, respectively). Sequence data were filtered for non-recombinant clones.

Gene expression analysis

LongSAGE expression data was analyzed with DiscoverySpace 3.2.4 and 4.01 software . Duplicate ditags (identical copies of a ditag) and singletons (tags counted only once) were retained for analysis. Sequence data were filtered for bad tags (tags with one N-base call) and linker-derived tags (artifact tags). Only LongSAGE tags with a sequence quality factor (QF) greater than 95% were included in analysis[35]. Where indicated, a clustering algorithm was used to amalgamate 1-off tags (tags one bp incorrect from a complete map to a transcript) with likely 'parent' tags to improve the mapping capability of LongSAGE tags by apparently reducing PCR/sequencing errors[35]. This clustering algorithm altered the number of tag types (i.e., species) without changing the total number of tags. In instances where clustering was used, the 95% QF cutoff was not. To filter data for candidate transcript validation, a p-value cutoff (p ≤ 0.001) was employed according to the Audic and Claverie test statistic[40]. The Audic and Claverie statistical method was used to identify differentially expressed tags between LongSAGE libraries because the method takes into account the sizes of the libraries and tag counts. LongSAGE tags that mapped ambiguously to more than one gene, and tags that differed by less than 2-fold were excluded from the candidate list. LongSAGE tags were mapped to reference sequence (RefSeq; May 30th, 2005) and Ensembl Gene (v31.35d), unless otherwise stated.

Quantitative real-time polymerase chain reaction

qRT-PCR was performed on TRIZOL-extracted RNA from LNCaP (serum-starved ± R1881 or the exception in Figure 4 in 10% serum), DU145 (10% serum) and PC-3 (5% serum) cells maintained in vitro, and LNCaP cells maintained in the in vivo Hollow Fibre model[29] (see below). Contaminating genomic DNA was removed from in vitro RNA samples using DNA-free or TURBO DNA-free (Ambion, Austin, TX, USA). Input RNA (1 μg) was reverse transcribed with SuperScript III First Strand Synthesis kit (Invitrogen). A 10 μL qRT-PCR reaction included 1 μl of template cDNA (0.1 μL for limited LNCaP Hollow Fibre samples), 1× Platinum SYBR Green qPCR SuperMix-UDG with ROX (Invitrogen) and 0.3 μM each of forward and reverse intron-spanning primers that produce products between 85-115 bp in size (see Additional file 1 for primer sequences). qRT-PCR reactions were cycled as follows in a 7900 HT Sequence Detection System (Applied Biosystems): 50°C for 2 min, 95°C for 2 min, (95°C for 0.5 min, 55-56°C for 0.3-0.5 min, and 72°C for 0.5 min) for 40-45 cycles, 95°C for 0.25 min, 60°C for 0.25 min, and 95°C for 0.25 min. All qRT-PCR reactions were performed in technical triplicates for each of at least three biological replicates. cDNAs (from different conditions) and genes [target and reference (glyceraldehyde-3-phosphate, GAPDH)] to be directly compared were assayed in the same instrument run. No-template reactions (negative controls) were run for each gene to ensure that DNA had not contaminated the qRT-PCR reactions. Only qRT-PCR data with single-peak dissociation curves were included in this analysis. Efficiency checks were performed for each primer pair in each cell line. PCR products were sequenced to verify the identity of quantified transcripts. The two-tailed, two-sample Student's T-tests were performed to identify significant differences in transcript expression. The F-test was used to identify unequal variance among samples to be compared.

LNCaP Hollow Fibre model

Animals

Five-week-old male athymic BALB/c Nude mice were obtained from Taconic Farms (Hudson, NY, United States of America) and kept in the British Columbia Cancer Research Centre (Vancouver, BC, Canada). Mice were maintained on a Harlan/Teklad irradiated diet with a constant supply of autoclaved water and housed in cages (three animals/cage) at 21°C ± 3°C with light/dark cycling (light between 6 AM and 6 PM). All animal experiments were performed according to a protocol approved by the Committee on Animal Care of the University of British Columbia.

Hollow fibre model

Polyvinylidene difluoride hollow fibres (Mr 500,000 molecular weight cutoff; 1-mm internal diameter; Spectrum Laboratories, Rancho Dominguez, CA, USA) were prepared and implanted as previously described[29]. Briefly, LNCaP human prostate cancer cells (3 × 107 cells) at passage 47 (provided by Dr. L.W.K. Chung at the Emory University School of Medicine, Atlanta, GA, USA) were injected into hollow fibres. The fibres were sealed and subcutaneously (s.c.) implanted into mice. Seven days post fibre implantation (day zero), mice were either castrated or left intact as controls. Blood was drawn via the tail vein each week to measure serum KLK3 levels to monitor the response to castration. Serum KLK3 levels were determined by enzymatic immunoassay kit (Abbott Laboratories, Abbott Park, IL, USA). Bundles of fibres were removed at day zero (Pre-Cx; four fibres) and day 10 (Cx; four fibres). Total RNA was isolated immediately from cells harvested from the fibres. Compromised fibres that were contaminated with mouse cells, as indicated by an infiltration of red blood cells that was determined by visual inspection, were not used in this study.

Abbreviations

AR: androgen receptor; AREs: androgen response elements; ARL6IP5: ADP-ribosylation like factor-6 interacting protein 5; CAMK2N1: calcium/calmodulin-dependent protein kinase II inhibitor 1; CI: confidence interval; Cx: castration; ERRF11: ERBB receptor feedback inhibitor; FBS: fetal bovine serum; FDR: false discovery rate; GAPDH: glyceraldehyde-3-phosphate; GLUL: glutamate-ammonia ligase; GOLPH3: golgi phosphoprotein 3; HM13: Histocompatibility (minor) 13; HSP90B1: heat shock protein 90 kDa beta member 1; KLK3: kallikrein 3 = PSA; LDT: linker-derived tag; LongSAGE: long serial analysis of gene expression; MANEA: mannosidase: endo alpha; MNE: mean normalized expression; NANS: n-acetylneuraminic acid synthase; NIPSNAP3A: nipsnap homologue 3A; PCR: polymerase-chain reaction; PSA: prostate-specific antigen = KLK3; QF: quality factor; qRT-PCR: quantitative real-time polymerase chain reaction; R1881: methyltrienolone: synthetic androgen; RefSeq: reference sequence; SAGE: serial analysis of gene expression; SD: standard deviation; shortSAGE: short serial analysis of gene expression; SLC41A1: solute carrier family 41: member 1; SOD1: superoxide dismutase 1; SVIP: small VCP/p97-interacting protein; TAOK3: tao kinase 3; TCP1: T-complex 1; VTA1: vps20-associated 1.

Authors' contributions

TLR conducted the experiments, analyzed the data and wrote the manuscript. GW generated the total RNA, analyzed the data, and helped to draft the manuscript. MAM provided support for the SAGE library construction with sequencing by RAH. SJM aided in the analysis of data. MDS conceived the study, designed the experiments, and coordination and wrote the manuscript. All authors read and approved the final manuscript.

Author's Information

Tammy L. Romanuik, PhD: Graduate Student, Genome Sciences Centre, BC Cancer Agency Current address: Medical Genetics, UBC Gang Wang, PhD: Postdoctoral fellow, Genome Sciences Centre, BC Cancer Agency Robert A. Holt, PhD: Head of Sequencing, Genome Sciences Centre, BC Cancer Agency Steven J. M. Jones, PhD: Associate Director/Head of Bioinformatics, Genome Sciences Centre, BC Cancer Agency Marco A. Marra, PhD: Director, Genome Sciences Centre Marianne D. Sadar, PhD: Program Leader for Prostate Cancer Research, BC Cancer Agency

Additional file 1

Primer sequences and amplification product sizes for candidate transcripts. The data provided represent the primer sequences used in quantitative real-time polymerase chain reaction to validate changes in gene expression in response to androgen. Click here for file
  55 in total

1.  Characterization of a new in vivo hollow fiber model for the study of progression of prostate cancer to androgen independence.

Authors:  Marianne D Sadar; Violetta A Akopian; Eliana Beraldi
Journal:  Mol Cancer Ther       Date:  2002-06       Impact factor: 6.261

2.  Correction of sequence-based artifacts in serial analysis of gene expression.

Authors:  Viatcheslav R Akmaev; Clarence J Wang
Journal:  Bioinformatics       Date:  2004-02-10       Impact factor: 6.937

3.  The expression of three abundance classes of messenger RNA in mouse tissues.

Authors:  N D Hastie; J O Bishop
Journal:  Cell       Date:  1976-12       Impact factor: 41.582

Review 4.  Antisense transcription: a critical look in both directions.

Authors:  T Beiter; E Reich; R W Williams; P Simon
Journal:  Cell Mol Life Sci       Date:  2009-01       Impact factor: 9.261

5.  New biochemical methods to determine androgen sensitivity of prostatic cancer: the relative enzymatic index (REI).

Authors:  J T Isaacs; W W Scott; D S Coffey
Journal:  Prog Clin Biol Res       Date:  1979

6.  Isolation of a human prostate carcinoma cell line (DU 145).

Authors:  K R Stone; D D Mickey; H Wunderli; G H Mickey; D F Paulson
Journal:  Int J Cancer       Date:  1978-03-15       Impact factor: 7.396

7.  Sex steroids as a cause of adenocarcinoma of the dorsal prostate in Nb rats, and their influence on the growth of transplants.

Authors:  R L Noble
Journal:  Oncology       Date:  1977       Impact factor: 2.935

8.  Impotence therapy and cancer of the prostate.

Authors:  P D Guinan; W Sadoughi; H Alsheik; R J Ablin; D Alrenga; I M Bush
Journal:  Am J Surg       Date:  1976-05       Impact factor: 2.565

9.  Establishment and characterization of a human prostatic carcinoma cell line (PC-3).

Authors:  M E Kaighn; K S Narayan; Y Ohnuki; J F Lechner; L W Jones
Journal:  Invest Urol       Date:  1979-07

10.  The development of prostatic adenocarcinoma in Nb rats following prolonged sex hormone administration.

Authors:  R L Noble
Journal:  Cancer Res       Date:  1977-06       Impact factor: 12.701

View more
  42 in total

1.  11-Ketotestosterone Is the Dominant Circulating Bioactive Androgen During Normal and Premature Adrenarche.

Authors:  Juilee Rege; Adina F Turcu; Josephine Z Kasa-Vubu; Antonio M Lerario; Gabriela C Auchus; Richard J Auchus; Joshua M Smith; Perrin C White; William E Rainey
Journal:  J Clin Endocrinol Metab       Date:  2018-12-01       Impact factor: 5.958

2.  Human CNNM2 is not a Mg(2+) transporter per se.

Authors:  Gerhard Sponder; Lucia Mastrototaro; Katharina Kurth; Lucia Merolle; Zheng Zhang; Nasrin Abdulhanan; Alina Smorodchenko; Katharina Wolf; Andrea Fleig; Reinhold Penner; Stefano Iotti; Jörg R Aschenbach; Jürgen Vormann; Martin Kolisek
Journal:  Pflugers Arch       Date:  2016-04-11       Impact factor: 3.657

3.  cPLA2 is protective against COX inhibitor-induced intestinal damage.

Authors:  David C Montrose; Krishna Kadaveru; Jillian N M Ilsley; Sierra H Root; Thiruchanduri V Rajan; Manish Ramesh; Frank C Nichols; Bruce T Liang; Dmitry Sonin; Arthur R Hand; Simona Zarini; Robert C Murphy; Glenn S Belinsky; Masako Nakanishi; Daniel W Rosenberg
Journal:  Toxicol Sci       Date:  2010-06-18       Impact factor: 4.849

4.  Changes in leukocyte gene expression profiles induced by antineoplastic chemotherapy.

Authors:  Rebeca González-Fernández; Manuel Morales; Julio Avila; Pablo Martín-Vasallo
Journal:  Oncol Lett       Date:  2012-04-03       Impact factor: 2.967

5.  Differential upregulation of the hypothetical transmembrane protein 66 (TMEM66) in multiple sclerosis patients with potential inflammatory response.

Authors:  Safa Taha; Muna Aljishi; Isa Alsharoqi; Moiz Bakhiet
Journal:  Biomed Rep       Date:  2014-11-17

6.  LNCaP Atlas: gene expression associated with in vivo progression to castration-recurrent prostate cancer.

Authors:  Tammy L Romanuik; Gang Wang; Olena Morozova; Allen Delaney; Marco A Marra; Marianne D Sadar
Journal:  BMC Med Genomics       Date:  2010-09-24       Impact factor: 3.063

7.  Spongian diterpenoids inhibit androgen receptor activity.

Authors:  Yu Chi Yang; Labros G Meimetis; Amy H Tien; Nasrin R Mawji; Gavin Carr; Jun Wang; Raymond J Andersen; Marianne D Sadar
Journal:  Mol Cancer Ther       Date:  2013-02-26       Impact factor: 6.261

8.  Solute Carrier Family SLC41, what do we really know about it?

Authors:  Andrea Fleig; Monika Schweigel-Röntgen; Martin Kolisek
Journal:  Wiley Interdiscip Rev Membr Transp Signal       Date:  2013

9.  Digital gene expression analysis of two life cycle stages of the human-infective parasite, Trypanosoma brucei gambiense reveals differentially expressed clusters of co-regulated genes.

Authors:  Nicola J Veitch; Paul C D Johnson; Urmi Trivedi; Sandra Terry; David Wildridge; Annette MacLeod
Journal:  BMC Genomics       Date:  2010-02-22       Impact factor: 3.969

10.  Heat-shock responsive genes identified and validated in Atlantic cod (Gadus morhua) liver, head kidney and skeletal muscle using genomic techniques.

Authors:  Tiago S Hori; A Kurt Gamperl; Luis Ob Afonso; Stewart C Johnson; Sophie Hubert; Jennifer Kimball; Sharen Bowman; Matthew L Rise
Journal:  BMC Genomics       Date:  2010-01-28       Impact factor: 3.969

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.