Literature DB >> 18389087

Rarity of Somatic Mutation and Frequency of Normal Sequence Variation Detected in Sporadic Colon Adenocarcinoma Using High-Throughput cDNA Sequencing.

Takatsugu Kan1, Bogdan C Paun, Yuriko Mori, Fumiaki Sato, Zhe Jin, James P Hamilton, Tetsuo Ito, Yulan Cheng, Stefan David, Alexandru V Olaru, Jian Yang, Rachana Agarwal, John M Abraham, Stephen J Meltzer.   

Abstract

We performed high-throughput cDNA sequencing in colorectal adenocarcinoma and matching normal colorectal epithelium. All six hundred three genes in the UCSC database that were expressed in colon cancers and contained open reading frames of 1000 nucleotides or less were selected for study (total basepairs/bp, 366,686). 304,350 of these 366,686 bp (83.0%) were amplified and sequenced successfully. Seventy-eight sequence variants present in germline (i.e. normal) as well as matching somatic (i.e. tumor) DNA were discovered, yielding a frequency of 1 variant per 3,902 bp. Fifty-one of these sequence variants were homozygous (26 synonymous, 25 non-synonymous), while 27 were heterozygous (11 synonymous, 16 non-synonymous). Cancer tissue contained only one sequence-altered allele of the gene ATP50, which was present heterozygously alongside the wild-type allele in matching normal epithelium. Despite this relatively large number of bp and genes sequenced, no somatic mutations unique to tumor were found. High-throughput cDNA sequencing is a practical approach for detecting novel sequence variations and alterations in human tumors, such as those of the colon.

Entities:  

Year:  2007        PMID: 18389087      PMCID: PMC2287164     

Source DB:  PubMed          Journal:  Bioinform Biol Insights        ISSN: 1177-9322


Introduction

It is widely believed that somatic as well as germline mutations play important roles in the origin and progression of colorectal cancers (Calvert and Frucht, 2002). Many genes have been investigated for mutation to elucidate mechanisms of colorectal cancer development, with these investigations demonstrating the involvement of mutations in colorectal carcinogenesis and progression. Samuels et al. reported that PIK3CA, a catalytic subunit of the class IA phosphatidylinositol 3-kinases, was somatically mutated in 32% of colorectal cancers, resulting in the attenuation of apoptosis and facilitated tumor invasion(Samuels et al. 2004). A comprehensive study entitled, “The Tyrosine Phosphatome” was accomplished by sequencing all genes involved in tyrosine phosphorylation in a large cancer cohort consisting of 175 colorectal cancer patients(Wang et al. 2004). Most mutational studies, however, have been preoccupied with the prevalence of somatic mutations in a specific single candidate gene in relatively small colorectal cancer patient cohorts. Recently, Sjoblom et al. reported the genome-wide frequencies of somatically mutated genes in human breast and colorectal cancers(Sjoblom et al. 2006). However, the methods these used were extremely expensive, time-consuming, and labor-intensive for a typical laboratory to perform. More practical strategies, amenable to smaller laboratories with more conservative budgets, would be of great value in the continuing quest to answer questions in the fields of tumor genomics and mutatomics. To this end, we present herein a circumscribed, practical mutational study employing high-throughput cDNA sequencing in colon adenocarcinoma, in which we demonstrate the eminent feasibility and results of determining sequence variation efficiently and at low cost.

Materials and Methods

Tissue samples

Colorectal cancer and its matching normal colonic mucosa from a patient undergoing surgical resection at the Baltimore VA Hospital after signing informed research consent was used for this study. Clinicopathological data were as follows: 75 year-old male; moderately-differentiated colorectal adenocarcinoma of the ascending colon; tumor size, 2.5 × 1.1 × 0.5 cm; TNM stage (Fifth Edition of the TNM classification of the UICC, 1997), T2N0MX, without any other malignancies. Both colorectal adenocarcinoma and normal colonic epithelium (obtained at the location within the surgically resected specimen furthest from the tumor) were cut into smaller pieces and frozen in liquid nitrogen immediately after removal. A frozen aliquot of each specimen was crushed into pieces and lysed immediately in either TRIZOL reagent (Invitrogen Corp., Carlsbad, CA,) to extract total RNA, or lysis buffer of a DNeasy Tissue kit (QIAGEN Inc., Valencia, CA) to extract DNA, according to these manufacturers’ instructions.

Cell lines

HeLa S3, HT29, HCT15, HCT116, LoVo, CaCo2, LS174T, LS411N, and DLD1, purchased from the American Type Culture Collection (ATCC), and KYSE30, 70, 110, 150, 220, 410, 770, 850 and OE33, obtained from Dr. Yutaka Shimada at Kyoto University in Japan (Shimada et al. 1992), were enrolled in the current study in order to validate our findings in the ATP50 gene. Culture conditions for each cell line were according to ATCC and the establisher’s recommendations. All cell lines were supplemented with 10% fetal bovine serum plus an appropriate concentration of penicillin and streptomycin.

Gene selection

To increase our chances of successfully amplifying and sequencing cDNAs, we restricted our study to genes that are known to be expressed in colorectal cancer cells, based on a gene expression database at the University of California, Santa Cruz (UCSC) [http://genome.ucsc.edu/index.html]. From among this gene set, we selected a subset of genes (approximately 600) containing open reading frames (ORFs) 1000 nucleotides or shorter in length. To automate design of the large number of primer sets required, we developed an in-house primer design algorithm based on the publicly available primer design software program, Primer3 (http://frodo.wi.mit.edu/cgi-bin/primer3/primer3_www_slow.cgi). PCR products were designed to range from 300 to 500 bp in length. ORFs of cDNAs longer than 500 bp were divided into 2 or 3 fragments; primers were then designed with adjacent fragments overlapped, in order to completely cover these longer ORFs. Finally, for each heterozygous sequence alteration, genomic DNA primers (available on request) were designed to confirm cDNA sequencing results.

RT-PCR

Total RNA extracted from colorectal adenocarcinoma and normal colonic epithelium was reverse-transcribed using a SuperScript III First-Strand kit (Invitrogen, Carlsbad, CA), and respective cDNA pools were made. RT-PCR was performed using an AccuPrime Supermix I Kit (Invitrogen). The PCR protocol was as follows: 1 min at 96 °C followed by 35 cycles of 30 sec at 94 °C, 45 sec at 58 °C, and 1 min at 72 °C. Secondary PCR was performed on purified template from the first RT-PCR product, using the same protocol.

Sequencing

A BigDye Terminator v3.1 Kit (Applied Biosystems, Foster City, CA) was used for the sequencing reaction, and sequence products were read on an SCE 9610 automated 96-capillary sequencer (Spectruby BaseSpectrum v2.10 (SpectruMedix) and analyzed with Mutation Surveyor v2.2 (SoftGenetics LLC, State College, PA). Each time candidate sequence alterations were discovered in cDNA from colorectal cancer tissue, identical procedures were followed in matched normal epitheliam to confirm whether or not they represented somatic alterations. After candidate alterations were confirmed, the entire procedure was repeated separately on a fresh aliquot of cDNA from both the cancer and normal specimens in order to exclude amplification or technical errors due to two-stage PCR. Genomic DNA sequencing was also performed on heterozygous sequence variants to confirm that identical sequence alterations were present in genomic DNA.

Methylation-specific PCR (MSP)

Because the gene ATP50 was apparently mutated, raising the possibility that it was a tumor suppressor gene, we evaluated this gene for alternative inactivation via promoter hypermethylation. MSP primer sequences of ATP50 for the methylated reaction were: forward (5′-CGAGTGGGAGC-GATTTAGGAC-3′) and reverse (5′-AACGC-CAAAATTACGACACG-3′), which amplify a 94-bp product. β-actin was selected as an internal control gene, using previously published MSP primers (Eads et al. 2001). CpGenome Universal Methylated DNA (Chemicon International, Inc., Temecula, CA) was used as a positive control. The detailed MSP procedure has been previously published (Sato et al. 2002).

Microsatellite instability (MSI) assay

MSI at each locus was determined by analyses of the length of each PCR-amplified microsatellite. MSI status was confirmed by MSI assays at five consensus loci (BAT25, BAT26, D2S123, D5S346, and D17S250) according to criteria from a National Cancer Institute workshop (Boland et al. 1998). Detailed procedures were as previously described (Mori et al. 2001).

Results

Project overview

A total of 603 genes (S-Table 1) were selected based on their length (under 1,000 bp) and their predicted expression in colorectal cancers according to the UCSC database. One thousand thirty-eight primer pairs (available on request) were designed to cover the entire ORFs (total bp, 366,687) of these 603 genes. Sequence data from 862 (83.0%) of these 1,038 primer sets were successfully analyzed, meaning that approximately 304,350 total bp were successfully sequenced (all primer sets for RT-PCR and cDNA sequencing are available on request).
S-Table-1.

Gene list

GenBank Accession No.Gene SymbolGenBank Accession No.Gene SymbolGenBank Accession No.Gene Symbol
NM_000100.2CSTBNM_198057.1TSC22D3NM_002811.3PSMD7
NM_000999.2RPL38NM_022652.2DUSP6NM_002812.3PSMD8
NM_001827.1CKS2NM_021121.2EEF1B2NM_002813.4PSMD9
NM_001863.3COX6B1NM_001412.2EIF1AXNM_170750.1PSMD10
NM_001866.2COX7BNM_004094.3EIF2S1NM_004577.3PSPH
NM_001867.2COX7CNM_001423.1EMP1NM_175847.1PTBP1
NM_002489.2NDUFA4NM_001424.3EMP2NM_198974.1PTK9
NM_002491.1NDUFB3NM_001425.1EMP3NM_002823.2PTMA
NM_002966.1S100A10NM_207168.1ENSANM_002824.4PTMS
NM_003009.2SEPW1NM_001005915.1ERBB3NM_133377.1RAD1
NM_003063.1SLNNM_001983.2ERCC1NM_153824.1PYCR1
NM_003095.1SNRPFNM_001984.1ESDNM_000320.1QDPR
NM_003133.1SRP9NM_001439.1EXTL2NM_004161.3RAB1A
NM_003498.3SNNNM_001997.2FAUNM_004162.3RAB5A
NM_003746.1DYNLL1NM_005247.2FGF3NM_002868.2RAB5B
NM_003860.2BANF1NM_002007.1FGF4NM_198896.1RAB6A
NM_003945.3ATP6V0ENM_002010.1FGF9NM_002870.2RAB13
NM_004045.2ATOX1NM_023108.1FGFR1NM_183235.1RAB27A
NM_004485.2GNG4NM_001449.3FHL1NM_201434.1RAB5C
NM_004541.2NDUFA1NM_054014.1FKBP1ANM_198829.1RAC1
NM_004772.1C5orf13NM_002013.2FKBP3NM_002872.3RAC2
NM_005274.1GNG5NM_016725.1FOLR1NM_133630.1RAD51L3
NM_005517.2HMGN2NM_004477.1FRG1NM_002881.2RALB
NM_005694.1COX17NM_000146.2FTLNM_006325.2RAN
NM_005770.3SERF2NM_198903.1GABRG2NM_002884.1RAP1A
NM_005887.1DLEU1NM_000166.2GJB1NM_015646.3RAP1B
NM_005949.1MT1FNM_024009.2GJB3NM_032626.5RBBP6
NM_005954.2MT3NM_002061.2GCLMNM_181558.1RFC3
NM_005978.3S100A2NM_006708.1GLO1NM_181578.1RFC5
NM_006156.1NEDD8NM_002066.1GMLNM_134427.1RGS3
NM_006274.2CCL19NM_016592.1GNASNM_005614.2RHEB
NM_006304.1SHFM1NM_005301.2GPR35NM_000326.3RLBP1
NM_006353.2HMGN4NM_002083.2GPX2NM_002938.2RNF4
NM_006698.2BLCAPNM_002084.2GPX3NM_183045.1RNF6
NM_006829.2C10orf116NM_203506.1GRB2NM_002946.3RPA2
NM_007233.1TP53AP1NM_001512.2GSTA4NM_002947.3RPA3
NM_007281.1SCRG1NM_147149.1GSTM4NM_033301.1RPL8
NM_012456.1TIMM10NM_145871.1GSTZ1NM_033251.1RPL13
NM_012458.2TIMM13NM_004492.1GTF2A2NM_000985.2RPL17
NM_012460.2TIMM9NM_002095.3GTF2E2NM_000984.3RPL23A
NM_013332.1HIG2NM_000858.3GUK1NM_000992.2RPL29
NM_013343.1LOH3CR2ANM_005318.2H1F0NM_001001.3RPL36AL
NM_014041.1SPCS1NM_002106.3H2AFZNM_021029.3RPL36A
NM_014051.2TMEM14ANM_005324.3H3F3BNM_001002.3RPLP0
NM_014221.1MTCP1NM_005326.3HAGHNM_002949.2MRPL12
NM_014356.2C6orf123NM_005327.1HADHSCNM_001007.3RPS4X
NM_014445.2SERP1NM_005330.3HBE1NM_001015.3RPS11
NM_014624.3S100A6NM_004494.1HDGFNM_001019.3RPS15A
NM_014792.2KIAA0125NM_139011.1HFENM_001020.3RPS16
NM_016096.2ZNF706NM_005340.3HINT1NM_001022.3RPS19
NM_016305.1SS18L2NM_002118.3HLA-DMBNM_001023.2RPS20
NM_016565.2CHCHD8NM_002128.3HMGB1NM_001025.3RPS23
NM_020142.3LOC56901NM_002129.2HMGB2NM_002960.1S100A3
NM_020179.1FN5NM_004965.6HMGN1NM_005620.1S100A11
NM_020181.1C14orf162NM_002131.2HMGA1NM_000664.3ACACA
NM_020248.1CTNNBIP1NM_173158.1NR4A1NM_198970.1AES
NM_020408.3C6orf149NM_002136.1HNRPA1NM_001636.1SLC25A6
NM_021104.1RPL41NM_031314.1HNRPCNM_001001787.1ATP1B1
NM_021127.1PMAIP1NM_002138.3HNRPDNM_001687.4ATP5D
NM_021177.3LSM2NM_021644.2HNRPH3NM_001002256.1ATP5G3
NM_023937.2MRPL34NM_006896.2HOXA7NM_004047.2ATP6V0B
NM_031286.2SH3BGRL3NM_153715.1HOXA10NM_198589.1BSG
NM_031287.2SF3B5NM_156037.1HOXB6NM_004927.2MRPL49
NM_032412.2ORF1-FL49NM_004502.2HOXB7NM_006136.2CAPZA2
NM_032574.1LOC84661NM_024016.2HOXB8NM_022845.2CBFB
NM_032747.1USMG5NM_014620.2HOXC4NM_001760.2CCND3
NM_052871.2MGC4677NM_153693.1HOXC6NM_171827.1p32/CD8A
NM_052971.1LEAP-2NM_006897.1HOXC9NM_001773.1CD34
NM_080677.1DYNLL2NM_014212.2HOXC11NM_000611.4CD59
NM_138448.2ACYP2NM_024501.1HOXD1NM_001780.3CD63
NM_139286.3CDC26NM_134421.1HPCAL1NM_004359.1CDC34
NM_194327.1GALIGNM_182638.1HPS1NM_058197.2p16/CDKN2A
NM_198835.1ACACANM_005524.2HES1NM_001280.1CIRBP
NM_020115.3ACRV1NM_198431.1HSPA4NM_001833.1CLTA
NM_001124.1ADMNM_001540.2HSPB1NM_022645.2CSH2
NM_000674.1ADORA1NM_005528.1DNAJC4NM_012140.3SLC25A10
NM_000676.2ADORA2BNM_181353.1ID1NM_148979.1CTSH
NM_001630.1ANXA8NM_002166.4ID2NM_000396.2CTSK
NM_001154.2ANXA5NM_174856.1IDH3BNM_001336.2CTSZ
NM_080649.1APEX1NM_004508.2IDI1NM_001915.2CYB561
NM_000041.2APOENM_005533.2IFI35NM_004418.2DUSP2
NM_152876.1FASNM_021068.1IFNA4NM_004427.2PHC2
NM_000486.3AQP2NM_000612.2IGF2NM_001970.3EIF5A
NM_053286.1AQP6NM_001552.1IGFBP4NM_001419.2ELAVL1
NM_001659.1ARF3NM_000576.2IL1BNM_198194.1STOM
NM_001660.2ARF4NM_172200.1IL15RANM_202001.1ERCC1
NM_001663.2ARF6NM_005536.2IMPA1NM_023110.1FGFR1
NM_001664.2RHOANM_014214.1IMPA2NM_201557.1FHL2
NM_004040.2RHOBNM_198219.1ING1NM_004468.3FHL3
NM_175744.3RHOCNM_198337.1INSIG1NM_057092.1FKBP2
NM_005168.2RND3NM_002198.1IRF1NM_016730.1FOLR1
NM_001665.2rho GNM_004030.1IRF7NM_004477.1FRG1
NM_004309.3ARHGDIANM_181493.1ITPANM_002032.1FTH1
NM_001177.3ARL1NM_002228.3JUNNM_002035.1FVT1
NM_004311.2ARL3NM_002231.2CD82NM_001487.1BLOC1S1
NM_004314.1ART1NM_004137.2KCNMB1NM_004483.3GCSH
NM_032468.2ASPHNM_033360.2KRASNM_004124.2GMFB
NM_005171.2ATF1NM_002295.2RPSANM_000581.2GPX1
NM_004024.2ATF3NM_005563.3STMN1NM_002085.1GPX4
NM_001677.3ATP1B1NM_005564.2LCN2NM_147148.1GSTM4
NM_001679.2ATP1B3NM_005566.1LDHANM_002095.3GTF2E2
NM_001001977.1ATP5ENM_201544.1LGALS8NM_002107.3H3F3A
NM_001002015.1ATP5F1NM_004987.3LIMS1NM_005342.1HMGB3
NM_005175.2ATP5G1NM_005574.2LMO2NM_002133.1HMOX1
NM_001002258.1ATP5G3NM_002346.1LY6ENM_002134.2HMOX2
NM_001003701.1ATP5JNM_002353.1TACSTD2NM_156036.1HOXB6
NM_001694.2ATP6V0CNM_014220.1TM4SF1NM_024017.3HOXB9
NM_001697.2ATP5ONM_002354.1TACSTD1NM_000194.1HPRT1
NM_004322.2BADNM_030885.2MAP4NM_005343.2HRAS
NM_053056.1CCND1NM_203378.1MBNM_174856.1IDH3B
NM_138578.1BCL2L1NM_002386.2MC1RNM_000628.3IL10RB
NM_004050.2BCL2L2NM_182763.1MCL1NM_181431.1FOXK2
NM_000713.1BLVRBNM_012328.1DNAJB9NM_181468.1ITGB4BP
NM_005180.5PCGF4NM_005370.4RAB8ANM_201543.1LGALS8
NM_004331.2BNIP3LNM_177524.1MESTNM_002359.2MAFG
NM_032515.3BOKNM_005371.3METTL1NM_004528.2MGST3
NM_004332.1BPHLNM_017459.1MFAP2NM_022792.2MMP19
NM_007306.1BRCA1NM_145791.1MGST1NM_002448.1MSX1
NM_198590.1BSGNM_002413.3MGST2NM_005962.3MXI1
NM_001207.3BTF3NM_002414.3CD99NM_079424.1MYL6
NM_001731.1BTG1NM_002415.1MIFNM_032104.1PPP1R12B
NM_007311.2BZRPNM_022791.2MMP19NM_004547.4NDUFB4
NM_172369.1C1QGNM_002434.1MPGNM_182739.1NDUFB6
NM_001217.2CA11NM_021126.3MPSTNM_005005.1NDUFB9
NM_000387.3SLC25A20NM_012331.2MSRANM_020529.1NFKBIA
NM_006888.2CALM1NM_002451.3MTAPNM_002520.4NPM1
NM_005184.1CALM3NM_015675.1GADD45BNM_002607.2PDGFA
NM_005185.2CALML3NM_001002841.1(MYL4NM_005022.2PFN1
NM_001745.2CAMLGNM_079423.1MYL6NM_000942.4PPIB
NM_001003962.1CAPNS1NM_002478.3MYOD1NM_206873.1PPP1CA
NM_004346.2CASP3NM_032103.1PPP1R12BNM_183079.1PRNP
NM_001755.2CBFBNM_005594.2NACANM_002765.2PRPS2
NM_004059.3CCBL1NM_182744.1NBL1NM_145888.1KLK10
NM_001759.2CCND2NM_014222.2NDUFA8NM_002790.2PSMA5
NM_199246.1CCNG1NM_004548.1NDUFB10NM_152255.1PSMA7
NM_001239.2CCNHNM_004549.3NDUFC2NM_176783.1PSME1
NM_001763.1CD1ANM_002496.1NDUFS8NM_183236.1RAB27A
NM_000733.2CD3ENM_181827.1NF2NM_004583.2RAB5C
NM_001769.2CD9NM_001001716.1NFKBIBNM_133629.1RAD51L3
NM_005191.2CD80NM_005008.2NHP2L1NM_021033.4RAP2A
NM_152942.1TNFRSF8NM_198175.1NM23A/NME1NM_002899.2RBP1
NM_001244.2TNFSF8NM_000904.1NQO2NM_000976.2RPL12
NM_001001392.1CD44NM_000270.1NPNM_001016.2RPS12
NM_198793.1CD47NM_199185.1NPM1NM_000331.2SAA1
NM_000560.2CD53NM_006172.1NPPANM_005981.3TSPAN31
NM_203330.1CD59NM_002524.2NRASNM_002970.1SAT
NM_004357.3CD151NM_004559.2YBX1NM_006745.2SC4MOL
NM_001786.2CDC2NM_007105.1SLC22A18ASNM_006746.3SCML1
NM_033534.1CDC2L2NM_005602.4CLDN11NM_001037.3SCN1B
NM_052827.1CDK2NM_175568.1P2RX4NM_003000.1SDHB
NM_000075.2CDK4NM_175081.1P2RX5NM_183352.1SEC13L1
NM_078467.1CDKN1ANM_002567.2PBPNM_014563.2TRAPPC2
NM_004064.2CDKN1BNM_002573.2PAFAH1B3NM_003016.2SFRS2
NM_000077.3CDKN2ANM_181696.1PRDX1NM_152235.1SFRS8
NM_078626.1CDKN2CNM_000281.2TCF1/PCBD1NM_004593.1SFRS10
NM_005195.2CEBPDNM_032403.1PCDHGC3NM_173217.1ST6GAL1
NM_001806.2CEBPGNM_002592.2PCNANM_170679.1SKP1A
NM_001809.2CENPANM_033023.1PDGFANM_005984.1SLC25A1
NM_004365.2CETN3NM_002608.1PDGFBNM_022875.1SMN2
NM_005507.2CFL1NM_213612.1SLC25A3NM_004596.3SNRPA
NM_001817.1CEACAM4NM_002642.3PIGCNM_198216.1SNRPB
NM_152253.1CHKBNM_002648.2PIM1NM_198220.1SNRPB2
NM_013324.4CISHNM_006224.2PITPNANM_177542.1SNRPD2
NM_001281.2CKAP1NM_002653.3PITX1NM_004175.3SNRPD3
NM_001284.2AP3S1NM_000929.1PLA2G5NM_022807.2SNRPN
NM_001288.4CLIC1NM_001005376.1PLAURNM_000454.4SOD1
NM_001291.2CLK2NM_021910.1FXYD3NM_006943.2SOX12
NM_001293.1CLNS1ANM_021105.1PLSCR1NM_001047.1SRD5A1
NM_007097.2CLTBNM_153321.1PMP22NM_003132.1SRM
NM_004368.2CNN2NM_174930.2PMS2L5NM_003135.1SRP19
NM_007310.1COMTNM_001003686.1PMS2L3NM_003144.2SSR1
NM_001300.3KLF6NM_022716.1PRRX1NM_003155.1STC1
NM_001861.2OX4I1NM_002696.1POLR2GNM_177528.1SULT1A2
NM_001305.3CLDN4NM_006232.2POLR2HNM_004177.3STX3A
NM_001306.2CLDN3NM_006233.4POLR2INM_004604.3STX4A
NM_001307.3CLDN7NM_021129.2PPA1NM_003164.2STX5A
NM_001878.2CRABP2NM_203430.1PPIANM_177534.1SULT1A1
NM_004379.2CREB1NM_000943.4PPICNM_003166.2SULT1A3
NM_001310.2CREBL2NM_177951.1PPM1ANM_181491.1SURF5
NM_181571.1CREMNM_177969.1PPM1BNM_014231.3VAMP1
NM_005206.3CRKNM_206877.1PPP1CBNM_014232.1VAMP2
NM_005207.2CRKLNM_002710.1PPP1CCNM_005638.3SYBL1
NM_001889.2CRYZNM_006241.3PPP1R2NM_006754.2SYPL1
NM_139014.1MAPK14NM_002715.1PPP2CANM_003187.3TAF9
NM_177436.1CSE1LNM_178002.1PR 53/PPP2R4NM_005643.2TAF11
NM_022644.2CSH2NM_000945.3PPP3R1NM_172208.1TAPBP
NM_177560.2CSNK2A1NM_005399.3PRKAB2NM_134324.1TARBP2
NM_001320.5CSNK2BNM_207578.1PRKACBNM_201437.1TCEA1
NM_001321.1CSRP2NM_212461.1PRKAG1NM_213648.1TCF7
NM_000396.2CTSKNM_138981.1MAPK10NM_181738.1PRDX2
NM_004394.1DAPNM_002756.2MAP2K3NM_201443.1TEAD4
NM_020548.4DBINM_002764.2PRPS1NM_003201.1TFAM
NM_001924.2GADD45ANM_139277.1KLK7NM_174886.1TGIF
NM_004083.4DDIT3NM_002774.2KLK6NM_003255.3TIMP2
NM_001355.2DDTNM_213633.1PSG4NM_003270.2TSPAN6
NM_030655.2DDX11NM_203287.1PSG11NM_003271.3TSPAN4
NM_213566.1DFFANM_148976.1PSMA1NM_021137.3TNFAIP1
NM_000791.2DHFRNM_002789.3PSMA4NM_000363.3TNNI3
NM_007326.1CYB5R3NM_002791.1PSMA6NM_005079.1TPD52
NM_138281.1DLX4NM_002794.3PSMB2NM_003287.2TPD52L1
NM_203316.1DPAGT1NM_002801.2PSMB10NM_199362.1TPD52L2

Sequence variants

Seventy-eight sequence variants within 50 genes were found among the 603 genes studied (Table 1) (S-Table 2 for detailed information). Thus, the frequency of sequence variants was 1 per 3,902 bp (78 total variants/304,350 total bp). Of these 78 sequence alterations, 51 were homozygous (26 synonymous, 25 non-synonymous) and 27 were heterozygous (11 synonymous, 16 non-synonymous). All sequence alterations were detected in both colorectal cancer tissue and matched normal colonic epithelium, with the exception of an alteration in the gene ATP50 (NM_001697), which manifested a unique expression mechanism (Fig. 1). Forty-four sequence alterations had been previously reported, but 34 sequence alterations were completely novel, having never been reported in the SNP database at The National Center for Biotechnology Information (NCBI).
Table 1.

Sequence variants

HomozygousHeterozygous
SynonymousNon-synonymousSynonymousNon-synonymous
Number of alterations26251116
S-Table 2.

Sequence variants

Homozygous alterationHeterozygous alteration
GenBank Accession No.Gene SymbolSynonymous alterationNCBI SNP DatabaseNon-Synonymous alterationNCBI SNP DatabaseSynonymous alterationNCBI SNP DatabaseNon-Synonymous alterationNCBI SNP Database
NM_000320QDPR Leu132LeuG396A,rs2597775
NM_000331SAA1C209T, Ala70Valrs1136743
T224C, Val75Alars1136747
NM_001007RPS4X Leu164LeuG492A,rs7580
NM_001020RPS16 Gly5GlyC15T,rs17626
T27G, Ser9Serrs17628
NM_001047SRD5A1A309AG, Pro103Prors3822430
G348GA, Ala116Alars8192186
NM_001320CSNK2B Tyr46TyrT138C,rs14365
NM_001636SLC25A6 Phe136PheT408C,rs7205
NM_001697ATP50 Gly36GlyT108C,rs17728665A218AG, Lys73Arg*
NM_001760CCND3T775TG, Ser259Alars1051130
NM_001817CEACAM4T668A, Val223Glu.*
NM_001861COX4I1G7GA, Ala3Thrrs17855751
NM_001889CRYZ Gly18GlyG54A,rs4650284T138C, Gly46Gly*
NM_002131HMGA1 Ser2SerT6C,*G49A, Glu17Lys*
G78T, Arg26Arg*G112A, Gly38Arg*
C255A, Gly85Gly*Pro48LeuC143T,*
C217T, Arg73Gly*
C236T, A237G,*
Pro79Leu G286A,*
Glu96Lys
NM_002136HNRPA1 Gly248GlyC744T,*
NM_002414CD99C369CT, Ala121Alars4575010
NM_002642PIGC Gly89GlyT267C,rs2230471
NM_002813PSMD9T50C, Val17Alars2230681
NM_003144SSR1C388T, His130Tyr*
NM_003255TIMP2G303GA, Ser101Serrs2277698
NM_004064CDKN1BT326TG, Val109Glyrs2066827
NM_004137KCNMB1G193GA, Glu65Lysrs11739136
NM_004175SNRPD3 Ala101AlaT303C,rs3176991
NM_004365CETN3G28GC, Val10Leurs4873
NM_004468FHL3 Pro180ProG540A,rs7366048
NM_004549NDUFC2C136CG, Leu46Valrs8875
NM_005171ATF1C327CT, Tyr109Tyrrs1129406
NM_005191CD80G135GA,rs2228017
Val45Val
NM_005301GPR35G85GA, Ala29Thr. A880AC,*rs3749172
NM_005342HMGB3 Asn186LysC558G,*Ser294Arg
NM_005594NACAT543TA, Ile181Ilers4788
NM_005984SLC25A1 Lys277LysA831G,*
NM_006353HMGN4 Gly66GlyG198Ars4871
NM_006896HOXA7 Ala32AlaT96G,rs2301720G52A, Ala18Thrrs2301721
NM_007310COMT His12HisC36T,rs4633G322A, Val108Metrs4680
NM_007311BZRP His53ArgA158G, Ala68Alars6971G204GA,rs6972
NM_012328DNAJB9 Pro61ProG183A,rs1043615
NM_013332HIG2 Glu28GluA84G,*
NM_014212HOXC11 Ser12SerT36G,rs4759315
NM_014232VAMP2T346A, Ser116Thr*
NM_021068IFNA4A146AC,*
His49Pro G178GC,*
Gly60Arg T190TA,*
Phe64Ile G187GC,rs3203576
NM_024009GJB3Glu63Gln C357CT,*
Asn119Asn
NM_033251RPL13G334A, Ala112Thrrs9930567
NM_052871MGC4677G28T, Ala10Serrs28673896C12CT, Thr4Thr*C109CT, Arg37Cys*
NM_058197CDKN2AArg54GlyA160T,*
NM_145888KLK10A318C, Gly106Glyrs2075688T347C, Leu116Pro*
C336G, Thr112Thrrs1061368
G423A, Leu141Leurs2075689`
NM_172200IL15RAC248T, Pro83Leu`*
A337C, Thr113Pro*
NM_172369C1QGGly215GluG644A,*
NM_181571CREMIle137ThrT410C,*
NM_198970TEAD4Pro194LeuC580CT,*
NM_201544LGALS8Met56Val.A166AG,rs1041937
G542GC, Gly181Ala*

No report found.

Figure 1.

cDNA sequencing of ATP50. Two different alteration sites were detected. At the 108th nucleotide, colorectal cancer tissue had only a mutant cytosine nucleotide, while normal colon contained both a thymine (wild) and a cytosine (mutant). Both codons GGT and GGC encoded glycine (synonymous alteration). At the 218th mucleotide, colorectal cancer tissue had only a mutant guanine nucleotide, while normal tissue contained both an adenine (wild) and a guanine (mutant). AAA encoded lysine and AGA encoded arginine (non-synonymous alteration). Gly, glycine; Arg, arginine.

Tumor-specific regulation of gene expression

Tumor-specific regulation of gene expression was found for NM_001697 (ATP50, Homo sapiens ATP synthase, H+ transporting, mitochondrial F1 complex, O subunit). The sequence alterations T108C (GGT to GGC, homozygous, Gly36Gly) and A218G (AAA to AAG, homozygous, Lys73Arg) were observed only in cancer-derived cDNA, while the alterations T108TC (CGT and GGC, heterozygous, 36Gly) and A218AG (AAA and AAG, heterozygous, 73Lys and 73 Arg) were observed in cDNA from normal epithelium. Surprisingly, both T108TC (CGT and GGC, heterozygous, 36Gly) and A218AG (AAA and AAG, heterozygous, 73Lys and 73 Arg), which were identical to the two alterations observed in normal cDNA, were observed in genomic DNA from both cancer and normal tissue (Figures 2, 3). This result implied that the cancer exhibited monoallelic expression from the variant allele of ATP50, while the normal epithelium manifested biallelic heterozygous expression, i.e. from both the reported normal allele and our discovered variant mutant allele simultaneously.
Figure 2.

Representative result of cDNA sequencing of ATP50 in normal and cancer tissues. The red box in the left panel shows the 36th codon, while the red box in the right panel shows the 73rd codon. All alterations were confirmed by both forward and reverse sequencing.

Figure 3.

Genomic DNA sequencing of ATP50 in normal and cancer tissues. Upper panels, forward sequencing; lower panels, reverse sequencing. The red boxes show the 36th and 73rd codons. Both normal and colorectal cancer tissues contain identical heterozygosities at the 108th and 218th nucleotides.

MSP

One possible mechanism for monoallelic expression observed for ATP50 was DNA methylation of its promoter region. MSP showed, however, that there was no methylation of the ATP50 promoter in colorectal cancer (S-Fig. 1).

Somatic mutations

There were no somatic mutations found among the 603 genes studied or within the p53 gene.

MSI status

MSI assays showed that there was no microsatellite instability in genomic DNA (S-Fig. 2).

Discussion

In the current study, we assumed that if a mutant protein was involved in carcinogenesis or tumor progression, this mutant would be expressed and therefore detectable in tumor mRNA. i.e. we assumed that somatic mutations involved in carcinogenesis or tumor progression would be detectable by direct cDNA sequencing. By using this strategy, we avoided the need for sequencing each exon of genomic DNA, reasoning that genes which are never expressed in normal or malignant colon probably do not participate in colorectal carcinogenesis. We discovered 78 sequence variants (44 of which had been previously reported as single-nucleotide polymorphisms, but 34 of which had never been reported) among the 603 genes (304,350 bp of ORFs) studied. Recently, Sjoblom T. et al. performed genome-wide sequencing in breast and colorectal cancers, revealing that an average of 52 mutations occurred in each colorectal cancer(Sjoblom et al. 2006). According to the article by Sjoblom et al. the somatic mutation frequency in colon cancers was 3.2 somatic mutations/Mb, on average (Table 1 of their paper). Therefore, the probability of our finding zero somatic mutations among the 603 genes (304,350 bp) that we studied was 37.76% (please see formula below), suggesting that our findings were statistically quite consistent with Sjoblom’s results: The Sjoblom team also defined “CAN-genes” (candidate cancer genes) as those that were frequently mutated in colorectal cancers, and found that 69 genes could be included in this category. Although the CAN-genes KRAS, GNAS and TP53 were studied by us, no somatic mutations were found in these genes. Furthermore, in addition to the genes mentioned above, NRAS, HRAS, p16, and p27 were included in the current study, but these genes also contained no somatic mutations. Finally, results of MSI assays revealed MS-stability (MSS), implying an absence of mutations in the major DNA mismatch repair genes (although these genes were not studied due to their long ORFs). It is possible that other molecular pathogenetic pathways were involved in this colorectal tumorigenesis, such as those containing APC, MCC, DCC, or the TGF-β cascade: these genes were also not examined in the current study due to ORF length. Approximately 24,000,000 bp among the entire genomic DNA sequence are reported as ORFs in the UCSC database. The average density of each SNP is once per 1.9 kilobases (i.e. 1,419,190 SNPs/2.7 gigabases of human genome sequence)(Sachidanandam et al. 2001). We sequenced 304,350 bp of ORFs (viz., 1.26% of the total ORFs in the UCSC database: 304,350 bp/24,000,000 bp) and discovered 78 sequence variants, yielding a frequency of 1 alteration per 3,902 bp (78/304,350 bp). Our observed sequence variant distribution may provide a basis with which to estimate the number of SNPs in a single individual with colon cancer. That is, the SNPs reported above are one possible subset of the entire database; there is no guarantee that a given individual will always harbor all SNPs in the database. The human ATP50 gene (X83218, NM_001697), encoding a 213-amino acid ATP synthase OSCP subunit, is a key structural component of the stalk of the mitochondrial respiratory chain F1F0-ATP synthase, which is a vital element in the cellular pathway of energy conversion (Senior, 1988). Although a mutant strain of yeast in which the delta subunit of F1F0-ATP synthase had been inactivated by insertional mutagenesis showed little or no ATPase activity(Giraud and Velours, 1994), and dysfunction of ATP synthase can cause a variety of degenerative diseases(Wallace, 1994), there have been no previous reports detailing a relationship between ATP synthase and tumorigenesis. We found restricted monoallelic (i.e. monoallelically silenced) expression of an altered allele from ATP50 in our colon cancer tissue, which would be expected to exert the same effect as would a somatic mutation of this gene. Genomic DNA sequencing of ATP50 revealed that this monoallelic expression was not due to LOH. We therefore studied the methylation status of the CpG island in the promoter region of ATP50 by MSP, but we found no methylation of this region. Other epigenetic mechanisms, such as histone deacetylation, might have contributed to monoallelic expression of ATP50. There was no monoallelic expression of ATP50 in 20 cancer cell lines that we examined. Although monoallelic expression of this altered ATP50 allele may be involved in a subset of colorectal cancers, further study is required to clarify the potential functional role of this gene in carcinogenesis. This study poses several advantages as well as limitations. Firstly, it has been reported that some synonymous mutations may influence the stability of mRNA(Duan and Antezana, 2003; Chamary and Hurst, 2005) because they affect the thermodynamic stability of mRNA secondary structures (Fitch, 1974; Klambt, 1975). Nonsense-mediated mRNA decay (NMD) is also known as a surveillance pathway that rapidly degrades mRNAs containing premature termination codons(Culbertson and Leeds, 2003; Amrani et al. 2006). These mechanisms may cause instability of mRNA, accelerate the degradation of mRNA, and consequently result in difficulty in detecting sequence alterations by cDNA sequencing. Since we used cDNA as our starting material for sequencing, we may have ignored some key genes because of RNA degradation. Nevertheless, many sequence variants were detected reasonably well in the current study, suggesting that degradation of mRNA occurred rarely, if at all, as a consequence of sequence alterations. Instead, we considered it more important to increase our chances of finding sequence alterations by using cDNA rather than genomic DNA because of the lower cost, time, and labor involved in sequencing cDNA, as well as the increased relevance of only studying genes that are expressed in the colon. Secondly, it is conceivable that we lost some gene sequence information due to extremely low expression levels. Therefore, we employed two-stage PCR to increase our chances of successful sequencing, thereby achieving a relatively high success rate of 862/1,038 reactions, or 83.0%. Possibly, this result still may have included genes that were not expressed in our particular colorectal cancer, even though we used the UCSC database to select genes that were purportedly expressed in colorectal cancers. Our sequencing success rate appears favorable when compared to genomic DNA sequencing, where 92% of genes were successfully analyzed (Wang et al. 2004). The total number of exons sequenced in our study was 2107, implying that at least 2107 primer pairs would have been necessary to conduct this study had it been attempted by genomic DNA sequencing; in contrast, we accomplished this task using only 1038 primer sets for cDNA sequencing. This contrast demonstrates that our method is useful to explore mutations because it is not only more cost-effective, but also less demanding in time and labor. Gene list Sequence variants No report found.
  19 in total

1.  A model for messenger RNA sequences maximizing secondary structure due to code degeneracy.

Authors:  D Klämbt
Journal:  J Theor Biol       Date:  1975-07       Impact factor: 2.691

Review 2.  Early nonsense: mRNA decay solves a translational problem.

Authors:  Nadia Amrani; Matthew S Sachs; Allan Jacobson
Journal:  Nat Rev Mol Cell Biol       Date:  2006-06       Impact factor: 94.444

3.  A map of human genome sequence variation containing 1.42 million single nucleotide polymorphisms.

Authors:  R Sachidanandam; D Weissman; S C Schmidt; J M Kakol; L D Stein; G Marth; S Sherry; J C Mullikin; B J Mortimore; D L Willey; S E Hunt; C G Cole; P C Coggill; C M Rice; Z Ning; J Rogers; D R Bentley; P Y Kwok; E R Mardis; R T Yeh; B Schultz; L Cook; R Davenport; M Dante; L Fulton; L Hillier; R H Waterston; J D McPherson; B Gilman; S Schaffner; W J Van Etten; D Reich; J Higgins; M J Daly; B Blumenstiel; J Baldwin; N Stange-Thomann; M C Zody; L Linton; E S Lander; D Altshuler
Journal:  Nature       Date:  2001-02-15       Impact factor: 49.962

4.  Instabilotyping: comprehensive identification of frameshift mutations caused by coding region microsatellite instability.

Authors:  Y Mori; J Yin; A Rashid; B A Leggett; J Young; L Simms; P M Kuehl; P Langenberg; S J Meltzer; O C Stine
Journal:  Cancer Res       Date:  2001-08-15       Impact factor: 12.701

5.  The consensus coding sequences of human breast and colorectal cancers.

Authors:  Tobias Sjöblom; Siân Jones; Laura D Wood; D Williams Parsons; Jimmy Lin; Thomas D Barber; Diana Mandelker; Rebecca J Leary; Janine Ptak; Natalie Silliman; Steve Szabo; Phillip Buckhaults; Christopher Farrell; Paul Meeh; Sanford D Markowitz; Joseph Willis; Dawn Dawson; James K V Willson; Adi F Gazdar; James Hartigan; Leo Wu; Changsheng Liu; Giovanni Parmigiani; Ben Ho Park; Kurtis E Bachman; Nickolas Papadopoulos; Bert Vogelstein; Kenneth W Kinzler; Victor E Velculescu
Journal:  Science       Date:  2006-09-07       Impact factor: 47.728

6.  Hypermethylation of the p14(ARF) gene in ulcerative colitis-associated colorectal carcinogenesis.

Authors:  Fumiaki Sato; Noam Harpaz; David Shibata; Yan Xu; Jing Yin; Yuriko Mori; Tong-Tong Zou; Suna Wang; Kena Desai; Anatoly Leytin; Florin M Selaru; John M Abraham; Stephen J Meltzer
Journal:  Cancer Res       Date:  2002-02-15       Impact factor: 12.701

Review 7.  A National Cancer Institute Workshop on Microsatellite Instability for cancer detection and familial predisposition: development of international criteria for the determination of microsatellite instability in colorectal cancer.

Authors:  C R Boland; S N Thibodeau; S R Hamilton; D Sidransky; J R Eshleman; R W Burt; S J Meltzer; M A Rodriguez-Bigas; R Fodde; G N Ranzani; S Srivastava
Journal:  Cancer Res       Date:  1998-11-15       Impact factor: 12.701

8.  ATP synthase of yeast mitochondria. Isolation of the F1 delta subunit, sequence and disruption of the structural gene.

Authors:  M F Giraud; J Velours
Journal:  Eur J Biochem       Date:  1994-06-15

9.  Mammalian mutation pressure, synonymous codon choice, and mRNA degradation.

Authors:  Jubao Duan; Marcos A Antezana
Journal:  J Mol Evol       Date:  2003-12       Impact factor: 2.395

10.  Mutational analysis of the tyrosine phosphatome in colorectal cancers.

Authors:  Zhenghe Wang; Dong Shen; D Williams Parsons; Alberto Bardelli; Jason Sager; Steve Szabo; Janine Ptak; Natalie Silliman; Brock A Peters; Michiel S van der Heijden; Giovanni Parmigiani; Hai Yan; Tian-Li Wang; Greg Riggins; Steven M Powell; James K V Willson; Sanford Markowitz; Kenneth W Kinzler; Bert Vogelstein; Victor E Velculescu
Journal:  Science       Date:  2004-05-21       Impact factor: 47.728

View more

北京卡尤迪生物科技股份有限公司 © 2022-2023.