| Literature DB >> 20386695 |
Manasa Ramakrishna1, Louise H Williams, Samantha E Boyle, Jennifer L Bearfoot, Anita Sridhar, Terence P Speed, Kylie L Gorringe, Ian G Campbell.
Abstract
Ovarian cancer is a disease characterised by complex genomic rearrangements but the majority of the genes that are the target of these alterations remain unidentified. Cataloguing these target genes will provide useful insights into the disease etiology and may provide an opportunity to develop novel diagnostic and therapeutic interventions. High resolution genome wide copy number and matching expression data from 68 primary epithelial ovarian carcinomas of various histotypes was integrated to identify genes in regions of most frequent amplification with the strongest correlation with expression and copy number. Regions on chromosomes 3, 7, 8, and 20 were most frequently increased in copy number (> 40% of samples). Within these regions, 703/1370 (51%) unique gene expression probesets were differentially expressed when samples with gain were compared to samples without gain. 30% of these differentially expressed probesets also showed a strong positive correlation (r > or =0.6) between expression and copy number. We also identified 21 regions of high amplitude copy number gain, in which 32 known protein coding genes showed a strong positive correlation between expression and copy number. Overall, our data validates previously known ovarian cancer genes, such as ERBB2, and also identified novel potential drivers such as MYNN, PUF60 and TPX2.Entities:
Mesh:
Substances:
Year: 2010 PMID: 20386695 PMCID: PMC2851616 DOI: 10.1371/journal.pone.0009983
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Summary of samples analysed by SNP and expression array.
| Grade | FIGO Stage | |||||||
| Subtype | 1 | 2 | 3 | NK | 1 | 2 | 3 | NK |
| Clear Cell (9) | 2 | 2 | 3 | 2 | 3 | 1 | 2 | 3 |
| Endometrioid (14) | 2 | 4 | 8 | 0 | 8 | 2 | 4 | 0 |
| Mucinous (7) | 5 | 2 | 0 | 0 | 4 | 0 | 2 | 1 |
| Serous (37) | 3 | 11 | 20 | 3 | 3 | 10 | 17 | 7 |
| Undifferentiated (1) | 1 | 0 | 0 | 0 | 0 | 1 | 0 | 0 |
NK, grade or stage not known. Information for 68 tumours that had both high quality expression and copy number data is listed here. Four more samples that were used in the copy number analyses alone are detailed in Table S1.
Figure 1Overview of genomic aberrations in the ovarian cancer dataset (N = 72).
Frequency of occurrence of genomic gains (yellow) and losses (blue) across the genome, depicted in chromosome order from 1p to Xq.
Figure 2Detailed view of chromosomes showing frequent gains.
Frequent gains occur on chromosomes 3, 7, 8 and 20, with each point indicating the frequency of gain of a CN segment. The red line in all panels indicates the 40% frequency threshold.
Genes with increased expression on chromosomes 3 and 7.
| Region ID | Chr | Start | End | Samples “G” | Samples “N” | DE Probesets (%) | Most significant DE Genes |
| 3_1 | 3 | 157.223 | 157.972 | 30 | 37 | 3 (60) | SSR3; TIPARP; KCNAB1 |
| 3_2 | 3 | 158.260 | 159.895 | 31 | 36 | 8 (62) | MLF1; GFM1; RSRC1; CCNL1; PTX3; VEPH1; LXN; SHOX2 |
| 3_3 | 3 | 159.895 | 159.959 | 30 | 37 | 2 (100) | RARRES1 |
| 3_4 | 3 | 159.959 | 161.006 | 32 | 35 | 2 (50) | MFSD1; SCHIP1 |
| 3_5 | 3 | 161.006 | 161.392 | 30 | 37 | 3 (75) | SCHIP1;IL12A |
| 3_7 | 3 | 161.392 | 168.660 | 33 | 35 | 8 (24) | KPNA4; SMC4; B3GALNT1; NMD3; TRIM59; hCG_16001; IFT80 |
| 3_8 | 3 | 168.697 | 168.916 | 37 | 31 | 1 (50) | PDCD10 |
| 3_9 | 3 | 168.916 | 169.209 | 38 | 30 | 2 (100) | PDCD10; SERPINI1 |
| 3_10 | 3 | 169.209 | 172.478 | 41 | 27 | 12 (40) | MYNN; PHC3; SKIL; MDS1; ARPM1;TLOC1; PRKCI; EVI1; EIF5A2; SLC7A14 |
| 3_12 | 3 | 172.586 | 177.095 | 39 | 29 | 2 (8) | ECT2; AADACL1 |
| 3_14 | 3 | 177.366 | 180.518 | 39 | 29 | 4 (27) | TBL1XR1; PIK3CA |
| 3_15 | 3 | 180.518 | 180.608 | 35 | 33 | 3 (100) | ZNF639; MFN1;GNB4 |
| 3_17 | 3 | 180.608 | 181.970 | 36 | 32 | 6 (43) | ACTL6A; MRPL47; NDUFB5; GNB4; LOC442098; TTC14 |
| 3_18 | 3 | 181.971 | 184.153 | 34 | 34 | 4 (57) | FXR1; DNAJC19; DCUN1D1; ATP11B |
| 3_19 | 3 | 184.153 | 184.291 | 35 | 33 | 2 (100) | DCUN1D1;MCCC1 |
| 3_20 | 3 | 184.291 | 185.996 | 34 | 34 | 18 (50) | ABCF3 |
| 3_22 | 3 | 186.007 | 187.399 | 36 | 32 | 5 (31) | SENP2; TMEM41A |
| 3_24 | 3 | 187.519 | 189.379 | 35 | 33 | 10 (42) | RFC4 |
| 3_25 | 3 | 189.379 | 189.430 | 38 | 30 | 2 (100) | LPP; FLJ42393 |
| 3_27 | 3 | 193.766 | 193.936 | 35 | 33 | 1 (50) | FGF12 |
| 3_28 | 3 | 193.936 | 199.337 | 31 | 37 | 36 (46) | WDR53 |
| 7_1 | 7 | 141.416 | 141.431 | 34 | 33 | 1 (100) | MGAM |
1. Start and end position of regions in Mbp based on hg18 (March 2006 release).
2. Number of samples with copy number gains (G) and normal copy number (N) used in the expression analysis. Samples with copy number loss were not included.
3. Number of differentially expressed (DE) probesets by expression microarray. Regions with no DE named genes are not shown here but are listed in Table S4.
4. Only coding genes with a symbol (from Affymetrix array annotation) are listed here and hence can differ from the number quoted in brackets. For a full list see Table S5. Genes for each region are listed in decreasing order of significance, with only the top 15 most significant listed.
*indicates genes with a correlation coefficient of >0.7.
Genes with increased expression on chromosome 8.
| Region ID | Chr | Start | End | Samples “G” | Samples “N” | DE Probesets (%) | Most significant DE Genes |
| 8_1 | 8 | 53.390 | 55.545 | 29 | 39 | 11 (73) | ATP6V1H*; MRPL15; TCEA1; LYPLA1; RB1CC1; RGS20; NPBWR1; SOX17; UNQ9433 |
| 8_5 | 8 | 60.387 | 61.696 | 35 | 32 | 6 (60) | XKR4; TGS1*; TMEM68*; RP1 |
| 8_7 | 8 | 61.696 | 61.817 | 35 | 32 | 2 (100) | RAB2A; CHD7 |
| 8_13 | 8 | 62.549 | 65.928 | 33 | 33 | 3 (21) | RLBP1L1*; YTHDF3 |
| 8_15 | 8 | 66.237 | 68.051 | 33 | 33 | 14 (67) | ARMC1; VCPIP1; C8orf44; RRS1; SGK3; C8orf45; MYBL1; ADHFE1; MTFR1; C8orf46 |
| 8_16 | 8 | 68.051 | 68.292 | 29 | 36 | 6 (100) | COPS5; ARFGEF1; CSPP1 |
| 8_19 | 8 | 70.815 | 73.993 | 32 | 35 | 9 (41) | KCNB2; NCOA2; TRPA1; TRAM1; MSC |
| 8_21 | 8 | 74.016 | 78.270 | 34 | 33 | 8 (32) | TMEM70; STAU2; PXMP3; TERF1; UBE2W; TCEB1 |
| 8_25 | 8 | 80.419 | 84.683 | 36 | 31 | 10 (40) | CHMP4C; ZNF704; ZBTB10; SNX16; ZFAND1 |
| 8_27 | 8 | 85.122 | 87.055 | 34 | 33 | 16 (67) | C8orf59; REXO1L2P; REXO1L1; E2F5 |
| 8_30 | 8 | 87.250 | 89.422 | 34 | 33 | 5 (50) | WWP1; FAM82B; CPNE3; WDR21C; CNGB3 |
| 8_31 | 8 | 89.426 | 93.278 | 37 | 29 | 7 (35) | OTUD6B; NBN; TMEM55A; SLC26A7; RUNX1T1; TMEM64 |
| 8_33 | 8 | 93.587 | 98.637 | 38 | 28 | 17 (53) | UQCRB; TP53INP1; C8orf38; MTERFD1; PLEKHF2; PTDSS1; KIAA1429; RBM35A; INTS8; TSPYL5 |
| 8_34 | 8 | 98.637 | 99.159 | 35 | 32 | 7 (88) | MTDH; LAPTM4B; MATN2; RPL30 |
| 8_36 | 8 | 99.159 | 100.102 | 37 | 30 | 9 (82) | POP1*; NPAL2; STK3; VPS13B; HRSP12; OSR2; KCNS2 |
| 8_37 | 8 | 100.112 | 101.579 | 37 | 30 | 6 (55) | COX6C; RNF19A; POLR2K; VPS13B; FBXO43 |
| 8_38 | 8 | 101.579 | 101.675 | 36 | 31 | 2 (100) | ANKRD46; MGC39715 |
| 8_39 | 8 | 101.675 | 105.906 | 39 | 28 | 25 (50) | YWHAZ; WDSOF1; FLJ45248; ATP6V1C1; ZNF706; UBR5; FZD6; PABPC1; AZIN1; MGC39715 |
| 8_43 | 8 | 107.681 | 110.578 | 39 | 28 | 4 (31) | ENY2; TTC35; NUDCD1; OXR1 |
| 8_45 | 8 | 110.578 | 110.700 | 38 | 29 | 2 (67) | EBAG9; GOLSYN |
| 8_52 | 8 | 113.663 | 117.487 | 40 | 27 | 1 (25) | TRPS1 |
| 8_54 | 8 | 117.713 | 119.186 | 42 | 25 | 8 (89) | RAD21; C8orf53; MED30; EXT1; EIF3H; SLC30A8 |
| 8_56 | 8 | 119.298 | 121.983 | 44 | 23 | 6 (35) | MTBP;DCC1; TAF2; MRPL13; SAMD12; MAL2 |
| 8_59 | 8 | 122.661 | 122.935 | 40 | 27 | 1 (100) | HAS2 |
| 8_60 | 8 | 122.935 | 127.209 | 44 | 23 | 24 (60) | C8orf76*; RNF139; DERL1; ATAD2; TRMT12; NDUFB9; ZNF572; TMEM65; C8orf32; SQLE |
| 8_63 | 8 | 127.320 | 129.639 | 46 | 21 | 2 (25) | FAM84B |
| 8_66 | 8 | 129.735 | 131.499 | 47 | 20 | 4 (50) | FAM49B; MLZE; DDEF1 |
| 8_69 | 8 | 131.596 | 135.232 | 44 | 23 | 9 (45) | TG; OC90; KCNQ3; NDRG1; KIAA0143; PHF20L1; WISP1; SLA |
| 8_72 | 8 | 135.435 | 136.466 | 43 | 24 | 2 (67) | ZFAT1* |
| 8_76 | 8 | 137.616 | 139.944 | 42 | 25 | 2 (100) | COL22A1; FAM135B |
| 8_78 | 8 | 140.056 | 146.269 | 43 | 24 | 89 (74) | ZC3H3*; PUF60; GPR172A; CYHR1; SCRIB; HSF1 ; ZNF7*; MAF1; SHARPIN; BOP1 |
1.-4. Please see legend to Table 2, except that only the top 10 genes are listed and genes present in more than one region are only shown in one of these.
Genes with increased expression on chromosome 20.
| Region ID | Chr | Start | End | Samples “G” | Samples “N” | DE Probesets (%) | Most significant DE Genes |
| 20_1 | 20 | 29.299 | 31.465 | 34 | 34 | 34 (62) | POFUT1; PDRG1; PLAGL2 ;ASXL1; TM9SF4; TPX2; CDK5RAP1; MAPRE1; COMMD7; KIF3B; C20orf112; RP11-49G10.8; DEFB118; DUSP15; DNMT3B |
| 20_2 | 20 | 31.466 | 31.648 | 29 | 39 | 2 (100) | CBFA2T2; SNTA1 |
| 20_4 | 20 | 31.649 | 33.694 | 31 | 37 | 34 (74) | PIGU; DYNLRB1; GGTL3; RBM12; RALY; NCOA6*;CEP250*; APBA2BP; TRPC4AP; EIF6; EDEM2; GSS; UQCC; PXMP4; EIF2S2 |
| 20_5 | 20 | 33.696 | 33.760 | 36 | 32 | 5 (100) | RBM12; NFS1; RBM39; C20orf52 |
| 20_6 | 20 | 33.958 | 37.049 | 29 | 39 | 38 (64) | CTNNBL1*; LOC388796; KIAA0406P; DHX35*; C20orf77; ACTR5; MANBAL; FAM83D; DSN1; RBL1; C20orf198; RPN2; SCAND1; C20orf117; C20orf24 |
| 20_7 | 20 | 37.107 | 41.095 | 29 | 37 | 11 (65) | PLCG1; CHD6; LPIN3; TOP1; PTPRT; LOC149692; ZHX3; EMILIN3; MAFB |
| 20_8 | 20 | 41.095 | 41.113 | 30 | 36 | 1 (100) | PTPRT |
| 20_9 | 20 | 41.124 | 41.226 | 28 | 38 | 2 (100) | PTPRT |
| 20_10 | 20 | 42.962 | 45.772 | 30 | 37 | 45 (62) | PIGT; UBE2C; ZSWIM1; TOMM34; DNTTIP1*; NCOA5; SLC35C2; ACOT8; NEURL2; KCNS1; C20orf67; SNX21; ELMO2; ZMYND8; TP53RK; |
| 20_12 | 20 | 45.850 | 49.180 | 29 | 38 | 23 (61) | TMEM189*; MOCS3*; DPM1; STAU1; DDX27; CSE1L; ARFGEF2; ADNP; SPATA2*; PTPN1; SLC9A8*; C20orf199; PARD6B; ZNF313; KCNG1 |
| 20_14 | 20 | 49.222 | 54.379 | 30 | 37 | 10 (42) | ZFP64*; AURKA; PFDN4; ATP9A; MC3R; TSHZ2; SUMO1P1 |
| 20_15 | 20 | 54.379 | 54.417 | 27 | 41 | 2 (100) | CSTF1; AURKA |
| 20_16 | 20 | 54.417 | 55.828 | 31 | 36 | 8 (32) | C20orf43; RAE1; BMP7; RBM38; GCNT7 |
| 20_18 | 20 | 55.991 | 57.887 | 33 | 34 | 12 (57) | VAPB; TUBB1; RAB22A; TH1L; SLMO2; STX16; ATP5E; GNAS; SYCP2; PPP4R1L; NPEPL1 |
| 20_20 | 20 | 57.901 | 62.427 | 33 | 34 | 38 (45) | LSM14B*; YTHDF1; SS18L1; DIDO1; GTPBP5; PSMA7; TAF4; C20orf11; C20orf20; TCFL5; C20orf177; MYT1; PCMTD2; DNAJC5; TPD52L2; |
1.-4. Please see legend to Table 2.
Highly amplified genes.
| Chr | Start (Mb) | End (Mb) | Length (bp) | No. samples | Genes |
| 3 | 170.040 | 170.248 | 208141 | 6 | None |
| 3 | 178.305 | 178.589 | 283690 | 5 | TBL1XR1 |
| 3 | 180.121 | 180.410 | 288435 | 5 |
|
| 8 | 55.208 | 55.528 | 319922 | 5 | MRPL15* |
| 8 | 62.495 | 63.491 | 995369 | 6 | RLBP1L1*;NKAIN3;ASPH |
| 8 | 102.003 | 102.062 | 58823 | 5 | YWHAZ |
| 8 | 123.144 | 123.746 | 601615 | 6 | None |
| 8 | 123.856 | 124.369 | 513120 | 6 | DERL1; ZHX2; WDR67*; ZHX1*; C8orf76*; FAM83A |
| 8 | 124.369 | 125.825 | 1455953 | 6 | ATAD2; C8orf32*; FBXO32; ANXA13; KLHL38; FAM91A1; FER1L6; MTSS1; NDUFB9; RNF139; TATDN1; TMEM65; TRMT12* |
| 8 | 125.828 | 127.764 | 1936500 | 7 | KIAA0196; NSMCE2; SQLE; ZNF572*; TRIB1; FAM84B* |
| 8 | 127.764 | 128.973 | 1208920 | 7 |
|
| 8 | 128.973 | 130.166 | 1193146 | 8 | PVT1†, TMEM75* |
| 8 | 130.166 | 138.988 | 8821634 | 7 | ADCY8; DDEF1; EFR3A; FAM49B; KCNQ3; MLZE; OC90; LRRC6; NDRG1; PHF20L1; SLA; TG*; TMEM71; WISP1; ST3GAL1; ZFAT*; KHDRBS3; CCDC26 |
| 8 | 138.988 | 144.000 | 5382420 | 7 | FAM135B; COL22A1; KCNK9; NIBP*; CHRAC1*; EIF2C2; PTK2; DENND3*; SLC45A4; FLJ43860; GPR20; PTP4A3; ARC; BAI1; C8orf55; CYP11B1; CYP11B2; GML; JRK; LY6D; LY6K; LYNX1; LYPD2; PSCA; SLURP1; TSNARE1 |
| 17 | 35.104 | 35.105 | 529 | 5 |
|
| 19 | 34.125 | 34.639 | 513414 | 5 | UQCRFS1* |
| 19 | 34.639 | 35.610 | 971542 | 6 | C19orf12*; PLEKHF1; POP4*; |
| 19 | 35.968 | 36.703 | 734619 | 6 | TSHZ3* |
| 19 | 37.459 | 38.011 | 552023 | 5 | ANKRD27*; PDCD5*; RGS9BP; ECAT8; DPY19L3*; ZNF507* |
| 19 | 38.372 | 39.140 | 767924 | 5 | CEBPA; LRP3; SLC7A10; CHST8; KCTD15; CEBPG*; PEPD*; FLJ12355 |
| 20 | 29.427 | 29.849 | 421241 | 5 | BCL2L1; COX4I2; DEFB119; DEFB121; DEFB123; DEFB124; HM13; ID1; REM1; TPX2 |
1. Derived from Refseq annotation (September 2009). Genes in italics are known oncogenes (based on Cancer Gene Census [38]), *Genes that show a strong (r>0.6) positive correlation of copy number with expression, †Not on expression microarray. Note that some regions encompass multiple smaller amplicons, only genes within regions (+/−10 kb) defined by >5 samples are shown.
Candidate oncogenes and current literature.
| Gene | Chr | Start | End | Total gain (%) | Comments | Other genes in region |
| PDCD10 | 3 | 168.884 | 168.935 | 43 | Angiogenesis disorder | |
| PRKCI | 3 | 171.423 | 171.506 | 51 | Oncogene in ovarian and other cancers | SKIL, PHC3, MYNN |
| ECT2 | 3 | 173.955 | 174.022 | 50 | Cytokinesis | |
| TBL1XR1* | 3 | 178.221 | 178.398 | 50 | Oncogene in breast cancer | |
| PIK3CA* | 3 | 180.349 | 180.435 | 50 | Known oncogene | MRPL47, NDUFB5 |
| SENP2 | 3 | 186.787 | 186.832 | 51 | SUMO1 deconjugating peptidase. Possible role in degradation of beta-catenin | TMEM41A |
| MRPL15* | 8 | 55.210 | 55.224 | 42 | Mitochondrial ribosomal protein | |
| RLBP1L1* | 8 | 62.363 | 62.577 | 46 | Clavesin 1 (CLVS1), regulates endosome morphology | |
| YWHAZ* | 8 | 102.000 | 102.035 | 53 | 14-3-3 isoform zeta, oncogenic functions in inhibiting apoptosis and adhesion | |
| DERL1* | 8 | 124.095 | 124.124 | 60 | Endoplasmic reticulum protein | WDR67*, C8orf76* |
| ATAD2* | 8 | 124.401 | 124.478 | 60 | ATPase. E2F target, binds MYC, expression correlates with poor outcome in breast cancer | WDYHV1/C8ORF32*, FBXO32*, FAM91A1* |
| RNF139* | 8 | 125.556 | 125.570 | 60 | Translocation causes hereditary renal cancer. Interacts with VHL | NDUFB9*, TRMT12*, TMEM65*, SQLE* |
| FAM84B* | 8 | 127.634 | 127.640 | 61 | – | |
| FAM49B* | 8 | 130.923 | 131.021 | 61 | – | |
| NDRG1* | 8 | 134.319 | 134.379 | 60 | Diverse role in stress response including hypoxia | |
| ZFAT* | 8 | 135.559 | 135.794 | 60 | Zinc finger and AT hook protein, anti-apoptotic role | |
| PTK2* | 8 | 141.738 | 142.081 | 60 | Focal adhesion kinase. Involved in signal transduction for proliferation | CHRAC1*, NIBP/TRAPPC9*, SLC45A4* |
| PTP4A3* | 8 | 142.501 | 142.511 | 60 | Protein tyrosine phosphatase. Increases proliferation and metastasis | JRK*, TSTA3, ZC3H3, LY6E |
| PUF60 | 8 | 144.971 | 144.984 | 60 | mRNA splicing factor | CYC1, ZNF623, ZNF7, CYHR1 |
| ERBB2* | 17 | 35.098 | 35.138 | Known oncogene in breast cancer | ||
| TPX2* | 20 | 29.791 | 29.853 | 42 | Activator of Aurora-A and involved in spindle assembly | |
| UBE2C | 20 | 43.875 | 43.879 | 42 | Ubiquitin-conjugating enzyme E2C, degradation of mitotic cyclins and cell cycle progression | PIGT |
| ZFP64 | 20 | 50.134 | 50.242 | 43 | Zinc finger protein, Notch signalling | |
| AURKA | 20 | 54.378 | 54.401 | 43 | Aurora kinase, cell cycle regulation, chromosome segregation, microtubule/spindle function | CSTF1, RAE1, C20orf43 |
| SS18L1 | 20 | 60.152 | 60.191 | 46 | Synovial sarcoma translocation fusion gene | GTPBP5, LSM14B, TAF4 |
Genes were selected as follows: Gain in >40% and differentially expressed, with fold change expression in gain vs. neutral of >0.7 or correlation coefficient (r) of >0.7. Of these genes (n = 121), the most frequently gained in each chromosome were selected: Chr 3 n≥50, Chr8 n≥60, Chr 20 n≥42. 2. High level amplification in at least 5 samples (*), and differentially expressed, with fold change expression in gain vs. neutral of >0.6 or correlation coefficient (r) of >0.6. Chr19 genes (n = 12) are not shown here.
Figure 3Correlation between copy number and expression for a frequently gained region on cytoband 3q26.2.
A. Frequency of copy number gain on chromosome 3 from p-ter at left to q-ter at right as indicated by the ideogram. B. Genes on Chr3: 169.209–172.478 Mbp, a region gained in 60% (41/68) of all samples, including genes previously associated with ovarian cancer (PRKCI, MECOM or MDS1/EVI1) and potentially novel oncogenes (MYNN). C. A volcano plot presenting the results of expression analyses between amplified and unamplified samples in this region. The genes in the top right corner are significantly overexpressed in samples with copy number gain (p<0.05; above the red line at –logP 4.32) compared to samples without copy number change (selected genes are labelled). For full list of differentially expressed genes see Table S5. D. Plot comparing copy number and expression in all samples for the gene MYNN that showed the highest correlation (r = 0.74, Pearson's test) between copy number and expression for this region on 3q26.2.