J Tang1, Y Li1, K Lyon1, J Camps2, S Dalton1, T Ried2, S Zhao1. 1. Department of Biochemistry and Molecular Biology, Institute of Bioinformatics, University of Georgia, Athens, GA, USA. 2. Genetics Branch, Center for Cancer Research, National Cancer Institute/NIH, Bethesda, MD, USA.
Abstract
Herein we report a proof-of-principle study illustrating a novel dog-human comparison strategy that addresses a central aim of cancer research, namely cancer driver-passenger distinction. We previously demonstrated that sporadic canine colorectal cancers (CRCs) share similar molecular pathogenesis mechanisms as their human counterparts. In this study, we compared the genome-wide copy number abnormalities between 29 human and 10 canine sporadic CRCs. This led to the identification of 73 driver candidate genes (DCGs), altered in both species, and with 27 from the whole genome and 46 from dog-human genomic rearrangement breakpoint (GRB) regions, as well as 38 passenger candidate genes (PCGs), altered in humans only and located in GRB regions. We noted that DCGs significantly differ from PCGs in every analysis conducted to assess their cancer relevance and biological functions. Importantly, although PCGs are not enriched in any specific functions, DCGs possess significantly enhanced functionality closely associated with cell proliferation and death regulation, as well as with epithelial cell apicobasal polarity establishment/maintenance. These observations support the notion that, in sporadic CRCs of both species, cell polarity genes not only contribute in preventing cancer cell invasion and spreading, but also likely serve as tumor suppressors by modulating cell growth. This pilot study validates our novel strategy and has uncovered four new potential cell polarity and colorectal tumor suppressor genes (RASA3, NUPL1, DENND5A and AVL9). Expansion of this study would make more driver-passenger distinctions for cancers with large genomic amplifications or deletions, and address key questions regarding the relationship between cancer pathogenesis and epithelial cell polarity control in mammals.
Herein we report a proof-of-principle study illustrating a novel dog-human comparison strategy that addresses a central aim of cancer research, namely cancer driver-passenger distinction. We previously demonstrated that sporadic caninecolorectal cancers (CRCs) share similar molecular pathogenesis mechanisms as their human counterparts. In this study, we compared the genome-wide copy number abnormalities between 29 human and 10 canine sporadic CRCs. This led to the identification of 73 driver candidate genes (DCGs), altered in both species, and with 27 from the whole genome and 46 from dog-human genomic rearrangement breakpoint (GRB) regions, as well as 38 passenger candidate genes (PCGs), altered in humans only and located in GRB regions. We noted that DCGs significantly differ from PCGs in every analysis conducted to assess their cancer relevance and biological functions. Importantly, although PCGs are not enriched in any specific functions, DCGs possess significantly enhanced functionality closely associated with cell proliferation and death regulation, as well as with epithelial cell apicobasal polarity establishment/maintenance. These observations support the notion that, in sporadic CRCs of both species, cell polarity genes not only contribute in preventing cancer cell invasion and spreading, but also likely serve as tumor suppressors by modulating cell growth. This pilot study validates our novel strategy and has uncovered four new potential cell polarity and colorectal tumor suppressor genes (RASA3, NUPL1, DENND5A and AVL9). Expansion of this study would make more driver-passenger distinctions for cancers with large genomic amplifications or deletions, and address key questions regarding the relationship between cancer pathogenesis and epithelial cell polarity control in mammals.
A central aim of cancer research has been to identify a handful of cancer-causative alterations (drivers) from hundreds to thousands of abnormal changes found in a cancer genome (1, 2). Discoveries of such drivers in the past have contributed to the understanding of cancer etiology and yielded prognostic markers and therapeutic intervention targets such as BCR-ABL, leading to the development of the spectacularly successful anti-leukemia drug imatinib (Gleevec) (3). As high-throughput technologies such as next-generation sequencing and high density array become routinely available, hundreds of thousands of cancer genomes have been or will be characterized and millions of cancer-related somatic mutations will be uncovered (1, 2, 4, 5). Hence, the need to efficiently distinguish drivers from passengers (incidental changes or changes occurring as a consequence of cancer) becomes increasingly pressing and has become one of the National Cancer Institute (NCI)'s Provocative Questions that need to be urgently addressed (provocativequestions.nci.nih.gov).Drivers are typically more recurrent than passengers, and researchers have been using strategies such as increasing sample size and studying early stage cancers with advanced technologies and data analysis tools for their differentiation (1, 2, 4-6). Differing from these traditional approaches that study humancancers only, we have developed a novel human-dog comparative genomics and oncology strategy for cancer driver-passenger distinction for genes amplified or deleted in humancancers, rationalized as follows. The dog has become an increasingly important model for studying human physiology and diseases (7, 8), and sporadic caninecancer represents one of the best cancer models (9). First, these cancers are naturally-occurring and heterogeneous, unlike most genetically-modified or xenograft rodent models, and significantly, dogs share the same environment as humans and hence are exposed to the same carcinogens. Furthermore, the dog genome has been sequenced to a >7X coverage, and a relatively accurate version of its genome sequence assembly is available (10), unlike another companion animal, the cat, which was sequenced only to a 2.8X coverage (11). Critically, the dog genome is rearranged when compared to the human genome with over 300 inversions and translocations identified (10, 12), resulting in thousands of genes that are clustered in the human genome to be dispersedly located in the dog genome. We utilize these orthologous genes’ different genomic locations between the two species for driver-passenger distinction. Our hypothesis is that provided the two species share similar molecular pathways of pathogenesis for the cancer of interest, alterations that are common to both will be deemed drivers, whereas those that are found in only one species and are located in the dog-human genomic rearrangement sites will be considered as passengers.To test this hypothesis, we conducted proof of principle studies on CRC, one of the best understood cancers for studying cancer initiation and progression (5, 13, 14). Critically, we have previously demonstrated that sporadic canine CRCs likely follow similar cancer development and progression pathways as their human counterparts (15, 16), providing the molecular justification for applying this novel cross-species comparison strategy on CRC. In this study, we compared copy number abnormality (CNAs) between human and dog sporadic CRCs (15, 17) and identified 73 driver candidate genes (DCGs) and 38 passenger candidate genes (PCGs). We noted that DCGs significantly differ from PCGs in every analysis conducted to assess their cancer relevance and biological functions. Importantly, while PCGs’ functions appear random, DCGs are significantly enriched in functions closely associated with cell proliferation/death regulation, as well as with epithelial cell apicobasal polarity establishment/maintenance. These observations emphasize the importance of cell polarity genes in CRC pathogenesis, consistent with published studies elegantly illustrating that in a Drosophila genetic model with the oncogenic Ras mutation, loss of cell polarity both accelerates the growth and drives the invasion of Ras-induced benign tumors through JNK activation (18). In summary, this pilot study demonstrates that the proposed dog-human comparison strategy for cancer driver-passenger discrimination is valid. It also provides genome-wide mammalian data supporting the notion that alterations of cell polarity genes, hence loss of epithelial cell polarity, are among major drivers of CRC carcinogenesis, a critical but controversial subject in the field (19, 20).
RESULTS
CRC driver-passenger distinction via human-dog comparison
Genome-wide comparisons – 1stDCG identification
We first compared the genome-wide CNAs between 29 human sporadic CRCs (17) and 10 canine sporadic CRCs (15), with the vast majority being chromosome instability (CIN) tumors and harboring large genomic amplifications and deletions. This analysis led to the discovery of 27 genes, with each having at least one of its exons significantly altered in both species (Fig. 1), autosome-wide [sex chromosomes were excluded because the canine Y chromosome is not yet sequenced (10) and the extensive sequence duplication between X and Y chromosomes (21) complicates the analysis]. These 27 genes, which include the best known colorectal tumor suppressor APC (13) (Supplementary Table s1), are referred to as the first DCGs or 1stDCGs hereafter.
Fig. 1
CRC driver-passenger distinction via human-dog comparison
The top portion illustrates how 1stDCGs (27 total), shown in purple and with at least one of their exons significantly amplified/deleted in both species, were identified autosome-wide (see the main text and Supplementary Table s1 for details). The middle section indicates how 2ndDCGs (46 total), shown in green, and PCGs (38 total), shown in red, were identified from the human-dog GRB sites in the human genome (see the main text for criteria for DCG-PCG distinction; Supplementary Tables s1-s3 and Fig. s1 for details). The bottom area shows examples of DCG-PCG pairs identified from two GRB sites in the human genome that harbor a human-dog translocation breakpoint (left) or a human-dog inversion breakpoint (right). Genes in each example are clustered in the human genome, but are either located on different chromosomes (right), or far apart (0.2 Mb for the human versus 16.6Mb for the dog) if on the same chromosome (left), in the dog genome. In addition, genes in each example are all amplified in the human tumors; however, in the dog tumors, only DCGs [GADD45A and SLC2A1, both of which are already known cancer driver genes (22, 23)] are amplified, whereas PCGs (genes shown in red, none of which has been reported in the literature to be cancer driver genes) remain neither amplified nor deleted.
We then focused on the human genomic sites that harbor the human-dog GRB (a total of 324 GRBs were detected for the autosomes; see Supplementary Table s2) to identify potential driver-passenger pairs (Fig. 1), based on the following criteria. First, the greater GRB region, which includes the GRB itself and the two immediate flanking regions with each containing at least one gene, is altered in at least one humantumor. Second, at least one canine orthologous gene at one side of the GRB is altered in at least one dogtumor, while none of the canine orthologous genes at the other side of the GRB is disrupted in any of the dogtumors (Fig. 1). Once these two conditions were met, genes changed in both species were deemed drivers and those disrupted only in humantumors were considered as passengers (Fig. 1). This strategy allowed the identification of additional 46 DCGs (referred to as 2ndDCGs hereafter) and 38 PCGs from 25 GRB sites, with most genes in both groups recurrently altered in the human CRCs (Supplementary Tables s1 & s3).Combined the two approaches described above, a total of 73 DCGs (27 1stDCG and 46 2ndDCG) and 38 PCGs were identified. We then conducted both bioinformatic and experimental analyses to evaluate their cancer relevance and the biological functions, as described below.
DCGs’ cancer relevance is strong, but PCGs’ is not
Known cancer drivers found among DCGs but not among PCGs
Besides APC, both 1stDCGs and 2ndDCGs contain known cancer genes with many already being targeted for therapeutic intervention, including the DNA damage-inducible cell-cycle regulator GADD45A (22), the constitutive glucose transporter SLC2A1 (23), the signaling molecule A3 adenosine receptor ADORA3 (24), the chromatin modifier EZH2 (6), the receptor tyrosine phosphatase PTPRD (25), and the cancer biomarker MUC16 (26). The PCGs, however, do not contain any such genes.
DCGs significantly differ from PCGs in cancer relevance examined
To assess the cancer relevance of our candidate genes, we determined their presence/absence in well-known cancer databases including Cancer Gene Census (a catalog of genes of which mutations are causally implicated in cancer) (6), KEGG cancer pathways, COSMIC (a catalog of somatic mutations of cancers) (27), and Cancer Gene Index (a database of associations between genes and cancers developed at the NCI). Then, we searched the Mouse Genome Informatics database for phenotypic changes in existing gene-knockout mouse models, including tumorigenesis and other significant alterations. It has been reported that cancer genes interact with more partners than a typical human gene (28); we thus determined the predicted protein-protein interactions and DNA-protein interactions (which imply how well a gene is transcriptionally regulated) of each gene. Finally, we also searched the PubMed cancer subset, a collection of cancer-related literatures created by the NCI and the National Library of Medicine (www.nlm.nih.gov/bsd/pubmed_subsets/cancer_strategy.html). All the information was catalogued in Supplementary Table s1, on which we performed Hotelling's T-squared tests. As shown in Fig. 2, while the difference between the two DCG groups is insignificant (p = 0.5), DCGs indeed differ notably from PCGs in these cancer-related features examined (p ≤ 0.05).
Fig. 2
DCGs significantly differ from PCGs in cancer relevance examined
A) The top triangle shows the Hotelling's T-squared test results performed with the data documented in Supplementary Table s1 and described in the text. The numbers shown between the boxes are p-values from the tests. The table below indicates the somatic mutation status reported by a recent NGS study with 276 human CRCs (5) for each group of our candidate genes (see Supplementary Table s4 for details).
B) DCGs and PCGs responding differently in their mRNA expression changes to Taxol-treatment of HCT116 cells. The treatment of 20nM Taxol for 24 hours and 48 hours induced HCT116 cell death and cell cycle arrest (demonstrated by the multinucleate cells indicated with red arrows) (Top). The overall gene expression change at 20nM Taxol between different treatment times was indicated by the p-values, obtained from t-tests using data from Supplementary Table s5 and with “↑” representing significant increase and “↓” indicating significant decrease (middle) (see Supplementary Fig. s2 for p-values of DCGs/PCGs separated based on whether they were amplified or deleted in CRCs). The bottom shows the expression change of individual DCGs, including GADD45A and C1orf63 with similar drug-response profiles, as well as those having a slight expression decrease at 24 hours followed by a larger increase or a near complete recovery at 48 hours. Similar gene expression trends were observed for 10nM and 80nM Taxol treatment (see Supplementary Table s5).
DCGs significantly mutated in human CRCs but PCGs did not
We took advantage of a recent publication from The Cancer Genome Atlas, which investigated 276 human CRCs via next-generation sequencing (NGS) (5). As shown in Fig. 2 and Supplementary Table s4, both 1stDCGs and 2ndDCGs carry significantly more nonsynonymous somatic mutations than PCGs. Upon excluding missense mutations and only including nonsense mutations and fame-shift indels, DCGs become even more significant, with 14 1stDCGs (p = 0.0003) and 17 2ndDCGs (p = 0.02), but only 6 PCGs (p = 0.5), being identified. This provided additional evidence supporting the strong cancer relevance of DCGs, but not PCGs.
DCGs responded to Taxol-treatment of HCT116 cells but PCG did not
We treated HCT116, a near diploid humanCRC line, with Taxol (Paclitaxel), a anticancer drug shown to stabilize microtubules, block the progression of cell cycle and trigger cancer cell death (29, 30). For PCGs, we did not observe any significant changes in gene expression on the whole at different Taxol concentrations (10nM, 20nM, and 80nM) and treatment times (24 hours and 48 hours) (Fig. 2 and Supplementary Table s5). For DCGs (especially those deleted in CRCs; see Table s5 and Fig. s2), however, we noted an initial small expression decrease after 24 hours’ treatment of most genes, for many of which this was followed by a larger expression increase, making the expression level even higher than before the drug treatment or a near complete recovery at 48 hours (Fig. 2).We do not know if DCGs’ initial expression decrease described above is related to their enrichment in cell adhesion functions (see later sections), but both us (Fig. 2) and others (30) observed a drastic decrease in cell adhesion after 24 hours’ Taxol-treatment. The subsequent expression increase at 48 hours (Fig. 2) could possibly arise from the accelerated expression increment of cell cycle check-point genes, which arrested the cell cycle at G2/M checkpoints demonstrated by the significantly increased number of multinucleate cells observed at 48 hours as compared to 24 hours (see Fig. 2 and reference 30). Two DCGs, GADD45A and C1orf63, are especially noteworthy, with their expression level increasing along with the Taxol concentration and incubation time, reaching 18-fold for C1orf63 and 14-fold for GADD45A at 80nM for 48 hours (Fig. 2). GADD45A is known to be upregulated by stress and drug therapy, and is required at the G2/M checkpoint and arrests cell cycle progression (22). The function of C1orf63 is unknown currently; its similar Taxol response profile as GADD45A raises the possibility that C1orf63 may possess comparable functions.In summary, while much more work is clearly needed to understand the mechanisms of the observed drug response, the study demonstrated a clear difference between DCGs and PCGs in their overall response to Taxol treatment of HCT116 cells. This supports the strong cancer relevance of DCGs (but not PCGs) concluded from the bioinformatic analyses described above.
Many DCGs contributed to cell growth, proliferation or death regulation; PCGs did not
A total of 17 DCGs (23%) but only three PCGs (8%) were annotated to participate in cell growth/death regulation, cell cycle control, and DNA repair (Supplementary Table s6). Furthermore, we have experimentally demonstrated additional 12 DCGs being critical to the proliferation/death of humanCRC cells HCT116, DLD1 and SW620 (Fig. 3 and Supplementary Fig. s3). Four genes, including RASA3, DENND5A, AVL9, and NUPL1, are especially noteworthy. They are all predominantly deleted in CRCs (Table s1), and their siRNA knockdown promoted cell growth of HCT116 and other CRC cells (Fig. 3 and Fig. s3). These observations support that these genes are potential tumor suppressors of CRC. RASA3 (a RAS GTPase activator) and DENND5A (a GDP-GTP exchange factor for RAB39; see Supplementary Table s7) both participate in small GTPase signaling, AVL9 function in exocytosis in yeast (31), and NUPL1 is a nucleoporin gene (Fig. 4 and Table s1). Except for RASA3 which is known to inactivate RAS by enhancing its weak intrinsic GTPase activity (32), no any previous studies have been documented regarding their role in cancer.
Fig. 3
siRNA knockdown of DCGs promotes or inhibits the growth of HCT116 cells
A) Cell growth promotion: The six genes shown are expressed in HCT116 and siRNA knockdown was performed, as demonstrated by the bar graphs on the top where the p-value indicates the mRNA expression difference between scramble-control (S) and knockdown (KD) of each gene. Then, the cell growth was quantified by crystal violet-staining, represented by the images in the middle, with the color intensity quantification indicated by the bar graphs below the images and the difference between scramble-control and knockdown specified by the p-values. B) Cell growth inhibition: siRNA knockdown of three genes indicated hinders cell growth. These genes and additional ones were assayed similarly in two other CRC lines SW620 and DLD1 (Supplementary Fig. s3), as well the human cervical cancer line HeLa and an immortalized primary human astrocyte (IHA).
Fig. 4
DCGs (but not PCGs) are enriched in functions associated with epithelial cell polarity establishment and maintenance, and many DCGs participate in cell proliferation/death and/or cell cycle control
The top left images indicate that loss of epithelial cell apicobasal polarity occurs in both adenomas (tumor cells not yet penetrating the basement membrane) and adenocarcinomas (tumor cells already invading tissue layers below the epithelium). Genes shown to the right of the images are 1stDCGs (purple), 2ndDCGs (green), or PCGs (red) already known or likely functioning in epithelial cell polarity based on literature reports (see the main text), as well as gene ontology (GO) terms, INTERPRO domains, and other information provided by DAVID (33) (Supplementary Table s6). The genes with “?” were classified based on protein domain matches only. Genes shown below the images are DCGs or PCGs participating in cell proliferation/death and/or cell cycle regulations, based on annotation by DAVID (33) (Supplementary Table s6), published literature (see the main text), and experiments (Fig. 3 and Fig. s3). See Supplementary Fig. s4 for the classified functions of all DCGs and PCGs that may or may not relate to cell polarity.
In summary, 28 out of 73 total (38%) DCGs are closely associated with cell proliferation and death regulation, cell cycle control, and/or DNA repair (Figs. 3 and 4; Table s6), as known or putative cancer genes. For PCGs, however, only three out of 38 total (<10%) are involved in such processes (ASAH2B, CTGF, and SERBP1), some of which may be false negatives that could be resolved by increasing sample size.
DCGs (but not PCGs) significantly enriched in functions closely associated with epithelial cell apicobasal polarity establishment and maintenance
1stDCGs enriched in cell adhesion/motility functions and 2ndDCG enriched in vesicle-trafficking and ion-transport functions, whereas PCGs not enriched in any specific functions
We applied the frequently-used DAVID Gene Functional Classification Tools (33) on the candidate genes and observed notably more functional groups being over-represented in DCGs than in PCGs. These include cell adhesion (p = 0.02), adherent junctions (p = 0.04), cell motility (p = 0.05), and cell morphogenesis (p = 0.05) for 1stDCGs, as well as vesicle-trafficking (p = 0.03) and ion transport (p = 0.05) for 2ndDCGs (Supplementary Table s8). For PCGs, however, no any significant enrichment was found, with C2H2-zinc finger (ZNF) genes being the most enriched group (p = 0.14) (Table s8).Because 2ndDCGs and PCGs were identified from the 324 GRB sites rather than autosome-wide (Fig. 1), we performed the same enrichment analysis using the 4,401 genes encoded in 1Mb sequences surrounding each GRB (0.5Mb each of upstream and downstream of the GRB center) as the background, instead of the entire autosome gene set as above. The analysis revealed ion homeostasis/transport and vesicle-trafficking remaining significant (p ≤ 0.05) for 2ndDCGs as above (Table s8). For PCGs, however, still no significant enrichment was observed, and C2H2-ZNFs (p = 0.23) were replaced by nuclear lumen-associated genes as the most enriched, which stay nevertheless insignificant (p = 0.17) (Table s8).
GRB sites enriched in C2H2-ZNFs and G-protein coupled receptors (GPCRs)
The following analysis explained why up to five C2H2-ZNFs are among PCGs (Table s1). Compared to the whole autosome set, the 324 GBR sites, which were used to identify PCGs and 2ndDCGs (Fig. 1), are significantly enriched with C2H2-ZNF genes (p ≤ 0.0001; Supplementary Table s9). However, when comparing the 25 GBR sites, where PCGs and 2ndDCGs locate, against the entire 324 GBR set, the difference is insignificant (p > 0.3). Hence, consistent with published studies (34), the GRB sites harbor significantly more C2H2-ZNFs than other genomic regions [for which a possible explanation could be that, like segmental duplications that are also enriched at the GRB sites (12), C2H2-ZNF-formed gene clusters could mediate genomic rearrangements, which in turn expand this gene family via duplications and deletions]. Thus, the five C2H2-ZNFs in PCGs were pulled out by random chance. Similar conclusions were reached for GPCRs and olfactory receptors (ORs) (Tables s1 and s9). C2H2-ZNFs and GPCRs are the only gene families of which multiple members were found among PCGs.
Many DCGs likely function in cell polarity, while PCGs do not
DCGs’ significant enrichment in cell adhesion and trafficking functions described above indicate that many DCGs could possibly be cell polarity genes, rationalized as follows. Colon epithelial cells are asymmetric, exhibiting apical and basal polarity (Fig. 4). This allows them to form permeability barriers between two compartments in the body and to vectorially transport ions and solutes between the compartments. According to published literature (35), development of epithelial cell polarity requires cell-cell and cell-substratum adhesions achieved by genes functioning in various adhesion complexes, cell junctions, and cytoskeleton organization. In addition, the polarized plasma membrane, which is divided into functionally and structurally distinct apical and basolateral domains, is established and maintained by polarized intracellular protein sorting, trafficking and targeting involving a variety of genes and organelles including trans-Golgi network and endosomes.We found that up to 78% of 1stDCGs and 72% of 2ndDCGs, but only 26% of PCGs, might participate in these processes of establishing and maintaining epithelial cell polarity. Specifically, as many as 56% of 1stDCGs (15 genes out of 27 total, with 12 certain and three likely) and 39% of 2ndDCGs (18 genes out of 46 total, with 13 certain and five likely), but only 13% of PCGs (five genes out of 38 total, with three certain and two likely), possess functionality in cell-cell or cell-substratum adhesions, as listed in Fig. 4 and Table s6. For polarized ion-transport and trafficking, we noted up to 33% of 2ndDCGs (15 genes with 13 certain and two likely) and 22% of 1stDCGs (six genes with four certain and two likely), but only 13% of PCGs (five genes) having or likely having such functions (Fig. 4).Significantly, a substantial portion of DCGs are already known to be located in different portions of the polarized plasma membrane (Fig. 4 and Table s6); however, we did not find such information for any of the PCGs. Furthermore, a total of eight 1stDCGs and six 2ndDCGs are already-known epithelial cell polarity genes (Fig. 4 and Table s6). Many function in cell adhesion, including protocadherin PCDH9; the multifunctional CRCtumor suppressor APC; membrane-associated guanylate kinase MPP7 and DLGAP2 (DLG-associated protein 2) (36); basement membrane genes LAMA1 and FREM2; cytoskeleton genes ACTC1 and KIFAP3 (37); and small GTPase signaling molecules RAP1A and ARHGEF7 (38). Others function in polarized ion and small molecule transporting, including potassium/chloride cotransporter SLC12A6 (39), glucose/cation symporter SLC2A1, apical membrane-located v-type proton ATPase subunit ATP6V1B2 (40), and phosphodiesterase ENPP1 (40). Furthermore, 1stDCGs contain three additional putative polarity genes: 1) EFNB2, a member of Eph/Ephrin signaling pathway which regulates the mesenchymal-epithelial transition (MET) (41); 2) EYA2, a tyrosine phosphatase that dephosphorylates histone H2AX and involves in epithelial-mesenchymal transition (EMT) (42); and 3) MYCBP2, a MYC-binding protein whose functions include cytoskeleton organization (37). Similarly, 2ndDCGs also harbor five such genes: 1) AVL9, an exocytosis gene in yeast (31); 2) CHRNA7, a cholinergic receptor whose functions include regulating calcium homeostasis (43); 3) EPDR1, an ependymin-related gene that is involved a cell-substratum adhesion; 4) MLANA, a palmitoylated integral membrane protein likely involving in sorting and degradation of melanosome proteins (44); and 5) SFRP4, a WNT-signaling gene. In summary, as many as 41% of 1stDCGs and 24% of 2ndDCGs are already-known or likely polarity genes. For PCGs, only three (8%) are possibly such genes [CTGF, SEC24C, TRIM62 (45)].
Deleted (in CRC) DCGs’ expressions increase significantly in a cell culture system where cell-cell/cell-ECM (extracellular matrix) adhesions are establishing
The above analysis is corroborated by mRNA expression changes of DCGs and PCGs in an in vitro human embryonic stem cell (hESC) differentiation system, hESC WA09 → ISL1+ nascent mesoderm, during which cell-cell and cell-ECM adhesions are being built. Among 265 most significantly upregulated genes during this differentiation revealed by microarray analyses, 96 of them function in cell adhesion, ECM building, and/or are secreted (Supplementary Table s10), making focal adhesion and ECM-receptor interaction the most significantly enriched pathways among these upregulated genes (FDR < 0.2). As summarized in Fig. 5, we observed a significant expression increase on the whole for DCGs that were predominantly deleted in CRCs (Table s1), especially for those from 1stDCGs where the overall expression increased by two-fold (Supplementary Table s11). Meanwhile, the expression of PCGs barely changed on average. This result is consistent with the cell adhesion function listed for many of the deleted DCGs in Fig. 4.
Fig. 5
Expression changes of DCGs and PCGs in an in vitro hESC differentiation system
During this differentiation cell-cell/cell-ECM adhesions are establishing, demonstrated by microarray expression studies (see the main text). The expression level of individual genes of 1stDCGs, 2ndDCGs, and PCGs in hESC WA09 and its differentiated ISL1+ nascent mesoderm was determined by qRT-PCR (Supplementary Table s11). T-tests were then performed to determine the significance of the expression difference for the genes of each group, which were further separated depending upon if a gene was predominantly amplified (indicated by “Amplification” in the image) or deleted (indicated by “Deletion”) in CRCs examined (Supplementary Tables s1 & s11), between the two cell types. “↑” indicates a significant increase in gene expression.
Although both bioinformatic and experimental analyses described above supports the likelihood that DCGs participate in cell polarity building and maintenance whereas PCGs do not, we acknowledge that for numerous DCGs (Fig. 4), it remains to be experimentally determined whether they are polarity genes or not.
Other DCG-PCG differences in functional groups
Besides those described above, we also noted several other functional groups being more prominent in DCGs than in PCGs (Table s7). These include: 1) ubiquitin-dependent protein degradation genes (six for DCGs versus none for PCGs); 2) redox genes (five for DCGs versus none for PCGs); 3) calcium-binding/dependent genes (10 for DCGs versus one for PCGs); and 4) developmental genes (13 for DCGs versus three for PCGs).
DISCUSSION
Our novel dog-human comparison strategy for cancer diver-passenger discrimination is valid
This pilot project provides a proof of principle study successfully demonstrating the efficiency of our approach on CRCs with large genomic amplifications and deletions. First, the identified DCGs significantly differ from the PCGs in every analysis performed, including cancer relevance based on existing cancer databases and literature, significant somatic mutations in CRCs discovered by NGS (5), and response to the anticancer drug Taxol (Fig. 2). Importantly, both bioinformatic and experimental analyses revealed that, while PCGs’ functions appear random, DCGs hold significantly enhanced functionalities in cell proliferation and death regulation, and cell cycle controls, as well as in cell adhesion, polarized trafficking and other cell polarity-related features that are critical in maintaining colon epithelium integrity (Figs. 2-5) (see below). Alteration of these functions promotes tumor cell growth, invasion and migration. Hence, DCGs displayed strong cancer relevance but PCGs did not, supporting that our approach is valid.
Polarity genes are cancer driver genes?
Loss of cell polarity is a hallmark of cancers that originate from epithelial cells (19). Although its role as a driver or a passenger of cancer is still under debate at present (19), cell polarity has been proposed to be critical in cancer initiation by controlling asymmetric division of cancer stem cells, as well as in cancer progression by preventing cancer cell invasion and spreading (20). This has become increasingly appreciated in recent years as more studies demonstrate the importance of the EMT in cancer (46). Critically, loss of cell polarity has been shown to be cancer-driving in Drosophila genetic models (18).While further research is clearly needed, this study also underscores the importance of cell polarity in CRC pathogenesis. First, nearly half of our DCGs function in cell-cell adhesion or cell-substratum adhesion (Fig. 4). Disruption of these genes would free tumor cells to invade tissue layers below the basement membrane and spread to other places (Fig. 4), which in fact occurred in all human CRCs [classified as T1, T2, T3 or T4 tumors (17)] and most of the canine CRCs [many are invasive adenocarcinomas (15)] investigated in this study. Hence, alterations of these genes with cell polarity-related functions are indeed cancer drivers in this regard. Second, many DCGs (but not PCGs) contributes to cell proliferation and death regulation (Figs. 3 and 4). Hence, this pilot study indicates that, in sporadic CRCs of both species, genes closely associated with epithelial cell polarity establishment/maintenance not only play a critical role in preventing cancer cell invasion and spreading, but also likely serve as tumor suppressors by modulating cell growth. This is consistent with the conclusion from Drosophila genetic model studies (18), and supports the notion that alterations of polarity genes and loss of cell polarity are a cancer driver (20), which can be further tested by future experimental validation of those DCGs with putative polarity and cancer-driving roles (Fig. 4).Unlike previously published papers that are limited to the mere identification of genes recurrently amplified/deleted in CRCs, this human-dog comparison approach allowed us to further classify these genes as either cancer driver or passenger candidates, effectively narrowing down the targets for downstream functional validation. Expanding this pilot study to a larger scale would likely make many more driver-passenger distinctions and address key questions regarding cancer pathogenesis and epithelial cell polarity control. Meanwhile, we caution this pilot study demonstrating only that the proposed dog-human comparison for driver-passenger discrimination is valid for amplified/deleted genes in cancers with large genomic amplifications or deletions, e.g., CRCs with CIN (5, 13). For genes that are mutated via base substitutions or small indels [e.g., in CRCs with microsatellite instability (5, 13)] or abnormal epigenetic modifications (47), it remains to be tested whether this approach is effective or not.
MATERIALS AND METHODS
All analyses were performed with the canFam version 2.0 and the human genome NCBI build 36.1 (hg18). CNAs in both human (17) and dog (15) CRCs were identified as described previously (15). Amplified/deleted genes were identified using the RefGene annotation for the human and the xenoRefGene annotation for the dog, downloaded from the University of California Santa Cruz (UCSC) genome site (www.genome.ucsc.edu). Genes recurrently disrupted in both species were identified with the GISTIC algorithm (48). The 324 GRBs for the autosomes (Table s2) were identified by integrating our human/dog genomic synteny/rearrangement data (12) and the UCSC hg18-canfam2 net alignment data with manual curation.The Cancer Gene Census (6), containing 410 putative driver genes with 19 explicitly stated to be CRC-related when downloaded, and COSMIC (27) data v48 (containing 4,602 mutated genes when downloaded) were both from www.sanger.ac.uk, while the Cancer Gene Index was from cabig.nci.nih.gov/inventory/data-resources/cancer-gene-index. The humancancer pathways included hsa05200 to hsa05223 of the KEGG Release 54.0 obtained from www.genome.jp/kegg/. Substantial phenotypic change information (e.g., tumorigenesis, lethality, defect, and sterility) of existing gene-knockout mouse models was obtained from MGI phenotype database v4.35 from www.informatics.jax.org. Protein-protein interactions were determined using the I2D database v1.9 at ophid.utoronto.ca/, while DNA-protein interactions were determined by counting the transcription factor binding sites in DAVID v6.7 (david.abcc.ncifcrf.gov/) (33). Gene functional classification was investigated using DAVID v6.7 (33), the Pfam database (pfam.sanger.ac.uk), and literature reports.HCT116 cells and other CRC lines, kindly provided by Dr. Bert Vogelstein of Johns Hopkins University, were cultured and authenticated following the instruction provided by ATCC. Taxol (Paclitaxel) was purchased from Sigma-Aldrich (Product No. T7402). The expression levels of DCGs and PCGs in HCT116 cells, WA09 and ISL1+ nascent mesoderm cells were determined by quantitative reverse-transcriptase polymerase chain reaction (qRT-PCR) as described previously (49). The primers were listed in Supplementary Table s12, and GAPDH was used as the normalization gene. siRNA primers (Supplementary Table s13) were designed using Silencer® siRNA Construction Kit Template Design Tool and siRNA Target Finder (Ambion). Then, siRNA and scramble-control RNA were synthesized with Silencer® siRNA Construction Kit (cat no. AM1620) from Ambion. Transfection was performed with siPORT NeoFX Transfection Agent (cat no. AM4510) from Ambion. Cell growth and crystal violet-staining were performed as described (50). Images were quantified using ImagJ (rsbweb.nih.gov/ij/) and t-tests were performed between scramble-control and knockdown. WA09 hESCs were maintained in StemPro defined media (Invitrogen). Differentiation to ISL1+ nascent mesoderm was achieved by supplementation of defined media with Wnt3a (25 ng/ml) and BMP4 (50 ng/ml) for 4 days. Microarray experiments were then performed with Affymetrix Human Gene 1.0 ST arrays at the Emory Biomarker Service Center.
Authors: Thomas J Hudson; Warwick Anderson; Axel Artez; Anna D Barker; Cindy Bell; Rosa R Bernabé; M K Bhan; Fabien Calvo; Iiro Eerola; Daniela S Gerhard; Alan Guttmacher; Mark Guyer; Fiona M Hemsley; Jennifer L Jennings; David Kerr; Peter Klatt; Patrik Kolar; Jun Kusada; David P Lane; Frank Laplace; Lu Youyong; Gerd Nettekoven; Brad Ozenberger; Jane Peterson; T S Rao; Jacques Remacle; Alan J Schafer; Tatsuhiro Shibata; Michael R Stratton; Joseph G Vockley; Koichi Watanabe; Huanming Yang; Matthew M F Yuen; Bartha M Knoppers; Martin Bobrow; Anne Cambon-Thomsen; Lynn G Dressler; Stephanie O M Dyke; Yann Joly; Kazuto Kato; Karen L Kennedy; Pilar Nicolás; Michael J Parker; Emmanuelle Rial-Sebbag; Carlos M Romeo-Casabona; Kenna M Shaw; Susan Wallace; Georgia L Wiesner; Nikolajs Zeps; Peter Lichter; Andrew V Biankin; Christian Chabannon; Lynda Chin; Bruno Clément; Enrique de Alava; Françoise Degos; Martin L Ferguson; Peter Geary; D Neil Hayes; Thomas J Hudson; Amber L Johns; Arek Kasprzyk; Hidewaki Nakagawa; Robert Penny; Miguel A Piris; Rajiv Sarin; Aldo Scarpa; Tatsuhiro Shibata; Marc van de Vijver; P Andrew Futreal; Hiroyuki Aburatani; Mónica Bayés; David D L Botwell; Peter J Campbell; Xavier Estivill; Daniela S Gerhard; Sean M Grimmond; Ivo Gut; Martin Hirst; Carlos López-Otín; Partha Majumder; Marco Marra; John D McPherson; Hidewaki Nakagawa; Zemin Ning; Xose S Puente; Yijun Ruan; Tatsuhiro Shibata; Michael R Stratton; Hendrik G Stunnenberg; Harold Swerdlow; Victor E Velculescu; Richard K Wilson; Hong H Xue; Liu Yang; Paul T Spellman; Gary D Bader; Paul C Boutros; Peter J Campbell; Paul Flicek; Gad Getz; Roderic Guigó; Guangwu Guo; David Haussler; Simon Heath; Tim J Hubbard; Tao Jiang; Steven M Jones; Qibin Li; Nuria López-Bigas; Ruibang Luo; Lakshmi Muthuswamy; B F Francis Ouellette; John V Pearson; Xose S Puente; Victor Quesada; Benjamin J Raphael; Chris Sander; Tatsuhiro Shibata; Terence P Speed; Lincoln D Stein; Joshua M Stuart; Jon W Teague; Yasushi Totoki; Tatsuhiko Tsunoda; Alfonso Valencia; David A Wheeler; Honglong Wu; Shancen Zhao; Guangyu Zhou; Lincoln D Stein; Roderic Guigó; Tim J Hubbard; Yann Joly; Steven M Jones; Arek Kasprzyk; Mark Lathrop; Nuria López-Bigas; B F Francis Ouellette; Paul T Spellman; Jon W Teague; Gilles Thomas; Alfonso Valencia; Teruhiko Yoshida; Karen L Kennedy; Myles Axton; Stephanie O M Dyke; P Andrew Futreal; Daniela S Gerhard; Chris Gunter; Mark Guyer; Thomas J Hudson; John D McPherson; Linda J Miller; Brad Ozenberger; Kenna M Shaw; Arek Kasprzyk; Lincoln D Stein; Junjun Zhang; Syed A Haider; Jianxin Wang; Christina K Yung; Anthony Cros; Anthony Cross; Yong Liang; Saravanamuttu Gnaneshan; Jonathan Guberman; Jack Hsu; Martin Bobrow; Don R C Chalmers; Karl W Hasel; Yann Joly; Terry S H Kaan; Karen L Kennedy; Bartha M Knoppers; William W Lowrance; Tohru Masui; Pilar Nicolás; Emmanuelle Rial-Sebbag; Laura Lyman Rodriguez; Catherine Vergely; Teruhiko Yoshida; Sean M Grimmond; Andrew V Biankin; David D L Bowtell; Nicole Cloonan; Anna deFazio; James R Eshleman; Dariush Etemadmoghadam; Brooke B Gardiner; Brooke A Gardiner; James G Kench; Aldo Scarpa; Robert L Sutherland; Margaret A Tempero; Nicola J Waddell; Peter J Wilson; John D McPherson; Steve Gallinger; Ming-Sound Tsao; Patricia A Shaw; Gloria M Petersen; Debabrata Mukhopadhyay; Lynda Chin; Ronald A DePinho; Sarah Thayer; Lakshmi Muthuswamy; Kamran Shazand; Timothy Beck; Michelle Sam; Lee Timms; Vanessa Ballin; Youyong Lu; Jiafu Ji; Xiuqing Zhang; Feng Chen; Xueda Hu; Guangyu Zhou; Qi Yang; Geng Tian; Lianhai Zhang; Xiaofang Xing; Xianghong Li; Zhenggang Zhu; Yingyan Yu; Jun Yu; Huanming Yang; Mark Lathrop; Jörg Tost; Paul Brennan; Ivana Holcatova; David Zaridze; Alvis Brazma; Lars Egevard; Egor Prokhortchouk; Rosamonde Elizabeth Banks; Mathias Uhlén; Anne Cambon-Thomsen; Juris Viksna; Fredrik Ponten; Konstantin Skryabin; Michael R Stratton; P Andrew Futreal; Ewan Birney; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Sancha Martin; Jorge S Reis-Filho; Andrea L Richardson; Christos Sotiriou; Hendrik G Stunnenberg; Giles Thoms; Marc van de Vijver; Laura van't Veer; Fabien Calvo; Daniel Birnbaum; Hélène Blanche; Pascal Boucher; Sandrine Boyault; Christian Chabannon; Ivo Gut; Jocelyne D Masson-Jacquemier; Mark Lathrop; Iris Pauporté; Xavier Pivot; Anne Vincent-Salomon; Eric Tabone; Charles Theillet; Gilles Thomas; Jörg Tost; Isabelle Treilleux; Fabien Calvo; Paulette Bioulac-Sage; Bruno Clément; Thomas Decaens; Françoise Degos; Dominique Franco; Ivo Gut; Marta Gut; Simon Heath; Mark Lathrop; Didier Samuel; Gilles Thomas; Jessica Zucman-Rossi; Peter Lichter; Roland Eils; Benedikt Brors; Jan O Korbel; Andrey Korshunov; Pablo Landgraf; Hans Lehrach; Stefan Pfister; Bernhard Radlwimmer; Guido Reifenberger; Michael D Taylor; Christof von Kalle; Partha P Majumder; Rajiv Sarin; T S Rao; M K Bhan; Aldo Scarpa; Paolo Pederzoli; Rita A Lawlor; Massimo Delledonne; Alberto Bardelli; Andrew V Biankin; Sean M Grimmond; Thomas Gress; David Klimstra; Giuseppe Zamboni; Tatsuhiro Shibata; Yusuke Nakamura; Hidewaki Nakagawa; Jun Kusada; Tatsuhiko Tsunoda; Satoru Miyano; Hiroyuki Aburatani; Kazuto Kato; Akihiro Fujimoto; Teruhiko Yoshida; Elias Campo; Carlos López-Otín; Xavier Estivill; Roderic Guigó; Silvia de Sanjosé; Miguel A Piris; Emili Montserrat; Marcos González-Díaz; Xose S Puente; Pedro Jares; Alfonso Valencia; Heinz Himmelbauer; Heinz Himmelbaue; Victor Quesada; Silvia Bea; Michael R Stratton; P Andrew Futreal; Peter J Campbell; Anne Vincent-Salomon; Andrea L Richardson; Jorge S Reis-Filho; Marc van de Vijver; Gilles Thomas; Jocelyne D Masson-Jacquemier; Samuel Aparicio; Ake Borg; Anne-Lise Børresen-Dale; Carlos Caldas; John A Foekens; Hendrik G Stunnenberg; Laura van't Veer; Douglas F Easton; Paul T Spellman; Sancha Martin; Anna D Barker; Lynda Chin; Francis S Collins; Carolyn C Compton; Martin L Ferguson; Daniela S Gerhard; Gad Getz; Chris Gunter; Alan Guttmacher; Mark Guyer; D Neil Hayes; Eric S Lander; Brad Ozenberger; Robert Penny; Jane Peterson; Chris Sander; Kenna M Shaw; Terence P Speed; Paul T Spellman; Joseph G Vockley; David A Wheeler; Richard K Wilson; Thomas J Hudson; Lynda Chin; Bartha M Knoppers; Eric S Lander; Peter Lichter; Lincoln D Stein; Michael R Stratton; Warwick Anderson; Anna D Barker; Cindy Bell; Martin Bobrow; Wylie Burke; Francis S Collins; Carolyn C Compton; Ronald A DePinho; Douglas F Easton; P Andrew Futreal; Daniela S Gerhard; Anthony R Green; Mark Guyer; Stanley R Hamilton; Tim J Hubbard; Olli P Kallioniemi; Karen L Kennedy; Timothy J Ley; Edison T Liu; Youyong Lu; Partha Majumder; Marco Marra; Brad Ozenberger; Jane Peterson; Alan J Schafer; Paul T Spellman; Hendrik G Stunnenberg; Brandon J Wainwright; Richard K Wilson; Huanming Yang Journal: Nature Date: 2010-04-15 Impact factor: 49.962
Authors: Selvaraju Veeriah; Cameron Brennan; Shasha Meng; Bhuvanesh Singh; James A Fagin; David B Solit; Philip B Paty; Dan Rohle; Igor Vivanco; Juliann Chmielecki; William Pao; Marc Ladanyi; William L Gerald; Linda Liau; Timothy C Cloughesy; Paul S Mischel; Chris Sander; Barry Taylor; Nikolaus Schultz; John Major; Adriana Heguy; Fang Fang; Ingo K Mellinghoff; Timothy A Chan Journal: Proc Natl Acad Sci U S A Date: 2009-05-28 Impact factor: 11.205
Authors: Clémence Belleannée; Nicolas Da Silva; Winnie W C Shum; Dennis Brown; Sylvie Breton Journal: Am J Physiol Cell Physiol Date: 2010-01-13 Impact factor: 4.249
Authors: Bilian Jin; Bing Yao; Jian-Liang Li; C Robert Fields; Amber L Delmas; Chen Liu; Keith D Robertson Journal: Cancer Res Date: 2009-09-01 Impact factor: 12.701
Authors: Christine R Beck; Pamela Collier; Catriona Macfarlane; Maika Malig; Jeffrey M Kidd; Evan E Eichler; Richard M Badge; John V Moran Journal: Cell Date: 2010-06-25 Impact factor: 41.582
Authors: Dan Li; Nan Li; Yi-Fan Zhang; Haiying Fu; Mingqian Feng; Dina Schneider; Ling Su; Xiaolin Wu; Jing Zhou; Sean Mackay; Josh Kramer; Zhijian Duan; Hongjia Yang; Aarti Kolluri; Alissa M Hummer; Madeline B Torres; Hu Zhu; Matthew D Hall; Xiaoling Luo; Jinqiu Chen; Qun Wang; Daniel Abate-Daga; Boro Dropublic; Stephen M Hewitt; Rimas J Orentas; Tim F Greten; Mitchell Ho Journal: Gastroenterology Date: 2020-02-12 Impact factor: 22.682
Authors: Deli Liu; Huan Xiong; Angela E Ellis; Nicole C Northrup; Carlos O Rodriguez; Ruth M O'Regan; Stephen Dalton; Shaying Zhao Journal: Cancer Res Date: 2014-07-31 Impact factor: 12.701
Authors: Deli Liu; Huan Xiong; Angela E Ellis; Nicole C Northrup; Kevin K Dobbin; Dong M Shin; Shaying Zhao Journal: PLoS Genet Date: 2015-06-01 Impact factor: 5.917
Authors: Jonathan P Mochel; Albert E Jergens; Dawn Kingsbury; Hyun Jung Kim; Martín G Martín; Karin Allenspach Journal: AAPS J Date: 2017-12-12 Impact factor: 3.603
Authors: Nicholas A Graham; Aspram Minasyan; Anastasia Lomova; Ashley Cass; Nikolas G Balanis; Michael Friedman; Shawna Chan; Sophie Zhao; Adrian Delgado; James Go; Lillie Beck; Christian Hurtz; Carina Ng; Rong Qiao; Johanna Ten Hoeve; Nicolaos Palaskas; Hong Wu; Markus Müschen; Asha S Multani; Elisa Port; Steven M Larson; Nikolaus Schultz; Daniel Braas; Heather R Christofk; Ingo K Mellinghoff; Thomas G Graeber Journal: Mol Syst Biol Date: 2017-02-15 Impact factor: 13.068
Authors: Kristin Marie Valand Herstad; Helene Thorsen Rønning; Anne Marie Bakke; Lars Moe; Ellen Skancke Journal: Acta Vet Scand Date: 2018-05-11 Impact factor: 1.695
Authors: Jin Wang; Tianfang Wang; Micah A Bishop; John F Edwards; Hang Yin; Stephen Dalton; Laura K Bryan; Shaying Zhao Journal: Oncotarget Date: 2018-06-26
Authors: Jin Wang; Tianfang Wang; Yanfang Sun; Yuan Feng; William C Kisseberth; Carolyn J Henry; Irene Mok; Susan E Lana; Kevin Dobbin; Nicole Northrup; Elizabeth W Howerth; Shaying Zhao Journal: Cancers (Basel) Date: 2018-09-14 Impact factor: 6.639