| Literature DB >> 19285540 |
Jenny Mattison1, Louise van der Weyden, Tim Hubbard, David J Adams.
Abstract
The elucidation of the human and mouse genome sequence and developments in high-throughput genome analysis, and in computational tools, have made it possible to profile entire cancer genomes. In parallel with these advances mouse models of cancer have evolved into a powerful tool for cancer gene discovery. Here we discuss the approaches that may be used for cancer gene identification in both human and mouse and discuss how a cross-species 'oncogenomics' approach to cancer gene discovery represents a powerful strategy for finding genes that drive tumourigenesis.Entities:
Mesh:
Substances:
Year: 2009 PMID: 19285540 PMCID: PMC2756404 DOI: 10.1016/j.bbcan.2009.03.001
Source DB: PubMed Journal: Biochim Biophys Acta ISSN: 0006-3002
Overview of genomic technologies for cancer gene discovery.
| Cancer gene discovery approach | Resolution | Pros | Cons |
|---|---|---|---|
| Gene resequencing | Nucleotide | Can be an accurate way of finding somatic mutations at the nucleotide level. | PCR-based strategies are not readily scaleable to genome-wide and are expensive. |
| With new array-based sequence enrichment technology the entire exome can be profiled. | Array-based sequence enrichment is still developmental and many protocols do not reproducibly capture the exome. | ||
| Expression analysis | Transcript | Expression data can be used for diagnostic and prognostic purposes. | Because expression profiling is a quantitative measure of gene activity it reports gene expression changes that are both the cause and the effect of genetic and epigenetic changes at the DNA level which often makes the output of these studies an ‘expression signature’ rather than a cancer gene. |
| RNA-Seq based approaches can be used for profiling the transcriptome including expression levels, splicing and fusion gene discovery. | |||
| Comparative Genomic Hybridization (CGH) | Megabase | Large complex rearrangements can be discovered using this technique. | Largely outdated by array-based approaches. Not readily scaleable to high-throughput. |
| Reveals large regions of rearrangement which may contain many genes so finding causal rearrangements can be difficult. | |||
| Array-based Comparative Genomic Hybridization (aCGH) | 100s bp | High resolution. SNP-based platforms can report allele-specific changes. | Stromal contamination and immune cell infiltrates can influence the ability of these platforms to determine the copy number of the cancer. |
| Like all genomics platforms array-based CGH reports to copy number profile of a population of cells so tumour heterogeneity can be an issue. | |||
| Sequencing entire cancer genomes | Nucleotide | Can report nucleotide level variation, copy number information and can also report neutral changes in the genome such as balanced translocations and inversions. | Extremely expensive. Not clear how to computationally resolve highly rearranged regions. |
| Epigenetic profiling | Nucleotide | Can detect epigenetically silenced genes that would be missed by other approaches. | It has proved difficult to develop technology to scale to genome-wide epigenetic profiling at the nucleotide level. |
Fig. 1End-sequence profiling of tumour DNA. The tumour genome is fragmented and the ends of the fragmented DNA molecules are sequenced. These sequenced ends are then mapped to the reference genome. Ends that are an abnormal distance apart, or in an abnormal orientation, shown here as “invalid”, are indicative of rearrangements within the tumour genome. Redrawn with modifications from Raphael et al. [117].
Fig. 2Structure of insertional mutagens used for cancer gene discovery in the mouse. (A) The provirus contains two long terminal repeats (LTRs) flanking the genes required for viral assembly. Elements within the LTRs drive transcription of the viral genes but can also induce mutation of nearby cellular genes. Splicing of a viral splice donor (SD) or cryptic splice donor (not shown) to a splice acceptor or cryptic splice acceptor in the first intron or 5′ UTR of a cellular gene results in the formation of a chimeric transcript, in which the celluar gene is coupled to the viral promoter. Splicing of a splice donor or cryptic splice donor in a cellular gene to a viral splice acceptor (SA) or cryptic splice acceptor (not shown) can cause premature termination of gene transcription owing to the presence of polyadenylation signal (pA) and cryptic polyadenylation signals (not shown) in the LTR. Adapted from figure in Uren et al. (see [164,168]). Figure is not to scale. (B) Structure of the Sleeping Beauty transposon T2Onc [242]. The presence of splice acceptors (SA) and polyadenylation signals (pA) in both orientations enables premature termination of gene transcription from intragenic insertions in both orientations. The transposon also contains the murine stem cell virus (MSCV) 5′ LTR and a splice donor (SD) site that can induce promoter mutations in cellular genes. Elements for mutagenesis are flanked by 2 IR/DR elements, shown as arrows, which are required for transposon mobilisation.
Fig. 3Mechanisms of mutagenesis by the murine leukaemia virus. The provirus is shown in blue; coding and non-coding exons are shown in red and white, respectively. (A) Enhancer mutation. An enhancer element in the 5′ LTR of MuLV can cause upregulation of nearby cellular genes. Oncogenic insertions of this type are most frequently found upstream and in the antisense orientation with respect to the cellular gene(s) that they are mutating. (B) Promoter mutation. Insertion of MuLV into the promoter region of a cellular gene results in chimeric transcripts that are produced at higher levels than the endogenous gene transcript. (C) Truncating mutation. Intragenic MuLV insertions can cause premature termination of gene transcription, resulting in either gene upregulation or gene inactivation. The figure shows an insertion within the 3′ UTR region, which may remove mRNA-destabilising motifs, thereby stabilising the gene transcript.
Fig. 4Isolation of retroviral insertion sites. (A) Inverse PCR. Tumour DNA is digested using restriction enzyme X and the restricted DNA is allowed to circularise. Genomic DNA flanking retroviral insertions are amplified using PCR primers that bind within the insertion and point out towards the genomic DNA. A second round of PCR is performed using nested primers. The amplified DNA is sequenced and mapped to the mouse reference genome. (B) Splinkerette PCR. As for inverse PCR, except that instead of circularising the digested DNA, a splinkerette adapter (shown in yellow) is ligated and genomic DNA flanking the retroviral insertions is amplified using PCR primers that bind to the adapter and the retroviral LTR.