| Literature DB >> 30321343 |
Mark J Margres1, Manuel Ruiz-Aravena2, Rodrigo Hamede2,3, Menna E Jones2, Matthew F Lawrance1, Sarah A Hendricks4, Austin Patton1, Brian W Davis5,6, Elaine A Ostrander6, Hamish McCallum7, Paul A Hohenlohe4, Andrew Storfer1.
Abstract
Understanding the genetic basis of disease-related phenotypes, such as cancer susceptibility, is crucial for the advancement of personalized medicine. Although most cancers are somatic in origin, a small number of transmissible cancers have been documented. Two such cancers have emerged in the Tasmanian devil (Sarcophilus harrisii) and now threaten the species with extinction. Recently, cases of natural tumor regression in Tasmanian devils infected with the clonally contagious cancer have been detected. We used whole-genome sequencing and FST-based approaches to identify the genetic basis of tumor regression by comparing the genomes of seven individuals that underwent tumor regression with those of three infected individuals that did not. We found three highly differentiated candidate genomic regions containing several genes related to immune response and/or cancer risk, indicating that the genomic basis of tumor regression was polygenic. Within these genomic regions, we identified putative regulatory variation in candidate genes but no nonsynonymous variation, suggesting that natural tumor regression may be driven, at least in part, by differential host expression of key loci. Comparative oncology can provide insight into the genetic basis of cancer risk, tumor development, and the pathogenicity of cancer, particularly due to our limited ability to monitor natural, untreated tumor progression in human patients. Our results support the hypothesis that host immune response is necessary for triggering tumor regression, providing candidate genes that may translate to novel treatments in human and nonhuman cancers.Entities:
Mesh:
Year: 2018 PMID: 30321343 PMCID: PMC6251476 DOI: 10.1093/gbe/evy229
Source DB: PubMed Journal: Genome Biol Evol ISSN: 1759-6653 Impact factor: 3.416
Fig. 1.—Sampling of Sarcophilus harrisii. We collected tissue samples from seven individuals that underwent tumor regression and three individuals that did not for whole-genome sequencing. Sampling region as well as the sites of discovery (i.e., putative sites of origin) for DFT1 and DFT2 are indicated in the inset.
Fig. 2.—Manhattan plots showing differentiation between cases and controls for (A) SNPs, (B) indels, and (C) structural variants. Blue points in (A) and (B) indicate scaffolds that were identified as highly differentiated candidate genomic regions by segmentFst. The three candidate indel scaffolds in (B) are the same scaffolds as in (A). Because the x axis is variant index rather than true genomic position (i.e., the 36,000 scaffolds of the reference genome makes robust physical mapping problematic), the scaffolds only appear slightly shifted. (C) Blue points indicate empirical outliers that passed all filters (see Materials and Methods). Green points indicate empirical outliers that did not pass all filters. The y axis in all panels represents the negative log of the pFst statistic.
Candidate Genomic Regions Identified in the SNP-Based segmentFst Analysis with Corresponding Genes and Candidate SNPs
| Chromsome | Scaffold | Start–End | Gene | Putative Function | Candidate SNPs |
|---|---|---|---|---|---|
| 1 | GL834900 | 59090–264617 | Protein phosphatase 2, regulatory subunit B, delta ( | Cell growth/division control; Activated TLR4 signaling | – |
| BCL2/adenovirus E1B 19 kDa interacting protein 3 ( | Apoptosis and authophagy | – | |||
| Janus kinase and microtubule interacting protein 3 ( | Microtubule and kinase binding | 13 intronic | |||
| 2 | GL841602 | 886632–992998 | Xylulokinase ( | Metabolism | 8 intronic |
| 1 downstream | |||||
| 2 | GL841626 | 476121–923043 | Sodium channel, voltage-gated, type XI, alpha subunit ( | Action potential and pain signaling | 4 upstream |
| WD repeat domain 48 ( | Deubiquitinating complex regulator; Potential tumor suppressor | 2 intronic | |||
| Golgi reassembly stacking protein 1 ( | Protein sorting and modification | 1 upstream | |||
| Tetratricopeptide repeat domain 21A ( | Protein localization | – | |||
| Xin actin-binding repeat containing 1 ( | Cellular junction function; leukemia candidate gene | – | |||
| Solute carrier family 22, member 14 ( | Molecule transport | – | |||
| Solute carrier family 22, member 13 ( | Molecular transport and metabolism | – | |||
| Oxidative stress responsive 1 ( | Kinase response to environmental stress | – | |||
| 3 | GL849905 | 3117969–3162680 | Mediator complex subunit 12-like ( | Transcriptional coactivation | – |
| 4 | GL856968 | 1213224–1266653 | – | – | – |
| 5 | GL861622 | 8458–1345777 | Lymphoid-restricted membrane protein ( | Peptide delivery to MHC class I; related to b-cell lymphomas | – |
| Cancer susceptibility candidate 1 ( | Microtubule and tubulin binding; related to respiratory neoplasms | – | |||
| LYR motif containing 5 ( | Protein synthesis | – | |||
| Kirsten rat sarcoma viral oncogene homolog ( | Member of small GTPase superfamily; implicated in multiple carcinomas | 1 intronic | |||
| Lamin tail domain containing 1 ( | Structural activity; related to respiratory neoplasms | 30 intronic | |||
| Ras association domain family member 8 ( | Tumor suppressor protein; epithelial cell migration | – | |||
| Sarcospan ( | Structural support | – | |||
| 5 | GL861682 | 1183188–1317129 | Glutamate decarboxylase 2 ( | L-glutamic acid catalysis | 87 intronic |
| 5 | GL861682 | 1500464–1544340 | Myosin IIIA ( | Hearing; associated with bladder/colon cancer | 7 intronic |
| 5 | GL861847 | 655–43394 | – | – | – |
| 5 | GL862178 | 7209–28728 | ENSSHAG00000000783 | Unannotated coding sequence | 9 intronic |
| 5 | GL862286 | 963–11539 | – | – | – |
Putative immune or cancer-related function. Candidate SNPs were variants with .
Candidate Genomic Regions Identified in the Indel-Based segmentFst Analysis with Corresponding Genes and Candidate Indels
| Chromsome | Scaffold | Start–End | Gene | Putative Function | Candidate Indels |
|---|---|---|---|---|---|
| 3 | GL849905 | 2931754–3563755 | Transcriptional coactivation | 6 intronic | |
| 1 upstream | |||||
| 4 intronic | |||||
| G protein-coupled receptor 87 ( | Regulated by | 1 intronic | |||
| 3 downstream | |||||
| Purinergic receptor P2Y, G-protein coupled, 14 ( | Receptor activity; immune and neuroimmune function | – | |||
| G protein-coupled receptor 171 ( | Ligand-binding receptor | – | |||
| Siah E3 ubiquitin protein ligase 2 ( | Hypoxia response; involved in apoptosis and tumor suppression | – | |||
| 5 | GL861622 | 159894–1238762 | Microtubule and tubulin binding; related to respiratory neoplasms | 1 intronic | |
| Protein synthesis | – | ||||
| Small GTPase superfamily member; implicated in multiple carcinomas | – | ||||
| Structural activity; related to respiratory neoplasms | 7 intronic | ||||
| 1 upstream | |||||
| 1 downstream | |||||
| Tumor suppressor protein; epithelial cell migration | 1 intronic | ||||
| 5 | GL861682 | 1186331–1550521 | L-glutamic acid catalysis | 11 intronic | |
| Hearing; associated with bladder/colon cancer | 21 intronic | ||||
| 1 upstream |
Putative immune or cancer-related function.
Indels found in overlapping open-reading frames and, therefore, with ambiguous designations. Candidate indels were variants with . Abbreviated gene names were defined in table 1.
Fig. 3.—Pie charts depicting the relative frequencies of different mutational classes among the most differentiated (A) SNPs (n = 585) and (B) indels (n = 128) in the candidate genomic regions from figure 2. Nonsyn, nonsynonymous; Syn, synonymous; UTR, untranslated region.
Candidate Genes Identified in the Structural Variant Analysis
| Chromsome | Scaffold | Position | Type | Gene | Putative Function |
|---|---|---|---|---|---|
| 2 | GL841465 | 252851 | 58 bp insertion intron | DTW domain containing 2 ( | Mitochondrion protein targeting |
| GL841584 | 1176582 | 95 bp deletion intron | BMP binding endothelial regulator ( | Bone morphogenetic protein inhibition | |
| 3 | GL849630 | 133609 | 584 bp insertion upstream | Solute carrier family 38 member 11 ( | Amino acid transport |
| GL849631 | 677692 | 278 bp insertion intron | Cysteine-serine-rich nuclear protein 3 ( | Transcriptional activator regulator; apoptosis | |
| 4 | GL856953 | 1140408 | 247 bp insertion intron | Mitogen-activated protein kinase kinase kinase 5 ( | Apoptosis signal transduction |
| GL856993 | 210323 | Translocation of intron with intergenic region on chromosome 3 (GL849549) | Tripartite motif containing 33 ( | E3 ubiquitin-protein ligase; related to carcinomas | |
| 5 | GL861591 | 1045356 | 614 bp deletion intron | ENTH domain containing 1 ( | Unknown |
| GL861622 | 511673 | Translocation of downstream region with intron of tankyrase ( | Structural activity; related to respiratory neoplasms | ||
| 511686 | Translocation of downstream region with intron of | Structural activity; related to respiratory neoplasms | |||
| GL861672 | 1578145 | 69 bp insertion intron | GTP binding protein 4 ( | Cell signaling | |
| 6 | GL864736 | 1234425 | Translocation of intron with downstream region of | Wnt signaling; target of cancer treatment | |
| 1234867 | Translocation of intron with downstream region of | Wnt signaling; target of cancer treatment | |||
| GL864753 | 78910 | 395 bp insertion intron | SEC31 homolog A ( | Vesicle budding; related to inflammatory myofibroblastic tumors | |
| X | GL867607 | 3118283 | Translocation of intron with intergenic region on chromosome 1 (GL835007) | Dachshund family transcription factor 2 ( | Transcription factor |
Putative immune or cancer-related function. Abbreviated gene names were defined in tables 1 and 2.
Fig. 4.—VISTA plot showing conserved sequences across devils and six other mammals for a candidate genomic region containing MYO3A. (A) We plotted 1,400,000–1,500,000 on devil chromosome 5 scaffold GL861682. Two conserved intronic sequences (B and C), indicated by the red boxes in (A), possessed candidate SNPs that were highly differentiated () between cases and controls, suggesting that these SNPs may lead to regulatory variation in MYO3A. Genomic position in the reference human genome is listed on the x axis in all panels. Percent similarity is shown on the y axis where the blue line indicates 70% identity in all panels. Asterisks represent approximate location of candidate SNPs. CNS, conserved noncoding sequence; UTR, untranslated region.