| Literature DB >> 29985390 |
Philip Webster1,2,3, Joanna C Dawes1,2, Hamlata Dewchand1,2, Katalin Takacs1,2, Barbara Iadarola1,2, Bruce J Bolt1,2, Juan J Caceres4, Jakub Kaczor1,2, Gopuraja Dharmalingam1,2, Marian Dore1,2, Laurence Game1,2, Thomas Adejumo1,2, James Elliott1,2, Kikkeri Naresh3, Mohammad Karimi1,2, Katerina Rekopoulou1,2, Ge Tan1,2, Alberto Paccanaro4, Anthony G Uren5,6.
Abstract
Determining whether recurrent but rare cancer mutations are bona fide driver mutations remains a bottleneck in cancer research. Here we present the most comprehensive analysis of murine leukemia virus-driven lymphomagenesis produced to date, sequencing 700,000 mutations from >500 malignancies collected at time points throughout tumor development. This scale of data allows novel statistical approaches for identifying selected mutations and yields a high-resolution, genome-wide map of the selective forces surrounding cancer gene loci. We also demonstrate negative selection of mutations that may be deleterious to tumor development indicating novel avenues for therapy. Screening of two BCL2 transgenic models confirmed known drivers of human non-Hodgkin lymphoma, and implicates novel candidates including modifiers of immunosurveillance and MHC loci. Correlating mutations with genotypic and phenotypic features independently of local variance in mutation density also provides support for weakly evidenced cancer genes. An online resource http://mulv.lms.mrc.ac.uk allows customized queries of the entire dataset.Entities:
Mesh:
Year: 2018 PMID: 29985390 PMCID: PMC6037733 DOI: 10.1038/s41467-018-05069-9
Source DB: PubMed Journal: Nat Commun ISSN: 2041-1723 Impact factor: 14.919
Fig. 1Variable latency and immunophenotype of MuLV lymphoma from wild type and BCL2 transgenic mice. a The Vav-BCL2 transgene significantly reduced latency on an F1 background. b, c The Emu-BCL2-22 transgene significantly reduces latency on a C57BL/6 background but not F1 background. The Emu-Bcl2-22 C57BL/6 cohort had a significantly shorter latency than wild-type C57BL/6 controls and both C57BL/6 cohorts had longer latency compared with F1 equivalents (Supplementary Fig. 1). d–f Stacked bar charts on the right represent the immunophenotyping of spleen suspensions from each cohort. Each row represents one spleen. Colors in each row represent the proportion of B cells (blue CD19+) and T cells (yellow CD5+ CD4− CD8−, light orange CD5+ CD8+, dark orange CD5+ CD4+ CD8+, and red CD5+ CD4+) in each sample. BCL2 transgenes increase the proportion of B cells in all cohorts and the mixture of T cell lymphoma subtypes is highly variable. g The proportion of CD19+ B cells is increased by both BCL2 transgenes. h Histogram of all CD19+ proportions from all cohorts combined is a bimodal distribution that can be segregated into those consisting primarily of B cells (>50%) and T cells
Fig. 2Quantifying the progression of MuLV replication and clonal outgrowth of resulting lymphoma. Virus copy number and expression level was quantified by QPCR of genomic DNA (a) and RTQPCR of cDNA (b) extracted from spleen samples of time course animals. Error bars represent s.d. of 3 technical replicates per DNA/RNA sample. (c) Profiles of the relative abundance of the top 50 most clonal integrations from a cross section of mature lymphoma (upper 2 rows) and time course samples (lower 4 rows) are represented as bar graphs. Non-adjusted clonality is indicated in blue, normalized clonality (such that the most abundant integration has a value of 1) are the graphs in red. Asymptomatic animals from early time points display a relatively flat profile whereas later time points and mice with symptomatic lymphoma show clear signs of clonal outgrowth. Shannon entropy values (E) are displayed on each graph
Fig. 3Using distance-based measures and entropy as indicators of clonal outgrowth of lymphoma. a Dynamic Time Warping was used to cluster clonality profiles of all samples and identifies two major groups; early-stage samples (blue) and samples undergoing clonal outgrowth (red). Near identical clusters were obtained using the Kolgomorov–Smirnov statistic (Supplementary Data 1). b Samples are plotted comparing entropy score by rank and individual samples are colored by cluster branch, indicating both entropy scores and clustering give a similar bifurcation of samples. c Distribution of entropy scores between the two clusters indicates an entropy value of 3.5 effectively separates the groups. The mean (horizontal line), ±1 s.d. (box), and ±2 s.d. (vertical line) are indicated. d Distribution of entropy scores between different time points indicates a progressive increase in the frequency of clonal outgrowth (mean (horizontal line), ±1 s.d. (box), and ±2 s.d. (vertical line)). Superimposing the clonality profiles of all samples within each cluster indicates consistent shape within the low entropy group (e) and within the high entropy group (f). A normalized clonality value of 0.1 is used to differentiate clonal and subclonal mutations within the late-stage clonal outgrowth samples
Fig. 4Multiple criteria indicate selection of both clonal and subclonal mutations at CIS loci. Four heat maps representing the relative levels of selection observed between different categories of integrations. Fisher’s exact tests were performed counting the inserts within 100kb windows surrounding each of the top 50 clonal insert loci. Blue indicates comparisons between early and late-stage integrations. Red represents integration orientation bias (forward or reverse strand). Yellow represents specificity for B cell (>50% CD19) versus T cell lymphomas. Green represents specificity between different genotypes. p-Values for Fisher’s exact tests are indicated by color intensity
Fig. 5Genome-wide scanning of subclonal mutation distributions identifies regions undergoing selection. a Genome-wide contingency table tests of all mutations identifies loci that are late-stage specific, strand biased and genotype biased. The Venn diagram demonstrates substantial overlap between loci identified by these criteria. b Distribution of integrations over the Myc/Pvt1 locus. Each row of colored vertical lines represents the forward and reverse strand integrations of each category of mice. Gray bands below each colored row represent the level of selection evidenced by contingency table tests. Late-stage specific integrations are evident throughout the region; however, integrations upstream of Myc are primarily on the forward strand and T cell specific whereas integrations within the Pvt1 gene are in the reverse orientation and somewhat biased toward wild-type mice
Fig. 6Overlap of CIS loci with exome sequencing studies of hematologic malignancies. Human orthologues were identified for all candidate genes (an automated list KCRBM, and a curated list) using biomart and compared to lists of genes with coding mutations in 12 cohorts of hematologic malignancy in cBio portal. a All genes found mutated in at least two samples over all cohorts are listed with mutation counts from each cohort. The full overlap for all cohorts is listed in Supplementary Data 7. b The significance of overlap between the set of candidate orthologues and the set of mutated genes in each study is calculated using a Fisher’s exact test