| Literature DB >> 25355294 |
Carl Mårten Lindqvist1, Jessica Nordlund, Diana Ekman, Anna Johansson, Behrooz Torabi Moghadam, Amanda Raine, Elin Övernäs, Johan Dahlberg, Per Wahlberg, Niklas Henriksson, Jonas Abrahamsson, Britt-Marie Frost, Dan Grandér, Mats Heyman, Rolf Larsson, Josefine Palle, Stefan Söderhäll, Erik Forestier, Gudmar Lönnerholm, Ann-Christine Syvänen, Eva C Berglund.
Abstract
Genomic characterization of pediatric acute lymphoblastic leukemia (ALL) has identified distinct patterns of genes and pathways altered in patients with well-defined genetic aberrations. To extend the spectrum of known somatic variants in ALL, we performed whole genome and transcriptome sequencing of three B-cell precursor patients, of which one carried the t(12;21)ETV6-RUNX1 translocation and two lacked a known primary genetic aberration, and one T-ALL patient. We found that each patient had a unique genome, with a combination of well-known and previously undetected genomic aberrations. By targeted sequencing in 168 patients, we identified KMT2D and KIF1B as novel putative driver genes. We also identified a putative regulatory non-coding variant that coincided with overexpression of the growth factor MDK. Our results contribute to an increased understanding of the biological mechanisms that lead to ALL and suggest that regulatory variants may be more important for cancer development than recognized to date. The heterogeneity of the genetic aberrations in ALL renders whole genome sequencing particularly well suited for analysis of somatic variants in both research and diagnostic applications.Entities:
Keywords: RNA sequencing; acute lymphoblastic leukemia; clonal heterogeneity; whole genome sequencing
Mesh:
Substances:
Year: 2015 PMID: 25355294 PMCID: PMC4309499 DOI: 10.1002/humu.22719
Source DB: PubMed Journal: Hum Mutat ISSN: 1059-7794 Impact factor: 4.878
Clinical Characteristics of Whole Genome Sequenced ALL Patients
| Patient | Sex | Age | WBC | % Blasts | Immunophenotype | Genetic subtype | Treatment group | Clinical follow-up | Remission tissue |
|---|---|---|---|---|---|---|---|---|---|
| ALL_458 | Male | 3.6 | 12.3 | 90 | BCP-ALL | t(12;21) | IR | CCR1 (8.5) | PB |
| ALL_559 | Male | 5.9 | 128.0 | 95 | T-ALL | T-ALL | HR | CCR1 (7) | PB |
| ALL_707 | Male | 1.6 | 9.6 | 80–90 | BCP-ALL | Other | SR | CCR1 (5) | PB |
| ALL_501 | Female | 6.7 | 1.4 | 80 | BCP-ALL | Normal | SR | CCR1 (8) | BM |
Age at diagnosis in years.
White blood cell count at diagnosis (109 cells/l).
The full karyotypes are:
ALL_458: 46,XY.ish.t(12;21)(p13;q22),del(12)(p13p13),del(21)(q22q22)
ALL_559: 46,XY,t(7;9)(q3?4;q3?2)[10].ish.del(9)(p21p21)x2,der(11)t(7;11)(q3?4;p1?3)/46,XY[15]. The t(7;9) was detected by G-banding in 10 of 25 metaphases. Remaining aberrations, and t(7;9), were detected by FISH. The question marks indicate that the breakpoint on sub-band level is uncertain.
ALL_707: 46,XY,der(7)t(7;9)(q11;p13)del(9)(p21p24),der(9)t(7;9)(q11;p13),del(19)(q13)[24]/46,XY[1]
ALL_501: 46,XX[20]. Hyperdiploidy and the most common rearrangements (BCR-ABL1, PBX1-TCF3, ETV6-RUNX1, and MLL) were excluded by FISH and DNA index analysis.
The patients were treated according to the Nordic Society for Pediatric Haematology and Oncology (NOPHO) protocols [Schmiegelow et al., 2010].
Within parenthesis is the follow-up time in years.
BCP-ALL, B-cell precursor ALL; SR, standard risk; IR, intermediate risk; HR, high risk; CCR1, first continuous complete remission; PB, peripheral blood; BM, bone marrow.
Cytogenetic Representation of ALL Patients Included in the Validation Cohort
| Immunophenotype | Genetic subtype | Number of patients | Population (%) |
|---|---|---|---|
| BCP-ALL | HeH | 47 (28.0) | 26.3 |
| t(12;21) | 35 (20.8) | 16.7 | |
| Other | 21 (12.5) | 13.4 | |
| Normal / no result | 18 (10.7) | 21.7 | |
| t(9;22) | 8 (4.8) | 2.2 | |
| 11q23/MLL | 4 (2.4) | 3.6 | |
| iAMP21 | 4 (2.4) | 0.5 | |
| t(1;19) | 4 (2.4) | 1.9 | |
| dic(9;20) | 3 (1.8) | 1.8 | |
| > 67 chr | 1 (0.6) | 0.4 | |
| T-ALL | T-ALL | 23 (13.7) | 10.5 |
| Total | 168 (100) | 99.0 |
HeH, high hyperdiploidy (51–67 chromosomes); t(12;21), translocation between the chromosomes (12;21)(p13;q22)ETV6-RUNX1; t(9;22), translocation between the chromosomes (9;22)(q11;q34)BCR-ABL1; 11q23/MLL, translocation between MLL and various other genes; iAMP21, intrachromosomal amplification of chromosome 21; dic(9;20), dicentric chromosome (9;20)(p13;q11); > 67 chr, > 67 chromosomes; Other, other clonal aberrations; Normal, no genetic aberrations detected and a normal karyotype observed in at least 5 of 25 metaphases; No result, no karyotype reported or the cytogenetic analysis failed.
Within parenthesis is the percentage of samples of each subtype in the validation cohort.
Percentage of each subtype in 2367 patients diagnosed with ALL in the Nordic countries during 1996–2008. The frequency of subtypes changes with time, since new subtypes are discovered and new analysis methods are added. The reason why the population does not sum to 100% is that the rare subtype hypodiploidy is not represented in the validation cohort.
Figure 1A: Circos [Krzywinski et al., 2009] plots showing the genomic location of validated somatic single nucleotide variants (SNVs), insertion-deletions (indels), copy number alterations (CNAs), copy neutral loss of heterozygosity (LOH) events, and translocations in the whole genome sequenced ALL patients. SNVs and indels are shown as red (original clone) or blue (subclone) dots in the circle closest to the chromosomes. Inside the SNVs and indels, deletions are shown with red, duplications with blue and LOH with yellow circle segments. Black arcs indicate translocations. Gene names are color-coded as follows: gray, expressed genes with exonic indels or nsSNVs that were predicted to be damaging and genes that were highlighted as putative drivers in the validation cohort; black, genes involved in translocations; red, selected genes in CNA or LOH regions; green, selected differentially expressed genes that are located near breakpoints for translocations or putatively regulatory SNVs. B: Mutational patterns in the whole genome sequenced patients. The higher frequency of C>A mutations compared to C>T mutations in ALL_707 is significantly different from the other patients (chi-square test, P < 0.001).
Validated Exonic Single Nucleotide Variants and Insertion–Deletions in Whole Genome Sequenced ALL Patients
| Sample | Chr | Position | cDNA change | Protein change | Gene | Gene description | Effect | SIFT | PP2 | AF DNA | AF RNA | FPKM sample | FPKM control |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ALL_458 | 2 | 178416648 | c.844G>A | p.E282K | TTC30B | Tetratricopeptide repeat domain 30B | nsSNV | D | D | 0.12 | 0.14 | 2.2 | 0.8 |
| ALL_458 | 3 | 51749797 | c.2008C>T | p.R670W | GRM2 | Glutamate receptor, metabotropic 2 isoform a | nsSNV | D | D | 0.44 | NA | 0.1 | 0.1 |
| ALL_458 | 12 | 11285961 | c.883C>T | p.R295W | TAS2R30 | Type 2 taste receptor member 30 | nsSNV | T | B | 0.33 | 0.00 | 0.2 | 1.6 |
| ALL_559 | 1 | 10363472 | c.2229T>G | p.I743M | KIF1B | Kinesin family member 1B isoform b | nsSNV | D | D | 0.34 | 0.20 | 2.5 | 3.3 |
| ALL_559 | 1 | 11186751 | c.1069C>T | p.R357C | MTOR | FK506 binding protein 12-rapamycin associated | nsSNV | T | B | 0.56 | 0.60 | 12.3 | 10.8 |
| ALL_559 | 1 | 32745467 | c.1067A>C | p.E356A | LCK | Lymphocyte-specific protein tyrosine kinase | nsSNV | D | D | 0.61 | 0.64 | 306.8 | 67.2 |
| ALL_559 | 3 | 48617469 | c.5119C>T | p.R1707W | COL7A1 | Alpha 1 type VII collagen precursor | nsSNV | D | P | 0.47 | 1.00 | 0.0 | 0.3 |
| ALL_559 | 5 | 180053029 | c.1261C>T | p.P421S | FLT4 | fms-related tyrosine kinase 4 isoform 2 | nsSNV | D | P | 0.35 | NA | 0.0 | 1.0 |
| ALL_559 | 6 | 39864708 | c.2462G>A | p.R821Q | DAAM2 | Dishevelled associated activator of | nsSNV | T | P | 0.40 | 0.00 | 0.1 | 1.6 |
| ALL_559 | 6 | 84055977 | c.515G>T | p.G172V | ME1 | Cytosolic malic enzyme 1 | nsSNV | D | D | 0.39 | 0.00 | 0.1 | 0.0 |
| ALL_559 | 9 | 139397768 | c.5033T>C | p.L1678P | NOTCH1 | Notch1 preproprotein | nsSNV | D | D | 0.31 | 0.29 | 9.5 | 7.8 |
| ALL_559 | 15 | 41165511 | c.456C>T | p.N152N | RHOV | ras homolog gene family, member V | sSNV | NA | NA | 0.22 | NA | 0.1 | 0.1 |
| ALL_559 | 17 | 4536237 | c.1459G>A | p.V487M | ALOX15 | Arachidonate 15-lipoxygenase | nsSNV | D | P | 0.45 | NA | 0.0 | 0.0 |
| ALL_559 | 19 | 13041102 | c.438_439insG | p.G146fs | FARSA | Phenylalanyl-tRNA synthetase, alpha subunit | frameshift ins | NA | NA | 0.62 | 0.36 | 8.8 | 9.5 |
| ALL_707 | 12 | 25398281 | c.38G>A | p.G13D | KRAS | c-K-ras2 protein isoform a precursor | nsSNV | D | NA | 0.38 | 0.25 | 14.5 | 10.9 |
| ALL_707 | 20 | 10030811 | c.1594G>T | p.A532S | ANKEF1 | Ankyrin repeat and EF-hand domain containing 1 | nsSNV | D | D | 0.39 | NA | 0.1 | 0.2 |
| ALL_707 | 21 | 42609582 | c.544G>T | p.E182X | BACE2 | Beta-site APP-cleaving enzyme 2 isoform A | nonsense SNV | NA | NA | 0.41 | 0.00 | NA | NA |
| ALL_501 | X | 151092993 | c.857A>G | p.K286R | MAGEA4 | Melanoma antigen family A, 4 | nsSNV | T | P | 0.32 | NA | 0.0 | 0.0 |
| ALL_501 | X | 48382171 | c.12C>T | p.N4N | EBP | Emopamil binding protein (sterol isomerase) | sSNV | NA | NA | 0.46 | 0.00 | 30.4 | 16.0 |
| ALL_501 | 1 | 157659674 | c.1724C>T | p.A575V | FCRL3 | Fc receptor-like 3 precursor | nsSNV | T | B | 0.31 | 0.00 | 0.6 | 0.7 |
| ALL_501 | 2 | 61711139 | c.2610G>C | p.Q870H | XPO1 | exportin 1 | nsSNV | T | B | 0.46 | 0.48 | 115.1 | 107.1 |
| ALL_501 | 11 | 112123109 | c.410C>T | p.A137V | PLET1 | Hypothetical protein LOC349633 precursor | nsSNV | T | B | 0.38 | 0.00 | 0.2 | 0.2 |
| ALL_501 | 12 | 49426841 | c.11647_11648insGCTC | p.H3883fs | KMT2D | Lysine (K)-specific methyltransferase 2D | frameshift ins | NA | NA | 0.60 | 0.32 | 5.2 | 15.4 |
Nucleotide numbering uses +1 as the A of the ATG translation initiation codon in the reference sequence, with the initiation codon as codon 1.
SIFT predictions: D, damaging; T, tolerated.
PolyPhen2 (PP2) predictions: D, probably damaging; P, possibly damaging; B, benign.
AF from deep-sequencing data. The AF for the SNV in TAS2R30 is from WGS data, as this SNV was not covered in the deep-sequencing data.
The shown expression value represents the mean of 27 BCP-ALL (for ALL_458, ALL_707 and ALL_501) or 18 T-ALL (for ALL_559) samples.
Chr, chromosome; nsSNV, nonsynonymous SNV; sSNV, synonymous SNV; ins, insertion; AF, allele fraction; FPKM: fragments per kilobase of transcript per million mapped reads.
Somatic Copy Number Alterations in Whole Genome Sequenced ALL Patients
| Sample | Subtype | Chr | Start | End | Size (kb) | Type | Cytoband | Cytogenetic prediction | Affected genes | FPKM sample | FPKM control |
|---|---|---|---|---|---|---|---|---|---|---|---|
| ALL_458 | t(12;21) | 10 | 1 | 135534747 | 135,535 | Dup* | all chr 10 | NA | 691 genes | 21.9 | 13.3 |
| ALL_458 | t(12;21) | 11 | 96600814 | 134944379 | 38,344 | Del* | q21-q25 | NA | 261 genes | 11 | 10.6 |
| ALL_458 | t(12;21) | 12 | 11253986 | 13357791 | 2,104 | Del | p13.2-p13.1 | p13 | 25 genes including ETV6 | 3.1 | 6.5 |
| ALL_458 | t(12;21) | 12 | 68835364 | 69203100 | 368 | Del | q15 | NA | MDM2, NUP107, RAP1B, SLC35E3 | 85.4 | 87.6 |
| ALL_458 | t(12;21) | 22 | 22569501 | 22600200 | 31 | Del | q11.22 | NA | VPREB1 | 39.3 | 72.3 |
| ALL_559 | T-ALL | 3 | 70233601 | 74494601 | 4,261 | Del | p13-p12.3 | NA | 11 genes including FOXP1, RYBP, SHQ1 | 6 | 9.8 |
| ALL_559 | T-ALL | 9 | 46587 | 21295942 | 21,249 | LOH | p24.3-p21.3 | NA | 74 genes | 46.5 | 28.5 |
| ALL_559 | T-ALL | 9 | 21300000 | 22100401 | 800 | Del | p21.3 | p21×2 | 11 genes including CDKN2A, CDKN2B | 0.1 | 0.9 |
| ALL_559 | T-ALL | 9 | 22110997 | 32243981 | 10,133 | LOH | p21.3-p21.1 | NA | 13 genes | 1.5 | 1.5 |
| ALL_707 | Other | 9 | 197421 | 21980801 | 21,783 | Del | p24.3-p21.3 | p21-p24 | 82 genes | 25.7 | 19.9 |
| ALL_707 | Other | 9 | 21980802 | 22021001 | 40 | Del | p21.3 | p21-p24 | CDKN2A, CDKN2B | 0 | 1.2 |
| ALL_707 | Other | 9 | 22021002 | 36908001 | 14,887 | Del | p21.3-p13.2 | p21-p24 | 92 genes | 28.3 | 34.9 |
| ALL_707 | Other | 19 | 58519186 | 59089786 | 571 | Del | q13.43 | q13 | 23 genes | 18.9 | 16.8 |
| ALL_501 | Normal | 2 | 89165816 | 89554016 | 388 | Del | p11.2 | NA | NA | NA | NA |
| ALL_501 | Normal | 7 | 50412894 | 50463634 | 51 | Del | p12.2 | NA | IKZF1 | 84.6 | 71.1 |
The size represents the minimal overlap between different predictions. The breakpoints of the IKZF1 deletion were determined by analysis of softclipped reads in the WGS data.
A * indicates that the CNA is subclonal.
Results of cytogenetic analysis at diagnosis. NA indicates that no aberration was observed in the region by cytogenetic analysis.
Mean expression of genes located within the CNA in the sample harboring the CNA.
Mean expression of genes located within the CNA in the control data set, where each gene is represented by the mean of 27 BCP-ALL (for ALL_458, ALL_707 and ALL_501) or 18 T-ALL (for ALL_559) samples.
Chr, chromosome; Dup, duplication; Del, deletion; LOH, loss of heterozygosity; FPKM, fragments per kilobase of transcript per million mapped reads; CNA, copy number alteration.
Figure 2A: Density plot showing the allele fraction (AF) distribution of validated somatic single nucleotide variants (SNVs) in the four whole genome sequenced ALL patients. Each sample displays a density peak with AF between 0.41 and 0.46. In addition, ALL_458 and ALL_559 display density peaks with AF of 0.19 and 0.27, respectively, indicative of subclones. B: Mutational patterns of SNVs belonging to the original clone and the subclone for ALL_458 and ALL_559. C: Subclonal copy number alterations (CNAs) in ALL_458 visualized using Omni2.5 BeadChip data. The top panel shows the log R ratio (LRR) and the bottom panel shows the B-allele frequency (BAF) for the duplication of chromosome 10 (left) and the deletion of chromosome 11q21–25 (right). R corresponds to the total intensity of each probe, and LRR is the log2 of the ratio of the measured normalized R-value in the ALL sample and the normalized R-value of the reference. LRR = 0 indicates no change in copy number. BAF represents the AF, with values of 0 and 1 indicating homozygosity and 0.5 indicating heterozygosity in a diploid genomic region. The solid and dashed lines correspond to the estimated BAF if the CNA is present in 40% and 100% of the leukemic cells, respectively. The red dots show the genomic location of somatic SNVs, with the position on the y-axis corresponding to the AF.
Figure 3Bar plots showing the expression of five genes up- and downstream of genes involved in chromosomal rearrangements. The gene involved in the fusion, if annotated, is highlighted in bold. Genes exhibiting a fourfold relative difference and an absolute difference of at least 3 standard deviations between the sample and the control are marked with a *. A and B: Expression changes in ALL_559 associated with the fusions of TRBC2 with RIC3 (A) and a non-annotated gene close to TMEM38B (B). LMO1, TUB, and TAL2 are overexpressed in ALL_559 compared with 18 T-ALL samples. C and D: Expression changes in ALL_707 associated with the PAX5-ELN fusion. ELN, which constitutes the major part of the fusion gene, is overexpressed in ALL_707 compared with 27 BCP-ALL samples suggesting that the PAX5-ELN fusion gene is highly expressed.
Figure 4Recurrent somatic mutations detected in the validation cohort in the genes that were identified by WGS. Each column represents one patient, with the whole genome sequenced samples in the four leftmost columns. In the upper panel, each row represents one gene. Only samples and genes with at least one mutation in an exon, splice site, or untranslated region (UTR) in the validation cohort are shown. Each colored box indicates a mutation. For patients with more than one variant in the same gene, the color is prioritized according to the order shown in the legend. In the lower panel, the genetic subtype of each sample is shown.