| Literature DB >> 18987736 |
Timothy J Ley1, Elaine R Mardis, Li Ding, Bob Fulton, Michael D McLellan, Ken Chen, David Dooling, Brian H Dunford-Shore, Sean McGrath, Matthew Hickenbotham, Lisa Cook, Rachel Abbott, David E Larson, Dan C Koboldt, Craig Pohl, Scott Smith, Amy Hawkins, Scott Abbott, Devin Locke, Ladeana W Hillier, Tracie Miner, Lucinda Fulton, Vincent Magrini, Todd Wylie, Jarret Glasscock, Joshua Conyers, Nathan Sander, Xiaoqi Shi, John R Osborne, Patrick Minx, David Gordon, Asif Chinwalla, Yu Zhao, Rhonda E Ries, Jacqueline E Payton, Peter Westervelt, Michael H Tomasson, Mark Watson, Jack Baty, Jennifer Ivanovich, Sharon Heath, William D Shannon, Rakesh Nagarajan, Matthew J Walter, Daniel C Link, Timothy A Graubert, John F DiPersio, Richard K Wilson.
Abstract
Acute myeloid leukaemia is a highly malignant haematopoietic tumour that affects about 13,000 adults in the United States each year. The treatment of this disease has changed little in the past two decades, because most of the genetic events that initiate the disease remain undiscovered. Whole-genome sequencing is now possible at a reasonable cost and timeframe to use this approach for the unbiased discovery of tumour-specific somatic mutations that alter the protein-coding genes. Here we present the results obtained from sequencing a typical acute myeloid leukaemia genome, and its matched normal counterpart obtained from the same patient's skin. We discovered ten genes with acquired mutations; two were previously described mutations that are thought to contribute to tumour progression, and eight were new mutations present in virtually all tumour cells at presentation and relapse, the function of which is not yet known. Our study establishes whole-genome sequencing as an unbiased method for discovering cancer-initiating mutations in previously unidentified genes that may respond to targeted therapies.Entities:
Mesh:
Year: 2008 PMID: 18987736 PMCID: PMC2603574 DOI: 10.1038/nature07485
Source DB: PubMed Journal: Nature ISSN: 0028-0836 Impact factor: 49.962
Assessments of haploid and diploid coverage of the tumor and skin genomes from patient 933124.
| Tumor | Skin | |||
|---|---|---|---|---|
| Libraries | 4 | 3 | ||
| Runs | 98 | 34 | ||
| Reads obtained | 5,858,992,064 | 2,122,836,148 | ||
| Reads passing quality filter | 3,025,923,365 | 1,228,177,690 | ||
| Bases passing quality filter | 98,184,511,523 | 41,783,794,834 | ||
| Reads aligned by Maq | 2,729,957,053 | 1,080,576,680 | ||
| Reads unaligned by Maq | 295,966,312 | 138,276,594 | ||
|
| ||||
| SNVs detected with respect to hg18 (no Y) | 3,811,115 | 2,918,446 | ||
| SNVs (chr 1–22) detected with respect to hg18 | 3,681,968 | (100.0%) | 2,830,292 | (100.0%) |
| SNVs also present in dbSNP | 2,368,458 | (64.3%) | 2,161,695 | (76.4%) |
| SNVs also present in Venter genome | 1,499,010 | (40.7%) | 1,383,431 | (48.9%) |
| SNVs also present in Watson genome | 1,573,435 | (42.7%) | 1,456,822 | (51.5%) |
| SNVs not in dbSNP/Venter/Watson | 1,223,830 | (33.2%) | 591,131 | (20.9%) |
| SNVs not in dbSNP/Venter/Watson/skin | 925,200 | (25.1%) | – | |
|
| ||||
| HQ SNPs | 46,494 | (100.0%) | 46,572 | (100.0%) |
| HQ SNPs where reference allele is detected | 42,419 | (91.2%) | 38,454 | (82.6%) |
| HQ SNPs where variant allele is detected | 43,164 | (92.9%) | 39,220 | (84.2%) |
| HQ SNPs where both alleles are detected | 42,415 | (91.2%) | 38,454 | (82.6%) |
Figure 1Overlap of SNPs detected in 933124 and other genomes
(A) Venn diagram of overlap between SNPs detected in the 933124 tumor genome and the genomes of Watson and Venter. (B) Venn Diagram of overlap among 933124 tumor genome, skin genome, and dbSNP (ver. 127). Single nucleotide variants were defined with a MAQ SNP quality ≥ 15.
Figure 2Filters used to identify somatic point mutations in the tumor genome
See text for details.
Somatic mutations detected in the de novo and relapse AML samples.
| Gene | Consequence | Type | Solexa Tumor Reads WT:Variant | Solexa Skin Reads WT:Variant | Conservation Score of mutant base | Mutations in other AML cases |
|---|---|---|---|---|---|---|
| CDH24 | Y590X | nonsense | 9:9 | 16:0 | 0.998 | 0/187 |
| SLC15A1 | W77X | nonsense | 15:12 | 19:0 | 1.000 | 0/187 |
| KNDC1 | L799F | missense | 7:8 | 20:0 | NA | 0/187 |
| PTPRT | P1235L | missense | 9:13 | 16:0 | 1.000 | 0/187 |
| GRINL1B | R176H | missense | 15:10 | 14:0 | NA | 0/187 |
| GPR123 | T38I | Missense | 11:11 | 13:0 | NA | 0/187 |
| EBI2 | A338V | Missense | 7:12 | 18:2 | 1.000 | 0/187 |
| PCLKC | P1004L | missense | 19:9 | 15:1 | 0.98 | 0/187 |
| FLT3 | ITD | indel | 18:12 | 8:0 | NA | 51/185 |
| NPM1 | CATG insertion | indel | 36:6 | 33:0 | NA | 43/180 |
patient cohort defined in Link, et al23.
Figure 3Summary of Roche/454 FLX readcount data obtained for 10 somatic mutations and 2 validated SNPs in the primary tumor, relapse tumor, and skin specimens
The readcount data for the variant alleles in the primary tumor sample and relapse tumor sample are statistically different than that of the skin sample for all mutations (p<0.000001 for all mutations, Fisher’s exact test, denoted by a single asterisk in all cases). Note that the normal skin sample was contaminated with leukemic cells containing the somatic mutations. The patient’s WBC count was 105,000 (85% blasts) when the skin punch biopsy was obtained.
454 Readcount data for Somatic Mutations and known SNPs in primary tumor, skin, and relapse tumor samples.
| Primary AML
| Skin | Relapse (78% blasts) | ||||||||
|---|---|---|---|---|---|---|---|---|---|---|
| Gene | Variant | Variant | Ref | % Variant | Variant | Ref | % Variant | Variant | Ref | % Variant |
| CDH24 | Y590X | 5672 | 4890 | 53.70 | 564 | 10358 | 5.16 | 3108 | 4599 | 40.33 |
| SLC15A1 | W77X | 3817 | 4962 | 43.48 | 875 | 10773 | 7.51 | 4714 | 7173 | 39.66 |
| KNDC1 | L799F | 4640 | 4848 | 48.90 | 770 | 8972 | 7.90 | 3883 | 6342 | 37.98 |
| PTPRT | P1235L | 998 | 1058 | 48.54 | 126 | 1489 | 7.80 | 350 | 493 | 41.52 |
| GRINL1B | R176H | 2211 | 2674 | 45.26 | 318 | 4461 | 6.65 | 1447 | 2070 | 41.14 |
| GPR123 | T38I | 4618 | 4569 | 50.27 | 850 | 9751 | 8.02 | 3660 | 6057 | 37.67 |
| EBI2 | A338V | 12750 | 15453 | 45.21 | 458 | 10088 | 4.34 | 2646 | 3627 | 42.18 |
| PCLKC | P1004L | 9216 | 8815 | 51.11 | 6617 | 21786 | 23.29 | 8600 | 8822 | 49.36 |
| FLT3 | ITD | 4220 | 7810 | 35.08 | 3475 | 23159 | 13.05 | 3870 | 8495 | 31.30 |
| NPM1 | CATG ins | 1550 | 1974 | 43.98 | 143 | 2390 | 5.65 | 2303 | 3910 | 37.07 |
|
| ||||||||||
| BRCA2 | N372H | 778 | 752 | 50.85 | 763 | 876 | 46.55 | 285 | 303 | 48.47 |
| TP53 | P72R | 8989 | 1 | 99.99 | 8161 | 0 | 100.00 | 7914 | 6 | 99.92 |
The differences between variant frequencies in primary or relapse tumor samples and skin were highly significant for all somatic mutations (p<0.000001, Fisher’s exact test, one tailed). The BRCA2 variant is a known heterozygous SNP in this genome, and the TP53 variant is a known homozygous SNP.