| Literature DB >> 24409309 |
Johanna Hasmats1, Henrik Gréen2, Cedric Orear3, Pierre Validire4, Mikael Huss5, Max Käller1, Joakim Lundeberg1.
Abstract
Exome sequence capture and massively parallel sequencing can be combined to achieve inexpensive and rapid global analyses of the functional sections of the genome. The difficulties of working with relatively small quantities of genetic material, as may be necessary when sharing tumor biopsies between collaborators for instance, can be overcome using whole genome amplification. However, the potential drawbacks of using a whole genome amplification technology based on random primers in combination with sequence capture followed by massively parallel sequencing have not yet been examined in detail, especially in the context of mutation discovery in tumor material. In this work, we compare mutations detected in sequence data for unamplified DNA, whole genome amplified DNA, and RNA originating from the same tumor tissue samples from 16 patients diagnosed with non-small cell lung cancer. The results obtained provide a comprehensive overview of the merits of these techniques for mutation analysis. We evaluated the identified genetic variants, and found that most (74%) of them were observed in both the amplified and the unamplified sequence data. Eighty-nine percent of the variations found by WGA were shared with unamplified DNA. We demonstrate a strategy for avoiding allelic bias by including RNA-sequencing information.Entities:
Mesh:
Year: 2014 PMID: 24409309 PMCID: PMC3883664 DOI: 10.1371/journal.pone.0084785
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Patient characteristics.
| Patient | Gender | Chemo | Relapse | pT | pN | pM | Cancersubtype | Smoking status | UnamplifiedDNA (Tumor) | WGA DNA(Normal/Tumor) | RNA (Tumor) |
|
| Female | – | Yes | 2 | 0 | 0 | AC | Current | ✓ | ✓ | ✓ |
|
| Female | – | No | 2 | 0 | 0 | AC | Current | ✓ | ✓ | |
|
| Male | – | No | 2 | 0 | 1 | AC | Former | ✓ | ✓ | ✓ |
|
| Male | Cisplatin/Vinorelbine | Yes | 4 | 1 | 0 | AC | Former | ✓ | ✓ | |
|
| Female | Cisplatin/Vinorelbine | Yes | 1 | 2 | 0 | AC | Former | ✓ | ✓ | ✓ |
|
| Female | Cisplatin/Vinorelbine | No | 2 | 2 | 0 | AC | Current | ✓ | ✓ | ✓ |
|
| Female | Cisplatin/Vinorelbine | Yes | N/A | N/A | 0 | AC | Current | ✓ | ✓ | |
|
| Male | – | Yes | 2 | 2 | 0 | AC | Former | ✓ | ✓ | ✓ |
|
| Male | Cisplatin/Vinorelbine | No | 2 | 2 | 0 | AC | Current | ✓ | ✓ | ✓ |
|
| Female | – | Yes | 2 | 0 | 0 | AC | Former | ✓ | ✓ | ✓ |
|
| Male | Cisplatin/Vinorelbine | Yes | 2 | 2 | 0 | AC | Former | ✓ | ✓ | |
|
| Male | Cisplatin/Vinorelbine | No | 3 | 1 | 0 | AC | Former | ✓ | ✓ | ✓ |
|
| Male | – | Yes | 2 | 0 | 0 | AC | Current | ✓ | ✓ | ✓ |
|
| Male | – | No | 2 | 1 | 0 | SCC | Current | ✓ | ✓ | ✓ |
|
| Female | Cisplatin/Vinorelbine | Yes | 1 | 2 | 0 | AC | Current | ✓ | ✓ | ✓ |
|
| Female | – | No | 1 | 1 | 0 | AC | Current | ✓ | ✓ |
AC = Adenocarcinoma.
SCC = Squamous cell carcinoma.
pT = Postsurgical histopathological classification of primary tumour.
pN = Postsurgical histopathological classification of regional node.
pM = Postsurgical histopathological classification of distant metastasis.
Summary of variant calling following DNA sequencing.
| WGA tumor | TotalHetero | TotalHomo | RatioHetero/homo | Transition/Transversion rate | Coverage intarget regions | Totalreads |
| 118(T) | 7359 | 4136 | 1.78 | 3.42 | 39.67 | 35.55 M |
| 127(T) | 8068 | 4627 | 1.74 | 3.32 | 72.28 | 37.43 M |
| 140(T) | 7904 | 4411 | 1.79 | 3.25 | 59.33 | 34.73 M |
| 146(T) | 8104 | 4247 | 1.91 | 3.23 | 61.05 | 34.47 M |
| 210(T) | 7806 | 4333 | 1.80 | 2.96 | 53.72 | 54.23 M |
| 225(T) | 8088 | 4258 | 1.90 | 2.91 | 83.70 | 43.54 M |
| 247(T) | 7091 | 3498 | 2.03 | 2.58 | 24.65 | 19.77 M |
| 255(T) | 7387 | 4215 | 1.75 | 3.29 | 62.69 | 33.76 M |
| 278(T) | 8328 | 4612 | 1.81 | 3.11 | 107.10 | 57.96 M |
| 295(T) | 7517 | 4451 | 1.69 | 3.24 | 48.06 | 28.13 M |
| 322(T) | 9019 | 4173 | 2.16 | 2.53 | 49.57 | 19.14 M |
| 344(T) | 8107 | 4300 | 1.89 | 3.08 | 59.31 | 29.42 M |
| 396(T) | 7408 | 4169 | 1.78 | 3.06 | 31.25 | 35.00 M |
| 412(T) | 8755 | 4752 | 1.84 | 2.84 | 98.08 | 45.19 M |
| 421(T) | 8552 | 4489 | 1.91 | 2.95 | 81.62 | 37.04 M |
| 541(T) | 9669 | 4511 | 2.14 | 3.11 | 35.20 | 37.75 M |
|
| 8073 | 4324 | 1.87 | 3.06 | 60.46 | 36.44 M |
|
| 683 | 286 | 0.14 | 0.25 | 23.45 | 10.40 |
|
| ||||||
| 118(T) | 9006 | 4823 | 1.87 | 2.95 | 48.39 | 26.08 M |
| 127(T) | 14029 | 3152 | 4.45 | 3.27 | 87.78 | 32.40 M |
| 140(T) | 8725 | 4875 | 1.79 | 3.16 | 64.98 | 35.76 M |
| 146(T) | 15193 | 3123 | 4.86 | 3.18 | 85.07 | 29.08 M |
| 210(T) | 8860 | 5117 | 1.73 | 3.17 | 79.28 | 26.47 M |
| 225(T) | 14270 | 3081 | 4.63 | 3.30 | 73.46 | 24.50 M |
| 247(T) | 8879 | 5066 | 1.75 | 3.22 | 108.08 | 53.61 M |
| 255(T) | 8743 | 4891 | 1.79 | 3.26 | 81.93 | 42.86 M |
| 295(T) | 8717 | 5067 | 1.72 | 3.03 | 54.27 | 110.79 M |
| 278(T) | 8656 | 4998 | 1.73 | 3.21 | 88.89 | 47.00 M |
| 322(T) | 8650 | 4991 | 1.73 | 3.11 | 73.08 | 37.62 M |
| 344(T) | 14161 | 3006 | 4.71 | 3.15 | 77.05 | 26.14 M |
| 396(T) | 7627 | 4455 | 1.71 | 3.27 | 36.48 | 17.41 M |
| 412(T) | 8473 | 4940 | 1.72 | 3.21 | 73.48 | 38.37 M |
| 421(T) | 9085 | 4977 | 1.83 | 3.14 | 81.82 | 45.31 M |
| 541(T) | 10900 | 5329 | 2.05 | 3.17 | 73.85 | 37.67 M |
|
| 10248 | 4493 | 2.50 | 3.18 | 74.24 | 39.44 M |
|
| 2574 | 855 | 1.29 | 0.09 | 17.11 | 21.30 |
|
| ||||||
| 118(N) | 7476 | 4154 | 1.80 | 3.43 | 73.07 | 38.26 M |
| 127(N) | 9290 | 4619 | 2.01 | 2.64 | 101.20 | 23.63 M |
| 140(N) | 7288 | 3984 | 1.83 | 3.41 | 59.76 | 35.17 M |
| 146(N) | 8292 | 4613 | 1.80 | 3.32 | 67.14 | 37.40 M |
| 210(N) | 7894 | 4334 | 1.82 | 3.13 | 48.37 | 39.87 M |
| 225(N) | 11154 | 4935 | 2.26 | 2.07 | 127.39 | 75.50 M |
| 247(N) | 8925 | 4940 | 1.81 | 3.27 | 123.26 | 37.12 M |
| 255(N) | 8560 | 4771 | 1.79 | 3.25 | 83.78 | 26.35 M |
| 295(N) | 6506 | 3414 | 1.91 | 3.18 | 27.59 | 29.18 M |
| 278(N) | 6961 | 3966 | 1.76 | 3.10 | 30.67 | 28.21 M |
| 322(N) | 7747 | 4610 | 1.68 | 3.29 | 56.95 | 25.81 M |
| 344(N) | 8082 | 4298 | 1.88 | 3.09 | 60.13 | 17.95 M |
| 396(N) | 7276 | 4035 | 1.80 | 3.18 | 27.70 | 34.08 M |
| 412(N) | 8010 | 4156 | 1.93 | 2.84 | 38.52 | 26.65 M |
| 421(N) | 8561 | 4579 | 1.87 | 3.11 | 64.07 | 30.20 M |
| 541(N) | 10411 | 4840 | 2.15 | 3.03 | 78.53 | 46.44 M |
|
| 8277 | 4391 | 1.88 | 3.08 | 66.76 | 34.49 M |
|
| 1224 | 423 | 0.15 | 0.34 | 30.79 | 13.08 |
Figure 1Genetic variants unique or shared between analysis.
A) Venn diagram illustrating the distribution of SNVs for patient 140. Unique variants found by WGA are represented in light blue, and unique variants found without amplification are shown in light green. Shared variants identified by both methods are shown in green. B) Boxplot of the coverage of genetic variants found uniquely by WGA, without amplification, and with both methods for patient 140. The two leftmost boxes represent shared variant calls with coverage in those positions for WGA and without amplification, respectively. The two rightmost boxes represent the coverage over unique positions for each method. C) Venn diagram illustrating the distribution of SNVs for patient 295. Unique variants found by WGA are represented in light blue, and unique variants found without amplification are shown in light green. Shared variants identified by both methods are shown in green. D) Boxplot of the coverage of genetic variants found uniquely by WGA, without amplification, and with both methods for patient 295. The two leftmost boxes represent shared variant calls with coverage in those positions for WGA and without amplification, respectively. The two rightmost boxes represent the coverage over unique positions for each method.
Summary data for SNVs in positions identified in both WGA and unamplified sequence data, and using only one of the two methods.
| Patient-ID | Number of SNV | ||
| WGA | Shared | Unamplified | |
| 118 | 722 (6%) | 10985 (73%) | 3260 (23%) |
| 140 | 759 (6%) | 11781 (81%) | 1953 (14%) |
| 210 | 1099 (9%) | 11413 (75%) | 2715 (19%) |
| 247 | 1964 (17%) | 9483 (59%) | 4591 (33%) |
| 255 | 606 (5%) | 11162 (78%) | 2615 (19%) |
| 278 | 861 (7%) | 12250 (84%) | 1535 (11%) |
| 295 | 650 (5%) | 11464 (77%) | 2716 (19%) |
| 322 | 3866 (26%) | 10821 (61%) | 3023 (22%) |
| 396 | 2076 (17%) | 10151 (71%) | 2119 (17%) |
| 412 | 1559 (11%) | 12216 (81%) | 1374 (10%) |
| 421 | 1204 (9%) | 12108 (78%) | 2133 (15%) |
| 541 | 1398 (9%) | 13350 (75%) | 3096 (19%) |
|
|
|
|
|
|
|
|
|
|
Figure 2Example of coverage representation in unamplified and whole genome amplified DNA in patient 295 across the SPINK1 gene.
Figure 3The relationship between variant and total reads per position depending on analysis.
A) The relationship between the numbers of variant reads divided by total reads for SNVs identified by sequencing WGA and unamplified DNA per position in patient 140. B) The relationship between the numbers of variant reads divided by total reads for SNVs identified by sequencing WGA and unamplified DNA per position in patient 295.
Comparison of unique and tumor specific variants in unamplified DNA, WGA DNA and RNA.
| PatientID | % of uniqueunamplified withcoverage <10 inWGA bamfile | % of unique WGAwith coverage <10 inunamplified bamfile | % unique WGAin RNAwith coverage <10 | WGA tumorspecific | % tumor specificconfirmed inunamplified | % WGA tumor specific inRNA (bam) withcoverage <10 |
| 118 | 71.7 | 57.6 | 69.9 | 1080 | 70.4 | 68.9 |
| 140 | 72.7 | 57.7 | 65.0 | 1644 | 77.9 | 64.2 |
| 210 | 84.8 | 21.8 | 62.5 | 630 | 33.3 | 59.4 |
| 247 | 90.9 | 5.7 | – | 2095 | 13.8 | – |
| 255 | 81.6 | 43.1 | 65.0 | 698 | 62.9 | 63.0 |
| 295 | 79.7 | 43.7 | 100.0 | 3510 | 81.5 | 100.0 |
| 278 | 73.3 | 50.9 | 63.4 | 1819 | 82.1 | 58.6 |
| 322 | 79.8 | 9.4 | – | 5377 | 11.4 | – |
| 396 | 79.7 | 53.6 | 64.9 | 1778 | 39.0 | 63.9 |
| 412 | 72.3 | 32.4 | 58.9 | 2580 | 60 | 60.5 |
| 421 | 78.3 | 27.2 | 100.0 | 994 | 38.1 | 100.0 |
| 541 | 82.0 | 25.9 | – | 1098 | 32.2 | – |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|