| Literature DB >> 34753956 |
Yongmei Zhao1, Li Tai Fang2, Tsai-Wei Shen3, Sulbha Choudhari3, Keyur Talsania3, Xiongfong Chen3, Jyoti Shetty4, Yuliya Kriga4, Bao Tran4, Bin Zhu5, Zhong Chen6, Wanqiu Chen6, Charles Wang6, Erich Jaeger7, Daoud Meerzaman8, Charles Lu9, Kenneth Idler9, Luyao Ren10, Yuanting Zheng10, Leming Shi10, Virginie Petitjean11, Marc Sultan11, Tiffany Hung12, Eric Peters12, Jiri Drabek13,14, Petr Vojta13,14, Roberta Maestro14,15, Daniela Gasparotto14,15, Sulev Kõks14,16,17, Ene Reimann14,18, Andreas Scherer14,19, Jessica Nordlund14,20, Ulrika Liljedahl14,20, Jonathan Foox21, Christopher E Mason21, Chunlin Xiao22, Huixiao Hong23, Wenming Xiao24.
Abstract
With the rapid advancement of sequencing technologies, next generation sequencing (NGS) analysis has been widely applied in cancer genomics research. More recently, NGS has been adopted in clinical oncology to advance personalized medicine. Clinical applications of precision oncology require accurate tests that can distinguish tumor-specific mutations from artifacts introduced during NGS processes or data analysis. Therefore, there is an urgent need to develop best practices in cancer mutation detection using NGS and the need for standard reference data sets for systematically measuring accuracy and reproducibility across platforms and methods. Within the SEQC2 consortium context, we established paired tumor-normal reference samples and generated whole-genome (WGS) and whole-exome sequencing (WES) data using sixteen library protocols, seven sequencing platforms at six different centers. We systematically interrogated somatic mutations in the reference samples to identify factors affecting detection reproducibility and accuracy in cancer genomes. These large cross-platform/site WGS and WES datasets using well-characterized reference samples will represent a powerful resource for benchmarking NGS technologies, bioinformatics pipelines, and for the cancer genomics studies.Entities:
Mesh:
Year: 2021 PMID: 34753956 PMCID: PMC8578599 DOI: 10.1038/s41597-021-01077-5
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Fig. 1Study design for the experiment. DNA was extracted from either fresh cells or FFPE processed cells. Both fresh DNA and FFPE DNA were profiled on WGS and WES platforms for intra-center, inter-center and cross-platform reproducibility benchmarking. For fresh DNA, six centers performed WGS and WES in parallel following manufacture recommended protocols with limited deviation. Three library preparation protocols (TruSeq-Nano, Nextera Flex, and TruSeq PCR-free,) were used with four different quantities of DNA inputs (1, 10, 100, and 250 ng). DNA from HCC1395 and HCC1395BL was pooled at various ratios to create mixtures of 75%, 50%, 20%, 10%, and 5%. For FFPE samples, each fixation time point (1h, 2 h, 6 h, 24 h) had six blocks that were sequenced at two different centers. All libraries from these experiments were sequenced on the HiSeq series. In addition, nine libraries using the TruSeq PCR-free preparation were run on a NovaSeq for WGS analysis.
Summary of all experiment data including WGS, WES discovery and validation data sets used in the study.
| Study Design | Technology | Library Prep (input amount) | Platform | Sequecing | Number of Reads (coverage) | |
|---|---|---|---|---|---|---|
| HCC1395 | HCC1395BL | |||||
| Discovery | WGS (Fresh DNA) | TruSeq DNA PCR Free (1000 ng) | HiSeq | 6 centers (24 libraries) | 21 billion (750X) | 21 billion (750X) |
| TruSeq DNA PCR Free (1000 ng) | NovaSeq | 1 center (18 libraries) | 13 billion (400X) | 13 billion (400X) | ||
| WES (Fresh DNA) | Agilent SureSelect Human All Exon v6 + UTRs | HiSeq | 6 centers (24 libraries) | 3 billion (2,500X) | 3 billion (2,500X) | |
| Validation | WGS Protocol & Input Amount | TruSeq-DNA-Nano (1, 10, 100 ng), Nextera flex (1, 10, 100 ng), TruSeq PCR free (250 ng) | HiSeq | 2 center (14 libraries) | 9 billion (315X) | 9 billion (315X) |
| WGS Tumor Content | TruSeq DNA PCR Free (1000 ng) | HiSeq | 1 center (21 libraries) | 64 billion (Mixture of samples, total 2300X) | ||
| WGS FFPE | NEBNext Ultra II (200 ng) | HiSeq | 2 center (26 libraries) | 30 billion (980X) | 27 billion (900X) | |
| WGS 10x Linked-Read | 10X Chromium Genome Library Prep v2 (1250 ng) | 10X Genomics | 5 centers (22 libraries) | 20 billion (880X) | 20 billion (880X) | |
| WGS PacBio | Preparing >30 kbp SMRTbell Libraries | PacBio | 1 center (2 libraries) | 19 million (40X) | 22 million (44X) | |
| WES FFPE | Agilent SureSelect Human All Exon v6 + UTRs | HiSeq | 2 centers (17 libraries) | 3 billion (2600X) | 4 billion (3600X) | |
| WES | Agilent SureSelect Human All Exon v6 + UTRs | Ion Torrent | 1 centers (2 libraries) | 67 million (34X) | 82 million (47X) | |
| AmpliSeq | AmpliSeq Targeted Amplicon Panel | MiSeq | 1 center (2 libraries) | 25 million (2900x) | 22 million (3300x) | |
| Microarray | AffyChip CytoScan | AffyChip CytoScan HD | 1 center (2 libraries) | 2.1 million probes | ||
| Single Cell CNV | 10x Chromium Single Cell CNV Solution | HiSeq | 1 center (2 libraries) | 1.5 billion (1465 cells) | 1.3 billion (983 cells) | |
The table columns describe the data sets generated for either Discovery or Validation purpose of the study. For each experiment, the input DNA bio-sample preservation types and sequencing platforms are specified. There are total six sequencing centers for cross-site study. The total number of librarys are listed for HCC1395 and HCC1395BL together while the read coverages for each cell line are listed separately. For the QC statistics for each data set, please reference the Online-Only Tables 1–10 for details.
Whole Genome Sequencing (WGS) data sets for fresh DNA extracted from HCC1395BL and HCC1395 cell lines.
| Sample ID | Sample | Biosample | Platform | Machine model | library Protocol (Input amount) | Total Reads | Total Reads After Trimming | Percent Total Reads after Trimming | Total Mapped Reads | Percent Total Mapped Reads (Trimmed) | Percent Non-duplicated Reads (Mapped Trimmed) | Mean Coverage Depth | Mean Coverage Depth SD | Percent of Coverage > = 5X | Percent of Coverage > = 15X | Percent of Coverage > = 30X | Percent GC | Median Insert Size |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| WGS_IL_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 1,922,974,720 | 1,921,659,788 | 99.93 | 1,913,037,464 | 99.55 | 95.20 | 58.98 | 351.92 | 93.68 | 93.12 | 87.97 | 39.70 | 419 |
| WGS_IL_N_2 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 1,552,485,544 | 1,551,295,230 | 99.92 | 1,546,324,619 | 99.68 | 93.78 | 50.04 | 370.95 | 93.64 | 92.41 | 82.08 | 39.22 | 393 |
| WGS_IL_N_3 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 1,672,746,794 | 1,672,206,086 | 99.97 | 1,664,765,047 | 99.56 | 94.50 | 50.79 | 384.54 | 93.66 | 92.53 | 82.59 | 39.18 | 402 |
| WGS_IL_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 2,152,864,926 | 2,151,176,006 | 99.92 | 2,141,479,936 | 99.55 | 95.11 | 67.00 | 292.90 | 93.05 | 92.38 | 86.88 | 39.90 | 417 |
| WGS_IL_T_2 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 1,742,944,436 | 1,741,213,400 | 99.90 | 1,735,985,947 | 99.70 | 92.73 | 56.10 | 288.78 | 92.99 | 91.40 | 80.83 | 39.63 | 395 |
| WGS_IL_T_3 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 1,745,338,388 | 1,744,749,370 | 99.97 | 1,737,271,357 | 99.57 | 93.05 | 53.38 | 289.37 | 93.00 | 91.22 | 78.86 | 39.40 | 401 |
| WGS_EA_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,740,311,982 | 1,737,882,004 | 99.86 | 1,731,224,725 | 99.62 | 87.04 | 64.22 | 337.23 | 93.65 | 93.31 | 90.53 | 40.05 | 412 |
| WGS_EA_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,885,118,894 | 1,883,492,596 | 99.91 | 1,874,626,618 | 99.53 | 92.32 | 63.85 | 246.40 | 93.03 | 92.48 | 87.75 | 40.20 | 422 |
| WGS_NC_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 1,232,116,762 | 1,231,187,894 | 99.93 | 1,226,338,013 | 99.61 | 96.27 | 42.69 | 52.04 | 99.09 | 92.12 | 79.19 | 42.95 | 417 |
| WGS_NC_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | TruSeq DNA PCR Free (1000 ng) | 1,202,406,138 | 1,201,304,904 | 99.91 | 1,196,430,615 | 99.59 | 96.53 | 41.61 | 44.76 | 98.84 | 89.65 | 67.71 | 43.15 | 408 |
| WGS_LL_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 1,267,721,898 | 1,267,074,486 | 99.95 | 1,260,762,149 | 99.50 | 79.06 | 48.62 | 272.02 | 93.57 | 87.36 | 50.48 | 39.95 | 377 |
| WGS_LL_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 1,248,345,468 | 1,247,771,438 | 99.95 | 1,240,175,659 | 99.39 | 79.82 | 47.80 | 196.07 | 92.89 | 80.01 | 35.94 | 40.20 | 372 |
| WGS_NV_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,737,041,074 | 2,736,265,592 | 99.97 | 2,725,324,100 | 99.60 | 93.22 | 93.15 | 555.75 | 93.72 | 93.51 | 92.88 | 40.03 | 400 |
| WGS_NV_N_2 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,755,558,524 | 2,753,791,014 | 99.94 | 2,743,237,055 | 99.62 | 93.17 | 95.22 | 557.97 | 93.72 | 93.51 | 92.91 | 39.98 | 390 |
| WGS_NV_N_3 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,765,895,594 | 2,764,785,302 | 99.96 | 2,752,475,146 | 99.56 | 92.14 | 94.43 | 555.64 | 93.72 | 93.51 | 92.90 | 39.96 | 395 |
| WGS_NV_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,708,598,970 | 2,706,761,164 | 99.93 | 2,695,472,614 | 99.58 | 92.77 | 92.97 | 436.57 | 93.08 | 92.81 | 91.27 | 40.18 | 404 |
| WGS_NV_T_2 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,752,035,508 | 2,751,442,992 | 99.98 | 2,741,523,523 | 99.64 | 92.23 | 95.13 | 496.20 | 93.08 | 92.82 | 91.42 | 40.18 | 394 |
| WGS_NV_T_3 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,752,204,218 | 2,751,260,128 | 99.97 | 2,741,889,519 | 99.66 | 91.95 | 92.97 | 425.71 | 93.08 | 92.81 | 91.24 | 40.2 | 387 |
| WGS_FD_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,140,214,238 | 1,139,328,958 | 99.92 | 1,135,160,204 | 99.63 | 92.15 | 42.43 | 219.92 | 93.56 | 92.44 | 82.02 | 40.15 | 367 |
| WGS_FD_N_2 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,197,153,800 | 1,196,576,644 | 99.95 | 1,192,633,894 | 99.67 | 93.15 | 44.46 | 237.23 | 93.57 | 92.63 | 83.31 | 40.06 | 371 |
| WGS_FD_N_3 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,174,631,698 | 1,174,023,848 | 99.95 | 1,170,212,823 | 99.68 | 92.75 | 43.43 | 230.61 | 93.57 | 92.55 | 82.72 | 40.1 | 368 |
| WGS_FD_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,100,936,840 | 1,100,540,416 | 99.96 | 1,096,693,277 | 99.65 | 91.24 | 41.16 | 155.16 | 92.88 | 90.27 | 69.33 | 40.34 | 377 |
| WGS_FD_T_2 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,136,080,154 | 1,135,450,092 | 99.95 | 1,131,808,658 | 99.68 | 91.65 | 43.07 | 160.70 | 92.88 | 90.65 | 72.60 | 40.43 | 375 |
| WGS_FD_T_3 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | TruSeq DNA PCR Free (1000 ng) | 1,166,219,086 | 1,165,667,134 | 99.95 | 1,162,681,088 | 99.74 | 89.02 | 44.95 | 171.40 | 92.89 | 90.90 | 75.11 | 40.33 | 371 |
| WGS_NS_N_1 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,406,727,868 | 1,404,557,816 | 99.85 | 1,400,188,308 | 99.69 | 92.63 | 45.95 | 262.43 | 93.61 | 92.86 | 84.87 | 41.26 | 423 |
| WGS_NS_N_2 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,505,296,280 | 1,504,606,110 | 99.95 | 1,497,164,368 | 99.51 | 93.97 | 41.48 | 215.63 | 93.63 | 92.53 | 82.24 | 40.86 | 400 |
| WGS_NS_N_3 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,357,261,200 | 1,357,106,672 | 99.99 | 1,350,908,040 | 99.54 | 94.72 | 40.61 | 216.19 | 93.61 | 92.37 | 81.33 | 41.20 | 463 |
| WGS_NS_N_4 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,367,955,412 | 1,367,339,034 | 99.96 | 1,362,514,522 | 99.65 | 94.13 | 41.75 | 228.73 | 93.61 | 92.54 | 82.51 | 41.11 | 419 |
| WGS_NS_N_5 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,501,530,522 | 1,501,116,932 | 99.97 | 1,495,542,591 | 99.63 | 93.60 | 45.91 | 252.24 | 93.63 | 92.91 | 84.98 | 41.15 | 416 |
| WGS_NS_N_6 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,564,194,248 | 1,563,842,202 | 99.98 | 1,559,297,663 | 99.71 | 91.12 | 51.52 | 285.71 | 93.63 | 93.13 | 87.07 | 41.23 | 413 |
| WGS_NS_N_7 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,283,402,840 | 1,282,990,744 | 99.97 | 1,278,660,640 | 99.66 | 92.62 | 41.37 | 265.40 | 93.59 | 92.46 | 82.12 | 41.21 | 410 |
| WGS_NS_N_8 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,462,304,546 | 1,461,608,958 | 99.95 | 1,454,550,980 | 99.52 | 94.39 | 40.66 | 209.18 | 93.62 | 92.41 | 81.52 | 40.93 | 399 |
| WGS_NS_N_9 | HCC1395BL | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,470,980,018 | 1,470,257,498 | 99.95 | 1,463,536,841 | 99.54 | 94.01 | 40.94 | 209.21 | 93.63 | 92.45 | 81.76 | 40.90 | 401 |
| WGS_NS_T_1 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,695,377,648 | 1,693,184,204 | 99.87 | 1,688,102,934 | 99.70 | 91.64 | 55.97 | 235.94 | 93.00 | 92.10 | 85.08 | 41.46 | 422 |
| WGS_NS_T_2 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,261,325,698 | 1,260,860,796 | 99.96 | 1,254,572,158 | 99.50 | 94.60 | 34.93 | 136.34 | 92.90 | 88.65 | 56.06 | 41.20 | 401 |
| WGS_NS_T_3 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,417,458,666 | 1,417,283,958 | 99.99 | 1,411,500,925 | 99.59 | 94.34 | 42.23 | 186.34 | 92.97 | 90.56 | 70.59 | 41.30 | 481 |
| WGS_NS_T_4 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,842,018,210 | 1,839,790,050 | 99.88 | 1,832,403,023 | 99.60 | 93.14 | 56.30 | 224.01 | 93.03 | 92.17 | 85.38 | 41.40 | 424 |
| WGS_NS_T_5 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,252,160,470 | 1,251,737,302 | 99.97 | 1,247,854,937 | 99.69 | 93.80 | 38.70 | 164.96 | 92.92 | 89.82 | 64.15 | 41.39 | 416 |
| WGS_NS_T_6 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,708,185,366 | 1,707,737,216 | 99.97 | 1,702,648,733 | 99.70 | 91.74 | 56.14 | 253.64 | 93.01 | 92.13 | 85.25 | 41.40 | 416 |
| WGS_NS_T_7 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,565,451,318 | 1,565,121,762 | 99.98 | 1,560,619,578 | 99.71 | 92.10 | 51.27 | 245.10 | 92.99 | 91.78 | 82.23 | 41.36 | 407 |
| WGS_NS_T_8 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,228,386,838 | 1,227,945,398 | 99.96 | 1,223,091,592 | 99.61 | 94.87 | 33.97 | 133.30 | 92.89 | 88.27 | 53.79 | 41.22 | 391 |
| WGS_NS_T_9 | HCC1395 | Fresh cells | NovaSeq | NovaSeq 6000 | TruSeq DNA PCR Free (1000 ng) | 1,368,505,404 | 1,367,590,086 | 99.93 | 1,361,753,887 | 99.57 | 94.29 | 38.33 | 148.99 | 92.94 | 89.80 | 63.81 | 41.14 | 395 |
Libraries were made from TruSeq DNA PCR Free (1000 ng) library protocol and sequenced by six sequencing centers on Illumina HiSeq 3000/4000, HiSeq X10 and NovaSeq 6000 for cross-site comparision.
Single cell libraries were prepared using 10X Genomics Chromium Single Cell CNV Solution for CNV profiling.
| Sample | total num reads | percent bases R1 Q30 | percent bases R2 Q30 | correct bc rate | percent non cell barcode | shortest primary contig | percent mappable bins | num cells | total num reads in cells | total num mapped dedup reads in cells | median percent mapped duplicates per cell | mean mapped dedup reads per cell | median effective reads per 1Mbp | median unmapped frac | mean ploidy p25 | mean ploidy p50 | mean ploidy p75 | raw mapd p25 | raw mapd p50 | raw mapd p75 | normalized mapd p25 | normalized mapd p50 | normalized mapd p75 | normalized dimapd p25 | normalized dimapd p50 | normalized dimapd p75 | raw dimapd p25 | raw dimapd p50 | raw dimapd p75 | percent noisy cells | median est cnv resolution mb |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| TGEN_T1 | 1,260,880,054 | 0.93586572 | 0.84779037 | 0.87253147 | 0.03572204 | 46709983 | 0.90348588 | 1465 | 1060857656 | 870930774 | 0.05134489 | 594491.996 | 188 | 0.03222793 | 2.71181349 | 2.75221867 | 2.79193475 | 0.12499719 | 0.13813647 | 0.1566726 | 0.12499719 | 0.13813647 | 0.1566726 | 0.90367638 | 0.92666699 | 0.95082236 | 0.90367638 | 0.92666699 | 0.95082236 | 0.08395904 | 1.46801758 |
| TGEN_N1 | 1,308,946,506 | 0.94857232 | 0.86500184 | 0.86586948 | 0.05646006 | 46709983 | 0.90344055 | 983 | 1069386312 | 817062608 | 0.12479393 | 831192.887 | 255 | 0.02080845 | 1.89378524 | 1.90302472 | 1.92504924 | 0.11509274 | 0.13183257 | 0.15403324 | 0.11509274 | 0.13183257 | 0.15403324 | 0.92951399 | 0.9629042 | 1.0362302 | 0.92951399 | 0.9629042 | 1.0362302 | 0.26347915 | 1.28393555 |
Fig. 2Overall data quality for WGS and WES data sets from Illumina platform. (a) Percentage of total reads mapped to reference genome (hg38) for WGS (Green) and WES (Red) across 6 sequencing sites. (b) Mean coverage depth for WGS libraries across 6 sequencing sites. (c) Mean coverage depth in target capture regions for WES libraries across 6 sequencing sites. (d) Percentage of non-duplicated reads mapped to reference genome across 6 sequencing sites. WGS (Green) and WES (Red). (e) Percent GC content from different library prep protocols. WGS (Green) and WES (Red). (f) Mean insert size distribution from different library prep protocols. WGS (Green) and WES (Red).
Fig. 3Genome coverage from WGS data from three technologies including Illumina, PacBio, and 10X Genomics. Outer rainbow color track: chromosomes, red track: HCC1395, green track: HCC1395BL. (a) Genome coverage from WGS data by reads from Illumina platform. (b) Genome coverage from WGS data by reads from 10X Chromium linked-read technology (c) Genome coverage from WGS data by reads from PacBio platform. (d) Genome coverage plots generated using Indexcov software for whole genome sequencing cross-site comparison libraries. The estimated coverages along chromosome 6 for HCC1395BL (top) and HCC1395 (bottom) are shown. The net loss of one copy of the short-arm of chr6 is shown for HCC1395BL on top. For tumor HCC1395 cell line, there are many copy number gain or loss as shown in bottom of the read coverage plot for chromosome 6.
Whole Genome Sequencing (WGS) data sets for Formalin-Fixed Paraffin-Embedded (FFPE) process with fixation time of 1, 2, 6, or 24 hours for DNA extracted from HCC1395BL and HCC1395 cell lines.
| Sample ID | Sample | Biosample | Platform | Machine model | library Protocol (Input amount | Total Reads | Total Reads After Trimming | Percent Total Reads after Trimming | Total Mapped Reads | Percent Total Mapped Reads (Trimmed) | Percent Non-duplicated Reads (Mapped Trimmed) | Mean Coverage Depth | Mean Coverage Depth SD | Percent of Coverage > = 5X | Percent of Coverage > = 15X | Percent of Coverage > = 30X | Percent GC | Median Insert Size |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FFG_IL_N_1h | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,765,128,404 | 1,764,489,856 | 99.96 | 1,755,040,098 | 99.46 | 81.33 | 60.73 | 275.63 | 93.45 | 92.21 | 86.08 | 42.39 | 263 |
| FFG_IL_N_24h | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,639,966,382 | 1,639,566,366 | 99.98 | 1,630,268,029 | 99.43 | 83.76 | 58.14 | 273.91 | 93.45 | 92.28 | 85.66 | 40.35 | 258 |
| FFG_IL_N_2h | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,068,216,980 | 2,067,386,842 | 99.96 | 2,058,632,971 | 99.58 | 76.51 | 73.91 | 352.49 | 93.54 | 92.83 | 89.46 | 42.15 | 251 |
| FFG_IL_N_6h | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,524,211,818 | 1,523,752,982 | 99.97 | 1,516,101,932 | 99.50 | 81.66 | 53.68 | 279.78 | 93.44 | 91.89 | 83.37 | 41.94 | 264 |
| FFG_IL_T_1h | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,624,805,980 | 2,622,947,802 | 99.93 | 2,604,551,075 | 99.30 | 76.27 | 89.63 | 321.88 | 92.93 | 92.30 | 89.27 | 39.73 | 277 |
| FFG_IL_T_24h | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,721,612,052 | 1,721,231,716 | 99.98 | 1,710,443,429 | 99.37 | 84.68 | 59.41 | 477.62 | 92.99 | 91.52 | 81.57 | 39.43 | 269 |
| FFG_IL_T_2h | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,962,242,886 | 1,961,844,318 | 99.98 | 1,953,044,006 | 99.55 | 82.43 | 69.99 | 319.98 | 92.88 | 91.50 | 84.05 | 39.61 | 265 |
| FFG_IL_T_6h | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,943,133,326 | 2,941,257,406 | 99.94 | 2,918,794,880 | 99.24 | 75.07 | 100.42 | 608.82 | 93.06 | 92.71 | 90.90 | 40.19 | 306 |
| FFG_GZ_N_1h-B | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,149,948,070 | 2,149,439,522 | 99.98 | 2,136,427,075 | 99.40 | 86.88 | 67.84 | 319.70 | 93.54 | 92.37 | 85.63 | 40.12 | 247 |
| FFG_GZ_N_1h-F | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,403,027,922 | 2,401,640,844 | 99.94 | 2,392,089,625 | 99.60 | 85.68 | 77.14 | 194.74 | 93.38 | 92.37 | 89.44 | 41.62 | 238 |
| FFG_GZ_N_24h-B | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,460,398,476 | 2,459,792,090 | 99.98 | 2,448,071,395 | 99.52 | 85.20 | 78.05 | 649.98 | 93.68 | 93.33 | 91.38 | 41.19 | 244 |
| FFG_GZ_N_24h-C | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,761,455,916 | 1,756,523,044 | 99.72 | 1,746,804,745 | 99.45 | 84.75 | 54.76 | 190.92 | 93.28 | 91.67 | 84.17 | 41.39 | 232 |
| FFG_GZ_N_24h-F | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,113,753,208 | 2,112,823,428 | 99.96 | 2,104,899,010 | 99.63 | 87.70 | 68.93 | 250.14 | 93.44 | 92.41 | 87.92 | 40.72 | 227 |
| FFG_GZ_N_2h-A | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 3,121,263,208 | 3,109,608,214 | 99.63 | 3,095,791,797 | 99.56 | 75.15 | 97.51 | 620.28 | 93.71 | 93.34 | 91.95 | 41.61 | 242 |
| FFG_GZ_N_2h-E | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,844,864,910 | 1,842,574,250 | 99.88 | 1,835,180,122 | 99.60 | 87.31 | 58.71 | 163.92 | 93.29 | 91.83 | 85.43 | 40.76 | 253 |
| FFG_GZ_N_6h-A | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,348,616,522 | 2,347,656,712 | 99.96 | 2,336,897,178 | 99.54 | 85.56 | 74.84 | 552.08 | 93.66 | 93.27 | 90.84 | 41.44 | 249 |
| FFG_GZ_N_6h-E | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,324,253,696 | 2,322,277,434 | 99.92 | 2,312,897,831 | 99.60 | 85.59 | 74.02 | 270.78 | 93.48 | 92.60 | 89.34 | 41.11 | 248 |
| FFG_GZ_T_1h-A | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,173,073,798 | 2,172,389,654 | 99.97 | 2,161,992,309 | 99.52 | 85.94 | 66.90 | 653.36 | 93.01 | 91.95 | 84.29 | 41.64 | 256 |
| FFG_GZ_T_1h-B | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,631,699,638 | 1,618,249,558 | 99.18 | 1,610,013,338 | 99.49 | 82.37 | 51.10 | 105.70 | 92.36 | 89.26 | 75.89 | 40.67 | 253 |
| FFG_GZ_T_1h-E | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,441,354,752 | 2,440,848,016 | 99.98 | 2,431,265,220 | 99.61 | 86.43 | 78.56 | 156.28 | 92.75 | 91.44 | 86.19 | 40.18 | 252 |
| FFG_GZ_T_24h-B | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,517,517,314 | 2,516,534,444 | 99.96 | 2,506,328,630 | 99.59 | 84.68 | 80.08 | 489.37 | 93.04 | 92.53 | 88.90 | 41.49 | 249 |
| FFG_GZ_T_24h-F | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 3,096,658,362 | 3,095,956,264 | 99.98 | 3,082,562,699 | 99.57 | 84.29 | 98.75 | 295.06 | 92.97 | 92.31 | 89.34 | 39.91 | 257 |
| FFG_GZ_T_2h-A | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,281,259,460 | 2,279,880,072 | 99.94 | 2,265,837,473 | 99.38 | 82.69 | 71.81 | 577.42 | 93.02 | 92.19 | 86.23 | 41.71 | 243 |
| FFG_GZ_T_2h-B | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 1,844,049,210 | 1,833,063,922 | 99.40 | 1,823,246,662 | 99.46 | 80.78 | 59.04 | 146.98 | 92.69 | 90.75 | 81.60 | 41.32 | 242 |
| FFG_GZ_T_6h-A | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,133,526,796 | 2,132,509,862 | 99.95 | 2,123,023,988 | 99.56 | 82.75 | 66.89 | 556.68 | 93.01 | 92.08 | 85.27 | 41.58 | 246 |
| FFG_GZ_T_6h-B | HCC1395 | FFPE | HiSeq | HiSeq 4000 | NEBNext Ultra II (200 ng) | 2,891,641,362 | 2,879,535,806 | 99.58 | 2,863,935,742 | 99.46 | 75.53 | 92.64 | 245.11 | 92.93 | 92.21 | 89.40 | 40.97 | 236 |
Libraries were made from NEBNext Ultra II (200 ng) library protocol and sequenced on Illumina HiSeq 4000.
10X Genomics Chromium Genome Sequencing (10X WGS) data sets for fresh DNA extracted from HCC1395BL and HCC1395 cell lines.
| Sample ID | Sample | Biosample | Platform | Machine model | library Protocol (Input amount) | Total Reads | Total Mapped Reads | molecule length mean (Kb) | %mapped reads | mean depth | zero coverage | % pcr duplication | large sv calls | short deletion calls | longest phase block (Mb) | n50 phase block (Mb) |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| CHR_IL_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,212,061,014 | 2,092,851,168 | 72.00 | 94.61 | 92 | 1.42 | 8.32 | 336 | 4033 | 32 | 0.9 |
| CHR_IL_T_2 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,266,945,480 | 2,147,917,753 | 76.00 | 94.75 | 94 | 1.41 | 7.03 | 338 | 4257 | 31.9 | 0.9 |
| CHR_IL_T_3 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,329,482,706 | 2,226,020,206 | 77.00 | 95.56 | 99 | 1.40 | 6.44 | 330 | 4626 | 26.7 | 0.8 |
| CHR_NC_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,406,971,948 | 1,366,399,528 | 54.00 | 97.12 | 61 | 1.53 | 5.84 | 304 | 4438 | 14.9 | 1.2 |
| CHR_EA_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,321,617,070 | 1,279,515,774 | 67.00 | 96.81 | 57 | 1.40 | 11.87 | 295 | 5267 | 16.9 | 0.6 |
| CHR_FD_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,064,469,438 | 1,015,823,822 | 64.00 | 95.43 | 45 | 1.45 | 5.63 | 266 | 4439 | 17.5 | 0.7 |
| CHR_FD_T_2 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,070,778,002 | 1,027,173,140 | 66.00 | 95.93 | 45 | 1.45 | 4.37 | 265 | 4313 | 15.2 | 0.7 |
| CHR_FD_T_3 | HCC1395 | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,070,863,374 | 1,020,779,776 | 66.00 | 95.32 | 45 | 1.50 | 7.10 | 258 | 4382 | 18.4 | 0.8 |
| CHR_NV_T_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,763,309,906 | 2,686,525,126 | 67.60 | 97.22 | 119 | 1.44 | 12.05 | 353 | 3863 | 22.9 | 0.7 |
| CHR_NV_T_2 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,604,341,620 | 2,529,765,476 | 62.77 | 97.14 | 113 | 1.43 | 14.52 | 354 | 3705 | 17.6 | 0.6 |
| CHR_NV_T_3 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,614,247,622 | 2,546,038,714 | 61.83 | 97.39 | 114 | 1.43 | 13.23 | 350 | 3701 | 20.0 | 0.6 |
| CHR_IL_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,134,749,498 | 2,025,432,377 | 70.00 | 94.88 | 90 | 0.82 | 7.50 | 25 | 3507 | 33.6 | 7.5 |
| CHR_IL_N_2 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,202,527,584 | 2,107,633,315 | 72.00 | 95.69 | 93 | 0.82 | 7.16 | 23 | 3532 | 30.3 | 6.7 |
| CHR_IL_N_3 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,380,468,964 | 2,279,808,496 | 70.00 | 95.77 | 101 | 0.81 | 6.98 | 24 | 3460 | 24.4 | 7 |
| CHR_NC_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 3000/4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,420,063,110 | 1,383,151,777 | 66.00 | 97.40 | 62 | 0.90 | 6.79 | 24 | 3751 | 19.8 | 4.5 |
| CHR_EA_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,204,338,932 | 1,157,467,982 | 67.00 | 96.11 | 52 | 0.87 | 10.70 | 21 | 4717 | 19.3 | 4.4 |
| CHR_FD_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,079,294,448 | 1,047,436,008 | 63.00 | 97.05 | 46 | 0.91 | 5.02 | 24 | 3824 | 27.8 | 3.8 |
| CHR_FD_N_2 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,074,129,460 | 1,037,897,214 | 63.00 | 96.63 | 46 | 0.92 | 4.47 | 20 | 3827 | 18.4 | 3.9 |
| CHR_FD_N_3 | HCC1395BL | Fresh cells | HiSeq | HiSeq X10 | 10X Chromium Genome Library preparation v2 (1250 ng) | 1,069,034,706 | 1,026,145,348 | 61.00 | 95.99 | 45 | 0.79 | 4.65 | 20 | 3994 | 15.7 | 3.3 |
| CHR_NV_N_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,745,515,070 | 2,673,917,830 | 61.23 | 97.39 | 119 | 0.83 | 10.72 | 85 | 3063 | 25.4 | 4.7 |
| CHR_NV_N_2 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,610,495,528 | 2,542,805,354 | 55.45 | 97.41 | 114 | 0.83 | 14.18 | 93 | 3165 | 30.0 | 4.2 |
| CHR_NV_N_3 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | 10X Chromium Genome Library preparation v2 (1250 ng) | 2,614,793,942 | 2,538,150,020 | 56.52 | 97.07 | 113 | 0.83 | 12.25 | 93 | 3181 | 30.3 | 4.4 |
Libraries were made from Chromium Genome Library preparation v2 kit and sequenced by five sequencing centers on Illumina HiSeq 3000/4000, HiSeq X10 for cross-site comparision.
Fig. 4Evaluation of DNA damage for WGS and WES libraries. using GIV scores to capture the DNA damage due to the artifacts introduced during genomic library preparation. The estimation of damage is a global estimation based in an imbalance between R1 and R2 variant frequency. GIV score above 1.5 is defined as damaged. Undamaged DNA samples have a GIV score of 1. (a) DNA damage estimated for fresh cell prepared DNA for WGS Illumina libraries across different sites. (b) DNA damage estimated for FFPE WGS Illumina libraries. (c) DNA damage estimated for fresh cells prepared DNA for WES Illumina libraries across different sites (d) DNA damage estimated for FFPE WES Illumina libraries.
Fig. 5Reproducibility of somatic mutation calling from WES and WGS. The reproducibility UpSet plots for 12 repeated WES (a) and WGS runs (b). The number in each plot represents the reproducibility across the different replicates. (c) SNVs/indels calling concordance between WES and WGS from twelve repeated runs. For direct comparison, SNVs/indels from WGS runs were limited to genomic regions defined by an exome capturing kit (SureSelect V6 + UTR). WES is shown on the left in the Venn diagram and WGS is on the right. Shown coverage depths for WES and WGS were effective mean sequence coverage on exome region, i.e. coverage by total number of mapped reads after trimming. (d) Correlation of MAF in overlapping WGS and WES SNVs/indels from repeated runs.
| Measurement(s) | Somatic Mutation Analysis |
| Technology Type(s) | whole genome sequencing • Whole Exome Sequencing |
| Factor Type(s) | sequencing platform • sample prepration • library preparation • bioinformatics method |
| Sample Characteristic - Organism | Homo sapiens |
Whole Exome Sequencing (WES) data sets for fresh DNA extracted from HCC1395BL and HCC1395 cell lines.
| Sample ID | Sample | Biosample | Platform | Machine model | library prep protocol | Total Reads | Total Reads After Trimming | Percent Total Reads after Trimming | Total Mapped Reads | Percent Total Mapped Reads (Trimmed) | Percent Non-duplicated Reads (Mapped Trimmed) | Percent Reads Mapped On Target | Mean Coverage Inside Target (X) | Percent Coverage > = 15x | Percent Coverage > = 30x | Percent Coverage > = 100x | Percent GC | Median Insert Size |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| WES_IL_N_1 | HCC1395BL | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 279,588,550 | 279,131,838 | 99.84 | 278,802,099 | 99.88 | 57.12 | 85.66 | 250 | 98.22 | 95.58 | 75.96 | 48.18 | 171 |
| WES_IL_N_2 | HCC1395BL | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 701,443,812 | 700,345,738 | 99.84 | 699,269,688 | 99.85 | 72.62 | 76.73 | 554 | 99.52 | 99.23 | 95.44 | 47.26 | 188 |
| WES_IL_N_3 | HCC1395BL | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 382,848,400 | 382,210,820 | 99.83 | 381,421,428 | 99.79 | 87.45 | 44.61 | 172 | 99.14 | 96.90 | 66.10 | 45.46 | 206 |
| WES_EA_N_1 | HCC1395BL | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 176,997,814 | 175,449,424 | 99.13 | 175,389,529 | 99.97 | 71.88 | 86.94 | 165 | 96.47 | 92.23 | 61.91 | 50.28 | 161 |
| WES_NC_N_1 | HCC1395BL | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 114,116,530 | 114,113,400 | 100.00 | 113,154,465 | 99.16 | 96.59 | 69.07 | 83 | 95.69 | 84.44 | 27.31 | 46.62 | 172 |
| WES_LL_N_1 | HCC1395BL | fresh cell | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 69,810,764 | 69,803,466 | 99.99 | 69,388,404 | 99.41 | 75.04 | 76.08 | 89 | 87.00 | 78.31 | 14.15 | 49.57 | 174 |
| WES_NV_N_1 | HCC1395BL | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 397,299,120 | 397,250,484 | 99.988 | 394,606,858 | 99.34 | 78.66 | 73.67 | 317 | 99.40 | 98.80 | 89.18 | 44.06 | 272 |
| WES_NV_N_2 | HCC1395BL | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 418,058,694 | 418,034,480 | 99.994 | 414,260,963 | 99.10 | 79.46 | 75.1 | 333 | 99.42 | 98.84 | 89.69 | 43.76 | 271 |
| WES_NV_N_3 | HCC1395BL | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 375,905,406 | 375,873,310 | 99.991 | 373,410,296 | 99.35 | 80.00 | 74.65 | 307 | 99.37 | 98.70 | 88.14 | 43.98 | 274 |
| WES_FD_N_1 | HCC1395BL | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 68,458,296 | 68,453,794 | 99.993 | 68,364,347 | 99.87 | 77.26 | 73.85 | 62 | 91.19 | 73.51 | 16.91 | 48.71 | 173 |
| WES_FD_N_2 | HCC1395BL | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 78,054,046 | 78,049,486 | 99.994 | 77,947,282 | 99.87 | 77.11 | 73.26 | 70 | 93.01 | 78.41 | 21.55 | 48.25 | 176 |
| WES_FD_N_3 | HCC1395BL | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 80,289,374 | 80,283,552 | 99.993 | 80,182,548 | 99.87 | 75.28 | 72.98 | 72 | 92.98 | 78.52 | 22.44 | 48.83 | 169 |
| WES_IL_T_1 | HCC1395 | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 247,464,114 | 247,122,814 | 99.86 | 246,359,571 | 99.69 | 57.07 | 82.19 | 214 | 98.28 | 95.16 | 68.80 | 47.46 | 171 |
| WES_IL_T_2 | HCC1395 | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 776,700,452 | 775,713,612 | 99.87 | 774,851,468 | 99.89 | 84.68 | 75.54 | 613 | 99.32 | 99.00 | 95.13 | 47.69 | 191 |
| WES_IL_T_3 | HCC1395 | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 342,123,218 | 341,680,540 | 99.87 | 341,089,266 | 99.83 | 82.45 | 69.44 | 246 | 98.95 | 97.37 | 77.31 | 45.92 | 208 |
| WES_EA_T_1 | HCC1395 | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 210,071,430 | 208,191,476 | 99.11 | 208,119,731 | 99.97 | 77.36 | 86.72 | 197 | 95.89 | 92.14 | 65.97 | 49.55 | 165 |
| WES_NC_T_1 | HCC1395 | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 109,097,098 | 109,094,012 | 100.00 | 108,356,145 | 99.32 | 96.88 | 70.26 | 81 | 93.98 | 80.35 | 26.21 | 47.02 | 171 |
| WES_LL_T_1 | HCC1395 | fresh cell | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 53,498,898 | 53,493,162 | 99.99 | 53,302,502 | 99.64 | 74.53 | 58.44 | 89 | 78.21 | 66.11 | 7.94 | 50.12 | 174 |
| WES_NV_T_1 | HCC1395 | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 380,994,344 | 380,922,176 | 99.981 | 378,402,248 | 99.34 | 78.76 | 73.91 | 307 | 99.13 | 98.30 | 85.63 | 44.57 | 261 |
| WES_NV_T_2 | HCC1395 | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 398,077,794 | 398,060,634 | 99.996 | 394,293,828 | 99.05 | 80.50 | 73.22 | 311 | 99.15 | 98.34 | 85.89 | 44.4 | 278 |
| WES_NV_T_3 | HCC1395 | fresh cell | HiSeq | HiSeq 2500 | Agilent SureSelect Human All Exon v6 + UTRs | 386,958,118 | 386,882,840 | 99.981 | 384,363,231 | 99.35 | 81.70 | 73.86 | 310 | 99.13 | 98.30 | 85.78 | 44.52 | 274 |
| WES_FD_T_1 | HCC1395 | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 72,787,240 | 72,783,222 | 99.994 | 72,699,310 | 99.89 | 77.54 | 74.4 | 67 | 89.90 | 72.02 | 19.35 | 49.15 | 168 |
| WES_FD_T_2 | HCC1395 | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 68,338,828 | 68,334,984 | 99.994 | 68,153,419 | 99.73 | 78.14 | 73.61 | 60 | 88.39 | 68.41 | 16.02 | 48.87 | 172 |
| WES_FD_T_3 | HCC1395 | fresh cell | HiSeq | HiSeq 3000/4000 | Agilent SureSelect Human All Exon v6 + UTRs | 68,729,270 | 68,724,882 | 99.994 | 68,625,572 | 99.86 | 78.80 | 72.85 | 61 | 88.77 | 69.10 | 16.50 | 48.97 | 172 |
Libraries were made by using Agilent SureSelect Human All Exon v6 + UTRs library protocol and sequenced by six sequencing centers on Illumina HiSeq 2500 and HiSeq 3000/4000 for cross-site comparision.
Whole Exome Sequencing (WES) data sets for DNA extracted from HCC1395BL and HCC1395 cell lines and processed via Formalin-Fixed Paraffin-Embedded (FFPE) process with fixation time of 1, 2, 6, or 24 hours.
| Sample ID | Sample | Biosample | Platform | Machine model | library Protocol | Total Reads (PF) | Total Reads After Trimming | Percent Total Reads after Trimming | Total Mapped Reads (Trimmed) | Percent Total Mapped Reads (Trimmed) | Percent Non-duplicated Reads (Mapped Trimmed) | Percent Reads Mapped On Target | Effect Mean Coverage Inside Target (X) | Percent Coverage > = 15x | Percent Coverage > = 30x | Percent Coverage > = 100x | Percent GC | Median Insert Size |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| FFX_GZ_T_1h_1 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 72,688,712 | 72,595,030 | 99.87 | 72,386,458 | 99.71 | 87.95 | 89.23 | 71 | 61.71 | 48.90 | 20.99 | 58.17 | 174 |
| FFX_GZ_T_1h_3 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 104,142,794 | 103,976,012 | 99.84 | 103,701,635 | 99.74 | 86.35 | 83.89 | 96 | 68.06 | 63.55 | 35.76 | 53.83 | 173 |
| FFX_GZ_T_6h_3 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 83,420,206 | 83,313,508 | 99.87 | 83,083,057 | 99.72 | 86.73 | 85.97 | 79 | 67.01 | 60.58 | 28.64 | 53.86 | 173 |
| FFX_GZ_T_24h_1 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 126,758,774 | 126,608,628 | 99.88 | 126,157,724 | 99.64 | 86.25 | 88.59 | 122 | 68.14 | 62.15 | 34.19 | 57.67 | 178 |
| FFX_GZ_T_24h_2 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 80,904,232 | 80,702,760 | 99.75 | 80,484,694 | 99.73 | 85.48 | 86.80 | 78 | 66.74 | 59.77 | 27.12 | 54.75 | 177 |
| FFX_IL_T_1h_1 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 608,995,390 | 608,349,240 | 99.89 | 607,607,604 | 99.88 | 46.30 | 83.67 | 569 | 99.21 | 98.64 | 91.11 | 44.05 | 191 |
| FFX_IL_T_2h_1 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 778,975,090 | 777,139,604 | 99.76 | 775,811,184 | 99.83 | 33.23 | 76.22 | 685 | 96.75 | 94.16 | 81.39 | 42.82 | 147 |
| FFX_IL_T_6h_1 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 419,922,488 | 417,286,480 | 99.37 | 416,806,114 | 99.89 | 65.04 | 68.94 | 310 | 98.94 | 97.25 | 78.89 | 44.21 | 204 |
| FFX_IL_T_24h_1 | HCC1395 | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 878,303,836 | 872,276,568 | 99.31 | 871,782,275 | 99.94 | 47.68 | 64.61 | 644 | 98.42 | 97.19 | 88.47 | 44.00 | 168 |
| FFX_GZ_N_1h_1 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 110,247,342 | 109,510,666 | 99.33 | 109,202,128 | 99.72 | 85.61 | 87.30 | 105 | 68.78 | 64.80 | 39.58 | 53.89 | 178 |
| FFX_GZ_N_2h_3 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 97,716,616 | 97,226,646 | 99.50 | 96,965,642 | 99.73 | 87.49 | 87.88 | 94 | 68.50 | 64.46 | 36.05 | 54.31 | 174 |
| FFX_GZ_N_6h_1 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 96,772,432 | 95,979,612 | 99.18 | 95,684,076 | 99.69 | 85.75 | 88.64 | 92 | 67.53 | 60.93 | 28.26 | 57.13 | 172 |
| FFX_GZ_N_24h_1 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 90,151,054 | 90,056,824 | 99.90 | 89,810,047 | 99.73 | 87.25 | 87.86 | 88 | 67.16 | 59.81 | 27.36 | 57.10 | 173 |
| FFX_IL_N_1h_2 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 963,476,124 | 959,914,384 | 99.63 | 957,187,812 | 99.72 | 50.50 | 73.14 | 775 | 99.60 | 99.48 | 97.58 | 46.59 | 194 |
| FFX_IL_N_2h_2 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 535,913,728 | 533,170,624 | 99.49 | 532,149,369 | 99.81 | 23.80 | 75.59 | 434 | 99.00 | 97.42 | 81.56 | 55.67 | 202 |
| FFX_IL_N_6h_2 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 1,386,577,130 | 1,382,604,794 | 99.71 | 1,375,292,872 | 99.47 | 22.69 | 69.60 | 1058 | 99.64 | 99.56 | 98.55 | 46.66 | 186 |
| FFX_IL_N_24h_2 | HCC1395BL | FFPE | HiSeq | HiSeq 4000 | Agilent SureSelect Human All Exon v6 + UTRs | 1,041,349,526 | 1,040,887,252 | 99.96 | 1,039,335,968 | 99.85 | 25.82 | 81.78 | 942 | 99.63 | 99.52 | 97.93 | 44.65 | 206 |
Libraries were made from Agilent SureSelect Human All Exon v6 + UTRs protocol and sequenced on Illumina HiSeq 4000.
WGS Libraries were made from different library protocols such as TruSeq Nano, TruSeq PCR Free and Nextera Flex library protocol with different input amount and sequenced on Illumina HiSeq 3000/4000.
| Sample ID | Sample | Biosample | Platform | Machine model | library prep protocol (Input amount) | Total Reads | Total Reads After Trimming | Percent Total Reads after Trimming | Total Mapped Reads | Percent Total Mapped Reads (Trimmed) | Percent Non-duplicated Reads | Mean Coverage Depth | Mean Coverage Depth SD | Percent of Coverage > = 5X | Percent GC | Median Insert Size |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| LBP_LL_T_1 ng | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-Nano (1ng) | 926,143,044 | 892,177,246 | 96.3 | 887,751,031 | 99.5 | 17.4 | 32.3 | 128.4 | 88.2 | 40.3 | 411 |
| LBP_LL_T_10 ng | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-Nano (10ng) | 860,252,576 | 858,792,572 | 99.8 | 855,051,115 | 99.6 | 57.2 | 31.8 | 126.7 | 92.4 | 40.4 | 382 |
| LBP_LL_T_100 ng | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-Nano (100ng) | 840,067,566 | 834,660,190 | 99.4 | 830,420,093 | 99.5 | 83.4 | 30.7 | 127.7 | 92.6 | 40.3 | 384 |
| LBP_LL_T_250 ng | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-PCR-free(250ng) | 875,539,968 | 874,446,850 | 99.9 | 868,831,019 | 99.4 | 78.5 | 33.5 | 136.0 | 92.7 | 40.2 | 367 |
| LBP_LL_N_1 ng | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-Nano (1ng) | 863,716,968 | 857,386,678 | 99.3 | 852,997,339 | 99.5 | 26.4 | 31.6 | 169.3 | 92.3 | 40.1 | 382 |
| LBP_LL_N_10 ng | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-Nano (10ng) | 854,239,270 | 835,851,330 | 97.8 | 831,617,019 | 99.5 | 72.3 | 30.4 | 158.0 | 93.4 | 40.2 | 399 |
| LBP_LL_N_100 ng | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-Nano (100ng) | 1,230,957,576 | 1,227,609,086 | 99.7 | 1,221,602,690 | 99.5 | 76.5 | 44.7 | 237.4 | 93.6 | 40.1 | 405 |
| LBP_LL_N_250 ng | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | TruSeq-PCR-free(250ng) | 842,036,782 | 840,807,462 | 99.9 | 835,177,981 | 99.3 | 81.0 | 31.7 | 177.1 | 93.4 | 40.0 | 371 |
| LBP_HS_N_100 ng_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | Nextera flex (100ng) | 1,582,185,218 | 1,581,324,668 | 99.95 | 1,575,553,437 | 99.64 | 87.64 | 51.98 | 291.6 | 93.6 | 40.7 | 371 |
| LBP_HS_N_10 ng_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | Nextera flex (10ng) | 2,057,279,420 | 2,054,948,200 | 99.89 | 2,047,669,492 | 99.65 | 82.95 | 70.20 | 325.3 | 93.7 | 40.4 | 328 |
| LBP_HS_N_1 ng_1 | HCC1395BL | Fresh cells | HiSeq | HiSeq 4000 | Nextera flex (1ng) | 1,431,611,946 | 1,428,647,758 | 99.79 | 1,424,363,120 | 99.70 | 74.36 | 48.02 | 257.7 | 93.6 | 40.7 | 307 |
| LBP_HS_T_100 ng_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | Nextera flex (100ng) | 1,723,259,958 | 1,722,198,188 | 99.94 | 1,716,524,152 | 99.67 | 84.09 | 57.79 | 238.8 | 93.0 | 41.0 | 364 |
| LBP_HS_T_10 ng_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | Nextera flex (10ng) | 2,083,680,112 | 2,082,045,850 | 99.92 | 2,076,564,864 | 99.74 | 79.18 | 72.35 | 303.1 | 93.0 | 40.6 | 336 |
| LBP_HS_T_1 ng_1 | HCC1395 | Fresh cells | HiSeq | HiSeq 4000 | Nextera flex (1ng) | 1,794,810,974 | 1,792,451,636 | 99.87 | 1,788,984,708 | 99.81 | 52.23 | 63.46 | 259.0 | 93.0 | 40.9 | 341 |
WGS Libraries were made from pooling the HCC1395 and HCC1395BL cell lines with various ratios (3:1, 1:1, 1:4, 1:9 and 1:19) to create mixtures.
| Sample ID | Sample(Mix ratio) | Biosample | Platform | Machine model | library Protocol(Iinput amount) | Total Reads | Total Reads After Trimming | Percent Total Reads after Trimming | Total Mapped Reads | Percent Total Mapped Reads (Trimmed) | Percent Non-duplicated Reads | Mean Coverage Depth | Mean Coverage Depth SD | Percent of Coverage > = 5X | Percent GC | Median Insert Size |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| SPP_GT_0-1_1 | HCC1395:HCC1395BL (0:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,076,097,436 | 3,076,097,436 | 98.6 | 3,066,844,772 | 99.7 | 83.0 | 115.0 | 657.4 | 93.7 | 40.0 | 380 |
| SPP_GT_0-1_2 | HCC1395:HCC1395BL (0:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,001,832,062 | 3,001,832,062 | 99.9 | 2,993,554,635 | 99.7 | 86.1 | 112.0 | 627.8 | 93.7 | 40.0 | 391 |
| SPP_GT_0-1_3 | HCC1395:HCC1395BL (0:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,903,255,740 | 2,903,255,740 | 99.9 | 2,893,798,551 | 99.7 | 86.7 | 105.1 | 586.8 | 93.7 | 39.9 | 378 |
| SPP_GT_1-0_1 | HCC1395:HCC1395BL (1:0) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,077,848,110 | 3,077,848,110 | 98.9 | 3,069,341,612 | 99.7 | 76.4 | 117.9 | 475.3 | 93.1 | 40.2 | 368 |
| SPP_GT_1-0_2 | HCC1395:HCC1395BL (1:0) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,090,006,648 | 3,090,006,648 | 99.9 | 3,082,407,730 | 99.8 | 83.5 | 115.7 | 476.2 | 93.1 | 40.2 | 370 |
| SPP_GT_1-0_3 | HCC1395:HCC1395BL (1:0) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,962,991,404 | 2,962,991,404 | 99.9 | 2,954,244,873 | 99.7 | 87.8 | 107.5 | 448.9 | 93.1 | 40.1 | 376 |
| SPP_GT_1-1_1 | HCC1395:HCC1395BL (1:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,055,617,586 | 3,055,617,586 | 99.6 | 3,046,293,478 | 99.7 | 79.9 | 114.2 | 538.5 | 93.8 | 40.0 | 379 |
| SPP_GT_1-1_2 | HCC1395:HCC1395BL (1:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,972,612,956 | 2,972,612,956 | 100.0 | 2,963,334,376 | 99.7 | 78.4 | 107.8 | 532.6 | 93.8 | 39.8 | 383 |
| SPP_GT_1-1_3 | HCC1395:HCC1395BL (1:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,070,633,366 | 3,070,633,366 | 100.0 | 3,058,122,892 | 99.6 | 89.7 | 106.9 | 493.9 | 93.8 | 40.3 | 396 |
| SPP_GT_1-4_1 | HCC1395:HCC1395BL (1:4) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,031,054,048 | 3,031,054,048 | 97.5 | 3,018,769,715 | 99.6 | 74.4 | 114.9 | 609.9 | 93.8 | 40.0 | 369 |
| SPP_GT_1-4_2 | HCC1395:HCC1395BL (1:4) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,984,886,368 | 2,984,886,368 | 99.9 | 2,974,891,252 | 99.7 | 86.5 | 107.9 | 574.8 | 93.8 | 40.0 | 395 |
| SPP_GT_1-4_3 | HCC1395:HCC1395BL (1:4) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,076,330,144 | 3,076,330,144 | 98.5 | 3,065,365,108 | 99.6 | 70.8 | 111.9 | 576.8 | 93.8 | 40.1 | 392 |
| SPP_GT_1-9_1 | HCC1395:HCC1395BL (1:9) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,169,041,440 | 3,169,041,440 | 98.9 | 3,160,284,662 | 99.7 | 65.8 | 117.6 | 662.4 | 93.8 | 39.9 | 383 |
| SPP_GT_1-9_2 | HCC1395:HCC1395BL (1:9) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,001,203,286 | 3,001,203,286 | 99.9 | 2,993,060,766 | 99.7 | 84.9 | 109.1 | 605.5 | 93.8 | 39.9 | 384 |
| SPP_GT_1-9_3 | HCC1395:HCC1395BL (1:9) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,142,707,394 | 3,142,707,394 | 99.2 | 3,133,546,333 | 99.7 | 68.5 | 114.0 | 626.6 | 93.8 | 39.9 | 389 |
| SPP_GT_1-19_1 | HCC1395:HCC1395BL (1:19) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,113,584,442 | 3,113,584,442 | 98.3 | 3,104,340,674 | 99.7 | 63.0 | 115.4 | 663.9 | 93.7 | 39.8 | 381 |
| SPP_GT_1-19_2 | HCC1395:HCC1395BL (1:19) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,927,043,064 | 2,927,043,064 | 99.9 | 2,917,934,331 | 99.7 | 88.5 | 107.8 | 594.4 | 93.7 | 39.9 | 395 |
| SPP_GT_1-19_3 | HCC1395:HCC1395BL (1:19) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,044,009,076 | 3,044,009,076 | 99.9 | 3,033,242,220 | 99.6 | 90.6 | 109.1 | 579.2 | 93.8 | 40.2 | 397 |
| SPP_GT_3-1_1 | HCC1395:HCC1395BL (3:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,065,804,606 | 3,065,804,606 | 99.5 | 3,056,714,849 | 99.7 | 79.3 | 117.3 | 509.2 | 93.8 | 40.1 | 363 |
| SPP_GT_3-1_2 | HCC1395:HCC1395BL (3:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 3,052,239,564 | 3,052,239,564 | 100.0 | 3,042,838,917 | 99.7 | 66.1 | 112.7 | 523.1 | 93.8 | 39.8 | 376 |
| SPP_GT_3-1_3 | HCC1395:HCC1395BL (3:1) | Fresh cells | HiSeq | HiSeq 4000 | TruSeq DNA PCR Free (1000 ng) | 2,975,199,024 | 2,975,199,024 | 99.9 | 2,966,022,353 | 99.7 | 86.4 | 108.0 | 481.4 | 93.8 | 40.0 | 375 |
Libraries were prepared by using TruSeq DNA PCR Free (1000 ng) protocol and sequenced on Illumina HiSeq 4000.
PacBio Sequel II Whole Genome Sequencing data sets for fresh DNA extracted from HCC1395BL and HCC1395 cell lines.
| Sample ID | Sample | Biosample | Platform | Machine model | library Protocol (Input amount) | Total Reads | Total Bases (Bps) | Total Mapped Reads | Percent Total Mapped Reads | Percent of PCR duplidate(mapped reads) | Mean Coverage Depth | Mean Coverage Depth SD | Percent of Coverage > = 5X | Percent GC | Median Read Lengh | Raw Coverage |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| PBO_Normal | HCC1395BL | Fresh cells | PacBio_Sequel | PacBio_Sequel II | Preparing >30 kbp SMRTbell Libraries Using Megarupter Shearing and Blue Pippin Size-Selection for PacBio RS II and Sequel Systems | 21,935,561 | 131,842,758,047 | 16,759,622 | 76.40 | 2.46 | 44.17 | 33.25 | 92.29 | 41.14 | 8,050 | 44 |
| PBO_Tumor | HCC1395 | Fresh cells | PacBio_Sequel | PacBio_Sequel II | Preparing >30 kbp SMRTbell Libraries Using Megarupter Shearing and Blue Pippin Size-Selection for PacBio RS II and Sequel Systems | 18,835,934 | 116,009,882,995 | 16,273,225 | 86.39 | 2.58 | 38.95 | 47.35 | 91.62 | 41.57 | 8,050 | 39 |
Libraries were made from PacBio library preparation and sequencing library protocol.
AmpliSeq libraries were prepared using Illumina protocol and sequenced on MiSeq platform.
| Sample ID | Sample | Biosample | Platform | Machine model | library Protocol (Input amount) | Total Reads | Percent Total Reads after Trimming | Total Mapped Reads (Trimmed) | Percent Total Mapped Reads (Trimmed) | Percent Non-duplicated Reads (Mapped Trimmed) | Percent Reads Mapped On Target | Mean Coverage Inside Target (X) | Percent Coverage > = 30x | Percent GC | Median Insert Size |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| AMS_AB_N_1 | HCC1395BL | Fresh cells | MiSeq | MiSeq | AmpliSeq | 7,569,042 | 99.95 | 7,548,971 | 99.79 | 4.68 | 67.55 | 1173.88 | 98.61 | 41.66 | 191 |
| AMS_AB_N_2 | HCC1395BL | Fresh cells | MiSeq | MiSeq | AmpliSeq | 6,801,804 | 99.97 | 6,783,534 | 99.76 | 4.45 | 66.44 | 1049.06 | 98.52 | 41.02 | 196 |
| AMS_AB_N_3 | HCC1395BL | Fresh cells | MiSeq | MiSeq | AmpliSeq | 7,335,112 | 99.97 | 7,316,766 | 99.78 | 4.38 | 67.42 | 1155.25 | 98.52 | 40.95 | 196 |
| AMS_AB_T_1 | HCC1395 | Fresh cells | MiSeq | MiSeq | AmpliSeq | 8,245,680 | 99.96 | 8,225,203 | 99.79 | 4.34 | 66.10 | 1277.06 | 96.69 | 41.35 | 196 |
| AMS_AB_T_2 | HCC1395 | Fresh cells | MiSeq | MiSeq | AmpliSeq | 8,011,520 | 99.96 | 7,989,728 | 99.77 | 4.38 | 65.63 | 1238.92 | 96.71 | 41.46 | 196 |
| AMS_AB_T_3 | HCC1395 | Fresh cells | MiSeq | MiSeq | AmpliSeq | 9,163,270 | 99.96 | 9,141,920 | 99.81 | 4.33 | 65.89 | 1430.02 | 96.80 | 41.28 | 196 |