| Literature DB >> 26836295 |
Anneleen Decock1,2, Maté Ongenaert1, Wim Van Criekinge3,4,5, Frank Speleman1,2, Jo Vandesompele1,2,6.
Abstract
Comprehensive genome-wide DNA methylation studies in neuroblastoma (NB), a childhood tumor that originates from precursor cells of the sympathetic nervous system, are scarce. Recently, we profiled the DNA methylome of 102 well-annotated primary NB tumors by methyl-CpG-binding domain (MBD) sequencing, in order to identify prognostic biomarker candidates. In this data descriptor, we give details on how this data set was generated and which bioinformatics analyses were applied during data processing. Through a series of technical validations, we illustrate that the data are of high quality and that the sequenced fragments represent methylated genomic regions. Furthermore, genes previously described to be methylated in NB are confirmed. As such, these MBD sequencing data are a valuable resource to further study the association of NB risk factors with the NB methylome, and offer the opportunity to integrate methylome data with other -omic data sets on the same tumor samples such as gene copy number and gene expression, also publically available.Entities:
Mesh:
Substances:
Year: 2016 PMID: 26836295 PMCID: PMC4736656 DOI: 10.1038/sdata.2016.4
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1The MBD sequencing data of 102 primary neuroblastoma tumors are processed using different analysis tools.
Depicted are the available MBD sequencing data sets and downstream data processing and technical validation steps. These steps are represented as arrows and circles, respectively. For each step, the applied tool or analysis is indicated. For the technical validation steps, also the corresponding data descriptor figures and tables are indicated. DMA, differential methylation analysis; IGV, Integrative Genomics Viewer; PE, paired-end; RPKM, reads per kilobase CpG island per million.
In total, 102 annotated primary neuroblastoma DNA samples were profiled by MBD sequencing
| Each sample is characterized by a unique Sample Name and is assigned to a specific cohort (MBD cohort I or MBD cohort II). Clinical characteristics given are the age at diagnosis in months, International Neuroblastoma Staging System (INSS) stage, | ||||||||||||||
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 811 | MBD cohort I | 66.70684932 | 4 | non-amplified | 944 | 587 | died of disease | event | GSE21713—GSM541724 | id 4883 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1429 | MBD cohort I | 64.63561644 | 4 | non-amplified | 1188 | 867 | died of disease | event | GSE21713—GSM541703 | id 4758 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1467 | MBD cohort I | 0 | 4 | amplified | 1 | 1 | died of disease | event | GSE21713—GSM541689 | id 4817 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1473 | MBD cohort I | 7.528767123 | 4 | amplified | 239 | 116 | died of disease | event | GSE21713—GSM541691 | id 4882 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1477 | MBD cohort I | 78.70684932 | 4 | amplified | 1246 | 898 | died of disease | event | NA | id 4600 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1517 | MBD cohort I | 34.88219178 | 4 | non-amplified | 547 | 433 | died of disease | event | GSE21714—GSM541705 | id 11565 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1520 | MBD cohort I | 39.74794521 | 4 | amplified | 1279 | 552 | died of disease | event | GSE21713—GSM541694 | id 11771 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1522 | MBD cohort I | 22.61917808 | 3 | amplified | 728 | 594 | died of disease | event | GSE21713—GSM541696 | id 5098 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1527 | MBD cohort I | 107.9342466 | 4 | amplified | 319 | NA | died of disease | event | GSE21713—GSM541698 | id 4762 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1648 | MBD cohort I | 53.16164384 | 4 | non-amplified | 1221 | 341 | died of disease | event | GSE21713—GSM541701 | id 4603 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E061 | MBD cohort I | 30.70684932 | 4 | non-amplified | 1445 | 497 | died of disease | event | GSE32664—GSM810696 | id 5143 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E069 | MBD cohort I | 59.44109589 | 4 | non-amplified | 2836 | 2016 | died of disease | event | GSE32664—GSM810694 | id 5095 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E282 | MBD cohort I | 16.99726027 | 4 | amplified | 711 | 433 | died of disease | event | GSE32664—GSM810699 | id 5142 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E290 | MBD cohort I | 6.443835616 | 4 | amplified | 539 | 351 | died of disease | event | GSE32664—GSM810682 | id 5085 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 526 | MBD cohort I | 26.43287671 | 4 | amplified | 2009 | 2009 | alive | no event | GSE21713—GSM541725 | id 4866 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1017 | MBD cohort I | 22.32328767 | 3 | amplified | 1758 | 1758 | alive | no event | GSE21713—GSM541728 | id 4910 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1431 | MBD cohort I | 23.04657534 | 4 | amplified | 953 | 953 | alive | no event | GSE21713—GSM541704 | id 4550 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1521 | MBD cohort I | 38.03835616 | 4 | amplified | 2163 | 2163 | alive | no event | GSE21713—GSM541695 | id 4602 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1524 | MBD cohort I | 23.30958904 | 3 | amplified | 2387 | 2387 | alive | no event | GSE21713—GSM541697 | id 4766 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1616 | MBD cohort I | 18.34520548 | 4 | amplified | 2137 | 2137 | alive | no event | GSE21713—GSM541690 | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 2857 | MBD cohort I | 28.83287671 | 3 | amplified | 1295 | 1295 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 2863 | MBD cohort I | 10.81643836 | 4 | amplified | 1237 | 1237 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 2868 | MBD cohort I | 157.1835616 | 4 | non-amplified | 1159 | 1159 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E579 | MBD cohort I | 13.15068493 | 4 | non-amplified | 3534 | 3534 | alive | no event | GSE32664—GSM810692 | id 4913 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E598 | MBD cohort I | 48.85479452 | 4 | non-amplified | 3219 | 3219 | alive | no event | GSE32664—GSM810689 | id 5137 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E685 | MBD cohort I | 14.16986301 | 4 | non-amplified | 3011 | 3011 | alive | no event | GSE32664—GSM810685 | id 5043 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | E700 | MBD cohort I | 20.35068493 | 4 | non-amplified | 1536 | 1536 | alive | no event | GSE32664—GSM810680 | id 5041 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 278 | MBD cohort I | 14.53150685 | 1 | non-amplified | 3404 | 3404 | alive | no event | GSE21713—GSM541713 | id 4884 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 397 | MBD cohort I | 18.47671233 | 2 | non-amplified | 3555 | 3555 | alive | no event | GSE21713—GSM541720 | id 10138 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 410 | MBD cohort I | 1.676712329 | 1 | non-amplified | 2910 | 2910 | alive | no event | GSE21713—GSM541714 | id 4878 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 529 | MBD cohort I | 1.249315068 | 3 | non-amplified | 2264 | 2264 | alive | no event | GSE21713—GSM541707 | id 4863 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 530 | MBD cohort I | 1.545205479 | 1 | non-amplified | 2216 | 2216 | alive | no event | GSE21713—GSM541717 | id 4785 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 566 | MBD cohort I | 0.098630137 | 2 | non-amplified | 1615 | 1615 | alive | no event | GSE21713—GSM541718 | id 4826 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 711 | MBD cohort I | 16.99726027 | 2 | non-amplified | 1885 | 1885 | alive | no event | GSE21713—GSM541710 | id 11562 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 744 | MBD cohort I | 16.0109589 | 2 | non-amplified | 1850 | 1850 | alive | no event | GSE21713—GSM541716 | id 4870 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 747 | MBD cohort I | 7.989041096 | 1 | non-amplified | 2302 | 2302 | alive | no event | GSE21713—GSM541712 | id 11563 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 809 | MBD cohort I | 0.164383562 | 1 | non-amplified | 2904 | 2904 | alive | no event | GSE21713—GSM541723 | id 4868 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 914 | MBD cohort I | 1.249315068 | 1 | non-amplified | 2425 | 2425 | alive | no event | GSE21713—GSM541711 | id 11564 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 916 | MBD cohort I | 9.994520548 | 1 | non-amplified | 3830 | 3830 | alive | no event | GSE21713—GSM541726 | id 5372 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 926 | MBD cohort I | 0.920547945 | 1 | non-amplified | 1861 | 1861 | alive | no event | GSE21713—GSM541715 | id 4921 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1650 | MBD cohort I | 0.854794521 | 2 | non-amplified | 1264 | 187 | alive | event | GSE21713—GSM541702 | id 4813 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1699 | MBD cohort I | 7.463013699 | 3 | non-amplified | 2882 | 999 | alive | event | GSE21713—GSM541706 | id 10132 |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 41 | MBD cohort II | 49.90684932 | 4 | non-amplified | 569 | NA | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 610 | MBD cohort II | 34.02739726 | 4 | amplified | 850 | NA | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 928 | MBD cohort II | 11.7369863 | 4 | amplified | 412 | NA | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1430 | MBD cohort II | 35.4739726 | 4 | non-amplified | 520 | 391 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1507 | MBD cohort II | 14.33424658 | 4 | amplified | 285 | NA | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1713 | MBD cohort II | 40.99726027 | 4 | non-amplified | 581 | 449 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1780 | MBD cohort II | 30.90410959 | 3 | amplified | 569 | NA | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1782 | MBD cohort II | 41.49041096 | 4 | amplified | 707 | 441 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1783 | MBD cohort II | 13.6109589 | 4 | amplified | 316 | 288 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1784 | MBD cohort II | 76.5369863 | 4 | amplified | 1819 | 972 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1786 | MBD cohort II | 15.02465753 | 4 | amplified | 950 | 607 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1790 | MBD cohort II | 101.2931507 | 4 | non-amplified | 989 | 306 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1791 | MBD cohort II | 51.32054795 | 4 | non-amplified | 377 | 357 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1795 | MBD cohort II | 134.0383562 | 4 | non-amplified | 671 | 214 | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1796 | MBD cohort II | 24.2630137 | 4 | non-amplified | 414 | NA | died of disease | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 11 | MBD cohort II | 15.22191781 | 4 | non-amplified | 4396 | 4396 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1030 | MBD cohort II | 54.64109589 | 4 | non-amplified | 2653 | 2653 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1381 | MBD cohort II | 21.46849315 | 4 | non-amplified | 1981 | 1981 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1382 | MBD cohort II | 70.09315068 | 4 | non-amplified | 1891 | 1891 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1384 | MBD cohort II | 16.37260274 | 3 | amplified | 1620 | 1620 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1501 | MBD cohort II | 131.3753425 | 4 | non-amplified | 1558 | 1558 | alive | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1515 | MBD cohort II | 15.97808219 | 4 | non-amplified | 1687 | 1687 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1647 | MBD cohort II | 10.06027397 | 2 | amplified | 2136 | 2136 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1649 | MBD cohort II | 8.745205479 | 4 | amplified | 2174 | 2174 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1789 | MBD cohort II | 25.18356164 | 3 | amplified | 5616 | 5616 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1793 | MBD cohort II | 32.54794521 | 3 | amplified | 2328 | 2328 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1794 | MBD cohort II | 33.23835616 | 3 | amplified | 5096 | 5096 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1800 | MBD cohort II | 71.07945205 | 4 | amplified | 1862 | 1862 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1803 | MBD cohort II | 19.36438356 | 4 | amplified | 2410 | 2410 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1863 | MBD cohort II | 168.6246575 | 4 | non-amplified | 1053 | 560 | alive | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 820 | MBD cohort II | 10.88219178 | 2 | non-amplified | 4794 | 4794 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 822 | MBD cohort II | 12.82191781 | 1 | non-amplified | 1153 | 1153 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 823 | MBD cohort II | 4.24109589 | 1 | non-amplified | 1090 | 1090 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 912 | MBD cohort II | 0.164383562 | 1 | non-amplified | 2904 | 2904 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1028 | MBD cohort II | 3.484931507 | 2 | non-amplified | 2932 | 2932 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1038 | MBD cohort II | 11.4739726 | 2 | non-amplified | 1071 | 1071 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1039 | MBD cohort II | 7.956164384 | 3 | non-amplified | 1394 | 1394 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1469 | MBD cohort II | 1.446575342 | 1 | non-amplified | 3275 | 3275 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1476 | MBD cohort II | 22.22465753 | 2 | non-amplified | 2075 | 2075 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1483 | MBD cohort II | 0.690410959 | 1 | non-amplified | 2554 | 2554 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1484 | MBD cohort II | 1.347945205 | 1 | non-amplified | 2566 | 2566 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1486 | MBD cohort II | 11.50684932 | 1 | non-amplified | 2328 | 2328 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1488 | MBD cohort II | 2.432876712 | 2 | non-amplified | 1827 | 1827 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1509 | MBD cohort II | 1.019178082 | 3 | non-amplified | 2597 | 2597 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1646 | MBD cohort II | 4.109589041 | 2 | non-amplified | 2096 | 2096 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1530 | MBD cohort II | 0.098630137 | 4S | non-amplified | 2039 | 2039 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1494 | MBD cohort II | 1.084931507 | 4S | non-amplified | 1590 | 1590 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1615 | MBD cohort II | 0.295890411 | 4S | non-amplified | 1562 | 1562 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1613 | MBD cohort II | 0.460273973 | 4S | non-amplified | 1503 | 1503 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1191 | MBD cohort II | 1.709589041 | 4S | non-amplified | 2105 | 2105 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1750 | MBD cohort II | 0.723287671 | 4S | non-amplified | 1322 | 1322 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 277 | MBD cohort II | 2.367123288 | 4S | non-amplified | 2099 | 101 | alive | event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1392 | MBD cohort II | 3.484931507 | 4S | non-amplified | 3190 | 3190 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 821 | MBD cohort II | 2.038356164 | 4S | non-amplified | 2652 | 2652 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 750 | MBD cohort II | 8.482191781 | 4S | non-amplified | 1567 | 1567 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1383 | MBD cohort II | 0.493150685 | 4S | non-amplified | 2670 | 2670 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1013 | MBD cohort II | 2.432876712 | 4S | non-amplified | 2252 | 2252 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 511 | MBD cohort II | 3.682191781 | 4S | non-amplified | 3837 | 3837 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 520 | MBD cohort II | 0.328767123 | 4S | non-amplified | 3703 | 3703 | alive | no event | NA | NA |
| Neuroblastoma patient | Homo sapiens | Primary tumor | DNA sample collection | 1537 | MBD cohort II | 0.55890411 | 4S | amplified | 2178 | 2178 | alive | no event | NA | NA |
Using BamUtil, basic sequencing statistics of MBD cohort I and II are computed.
|
| |||||||||
|---|---|---|---|---|---|---|---|---|---|
| Total read number: the total number of reads in the two paired FASTQ files of a sample; duplicate reads as a percentage of the total read number; properly paired reads as a percentage of the total read number. | |||||||||
| total read number (e6) | 4.65–18.20 | 13.38 | 14.17 | 29.74–66.59 | 45.09 | 44.41 | 20.86–59.51 | 36.00 | 33.19 |
| duplicate reads (%) | 0.70–72.00 | 6.46 | 3.39 | 2.55–79.69 | 31.04 | 19.89 | 2.24–10.47 | 4.17 | 3.68 |
| properly paired reads (%) | 48.29–94.51 | 85.64 | 89.29 | 86.86–97.57 | 95.33 | 95.72 | 94.78–97.55 | 96.50 | 96.59 |
Figure 2The per base sequence quality scores indicate that the raw sequencing data are of good quality.
Shown are the distributions of the median per base quality score (determined by FASTQC) of the enriched samples of MBD cohort I (a), and of the enriched (b) and input (c) samples of MBD cohort II. In the boxplots, the lower and upper hinge of the boxes represents the 25th and 75th percentile, respectively. The whiskers extend to the lowest and highest value that is within 1.5 times the interquartile range. Data beyond the end of the whiskers are outliers and plotted as dots.
Figure 3The mapping quality scores illustrate high mapping accuracy.
Shown are the distributions of the percentages of mapped reads across the different mapping quality ranges, as determined by SAMStat ((a) enriched samples of MBD cohort I, (b) enriched samples of MBD cohort II and (c) input samples of MBD cohort II). In the boxplots, the lower and upper hinge of the boxes represents the 25th and 75th percentile, respectively. The whiskers extend to the lowest and highest value that is within 1.5 times the interquartile range. Data beyond the end of the whiskers are outliers and plotted as dots.
Figure 4Fragment CpG plots demonstrate that the MBD-enriched samples have a high fraction of CpG dense sequencing fragments.
Shown are the fractions of mapped MBD sequencing fragments with different CpG counts. Per cohort, 100,000 randomly selected fragments of each sample were used to construct the plots.
Figure 5CpG island RPKM values confirm enrichment towards methylated DNA fragments upon MBD capture.
Shown are the densities of the median RPKM values per subcohort. RPKM: reads per kilobase CpG island per million.
Figure 6Visualization of the MBD sequencing data in IGV confirms methylation of the PCDHB gene cluster.
In (a) the data of MBD cohort I is shown, in (b) the data of MBD cohort II. The upper panels show the genes in the cluster, the location of CpG islands and the GC percentage. In the lower panels, sequence coverage of 6 high-risk patient samples is shown (peak pattern), as well as the location of identified peaks (horizontal bars).