| Literature DB >> 28398289 |
Sara J C Gosline1, Hubert Weinberg2, Pamela Knight3, Thomas Yu1, Xindi Guo1, Nripesh Prasad4, Angela Jones4, Shristi Shrestha4, Braden Boone4, Shawn E Levy4, Salvatore La Rosa3, Justin Guinney1, Annette Bakker3.
Abstract
Neurofibromatosis type 1 (NF1) is a genetic disorder with a range of clinical manifestations such as widespread growth of benign tumours called neurofibromas, pain, learning disorders, bone deformities, vascular abnormalities and even malignant tumours. With the establishment of the Children's Tumour Foundation biobank, neurofibroma samples can now be collected directly from patients to be analysed by the larger scientific community. This work describes a pilot study to characterize one class of neurofibroma, cutaneous neurofibromas, by molecularly profiling of ~40 cutaneous neurofibromas collected from 11 individual patients. Data collected from each tumour includes (1) SNP Arrays, (2) Whole genome sequencing (WGS) and (3) RNA-Sequencing. These data are now freely available for further analysis at http://www.synapse.org/cutaneousNF.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28398289 PMCID: PMC5387919 DOI: 10.1038/sdata.2017.45
Source DB: PubMed Journal: Sci Data ISSN: 2052-4463 Impact factor: 6.444
Figure 1From sample to dataset.
A description of the collaborative effort that enabled the Cutaneous NF data repository and subsequent analysis of the samples.
Description of patients profiled in this study.
| 1 | 48 | Female | White | Yes | SNV | 0 | 6 |
| 2 | 34 | Female | Asian | No | SNV | 1–5 | 5 |
| 3 | 48 | Female | Hispanic | No | SNV | 0 | 5 |
| 4 | 36 | Female | White | No | SNV | 1 | 4 |
| 5 | 65 | Male | White | No | SNV | 0 | 5 |
| 6 | 46 | Female | White | No | LOH | 0 | 4 |
| 8 | 27 | Female | Black | No | SNV | 0 | 5 |
| 9 | 35 | Male | White | No | SNV | 0 | 4 |
| 10 | 58 | Male | White | Yes | — | 0 | 4 |
| 11 | 30 | Female | Black | Yes | SNV | 0 | 6 |
| 13 | 41 | Female | White | No | — | 0 | 4 |
Description of tumours profiled in this study and the available data for each tumour.
| 1 | 1 | 15 | x | ||
| 1 | 2 | 13 | x | ||
| 1 | 3 | NA | x | x | |
| 1 | 4 | 14 | x | ||
| 1 | 6 | 14 | x | x | |
| 1 | 9 | 15 | x | x | |
| 1 | PBMC | x | x | ||
| 2 | 4 | 7 | X | x | |
| 2 | 6 | 6 | X | x | |
| 2 | 1 | 5 | x | x | x |
| 2 | 2 | 7 | x | ||
| 2 | 8 | 9 | x | x | |
| 2 | PBMC | x | x | ||
| 3 | 1 | 11 | x | x | x |
| 3 | 2 | 16 | x | x | x |
| 3 | 3 | 20 | x | x | |
| 3 | 4 | 15 | x | x | x |
| 3 | PBMC | x | x | ||
| 4 | 1 | 8 | x | x | |
| 4 | 10 | 7 | x | x | |
| 4 | 4 | 5 | x | x | |
| 4 | 9 | 10 | x | x | |
| 4 | PBMC | x | x | ||
| 5 | 11 | 10 | x | x | |
| 5 | 12 | 9 | x | x | |
| 5 | 13 | 11 | x | x | |
| 5 | 5 | 8 | x | x | x |
| 5 | 8 | 6 | x | ||
| 5 | PBMC | x | |||
| 6 | 4 | 15 | x | x | x |
| 6 | 5 | 10 | x | x | x |
| 6 | 6 | 15 | x | x | x |
| 6 | 7 | 13 | x | x | x |
| 6 | PBMC | x | x | ||
| 8 | 1 | 10 | X | ||
| 8 | 5 | 13 | X | X | X |
| 8 | 7 | 12 | X | X | |
| 8 | 4 | 10 | X | X | x |
| 8 | 6 | 10 | X | X | X |
| 8 | PBMC | X | |||
| 9 | 1 | 12 | X | x | X |
| 9 | 6 | 5 | X | X | x |
| 9 | 7 | 5 | X | X | |
| 9 | 3 | 13 | X | X | |
| 9 | PBMC | x | X | ||
| 10 | 1 | 11 | x | x | x |
| 10 | 2 | 20 | x | x | x |
| 10 | 3 | 10 | x | x | x |
| 10 | 4 | 5 | x | x | |
| 10 | PBMC | x | x | ||
| 11 | 1 | 13 | X | x | x |
| 11 | 7 | 10 | X | x | |
| 11 | 8 | 10 | X | x | |
| 11 | 14 | 20 | x | ||
| 11 | 2 | 5 | x | x | |
| 11 | 3 | 3 | x | ||
| 11 | PBMC | x | x | ||
| 13 | 1 | 18 | x | x | x |
| 13 | 2 | 7 | x | x | |
| 13 | 3 | 12 | x | x | x |
| 13 | 7 | 6 | x | x |
Figure 2Copy number quality control metrics.
Distribution of SNP array values across samples. Patient samples represented by different colours. Teal outline represents tumour samples while pink outline represents blood. (a) Represents B allele frequencies, (b) represents log R ratio values. (c) Clustering of segmented values, with rows below representing patient samples (colours) and tissue of origin (grey for tumour, black for blood).
Figure 3Number of genomic variant sites shared between all pairs of samples.
Samples are labelled by patient and tissue type and indicate that all samples aside from the outlier (Patient 10 blood) cluster by patient.
High throughput sequencing read statistics in patient tumours and associated peripheral blood mononuclear cells (PBMCs).
| 1 | 1 | NA | NA | 2.60E+09 | 99.99 | 8.1 |
| 1 | 2 | NA | NA | 3.24E+09 | 99.99 | 7.6 |
| 1 | 3 | 8.00E+08 | 95.35 | NA | NA | NA |
| 1 | 4 | NA | NA | 2.69E+09 | 99.99 | 8.4 |
| 1 | 6 | 6.62E+08 | 94.84 | NA | NA | NA |
| 1 | 9 | 6.85E+08 | 96.08 | NA | NA | NA |
| 1 | PBMC | 8.02E+08 | 93.37 | NA | NA | NA |
| 2 | 1 | 7.82E+08 | 96.05 | 2.60E+09 | 99.99 | 7.3 |
| 2 | 2 | NA | NA | 2.52E+09 | 99.99 | 7.1 |
| 2 | 4 | 7.99E+08 | 96.47 | NA | NA | NA |
| 2 | 6 | 7.62E+08 | 96.74 | NA | NA | NA |
| 2 | 8 | 7.45E+08 | 95.14 | NA | NA | NA |
| 2 | PBMC | 7.56E+08 | 95 | NA | NA | NA |
| 3 | 1 | 7.33E+08 | 95.45 | 2.63E+09 | 99.99 | 6.9 |
| 3 | 2 | 7.42E+08 | 95.63 | 2.43E+09 | 99.99 | 7.6 |
| 3 | 3 | 7.99E+08 | 95.49 | 2.98E+09 | 99.99 | 7.3 |
| 3 | 4 | 8.93E+08 | 94.64 | 2.98E+09 | 99.99 | 8 |
| 3 | PBMC | 5.77E+08 | 94.82 | NA | NA | NA |
| 4 | 1 | 7.94E+08 | 92.44 | 2.67E+09 | 99.99 | 6 |
| 4 | 4 | 7.37E+08 | 94.5 | 2.99E+09 | 99.99 | 6.7 |
| 4 | 9 | 7.61E+08 | 94.46 | 3.93E+09 | 99.99 | 7.9 |
| 4 | 10 | 7.41E+08 | 93.64 | 3.06E+09 | 99.99 | 6.8 |
| 4 | PBMC | 9.84E+08 | 95.43 | NA | NA | NA |
| 5 | 5 | 7.67E+08 | 95.69 | 2.08E+09 | 99.99 | 8.7 |
| 5 | 8 | NA | NA | 2.28E+09 | 99.99 | 8 |
| 5 | 11 | 7.21E+08 | 94.52 | NA | NA | NA |
| 5 | 12 | 6.82E+08 | 93.94 | NA | NA | NA |
| 5 | 13 | 7.45E+08 | 93.73 | NA | NA | NA |
| 5 | PBMC | 5.71E+08 | 94.97 | NA | NA | NA |
| 6 | 4 | 7.35E+08 | 95.97 | 2.45E+09 | 99.99 | 8.3 |
| 6 | 5 | 5.42E+08 | 94.56 | 2.71E+09 | 99.99 | 9 |
| 6 | 6 | 7.72E+08 | 94.89 | 2.69E+09 | 99.99 | 6.3 |
| 6 | 7 | 6.83E+08 | 95.07 | 2.39E+09 | 99.99 | 8.8 |
| 6 | PBMC | 7.81E+08 | 95.46 | NA | NA | NA |
| 8 | 1 | NA | NA | 2.39E+09 | 99.99 | 8.8 |
| 8 | 4 | 6.89E+08 | 95.72 | 2.15E+09 | 99.99 | 8.1 |
| 8 | 5 | 8.10E+08 | 94.9 | 2.38E+09 | 99.99 | 7.9 |
| 8 | 6 | 7.05E+08 | 95.37 | 2.56E+09 | 99.99 | 9 |
| 8 | 7 | 7.00E+08 | 92.64 | NA | NA | NA |
| 8 | PBMC | 6.90E+08 | 93.4 | NA | NA | NA |
| 9 | 1 | 7.17E+08 | 95.31 | 3.35E+09 | 99.99 | 6.9 |
| 9 | 6 | 8.16E+08 | 94.77 | 2.82E+09 | 99.99 | 6.8 |
| 9 | 7 | 7.97E+08 | 91.96 | NA | NA | NA |
| 9 | PBMC | 6.30E+08 | 93.77 | NA | NA | NA |
| 10 | 1 | 7.43E+08 | 94.67 | 2.34E+09 | 99.99 | 7.5 |
| 10 | 2 | 7.39E+08 | 93.63 | 2.51E+09 | 99.99 | 6.5 |
| 10 | 3 | 8.20E+08 | 95.46 | 2.68E+09 | 99.99 | 6.4 |
| 10 | 4 | 8.07E+08 | 93.78 | NA | NA | NA |
| 10 | PBMC | 7.57E+08 | 94.48 | NA | NA | NA |
| 11 | 1 | 8.42E+08 | 96.04 | NA | NA | NA |
| 11 | 2 | 6.88E+08 | 95.64 | NA | NA | NA |
| 11 | 3 | NA | NA | 2.20E+09 | 99.99 | 6.6 |
| 11 | 7 | 1.24E+09 | 94.37 | NA | NA | NA |
| 11 | 8 | 5.81E+08 | 95.08 | NA | NA | NA |
| 11 | 14 | NA | NA | NA | NA | NA |
| 11 | PBMC | 7.14E+08 | 94.61 | NA | NA | NA |
| 13 | 1 | 7.44E+08 | 94.52 | 2.57E+09 | 99.99 | 6 |
| 13 | 2 | 7.97E+08 | 94.96 | NA | NA | NA |
| 13 | 3 | 8.06E+08 | 94.77 | 2.79E+09 | 99.99 | 8.2 |
| 13 | 7 | 9.34E+08 | 92.81 | NA | NA | NA |
Figure 4RNA-Seq count distribution.
Distribution of RNA-Seq read counts per gene for both (a) DESeq normalized per-gene counts and (b) FPKM calculated by Cufflinks. Reads are distributed similarly across samples after filtering for unexpressed genes (<2 counts or FPKM of 0.1). Expression of NF1 is indicated by ‘x’ in each sample.