| Literature DB >> 23577124 |
Iacopo Petrini1, Arun Rajan, Trung Pham, Donna Voeller, Sean Davis, James Gao, Yisong Wang, Giuseppe Giaccone.
Abstract
Molecular pathology of thymomas is poorly understood. Genomic aberrations are frequently identified in tumors but no extensive sequencing has been reported in thymomas. Here we present the first comprehensive view of a B3 thymoma at whole genome and transcriptome levels. A 55-year-old Caucasian female underwent complete resection of a stage IVA B3 thymoma. RNA and DNA were extracted from a snap frozen tumor sample with a fraction of cancer cells over 80%. We performed array comparative genomic hybridization using Agilent platform, transcriptome sequencing using HiSeq 2000 (Illumina) and whole genome sequencing using Complete Genomics Inc platform. Whole genome sequencing determined, in tumor and normal, the sequence of both alleles in more than 95% of the reference genome (NCBI Build 37). Copy number (CN) aberrations were comparable with those previously described for B3 thymomas, with CN gain of chromosome 1q, 5, 7 and X and CN loss of 3p, 6, 11q42.2-qter and q13. One translocation t(11;X) was identified by whole genome sequencing and confirmed by PCR and Sanger sequencing. Ten single nucleotide variations (SNVs) and 2 insertion/deletions (INDELs) were identified; these mutations resulted in non-synonymous amino acid changes or affected splicing sites. The lack of common cancer-associated mutations in this patient suggests that thymomas may evolve through mechanisms distinctive from other tumor types, and supports the rationale for additional high-throughput sequencing screens to better understand the somatic genetic architecture of thymoma.Entities:
Mesh:
Year: 2013 PMID: 23577124 PMCID: PMC3618227 DOI: 10.1371/journal.pone.0060572
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Features of the B3 thymoma.
(A) Preoperative Chest CT-scan showing a mediastinal mass protruding in the left hemithorax. (B) Haematoxylin and Eosin staining of postoperative tumor section showing the lobular aspects typical of B3 thymomas, the almost complete absence of intratumoral thymocytes and the presence of cancer cells characterized by polygonal shape and round nuclei. (C) Whole-genome sequencing results showing (1) Sanger sequencing-confirmed SNVs and INDELs, (2) reference genomic coordinates, (3) copy number gain (green) and loss (red), and (4) structural variations.
Summary of Complete Genomics sequencing results.
| Gross mapping yield (Gb) | ||
| Tumor | 222.02 | |
| Normal | 234.53 | |
| Fully called genome fraction | ||
| Tumor | 95.30% | |
| Normal | 96.40% | |
| SNV Transition/transversion | ||
| Tumor | 2.14591 | |
| Normal | 2.14218 | |
| SNV total count | ||
| Tumor | 3,096,049 | |
| Normal | 3,314,611 | |
| Novel SNV+ | ||
| Tumor | 133,662 | |
| Normal | 145,929 | |
| High confidence somatic SNV | 963 | |
| Non-synonymous | 9 | |
| Splicing sites | 1 | |
| miRNA loci | 0 | |
| INDELi total count | ||
| Tumor | 395,724 | |
| Normal | 433,906 | |
| Novel INDEL+ | ||
| Tumor | 67,163 | |
| Normal | 74,562 | |
| High confidence somatic INDEL | 1101 | |
| Coding region | 2 | |
| Splicing sites | 0 | |
| miRNA loci | 0 |
+ Novel: not previously described in dbSNP131; IINDEL group includes insertion, deletion and substitutions.
Prediction of mutation effect on protein function.
| Gene | Position | Old/new nt | SIFTPrediction | SIFTScore | PolyPhen2Prediction | PolyPhen2 Score |
| BCOR L1067fs | chrX:39930263-39930263 | -TC | Frame shift | – | – | – |
| FBN3S1756del | chr19:8162193–8162195 | CTG/− | In frame | – | – | – |
| LDB3 | chr10:88476529 | G/T | Splicing site | – | – | – |
| PCNXL3 P327Q | chr11:65385813 | C/A | Tolerated | 0.57 | Damaging | 0.998 |
| PHF15L569Q | chr5:133914340 | T/A | Tolerated | 0.59 | Damaging | 1 |
| PIONE498G | chr7:76978720 | T/C | Tolerated | 0.1 | Damaging | 0.974 |
| PPP1R3AG1063A | chr7:113517959 | C/G | Tolerated | 0.24 | Tolerated | 0.001 |
| SFXN3F82S | chr10:102795325 | T/C | Damaging | 0 | Damaging | 0.996 |
| SRGAP1R585H | chr12:64491096 | G/A | Damaging | 0 | Damaging | 1 |
| TAF1R1435C | chrX:70627923 | C/T | Damaging | 0 | Damaging | 1 |
| VN1R5R193W | chr1:247419950 | C/T | – | – | Damaging | 0.986 |
| WDR70R608H | chr5:37727093 | G/A | Damaging | 0 | Damaging | 1 |
A coding sequence mutation can damage or preserve the function of the related protein. Several bioinformatics algorithms have been developed to predict the effect of mutations. We used 2 independent methods based on different assumptions. SIFT prediction is based on the degree of conservation of amino acid residues in sequence alignments derived from closely related species. Polyphen2 prediction is based on various sequence and structure-based features of the substitution site.
Transcriptome sequencing data of tier 1 mutations.
| Mutation | Ensembl ID | Cufflinks FPKM | Low CI | High CI | Mutated allele expression |
| BCOR L1067fs | ENSG00000183337 | 18.87 | 0.00 | 836.48 | 90% |
| FBN3S1756del | ENSG00000142449 | 1.66 | 1.53 | 1.79 | 67% |
| LDB3 | ENSG00000122367 | 0.00 | 0.00 | 0.00 | na* |
| PCNXL3 P327Q | ENSG00000197136 | 18.96 | 18.33 | 19.59 | 42% |
| PHF15L569Q | ENSG00000043143 | 37.99 | 0.00 | 214.34 | 59% |
| PIONE498G | ENSG00000186088 | 15.68 | 0.00 | 140.89 | 0% |
| PPP1R3AG1063A | ENSG00000154415 | 0.00 | 0.00 | 0.01 | no reads |
| SFXN3F82S | ENSG00000107819 | 6.01 | 0.00 | 13.42 | 14% |
| SRGAP1R585H | ENSG00000196935 | 1.29 | 0.00 | 27.89 | 27% |
| TAF1R1435C | ENSG00000147133 | 12.22 | 0.00 | 64.17 | 0% |
| VN1R5R193W | ENSG00000197617 | 0.03 | 0.00 | 0.06 | 50% |
| WDR70R608H | ENSG00000082068 | 25.67 | 24.21 | 27.13 | 34% |
The expression of tier1 genes was estimated using Transcriptome sequencing data. For each gene, identified by a unique Ensembl identity (ID) code, the estimation of expression was calculated using Cufflinks and reported as FPKM (number of fragments per kilobase of exon per million fragments mapped: Cufflinks FPKM). Each FPKM value was calculated with a confidence interval ranging between Low CI and High CI values. Mutated allele expression summarizes the percentage of mutated reads covering the mutation site.
Candidate structural variations identified by complete genome sequencing.
| Variation | Left position | Left strand | Right position | Right strand |
| 1 | chr11:49883579 | − | chrM:16083 | + |
| 2 | chr11:86017394 | + | chrX:123645300 | + |
| 3 | chr1:158725872 | + | chr2:148952671 | − |
Structural variations such as large deletions, insertions inversions or translocations were predicted using whole genome sequencing data. Structural variations were hypothesized in the presence of a junction sequence: a hybrid sequence composed of two non-adjacent fragment of DNA. In the table are reported the candidate structural variations of this B3 thymoma: the extremity position of the 2 joint sequences and the strand of orientation. Among these candidates only the translocation t(11;X) was confirmed by PCR.
Figure 2t(11;X) translocation.
(A) Schematic illustration of t(11;X); STOP: inframe TAA codon. (B) PCR amplification using primers spanning C11ORF73 exon 2-ODZ1 intron 19 junction (2168T), ODZ1 exon 18-intron 19 junction (ODZ1 int19), GAPDH exons 9 and 10 and introns across exon 3 of an unrelated gene (KIF5C) on genomic and cDNA of the t(11;X)-positive B3 thymoma and a t(11;X)-negative thymoma cDNA. Note that the presence of KIF5C PCR product in the B3 thymoma genomic DNA but not cDNA suggests that the cDNA is free of genomic DNA contamination.