| Literature DB >> 29677173 |
Yann Guillermin1, Jonathan Lopez2,3,4, Kaddour Chabane5, Sandrine Hayette6,7, Claire Bardel8,9, Gilles Salles10,11,12, Pierre Sujobert13,14,15, Sarah Huet16,17,18.
Abstract
High throughput sequencing (HTS) is increasingly important in determining cancer diagnoses, with subsequent prognostic and therapeutic implications. The biology of cancer is becoming increasingly deciphered and it is clear that therapy needs to be individually tailored. Whilst translational research plays an important role in lymphoid malignancies, few guidelines exist to guide biologists and routine laboratories through this constantly evolving field. In this article, we review the challenges of interpreting HTS in lymphoid malignancies and provide a toolkit to interpret single nucleotide variants obtained from HTS. We define the pre-analytical issues such as sequencing DNA obtained from formalin-fixed and paraffin-embedded tissue (FFPE), the acquisition of germline DNA, or the bioinformatic pitfalls, the analytical issues encountered and how to manage them. We describe the main constitutional and cancer databases, their characteristics and limitations, with an emphasis on variant interpretation in lymphoid malignancies. Finally, we discuss the challenges of predictions that one can make using in silico or in vitro modelling, pharmacogenomic screening, and the limits of those prediction tools. This description of the current status in genomic interpretation highlights the need for new large databases and international collaboration in the lymphoma field.Entities:
Keywords: lymphoid malignancies; next-generation sequencing; variant interpretation
Mesh:
Substances:
Year: 2018 PMID: 29677173 PMCID: PMC5979354 DOI: 10.3390/ijms19041251
Source DB: PubMed Journal: Int J Mol Sci ISSN: 1422-0067 Impact factor: 5.923
Catalogues of germline variants databases.
| Database | Cell of Origin | Healthy/Non Cancer Disease | Data | Number of Exome/Genome | URL |
|---|---|---|---|---|---|
| ExAC | Germline | both | Exome | 60,706 | |
| gnomAD | Germline | both | Exome/Genome | 136,632 * | |
| 1000 Genomes | Germline | Healthy | Exome/Genome | 2504 | |
| dbSNP | Germline | both | Exome/Genome | NA | |
| ESP | Germline | both | Exome | 6503 |
* 123,136 exomes and 15,496 genomes; NA: No data available.
Ethnic representation in each database of germline variants, and reported frequencies for the MYD88 p.L265P mutation.
| Database | % African/African American | % Latino/Mixed Americans | % East Asian | % Finnish | % Non-Finnish European | % South Asian | % Ashkenazi | % Other | MYD88 p.L265P Allele Frequency | Reference |
|---|---|---|---|---|---|---|---|---|---|---|
| ExAC | 8.57 | 9.53 | 7.13 | 5.45 | 54.97 | 13.6 | NA | 0.75 | 0.01% | [ |
| gnomAD | 8.80 | 12.60 | 6.91 | 9.44 | 46.38 | 11.26 | 3.72 | 2.37 | 0.0036% | [ |
| 1000 Genomes | 26.4 | 13.86 | 19.53 | 3.95 | 16.13 | 20.13 | NA | 0 | 0.02% | [ |
| dbSNP | NA | NA | NA | NA | NA | NA | NA | NA | * | [ |
| ESP | NA | NA | NA | NA | NA | NA | NA | NA | Not present | [ |
* dbSNP reports VAF from different studies; NA: No data available.
Catalogues of cancer databases.
| Database | Cell of Origin | Data | Number of Exome/Genome | Link | Reference |
|---|---|---|---|---|---|
| TCGA | Somatic | Exome/Genome | 11,077 | [ | |
| ICGC | Somatic | Exome/Genome | 17,000 | [ | |
| COSMIC | Somatic | Exome/Genome | 32,000 genomes + 25,000 peer reviewed papers (genomes and/or exomes) | [ |
Mutation hotspots in lymphoid neoplasms.
| Gene | Hotspot Mutation | Lymphoid Neoplasms (Frequency) | Commentary |
|---|---|---|---|
| V600E | HCL (>90%), MM (5%) | targeted therapy available | |
| Y646, A692 * | FL (30%), DLBCL (10%) | targeted therapy available;* Amino-acid numbering based on transcript NM_004456.4 (sometimes reported as Y641 and A687 with NM_001203247.1) | |
| R172 | AITL (40%) | targeted therapy available | |
| G12, G13, Q61 | MM (40%), DLBCL (10%) | targeted therapy available | |
| L265P ** | LPL (95%), MGUS (50%), DLBCL (10%), CLL (5%), PCNSL (50%), EMZL/MALT (5%), NMZL (5%) | ** Amino-acid numbering based on transcript NM_002468.4 (sometimes reported as L273P with NM_001172567.1) | |
| G17V | AITL (60%), PTCL-NOS (20%) | - | |
| K700E, K666 | CLL (15%) | Prognostic impact in CLL | |
| E571 | PMBL (25%), cHL (25%), CLL (5%) | targeted therapy available |
Only recurrent mutations observed with a frequency >10% are presented. Abbreviations: HCL: hairy cell leukemia; MM: multiple myeloma; FL: follicular lymphoma; DLBCL: diffuse large B-cell lymphoma; AITL: angio-immunoblastic T-cell lymphoma; LPL: lymphoplasmacytic lymphoma; MGUS: monoclonal gammapathy of undetermined significance; CLL: chronic lymphocytic leukemia; PCNSL: primary central nervous system lymphoma; EMZL/MALT: extranodal marginal zone lymphoma of mucosa-associated lymphoid tissue; NMZL: nodal marginal zone lymphoma; PTCL-NOS: peripheral T-cell lymphoma, not otherwise specified; PMBL: primary mediastinal B-cell lymphoma; cHL: classical Hodgkin lymphoma. * Amino-acid numbering based on transcript NM_004456.4 (sometimes reported as Y641 and A687 with NM_001203247.1). ** Amino-acid numbering based on transcript NM_002468.4 (sometimes reported as L273P with NM_001172567.1).
Figure 1Mutation frequencies in different lymphoma entities. Abbreviations: CLL: chronic lymphocytic leukemia; SLL: small lymphocytic lymphoma; MCL: mantle cell lymphoma; MZL: marginal zone lymphoma; WM: Waldenström’s macroglobulinemia; HCL: hairy cell leukemia; FL: follicular lymphoma; BL: Burkitt lymphoma; GCB-DLBCL: germinal-center B-cell-like diffuse large B-cell lymphoma; ABC-DLBCL: activated B-cell-like diffuse large B-cell lymphoma; HL: Hodgkin lymphoma; PMBL: primary mediastinal B-cell lymphoma. AITL: angioimmunoblastic T-cell lymphoma; T-PLL: T-cell prolymphocytic leukemia; LGL: large granular lymphocytic leukemia; MF: mycosis fungoides, SS: Sézary syndrome; ATLL: adult T-cell leukemia/lymphoma; PTCL-NOS: peripheral T-cell lymphoma not otherwise specified; NKTCL: extranodal NK/T-cell lymphoma, nasal-type.
Bioinformatic resources for prediction of variant functional impact.
| Resource | URL | References |
|---|---|---|
| SIFTSorting Intolerant From Tolerant | [ | |
| PROVEANProtein Variation Effect Analyzer | [ | |
| PolyPhen-2Polymorphism Phenotyping v2 | [ | |
| MutationAssessor | [ | |
| MutationTaster | [ | |
| CAROL *Combined Annotation scoRing toOL | [ | |
| Align GCGD ** | [ | |
| dbNSFP v3.0 ***database for Nonsynonymous SNPs’ Functional Predictions | [ |
* Combines SIFT and PolyPhen-2; ** Cancer-specific database where users can either supply their own protein multiple sequence alignments or select from the library of alignments (currently available for ATM, BRCA1, BRCA2, CHEK2, and TP53); *** compiles prediction scores from 20 prediction algorithms (SIFT, Polyphen2-HDIV, Polyphen2-HVAR, LRT, MutationTaster2, MutationAssessor, FATHMM, MetaSVM, MetaLR, CADD, VEST3, PROVEAN, FATHMM-MKL coding, fitCons, DANN, GenoCanyon, Eigen coding, Eigen-PC, M-CAP, REVEL, MutPred) and 6 conservation scores (PhyloP × 2, phastCons × 2, GERP++ and SiPhy).