| Literature DB >> 32127026 |
Giulia Federici1, Silvia Soddu2.
Abstract
The promising expectations about personalized medicine have opened the path to routine large-scale sequencing and increased the importance of genetic counseling for hereditary cancers, among which hereditary breast and ovary cancers (HBOC) have a major impact. High-throughput sequencing, or Next-Generation Sequencing (NGS), has improved cancer patient management, ameliorating diagnosis and treatment decisions. In addition to its undeniable clinical utility, NGS is also unveiling a large number of variants that we are still not able to clearly define and classify, the variants of uncertain significance (VUS), which account for about 40% of total variants. At present, VUS use in the clinical context is challenging. Medical reports may omit this kind of data and, even when included, they limit the clinical utility of genetic information. This has prompted the scientific community to seek easily applicable tests to accurately classify VUS and increase the amount of usable information from NGS data. In this review, we will focus on NGS and classification systems for VUS investigation, with particular attention on HBOC-related genes and in vitro functional tests developed for ameliorating and accelerating variant classification in cancer.Entities:
Keywords: ATM gene; Functional tests; Germline and somatic mutations; Next-generation sequencing; Variant classification; Variants of uncertain significance
Year: 2020 PMID: 32127026 PMCID: PMC7055088 DOI: 10.1186/s13046-020-01554-6
Source DB: PubMed Journal: J Exp Clin Cancer Res ISSN: 0392-9078
The NGS Paradox at a glance: a list of benefits and possible pitfalls
| BENEFITS | PITFALLS |
|---|---|
| ✓ High-throughput technology progress | ✓ Great challenge for clinical use |
| ✓ Use and development of targeted drugs | ✓ New mutations with no drugs |
| ✓ Broad tumor characterization | ✓ Uncertainty in patient outcome |
| ✓ Identification of many diagnostic sub-populations | ✓ Many small sub-populations with poor statistical power |
| ✓ Simultaneous gene sequencing | ✓ Time consuming |
| ✓ Cost decrease | ✓ Not always accessible |
| ✓ Growing number of data | ✓ Difficult data interpretation and classification |
| ✓ Many newly discovered variants | ✓ Increase of VUS |
IARC classification system for genetic variants
| Class | Description | Probability of being pathogenic (5) | Counseling consequences |
|---|---|---|---|
| 1 | Benign | < 0.001 | Consider as “no mutation detected” |
| 2 | Likely benign | 0.001–0.049 | Consider as “no mutation detected” |
| 3 | Uncertain | 0.05–0.949 | Survey depending on family history |
| 4 | Likely pathogenic | 0.95–0.99 | Ascertained high risk regimen |
| 5 | Pathogenic | > 0.99 | Ascertained high risk regimen |
Examples of disease and population databases
| Tools | Website | Description |
|---|---|---|
| ClinVar | Freely accessible, public archive of reports of the relationships among human variations and phenotypes, with supporting evidence. | |
| ClinVar Miner | Interface for viewing ClinVar data. Complements the existing ClinVar database by enabling exploration of the data at different levels of granularity and from different perspectives. Statistics for current data and for historical data can be viewed relative to all submissions, submitters, conflicting submissions and genes. | |
| dbSNP | Public-domain archive for a broad collection of simple genetic polymorphisms. This collection includes single-base substitutions, small-scale multi-base “indels”, and retroposable element insertion and microsatellite repeat variations. | |
| Leiden Open Variation Database (LOVD) | Flexible, freely available tool for gene-centered collection and display of DNA variants. | |
| Cosmic | Source of expert manually curated somatic mutation information relating to human cancers. | |
| cBioPortal | Open-access, open-source resource for interactive exploration of multidimensional cancer genomics datasets. Stores non-synonymous mutations, DNA copy-number data, mRNA and microRNA expression data, protein-level and phosphoprotein level data, DNA methylation data, and de-identified clinical data. | |
| BRCA Exchange | Open-access web portal resource to display | |
| GnomAD | Resource with the goal of aggregating and harmonizing both exome and genome sequencing data from a wide variety of large-scale sequencing projects, and making summary data available. | |
| Exome sequencing project | Database for discovering novel genes and disease mechanisms by pioneering the application of NGS of the protein coding regions of the human genome across diverse, richly-phenotyped populations. | |
| 1000Genomes | Public catalogue of human variation and genotype data for inferring a large complement of variants, including “indels” and structural variants, in panels of people worldwide for whom only a small subset of SNPs have been analyzed, using partial sequencing techniques such as genotyping arrays. | |
| Human Gene Mutation Database | Up-to-date and comprehensive reference source to the spectrum of inherited human gene lesions. It includes the first example of all mutations causing or associated with human inherited disease, plus disease-associated/functional polymorphisms reported in the literature. | |
| Human Genome Variation Society | Source for the discovery and characterization of genomic variations including population distribution and phenotypic associations, by promoting collection, documentation and free distribution of genomic variation information and associated clinical variations. |
“indel” = insertion and/or deletion. Database and algorithm descriptions were taken from respective websites.
Examples of in silico algorithms
| Tools | Website | Description |
|---|---|---|
| PolyPhen-2 | Predicts possible impact of an amino acid substitution on the structure and function of a human protein using straightforward physical and comparative considerations. | |
| Provean | Predicts whether an amino acid substitution or “indel” has an impact on the biological function of a protein. | |
| Sift | Predicts whether an amino acid substitution affects protein function based on sequence homology and the physical properties of amino acids. | |
| Mutation taster | Predicts the functional consequences of amino acid substitutions, intronic and synonymous alterations, short “indel” mutations and variant spanning intron-exon borders. | |
| Mutation assessor | Predicts the functional impact of amino-acid substitutions in proteins, such as mutations discovered in cancer or missense polymorphisms. The functional impact is assessed based on evolutionary conservation of the affected amino acid in protein homologs. | |
| FATHMM | High-throughput web-server capable of predicting the functional consequences of both coding variants, i.e., non-synonymous single nucleotide variants (nsSNVs), and non-coding variants through Hidden Markov Models. | |
| Align-GVGD | Freely available, web-based program that combines the biophysical characteristics of amino acids and protein multiple sequence alignments to predict where missense substitutions in genes of interest fall in a spectrum from enriched deleterious to enriched neutral. | |
| Human splicing finder | Predicts the effects of mutations on splicing signals and identifies splicing motifs in any human sequence. It contains all available matrices for auxiliary sequence prediction to identify exonic and intronic motifs. |
“indel” = insertion and/or deletion. Database and algorithm descriptions were taken from respective websites.
VUS in the top 10 genes with the highest number of submitted variants
| Gene | Variants | VUS |
|---|---|---|
| 12,923 | 7859 | |
| 10,941 | 5101 | |
| 7614 | 2913 | |
| 6082 | 3114 | |
| 4602 | 2046 | |
| 4563 | 1446 | |
| 4266 | 2361 | |
| 4015 | 2209 | |
| 3296 | 1641 | |
| 3228 | 1011 |
Data available on the ClinVar Miner website: https://clinvarminer.genetics.utah.edu/
Number of submitted variants per significance
| Submission significance | Variants | Genes |
|---|---|---|
| Uncertain significance | 266,759 | 13,346 |
| Likely benign | 203,141 | 9515 |
| Benign | 128,364 | 14,810 |
| Pathogenic | 91,322 | 9998 |
| Likely pathogenic | 41,404 | 4198 |
| Not provided | 17,066 | 1594 |
| Other | 2134 | 109 |
Data available on the ClinVar Miner website: https://clinvarminer.genetics.utah.edu/
Number of ATM submitted genetic variants
| Submission significance | Variants |
|---|---|
| Uncertain significance | 2361 |
| Likely benign | 1353 |
| Pathogenic | 539 |
| Benign | 135 |
| Likely pathogenic | 312 |
| Not provided | 48 |
| Total | 4265 |
Data available on the ClinVar Miner website: https://clinvarminer.genetics.utah.edu/. If a variant has more than one submission, it may be counted in more than one significance column. In this case, the total number of variants will be less than the total of the other cells.
Fig. 1Pie chart representing the percentages of ATM submitted genetic variants subgrouped into clinical classes as reported in ClinVar Miner