| Literature DB >> 34316407 |
Hannah McConnell1, T Daniel Andrews1, Matt A Field2,3.
Abstract
BACKGROUND: Pharmacogenetic variation is important to drug responses through diverse and complex mechanisms. Predictions of the functional impact of missense pharmacogenetic variants primarily rely on the degree of sequence conservation between species as a primary discriminator. However, idiosyncratic or off-target drug-variant interactions sometimes involve effects that are peripheral or accessory to the central systems in which a gene functions. Given the importance of sequence conservation to functional prediction tools-these idiosyncratic pharmacogenetic variants may violate the assumptions of predictive software commonly used to infer their effect.Entities:
Keywords: Functional inference prediction; Missense mutation; Off-target; Pharmacogenetics; Pharmacogenomics; Variant
Year: 2021 PMID: 34316407 PMCID: PMC8286708 DOI: 10.7717/peerj.11774
Source DB: PubMed Journal: PeerJ ISSN: 2167-8359 Impact factor: 2.984
Classification criteria used to identify off-target pharmacogenetic variants from the PharmGKB database.
| Step | Filter | Number variants |
|---|---|---|
| 1 | Exclude synonymous and non-coding variants | 561 |
| 2 | Include variants that have type:toxicity/ADR | 339 |
| 3 | For drug and gene pairs, exclude variants with additional effect types other than Toxicity/ADR | 273 |
| 4 | Gene containing variant is NOT an ADME process gene OR annotated in GO with ‘xenobiotic metabolic process’ OR ‘transporter’ | 196 |
| 5 | Variant not containing an entry in clinVAR | 142 |
Note:
For 30 variants used as a truth set, we performed an additional detailed literature search to definitively classify each variant as type A or type B.
Figure 1Functional inference scores across all PharmGKB variants grouped by confidence level.
Distribution of functional effect scores of PharmGKB variants predicted by six mutation effect inference tools. Boxplots shown are of (A) CADD Phred score, (B) PolyPhen2 score, (C) SIFT score, (D) Mutation Assessor score, (E) MutPred score and (F) REVEL score. Scores are plotted for each tool in variant confidence categories (from 1 (highest) to 4 (lowest)) assigned by the PharmGKB annotation. Each tool is annotated with the information types it employs to make predictions—Seq: sequence conservation, Struct: protein structural metrics, Ens: an ensemble tool that integrates results of several individual tools. Each tool employs categorical cutoffs with the most damaging category colored as red.
Figure 2Receiver operator curves (ROC) for six functional inference tools vs Type A and Type B enriched pharmacogenetic variants.
Receiver operator curves (ROC) for CADD, Mutation Assessor, MutPred, PolyPhen2, REVEL and SIFT generated with ROCR. All high-confident PharmGKB Category 1 missense variants (Type A enriched) were input as the positive set while a set of randomly selected common variants (MAF > 0.1) were input as the negative set using the Perl function rand() across the entire set of dbSNP missense variants. The same analysis was also performed for all low-confidence PharmGKB Category 3 or 4 missense variants (Type B enriched). Labels were inverted for SIFT due to lower scores representing likely damaging mutations and CADD and Mutation Assessor scores were scaled into the range of 0–1.
AUC and MCC from functional inference tools for PharmGKB category 1 (Type A enriched) vs PharmGKB category 3 and 4 (Type B enriched) pharmacogenetic variants.
| Algorithm | Type A or Type B enriched | Area under curve (AUC) | Matthews correlation coefficient (MCC) |
|---|---|---|---|
| PolyPhen2 | Type A | 0.852 | 0.489 |
| MutPred | Type A | 0.975 | 0.788 |
| REVEL | Type A | 0.942 | 0.794 |
| SIFT | Type A | 0.774 | 0.358 |
| CADD | Type A | 0.728 | 0.321 |
| Mutation assessor | Type A | 0.763 | 0.396 |
| PolyPhen2 | Type B | 0.397 | 0.00338 |
| MutPred | Type B | 0.682 | 0.300 |
| REVEL | Type B | 0.498 | 0.106 |
| SIFT | Type B | 0.410 | −0.0400 |
| CADD | Type B | 0.461 | 0.118 |
| Mutation assessor | Type B | 0.427 | 0.0764 |
| Average Type A | 0.839 | 0.524 | |
| Average Type B | 0.479 | 0.094 |