| Literature DB >> 28365738 |
Hedvig Tordai1, Kristóf Jakab1, Gergely Gyimesi2, Kinga András1, Anna Brózik3, Balázs Sarkadi3, Tamás Hegedus1.
Abstract
ABC (ATP-Binding Cassette) proteins with altered function are responsible for numerous human diseases. To aid the selection of positions and amino acids for ABC structure/function studies we have generated a database, ABCMdb (Gyimesi et al. , ABCMdb: a database for the comparative analysis of protein mutations in ABC transporters, and a potential framework for a general application. Hum Mutat 2012; 33:1547-1556.), with interactive tools. The database has been populated with mentions of mutations extracted from full text papers, alignments and structural models. In the new version of the database we aimed to collect the effect of mutations from databases including ClinVar. Because of the low number of available data, even in the case of the widely studied disease-causing ABC proteins, we also included the possible effects of mutations based on SNAP2 and PROVEAN predictions. To aid the interpretation of variations in non-coding regions, the database was supplemented with related DNA level information. Our results emphasize the importance of in silico predictions because of the sparse information available on variants and suggest that mutations at analogous positions in homologous ABC proteins have a strong predictive power for the effects of mutations. Our improved ABCMdb advances the design of both experimental studies and meta-analyses in order to understand drug interactions of ABC proteins and the effects of mutations on functional expression. Database URL: http://abcm2.hegelab.org.Entities:
Mesh:
Substances:
Year: 2017 PMID: 28365738 PMCID: PMC5467578 DOI: 10.1093/database/bax023
Source DB: PubMed Journal: Database (Oxford) ISSN: 1758-0463 Impact factor: 3.451
Figure 1Variations in coding regions of ABCB11, ABCC6 and CFTR from ClinVar (top row), locus specific databases and reviews (bottom row).
Figure 2Distribution of ABCB11, ABCC6 and CFTR variants between ClinVar, ABCM2, and other resources.
Performance of in silico tools SNAP2 and PROVEAN in predicting the effect of mutations in ABC proteins as compared to curated mutations from databases
| Gene | Effect | Curated | SNAP2 | PROVEAN | ||
|---|---|---|---|---|---|---|
| Deleterious | 153 | 98 | 100 | |||
| Neutral | 52 | 29 | 22 | |||
| Unspecified | 15 | |||||
| Deleterious | 182 | 123 | 117 | |||
| Neutral | 2 | 1 | 1 | |||
| Unspecified | 0 | |||||
| ABCC7/ | Deleterious | 349 | 182 | 161 | ||
| Neutral | 31 | 6 | 14 | |||
| Unspecified | 629 | |||||
The number of correctly predicted mutation effects (true positive hits).
Simple homology-based inference can be used to predict the effect of variations even in distant family members
| CFTR | ||
|---|---|---|
| ABCC6 | ABCB11 | |
| Mutations at homologous positions | 85 | 75 |
| Mutations with known effect | 45 (100%) | 35 (100%) |
| Matching effects | 43 (96%) | 30 (86%) |
Distribution of deleterious mutations in different regions of ABC proteins
| Gene | Region | Ratio of mutated positions | |
|---|---|---|---|
| ABCB11 | TMD1_EL | 10% | (8/83) |
| ABCB11 | TMD2_EL | 8% | (3/40) |
| ABCB11 | TMD1_TH | 9% | (12/133) |
| ABCB11 | TMD2_TH | 7% | (10/137) |
| ABCB11 | TMD1_CL | 11% | (12/106) |
| ABCB11 | TMD2_CL | 7% | (7/107) |
| ABCB11 | NBD1 | 16% | (39/237) |
| ABCB11 | NBD2 | 11% | (26/239) |
| ABCB11 | CL | 8% | (20/239) |
| ABCC6 | TMD1_EL | 3% | (1/29) |
| ABCC6 | TMD2_EL | 4% | (1/23) |
| ABCC6 | TMD1_TH | 10% | (12/121) |
| ABCC6 | TMD2_TH | 5% | (6/117) |
| ABCC6 | TMD1_CL | 15% | (18/121) |
| ABCC6 | TMD2_CL | 13% | (16/126) |
| ABCC6 | NBD1 | 15% | (34/225) |
| ABCC6 | NBD2 | 16% | (38/235) |
| ABCC6 | CL | 9% | (29/320) |
| CFTR | TMD1_EL | 39% | (11/28) |
| CFTR | TMD2_EL | 14% | (5/35) |
| CFTR | TMD1_TH | 30% | (36/121) |
| CFTR | TMD2_TH | 15% | (18/117) |
| CFTR | TMD1_CL | 15% | (18/121) |
| CFTR | TMD2_CL | 25% | (32/126) |
| CFTR | NBD1 | 25% | (55/224) |
| CFTR | NBD2 | 18% | (43/234) |
| CFTR | CL | 14% | (66/474) |
Number of positions with mutations/length of region.
TMD: transmembrane domain, EL: extracellular loop, CL: cytoplasmic loop, NBD: nucleotide binding domain, TH: transmembrane helix.