| Literature DB >> 29254494 |
Gustavo Glusman1, Peter W Rose2, Andreas Prlić2,3, Jennifer Dougherty4, José M Duarte3, Andrew S Hoffman5, Geoffrey J Barton6, Emøke Bendixen7, Timothy Bergquist8, Christian Bock8, Elizabeth Brunk9, Marija Buljan10, Stephen K Burley2,3,11, Binghuang Cai8, Hannah Carter9, JianJiong Gao12, Adam Godzik13, Michael Heuer14, Michael Hicks15, Thomas Hrabe13, Rachel Karchin16,17, Julia Koehler Leman18,19, Lydie Lane20, David L Masica16, Sean D Mooney8, John Moult21,22, Gilbert S Omenn4,23, Frances Pearl24, Vikas Pejaver8,25, Sheila M Reynolds4, Ariel Rokem25, Torsten Schwede26, Sicheng Song8, Hagen Tilgner27, Yana Valasatava3, Yang Zhang23, Eric W Deutsch4.
Abstract
The translation of personal genomics to precision medicine depends on the accurate interpretation of the multitude of genetic variants observed for each individual. However, even when genetic variants are predicted to modify a protein, their functional implications may be unclear. Many diseases are caused by genetic variants affecting important protein features, such as enzyme active sites or interaction interfaces. The scientific community has catalogued millions of genetic variants in genomic databases and thousands of protein structures in the Protein Data Bank. Mapping mutations onto three-dimensional (3D) structures enables atomic-level analyses of protein positions that may be important for the stability or formation of interactions; these may explain the effect of mutations and in some cases even open a path for targeted drug development. To accelerate progress in the integration of these data types, we held a two-day Gene Variation to 3D (GVto3D) workshop to report on the latest advances and to discuss unmet needs. The overarching goal of the workshop was to address the question: what can be done together as a community to advance the integration of genetic variants and 3D protein structures that could not be done by a single investigator or laboratory? Here we describe the workshop outcomes, review the state of the field, and propose the development of a framework with which to promote progress in this arena. The framework will include a set of standard formats, common ontologies, a common application programming interface to enable interoperation of the resources, and a Tool Registry to make it easy to find and apply the tools to specific analysis problems. Interoperability will enable integration of diverse data sources and tools and collaborative development of variant effect prediction methods.Entities:
Mesh:
Year: 2017 PMID: 29254494 PMCID: PMC5735928 DOI: 10.1186/s13073-017-0509-y
Source DB: PubMed Journal: Genome Med ISSN: 1756-994X Impact factor: 11.117
Classification of methods to predict the effect of missense mutations
| Method type | Prediction | Limitations |
|---|---|---|
| Protein stability | Predicts the difference in unfolding free energy between wild-type and mutant protein | Considers only one possible mechanism that may affect the phenotype |
| Protein–protein/protein–nucleic acid affinity | Predicts the difference in the binding affinity between binding partners upon mutation | Small training datasets limit the scope of these methods |
| Protein–ligand affinity | Predicts the difference in ligand-binding affinity upon mutation | Small training datasets limit the scope of these methods |
| Phenotypic effect | Predicts the likelihood that a mutation is deleterious without considering a specific molecular mechanism | Except for Mendelian disease phenotypes, the phenotype may only be observed in a subset of the population (partial penetrance). Databases use different annotation practices and contain contradictory information for some mutations |
| Mapping and 3D visualization | Provides a 3D context of the site of mutation and may give atomic-level insight into mechanism of action | Visual approach is not suitable for automated whole-exome predictions |
| 3D mutation hotspots | Clusters mutations by spatial proximity that are not necessarily close in protein sequence | Clustering may not explain the effect of specific mutations in a hotspot |
3D three-dimensional
Fig. 1Components of the GVto3D portal. The Tools Registry contains a searchable description and metadata for tools, resources, and reference data sets for third-party variant effect prediction and annotation services. Standardized application programming interfaces (APIs) provide interoperability for data input and output of these third-party tools. Custom adapters can provide limited interoperability for tools that cannot adopt the API. A mapping service provides bidirectional mappings from reference genome coordinates to UniProt protein positions and to Protein Data Bank (PDB) residue positions. The tools can use the mapping service to accept variant positions in any of the three coordinate systems. A beacon system enables queries about variant positions where three-dimensional (3D) structural information and annotation are available