| Literature DB >> 25506935 |
Inna Dubchak1, Sandhya Balasubramanian2, Sheng Wang3, Meydan Cem4, Cem Meyden4, Dinanath Sulakhe5, Alexander Poliakov6, Daniela Börnigen7, Bingqing Xie8, Andrew Taylor2, Jianzhu Ma3, Alex R Paciorkowski9, Ghayda M Mirzaa10, Paul Dave11, Gady Agam12, Jinbo Xu3, Lihadh Al-Gazali13, Christopher E Mason4, M Elizabeth Ross14, Natalia Maltsev5, T Conrad Gilliam5.
Abstract
An essential step in the discovery of molecular mechanisms contributing to disease phenotypes and efficient experimental planning is the development of weighted hypotheses that estimate the functional effects of sequence variants discovered by high-throughput genomics. With the increasing specialization of the bioinformatics resources, creating analytical workflows that seamlessly integrate data and bioinformatics tools developed by multiple groups becomes inevitable. Here we present a case study of a use of the distributed analytical environment integrating four complementary specialized resources, namely the Lynx platform, VISTA RViewer, the Developmental Brain Disorders Database (DBDB), and the RaptorX server, for the identification of high-confidence candidate genes contributing to pathogenesis of spina bifida. The analysis resulted in prediction and validation of deleterious mutations in the SLC19A placental transporter in mothers of the affected children that causes narrowing of the outlet channel and therefore leads to the reduced folate permeation rate. The described approach also enabled correct identification of several genes, previously shown to contribute to pathogenesis of spina bifida, and suggestion of additional genes for experimental validations. The study demonstrates that the seamless integration of bioinformatics resources enables fast and efficient prioritization and characterization of genomic factors and molecular networks contributing to the phenotypes of interest.Entities:
Mesh:
Substances:
Year: 2014 PMID: 25506935 PMCID: PMC4266634 DOI: 10.1371/journal.pone.0114903
Source DB: PubMed Journal: PLoS One ISSN: 1932-6203 Impact factor: 3.240
Figure 1Integration of services in the described analytical environment.
Lynx logo © 2013–2014 Department of Genetics, University of Chicago. RViewer logo © 2010–2012 The Regents of the University of California.
Figure 2Analytical workflow for identification of spina bifida candidate genes.
Identified genetic variants in folate metabolism genes in spina bifida patients and unaffected parents.
| Gene | NW P-value | Identified Variations | Variation Type | Variation occurrence in patients | Variation occurrence in parents | Reference of association with SB |
| FOLH1 | 0 | rs61886492 | Missense, possibly damaging | 6C1, 6C2 | Morin, Devlin et al. 2003 | |
| DMGDH | 0 | rs532964 | Missense, possibly damaging | 2C1, 2C2, 6C1, 6C2 | 2M, 2F, 6M, 6F | Marini, Hoffmann et al. 2011 |
| MTR | 0 | rs1805087 | Missense, possibly damaging | 2C1, 2C2, 6C1 | 2M, 2F, 6M, 6F | Doolin, Barbaux et al. 2002 |
| MSGN1 | 0.002 | rs35858730 | Missense, possibly damaging | 2C1, 2C2, 6C1, 6C2 | Chalamalasetty, Dunty et al. 2011 | |
| CUBN | 0.002 | rs1801228 | Benign | 2M, 6M | Kozyraki, Fyfe et al. 1999, Wahlstedt-Froberg, Pettersson et al. 2003, Whitehead, 2006 | |
| rs41289311 | ||||||
| SLC19A | 0.0028 | rs1051266 | Missense, possibly damaging | 2M, 2F, 6M, 6F | Shaw, Lammer et al. 2002, Morin, Devlin et al. 2003 | |
| rs2239911, rs2239908, rs2239907 | 2C1, 2C2, 6C2 |
*The P-values in Table 1 are generated by 10 000 random permutations of the input data scored according to the strength of association with the phenotype using DBDB recommendations (random reassignment of the scores to network nodes and computation of the corresponding randomized scores for all candidate genes) [38].
**Family 1: affected children 2C1, 2C2; mother 2M, father 2F. Family 2: affected children 6C1, 6C2; mother 6M, father 6F.
Figure 3Ribbon diagram of SLC19A1 protein model generated by RaptorX.
Rainbow coloring from blue to red indicates the N- to C-terminal positions of the residues in the model. The docking location of Folic Acid (FOL), shown in a spacefill form, was predicted by RaptorX-Binding. Numbers in black correspond to key residues, shown in spacefill form, which related to the functional impact of an exonic variant (rs1051266; Arg27His, 80G>A). The diagram was generated using PyMOL.
Figure 4The changes to the binding site caused by the mutation (left: native, right: H27R mutant).
The mutation results in changing contact landscape inside the cleft, especially for the Arg151 residue.
Volume and surface area of the of the cleft in the native and the mutant structures as calculated by 3V.
| Cleft Volume | Native | H27R Mutant |
| Volume | 807 Å3 | 541 Å3 |
| Surface area | 442 Å2 | 328 Å2 |
| Sphericity | 0.95 | 0.98 |
| Effective radius | 5.47 Å | 4.94 Å |
Figure 5Docked folate conformations (blue: native, red: H27R mutant-bound folate molecules) showing the distinct change in the optimal conformation between native and the mutant.
Figure 6Access to the analytical tools within the described bioinformatics environment.