| Literature DB >> 23819581 |
Li Xie1, Clara Ng, Thahmina Ali, Raoul Valencia, Barbara L Ferreira, Vincent Xue, Maliha Tanweer, Dan Zhou, Gabriel G Haddad, Philip E Bourne, Lei Xie.
Abstract
BACKGROUND: It is a great challenge of modern biology to determine the functional roles of non-synonymous Single Nucleotide Polymorphisms (nsSNPs) on complex phenotypes. Statistical and machine learning techniques establish correlations between genotype and phenotype, but may fail to infer the biologically relevant mechanisms. The emerging paradigm of Network-based Association Studies aims to address this problem of statistical analysis. However, a mechanistic understanding of how individual molecular components work together in a system requires knowledge of molecular structures, and their interactions.Entities:
Mesh:
Substances:
Year: 2013 PMID: 23819581 PMCID: PMC3665574 DOI: 10.1186/1471-2164-14-S3-S9
Source DB: PubMed Journal: BMC Genomics ISSN: 1471-2164 Impact factor: 3.969
Figure 1A multiscale modeling strategy to integrate statistical machine learning, protein structural analysis, and biological network analysis.
Figure 2Workflow to determine core pathways and driver mutations. (A) mutated genes (blue filled circle) and differentially expressed genes (green filled circle) are mapped to a protein-protein interaction network in which the circles and lines represent proteins and interactions between them, respectively. (B) The shortest-path algorithm is applied to construct subnetworks by linking the mutated genes to up-, or down-regulated genes, respectively. (C) BiNGO is applied to identify overrepresented biological pathways. (D) The experimentally validated driver pathway is used to rank the driver mutations.
Predicted driver mutations and core pathways for hypoxia tolerance in Drosophila melanogaster from multiple evidences.
| Mutated Gene (Annotation Symbol) | Molecular Function | FDR Corrected | Shortest-path Distance (z-score) up/down | Functional role of nsSNP inferred from structural modeling | Human ortholog and hypoxia association | ||||
|---|---|---|---|---|---|---|---|---|---|
| Up-regulation | Down-regulation | ||||||||
| Notch* | Gurken/EGFR | Toll | Torso/RTK | ||||||
| Hairless (CG5460) | transcription corepressor | 1.01e-5 | 2.50e-3 | 8.23e-3 | 5.76e-5 | 2.44/4.42 | Possible DNA binding | 82 | Yes [ |
| Rad51D (CG6318) | DNA-dependent ATPase | 3.36e-2 | 1.20e-2 | 1.95e-2 | 1.42e-3 | 2.54/4.09 | PPI | <50 | Yes [ |
| Ulp1 (CG12359) | SUMO-specific protease | 4.68e-2 | 1.87e-2 | >0.05 | >0.05 | 1.78/3.86 | unknown | 63 | Yes [ |
| Wnt5 (CG6407) | receptor binding | 2.67e-2 | 1.97e-3 | 1.06e-2 | 1.16e-7 | 1.26/3.41 | unknown | 58 | Yes [ |
| HDAC4 (CG1770) | histone deacetylase 4 | 2.70e-2 | 4.10e-4 | >0.05 | 5.76e-5 | 1.11/3.13 | AR of catalytic activity | <50 | Yes [ |
| Sol (CG1391) | calcium-dependent cysteine-type endopeptidase | 1.51e-2 | 1.69e-2 | 1.55e-2 | 2.11e-3 | 0.33/2.82 | unknown | <50 | unknown |
| Dys (CG34157) | Dystrophin | 8.28e-5 | 8.05e-5 | >0.05 | 3.17e-3 | 0.38/0.72 | AR of substrate binding | 70 | Yes [ |
| GalNAc-T2 (CG6394) | N-acetylgalactosaminyl transferase | 2.59e-3 | 2.71e-10 | 1.30e-2 | 3.28e-3 | -1.51/0.81 | AR of substrate binding | <50 | Yes [ |
| CG33714 (CG33714) | mRNA binding | 8.59e-5 | 5.61e-3 | 1.51e-2 | >0.05 | -1.59/0.69 | mRNA binding | 87 | unknown |
*We experimentally validate that the up-regulation of Notch signaling, one of the most frequently overrepresented pathways, confers the hypoxia tolerance in Drosophila melanogaster [23, 26]. PPI: Protein-Protein Interaction, AR: Allosteric Regulation.
Figure 3Distribution of sequence identities between proteins containing nsSNPs and templates in the RCSB PDB.
Figure 4The model structure of HDAC4. The yellow spheres represent mutated amino acid A1075. Green sticks represent residues co-evolved with A1075. Red circle represents zinc binding sites of HDAC4. This model structure is built using Modeller [65] based on the sequence alignment between HDAC4 and PDB structure 2VQW. The sequence identity between HDAC4 (819-1223) and 2VQW is 58%.
Figure 5Model structure for Rad51D. The green and cyan cartoon represents the model structure of Rad51D and BRCA2, respectively. Dotted spheres represent the mutated amino acids. The red circle indicates the region of the protein-protein interaction interface. Ser55 mutation may directly impact the oligomerization of Rad51D.