| Literature DB >> 27907895 |
Pora Kim1, Junfei Zhao1, Pinyi Lu1, Zhongming Zhao2,3.
Abstract
Mutations at the ligand binding sites (LBSs) can influence protein structure stability, binding affinity with small molecules, and drug resistance in cancer patients. Our recent analysis revealed that ligand binding residues had a significantly higher mutation rate than other parts of the protein. Here, we built mutLBSgeneDB (mutated Ligand Binding Site gene DataBase) available at http://zhaobioinfo.org/mutLBSgeneDB We collected and curated over 2300 genes (mutLBSgenes) having ∼12 000 somatic mutations at ∼10 000 LBSs across 16 cancer types and selected 744 drug targetable genes (targetable_mutLBSgenes) by incorporating kinases, transcription factors, pharmacological genes, and cancer driver genes. We analyzed LBS mutation information, differential gene expression network, drug response correlation with gene expression, and protein stability changes for all mutLBSgenes using integrated genetic, genomic, transcriptomic, proteomic, network and functional information. We calculated and compared the binding affinities of 20 carefully selected genes with their drugs in wild type and mutant forms. mutLBSgeneDB provides a user-friendly web interface for searching and browsing through seven categories of annotations: Gene summary, Mutated information, Protein structure related information, Differential gene expression and gene-gene network, Phenotype information, Pharmacological information, and Conservation information. mutLBSgeneDB provides a useful resource for functional genomics, protein structure, drug and disease research communities.Entities:
Mesh:
Substances:
Year: 2016 PMID: 27907895 PMCID: PMC5210621 DOI: 10.1093/nar/gkw905
Source DB: PubMed Journal: Nucleic Acids Res ISSN: 0305-1048 Impact factor: 16.971
Figure 1.Overview of mutLBSgeneDB. mutLBSgeneDB is composed of seven annotation categories that cover the genotypic, transcriptomic, proteomic, phenotypic, pharmacological data and network analyses of 2372 mutLBSgenes with manual curation of 744 targetable_mutLBSgenes.
Annotation entry statistics for mutLBSgenes and targetable_mutLBSgenes
| Data type | # Entries | # mutLBSgenesa | # targetable_mutLBSgenesb |
|---|---|---|---|
| Total 2372 (%) | Total 744 (%) | ||
| Targetable genes | # genes | ||
| Human Kinomec | 267 | 267 (11.3%) | 267 (35.9%) |
| TRANSFACd | 216 | 216 (9.1%) | 216 (29.0%) |
| IUPHARe | 579 | 579 (24.4%) | 579 (77.8%) |
| Cancer driver genesf | 179 | 179 (7.5%) | 179 (24.1%) |
| Ligand binding site | # LBS | ||
| BioLiPg | 10 108 | 2372 (100.0%) | 744 (100.0%) |
| Mutation | # nsSNV | ||
| TCGAh | 11 873 | 2372 (100.0%) | 744 (100.0%) |
| Expression | # genes | ||
| TCGAi | 20 502 | 2372 (100.0%) | 744 (100.0%) |
| Expression with drug treatment | # genes | ||
| CCLEj | 19 931 | 2372 (100.0%) | 744 (100.0%) |
| Molecule | # molecules | ||
| DrugBankk | 8206 drugs | 865 (36.5%) | 378 (50.8%) |
| UniProtl | 2374 proteins | 2372 (100.0%) | 743 (99.9%) |
| BioLiPm | 6108 ligands | 1780 (75.0%) | 572 (76.9%) |
| Phenotype | # phenotype | ||
| DisGeNetn | 6761 disease ID | 1449 (61.1%) | 662 (85.5%) |
| ClinVaro | 107 phenotype ID | 80 (3.4%) | 49 (6.6%) |
| Conservation | # LBS | ||
| MUSCLE resultsp | 27 269 | 2371 (100.0%) | 744 (100.0%) |
aNumber of genes having ligand binding site mutations (mutLBSgenes).
bTargetable mutLBSgenes.
cKinases from The Human Kinome database.
dTranscription factors from TRANSFAC database.
eDrug target genes from IUPHAR database.
fSignificantly mutated genes in 18 TCGA cancer types.
gLigand binding sites from Ligand-protein binding database (BioLiP).
hSomatic non-synonymous single nucleotide variants (nsSNVs) from TCGA in 16 cancer types.
iExpression values from TCGA.
jAnti-cancer drug treated cell-line's gene expression data.
kmutLBSgenes related drug IDs from DrugBank database.
lProtein accession data from UniProt database.
mLigand binding to mutLBSgenes.
nGene-level disease annotation from DisGeNet database.
oMutation-level pathogenic information from ClinVar.
pConservation information across 8 species from MUSCLE.
Figure 2.Ligand binding site mutation information category and protein structure related information category. (A) Ligand binding site mutation information category. The lollipop plot shows non-synonymous single nucleotide variants (nsSNVs) at the ligand binding sites only in amino acid sequence of protein B-Raf. Cancer type specific mutLBS table shows the frequency of each LBS mutation in each cancer type. (B) Protein structure related information category. The dot plot shows the relative protein stability change after occurrence of each mutation. Red dots represent ligand binding site mutations and grey dots for other nsSNVs. This category also shows the predicted protein structure of wild type and mutant protein with their drugs. For the top 20 selected genes, we also calculated the binding affinity.