| Literature DB >> 24894379 |
Hayley Dingerdissen, Daniel S Weaver, Peter D Karp, Yang Pan, Vahan Simonyan, Raja Mazumder1.
Abstract
BACKGROUND: We have previously suggested a method for proteome wide analysis of variation at functional residues wherein we identified the set of all human genes with nonsynonymous single nucleotide variation (nsSNV) in the active site residue of the corresponding proteins. 34 of these proteins were shown to have a 1:1:1 enzyme:pathway:reaction relationship, making these proteins ideal candidates for laboratory validation through creation and observation of specific yeast active site knock-outs and downstream targeted metabolomics experiments. Here we present the next step in the workflow toward using yeast metabolic modeling to predict human metabolic behavior resulting from nsSNV.Entities:
Mesh:
Substances:
Year: 2014 PMID: 24894379 PMCID: PMC4057618 DOI: 10.1186/1745-6150-9-9
Source DB: PubMed Journal: Biol Direct ISSN: 1745-6150 Impact factor: 4.540
Human proteins with active site nsSNV and their yeast ortholog match
| Q9GZR2 | Q08237 | Q9P2J9 | Q12511 | Q15067 | P13711 |
| P49247 | Q12189 | Q9BZP6 | Q06350 | Q96S44 | P53323 |
| Q9UBZ4 | P38207 | O60825 | P32604 | O95154 | P42884 |
| Q14410 | P32190 | O15305 | P07283 | P78368 | P23292 |
| P51659 | Q02207 | Q9UNI6 | Q02256 | Q6PI48 | P15179 |
| P20618 | P23724 | Q96RR4 | P43637 | Q13164 | Q00772 |
| Q96T52 | P46972 | Q9NUW8 | P38319 | P06744 | P12709 |
| P08397 | P28789 | P00480 | P05150 | P11498 | P32327 |
| P04424 | P04076 | Q6IA69 | P38795 | P05186 | P11491 |
| P40818 | P32571 | Q04760 | P50107 | Q8TF76 | P32789 |
| Q8IWW8 | P10127 | P36873 | P32598 | P29120 | P13134 |
| P37268 | P29704 | Q9P2K8 | P15442 | Q8IWX5 | P23501 |
| Q96G46 | Q06053 | Q96GX9 | P47095 | Q96DP5 | P32785 |
| P48637 | Q08220 | Q13332 | P25044 | Q9Y3E5 | P34222 |
| P09467 | P09201 | P12955 | P43590 | P25789 | P23638 |
| Q9UGM6 | P04803 | P06132 | P32347 | P60484 | P53916 |
| P78549 | P31378 | O75191 | P42826 | P49917 | Q08387 |
| O96017 | P39009 | Q9H3S4 | P35202 | P32320 | Q06549 |
| Q7L3T8 | P39965 | P06737 | P06738 | Q9NSY1 | P53974 |
| Q9NYY3 | P32562 | Q13907 | P15496 | O60942 | Q01159 |
| O43426 | P50942 | P49841 | P38615 | Q9UGP5 | P25615 |
| O95363 | P08425 | P36871 | P37012 | Q9BUP3 | P40008 |
| P34949 | P29952 | Q9UBZ9 | P12689 | P54098 | P15801 |
| Q7L211 | P42840 | Q9HBY8 | P12688 | Q9UJM8 | P00175 |
| P07741 | P49435 | P07902 | P08431 | Q8WUX2 | P32656 |
| P13716 | P05373 | Q9Y2L1 | Q08162 | Q14249 | P08466 |
| P48449 | P38604 | Q96C11 | Q04585 | Q5VTY9 | Q08929 |
| P49336 | P39073 | O14818 | P40303 | Q86YJ6 | P16120 |
| Q8IZ73 | Q12362 | O00743 | P20604 | P12081 | P07263 |
| Q69YN2 | P53255 | Q5T2R2 | P18900 | Q9UQB9 | P38991 |
| Q99447 | P33412 | P04180 | P40345 | O14734 | P41903 |
| P30793 | P51601 | Q9Y3Q0 | P47161 | P14550 | P14065 |
| Q9BV23 | P53750 | O43175 | P40054 | P08243 | P49090 |
| P56937 | Q12452 | P22830 | P16622 | Q86V88 | P40081 |
| Q16769 | P43599 | O95336 | P38858 | Q8IXB1 | P40564 |
| Q15386 | P53119 | Q969P6 | P04786 | Q9Y3B8 | P54964 |
| Q9Y2H1 | P53894 | Q9Y2Z4 | P48527 | P00813 | P53909 |
| Q9UI42 | P38836 | Q9Y6R4 | P53599 |
This table lists the pool of 113 reciprocal best match human-yeast ortholog pairs, a subset from the original 559 proteins with active site nsSNV.
34 proteins with active site nsSNV with substrate/product/pathway relationships ideal for use in metabolomics
| t | a | 532 | L-threonine | CID: 6288 | L-threonyl-tRNA | SID: 5901 | |
| r | c | 236 | 3-phospho-D-glycerate | CID: 439183 | 3-phosphonooxypyruvate | CID: 105 | |
| p | l | 107 | 10-formyltetrahydrofolate | CID: 122347 | Tetrahydrofolate | CID: 91443 | |
| h | q | 434 | cholesterol | CID: 5997 | Cholesterol ester | SID: 5537 | |
| h | q | 347 | 4-trimethylammoniobutanoate | CID: 134 | Carnitine | CID: 85 | |
| r | c | 185 | 6-phospho-D-glucono-1,5-lactone | CID: 439452 | 6-phospho-D-gluconate | CID: 91493 | |
| q | x | 160 | L-phenylalanine | CID: 6140 | L-phenylalanyl-tRNA(Phe) | SID: 6321 | |
| r | c | 728 | Beta-D-glucose 6-phosphate | CID: 439427 | 6-phospho-D-gluconate | CID: 91493 | |
| a | v | 221 | PAPS | CID: 10214 | Adenylyl sulfate | CID: 10238 | |
| s | n | 205 | Phosphatidyl choline | SID: 3457 | 2-lysolecithin | SID: 6900 | |
| y | c | 150 | Tetrahydrobiopterin | CID: 44257 | Dihydrobiopterin | CID: 133246 | |
| r | p | 221 | 5-aminolevulinate | CID: 137 | Porphobilinogen | CID: 1021 | |
| d | g | 383 | Ferrous ion | protoporphyrin IX | CID: 27284 | CID: 4971 | Heme | CID: 444097 | |
| h | q | 305 | Adenylic acid | CID: 6083 | Ammonia | inosinic acid | CID: 222 | CID: 8582 | |
| h | r | 66 | Angiotensin I | CID: 3081372 | Angiotensin | CID: 172198 | |
| t | a | 618 | Guanosine 5'-triphosphate | CID: 6830 | Cyclic gmp | CID: 24316 | |
| g | s | 165 | L-histidine | CID: 6274 | Urocanic acid | CID: 736715 | |
| a | s | 271 | Biocytin | CID: 440721 | Biotin | L-lysine | CID: 171548 |CID: 5962 | |
| r | c | 125 | Glycine | gamma-glutamylcysteine | CID: 750 | CID: 123938 | Glutathione | CID: 124886 | |
| d | y | 160 | D-ribulose 5-phosphate | CID: 439184 | Ribose 5-phosphate | CID: 439167 | |
| t | i | 284 | L-dihydroorotic acid | CID: 439216 | Orotic acid | CID: 967 | |
| r | h | 718 | Protein lysine | SID: 5259 | Procollagen 5-hydroxy-L-lysine | SID: 4433 | |
| r | k | 104 | Leukotriene A4 | CID: 5280383 | Leukotriene C4 | CID: 5280493 | |
| y | x | 127 | 10-formyltetrahydrofolate | CID: 122347 | Tetrahydrofolate | CID: 91443 | |
| r | x | 266 | L-aspartic acid | CID: 5960 | L-aspartyl-tRNA(Asp) | SID: 5893 | |
| s | t | 153 | Malonyl-CoA | CID: 10663 | Malonyl-[acyl-carrier-protein] | SID: 4431 | |
| y | x | 121 | Globoside | SID: 124490726 | IV3GalNAca-Gb4Cer | SID: 124490727 | |
| p | s | 182 | D-glucosamine phosphate | CID: 440997 | Beta-D-fructose 6-phosphate | CID: 440641 | |
| r | x | 82 | Alpha-D-glucose | CID: 79025 | Beta-D-glucose | CID: 64689 | |
| g | r | 47 | S-methyl-5-thio-D-ribulose 1-phosphate | CID: 174549 | 5-(methylthio)-2,3-dioxopenyl phosphate | CID: 561 | |
| h | r | 314 | Platelet-activating factor | SID: 7195 | 1-alkyl-sn-glycero-3-phosphocholine | SID: 6975 | |
| n | s | 94 | Ethanolamine | CID: 700 | Phosphoethanolamine | CID: 1015 | |
| g | d | 242 | Guanylic acid | CID: 6804 | Inosinic acid | CID: 8582 | |
| g | s | 243 | L-tyrosine | CID: 6057 | L-tyrosyl-tRNA(Tyr) | SID: 5781 |
This table provides the UniProtKB accession ID, normal and variable residues, position of the variation and substrate/product pairs with PubChem IDs for 34 proteins with annotated nsSNVs at the active site. These proteins were chosen for this study based on a one-protein, one-variation, one-substrate/product pair relationship to simplify preliminary modeling by ensuring no complex interactions for the given protein pathway. NOTE: This table has been updated since the prior publication to include 2 additional proteins which were later identified to meet the criteria.
Figure 1Manual verification of residue conservation at the active site in both human and yeast orthologs. This is the reciprocal best BLAST hits alignment using yeast protein P05373, Delta-aminolevulinic acid dehydratase for S. cerevisiae, as the query searching against the entire human proteome. The only human hit is P13716, also delta-aminolevulinic acid for H. sapiens. From genbank annotations, we know the active site should occur at position 221 in the human sequence, corresponding to position 232 in the yeast sequence. Here we see conservation not only among the active site residues, but also in the surrounding region, which can also be important in facilitating active site binding.
Conserved residues between human and yeast orthologs at and around the active site
| P13716 | P05373 | 221 R | 232 R | GDR + CYQLPP |
| P22830 | P16622 | 383 D | 361 D | +ADLV |
| P48637 | Q08220 | 125 R | 128 R | RSDY+ |
| P49247 | Q12189 | 160 D | 107 D | DGADEVD |
| Q96GX9 | P47095 | 47 G | 39 G | TGTGGGIS + K |
| Q9Y2Z4 | P48527 | 243 G | 261 G | +Q + GG + DQ |
Pathway comparison between human candidate proteins and yeast orthologs using KEGG pathway annotations
| P13716 | ALAD | Porphyrin and clorophyll metabolism | P05373 | HEM2 | Yes | Same |
| | | Biosynthesis of secondary metabolites | | | No | Pathway only annotated in yeast; same substrate/product |
| P22830 | FECH | Porphyrin and clorophyll metabolism | P16622 | HEM15 | Yes | Same |
| | | Biosynthesis of secondary metabolites | | | No | Pathway only annotated in yeast; same substrate/product |
| P48637 | GSS | Glutathione metabolism | Q08220 | GSH2 | Yes | Same |
| P49247 | RPIA | Pentose phosphate pathway | Q12189 | RKI1 | Yes | Same |
| | | Biosynthesis of secondary metabolites | | | No | Pathway only annotated in yeast; same substrate/product |
| Q96GX9 | APIP | Cysteine and methionine metabolism | P47095 | MDE1 | Yes | Same |
| Q9Y2Z4 | YARS2 | Aminoacyl-tRNA biosynthesis | P48527 | MSY1 | Yes | Same |
This table summarizes the pathway involvement of the six identified candidate proteins. For all 6 proteins, the primary pathway involvement is the same, down to the same products and substrates and the same place within a pathway. For three of the yeast proteins, however, the pathway annotation in KEGG listed an additional pathway not annotated for humans: “Biosynthesis of secondary metabolites.” This is likely due to differential annotation between yeast and humans within KEGG. Because the substrates and products annotated for these pathways were identical to those of the primary pathways, these three proteins were not excluded and went onto analysis in the yeast FBA modeling.
Figure 2Similar protein interactions between human and yeast counterparts. Known and predicted protein interactions were retrieved from IntAct [39] and BioGrid [40] using both human and yeast gene names as query terms. Interactions were organized and edited following suggestions by Lim et al. [37]. It is interesting to note that although a sizeable portion of predicted interactors are also predicted to have functional orthology between species, very few of the inter-protein interactions among predicted orthologs were reported for both species.
Figure 3Schema of identification of candidate proteins for yeast modeling. The overall method used in this paper was manually performed, but consists of simple steps and queries which could easily be automated for the performance of future similar studies.
Summary of FBA results
| P13716 | P05373 | Porphobilinogen synthesis | No FBA growth | Ferroheme b + Yeast 7.0 biomass equation |
| P22830 | P16622 | Protoheme ferrochelatase | No FBA growth | Ferroheme b + Yeast 7.0 biomass equation |
| P48637 | Q08220 | Glutathione synthesis | WT growth (μ = 0.1405 hr−1) | 0 flux through reaction in WT (r_0485) |
| P49247 | Q12189 | Ribose-5-phosphate isomerization | No FBA growth | Phosphoribosyl-pyrophosphate block |
| Q96GX9 | P47095 | Methylthioribulose 1-phosphate dehydratase | Not determined | Gene not present in Yeast 7.0 model |
| Q9Y2Z4 | P48527 | Mitochondrial tyrosine-tRNA-ligase reaction | No FBA growth | Mitochondrial tRNA requirement + Yeast 7.0 biomass equation |
P47095 impact could not be modeled as the Yeast 7.0 model contains no corresponding gene.