Current methods of structure identification in mass-spectrometry-based nontargeted metabolomics rely on matching experimentally determined features of an unknown compound to those of candidate compounds contained in biochemical databases. A major limitation of this approach is the relatively small number of compounds currently included in these databases. If the correct structure is not present in a database, it cannot be identified, and if it cannot be identified, it cannot be included in a database. Thus, there is an urgent need to augment metabolomics databases with rationally designed biochemical structures using alternative means. Here we present the In Vivo/In Silico Metabolites Database (IIMDB), a database of in silico enzymatically synthesized metabolites, to partially address this problem. The database, which is available at http://metabolomics.pharm.uconn.edu/iimdb/, includes ~23,000 known compounds (mammalian metabolites, drugs, secondary plant metabolites, and glycerophospholipids) collected from existing biochemical databases plus more than 400,000 computationally generated human phase-I and phase-II metabolites of these known compounds. IIMDB features a user-friendly web interface and a programmer-friendly RESTful web service. Ninety-five percent of the computationally generated metabolites in IIMDB were not found in any existing database. However, 21,640 were identical to compounds already listed in PubChem, HMDB, KEGG, or HumanCyc. Furthermore, the vast majority of these in silico metabolites were scored as biological using BioSM, a software program that identifies biochemical structures in chemical structure space. These results suggest that in silico biochemical synthesis represents a viable approach for significantly augmenting biochemical databases for nontargeted metabolomics applications.
Current methods of structure identification in mass-spectrometry-based nontargeted metabolomics rely on matching experimentally determined features of an unknown compn>ound to those of candidate compclass="Chemical">n>ounds contained in biochemical databases. A major limitation of this approach is the relatively small number of compn>ounds currently included in these databases. If the correct structure is not present in a database, it cannot be identified, and if it cannot be identified, it cannot be included in a database. Thus, there is an urgent need to augment metabolomics databases with rationally designed biochemical structures using alternative means. Here we present the In Vivo/In Silico Metabolites Database (IIMDB), a database of in silico enzymatically synthesized metabolites, to partially address this problem. The database, which is available at httpn>://metabolomics.pharm.uconn.edu/iimdb/, includes ~23,000 known compn>ounds (n>an class="Species">mammalian metabolites, drugs, secondary plant metabolites, and glycerophospholipids) collected from existing biochemical databases plus more than 400,000 computationally generated human phase-I and phase-II metabolites of these known compounds. IIMDB features a user-friendly web interface and a programmer-friendly RESTful web service. Ninety-five percent of the computationally generated metabolites in IIMDB were not found in any existing database. However, 21,640 were identical to compounds already listed in PubChem, HMDB, KEGG, or HumanCyc. Furthermore, the vast majority of these in silico metabolites were scored as biological using BioSM, a software program that identifies biochemical structures in chemical structure space. These results suggest that in silico biochemical synthesis represents a viable approach for significantly augmenting biochemical databases for nontargeted metabolomics applications.
Authors: Johannes Kirchmair; Andrew Howlett; Julio E Peironcely; Daniel S Murrell; Mark J Williamson; Samuel E Adams; Thomas Hankemeier; Leo van Buren; Guus Duchateau; Werner Klaffke; Robert C Glen Journal: J Chem Inf Model Date: 2013-01-25 Impact factor: 4.956
Authors: David S Wishart; Dan Tzur; Craig Knox; Roman Eisner; An Chi Guo; Nelson Young; Dean Cheng; Kevin Jewell; David Arndt; Summit Sawhney; Chris Fung; Lisa Nikolai; Mike Lewis; Marie-Aude Coutouly; Ian Forsythe; Peter Tang; Savita Shrivastava; Kevin Jeroncic; Paul Stothard; Godwin Amegbey; David Block; David D Hau; James Wagner; Jessica Miniaci; Melisa Clements; Mulu Gebremedhin; Natalie Guo; Ying Zhang; Gavin E Duggan; Glen D Macinnis; Alim M Weljie; Reza Dowlatabadi; Fiona Bamforth; Derrick Clive; Russ Greiner; Liang Li; Tom Marrie; Brian D Sykes; Hans J Vogel; Lori Querengesser Journal: Nucleic Acids Res Date: 2007-01 Impact factor: 16.971
Authors: Pedro Romero; Jonathan Wagg; Michelle L Green; Dale Kaiser; Markus Krummenacker; Peter D Karp Journal: Genome Biol Date: 2004-12-22 Impact factor: 13.583
Authors: Kai Dührkop; Huibin Shen; Marvin Meusel; Juho Rousu; Sebastian Böcker Journal: Proc Natl Acad Sci U S A Date: 2015-09-21 Impact factor: 11.205
Authors: Robert W McGarrah; Scott B Crown; Guo-Fang Zhang; Svati H Shah; Christopher B Newgard Journal: Circ Res Date: 2018-04-27 Impact factor: 17.367
Authors: Tobias Kind; Hiroshi Tsugawa; Tomas Cajka; Yan Ma; Zijuan Lai; Sajjan S Mehta; Gert Wohlgemuth; Dinesh Kumar Barupal; Megan R Showalter; Masanori Arita; Oliver Fiehn Journal: Mass Spectrom Rev Date: 2017-04-24 Impact factor: 10.946
Authors: Thomas O Metz; Erin S Baker; Emma L Schymanski; Ryan S Renslow; Dennis G Thomas; Tim J Causon; Ian K Webb; Stephan Hann; Richard D Smith; Justin G Teeguarden Journal: Bioanalysis Date: 2017-01 Impact factor: 2.681
Authors: Jon R Sobus; John F Wambaugh; Kristin K Isaacs; Antony J Williams; Andrew D McEachran; Ann M Richard; Christopher M Grulke; Elin M Ulrich; Julia E Rager; Mark J Strynar; Seth R Newton Journal: J Expo Sci Environ Epidemiol Date: 2017-12-29 Impact factor: 5.563
Authors: Elaine A Cohen Hubal; Barbara A Wetmore; John F Wambaugh; Hisham El-Masri; Jon R Sobus; Tina Bahadori Journal: J Expo Sci Environ Epidemiol Date: 2018-08-16 Impact factor: 5.563
Authors: James G Jeffryes; Ricardo L Colastani; Mona Elbadawi-Sidhu; Tobias Kind; Thomas D Niehaus; Linda J Broadbelt; Andrew D Hanson; Oliver Fiehn; Keith E J Tyo; Christopher S Henry Journal: J Cheminform Date: 2015-08-28 Impact factor: 5.514