| Literature DB >> 27485425 |
Yuki Ohtana1, Azian Azamimi Abdullah1, Md Altaf-Ul-Amin1, Ming Huang1, Naoaki Ono1, Tetsuo Sato1, Tadao Sugiura1, Hisayuki Horai2, Yukiko Nakamura1, Aki Morita Hirai1, Klaus W Lange3, Nelson K Kibinge1, Tetsuo Katsuragi1, Tsuyoshi Shirai4, Shigehiko Kanaya5.
Abstract
Developing database systems connecting diverse species based on omics is the most important theme in big data biology. To attain this purpose, we have developed KNApSAcK Family Databases, which are utilized in a number of researches in metabolomics. In the present study, we have developed a network-based approach to analyze relationships between 3D structure and biological activity of metabolites consisting of four steps as follows: construction of a network of metabolites based on structural similarity (Step 1), classification of metabolites into structure groups (Step 2), assessment of statistically significant relations between structure groups and biological activities (Step 3), and 2-dimensional clustering of the constructed data matrix based on statistically significant relations between structure groups and biological activities (Step 4). Applying this method to a data set consisting of 2072 secondary metabolites and 140 biological activities reported in KNApSAcK Metabolite Activity DB, we obtained 983 statistically significant structure group-biological activity pairs. As a whole, we systematically analyzed the relationship between 3D-chemical structures of metabolites and biological activities.Keywords: Biological activities; Clustering of 3D-structure similarity; KNApSAcK family databases; Metabolites; Network of secondary metabolites; Phytochemistry; Structureproperty relationships; Visualization, Cheminformatics; ′Bioinformatics
Year: 2014 PMID: 27485425 DOI: 10.1002/minf.201400123
Source DB: PubMed Journal: Mol Inform ISSN: 1868-1743 Impact factor: 3.353